Mireia Díez

Orcid: 0000-0001-7894-8377

According to our database1, Mireia Díez authored at least 57 papers between 2010 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Do End-to-End Neural Diarization Attractors Need to Encode Speaker Characteristic Information?
CoRR, 2024

2023
DiaPer: End-to-End Neural Diarization with Perceiver-Based Attractors.
CoRR, 2023

Discriminative Training of VBx Diarization.
CoRR, 2023

DiaCorrect: Error Correction Back-end For Speaker Diarization.
CoRR, 2023

Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization.
CoRR, 2023

Multi-Speaker and Wide-Band Simulated Conversations as Training Data for End-to-End Neural Diarization.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Bayesian HMM clustering of x-vector sequences (VBx) in speaker diarization: Theory, implementation and analysis on standard tasks.
Comput. Speech Lang., 2022

From Simulated Mixtures to Simulated Conversations as Training Data for End-to-End Neural Diarization.
Proceedings of the Interspeech 2022, 2022

Speaker adaptation for Wav2vec2 based dysarthric ASR.
Proceedings of the Interspeech 2022, 2022

2021
Analysis of the but Diarization System for Voxconverse Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Analysis of Speaker Diarization Based on Bayesian HMM With Eigenvoice Priors.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

End-to-end DNN based text-independent speaker recognition for long and short utterances.
Comput. Speech Lang., 2020

13 years of speaker recognition research at BUT, with longitudinal analysis of NIST SRE.
Comput. Speech Lang., 2020


But System for the Second Dihard Speech Diarization Challenge.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Optimizing Bayesian Hmm Based X-Vector Clustering for the Second Dihard Speech Diarization Challenge.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Bayesian HMM Based x-Vector Clustering for Speaker Diarization.
Proceedings of the Interspeech 2019, 2019

2018
Analysis of BUT-PT Submission for NIST LRE 2017.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Speaker Diarization based on Bayesian HMM with Eigenvoice Priors.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

BUT System for DIHARD Speech Diarization Challenge 2018.
Proceedings of the Interspeech 2018, 2018

End-to-End DNN Based Speaker Recognition Inspired by I-Vector and PLDA.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Analysis of Score Normalization in Multilingual Speaker Recognition.
Proceedings of the Interspeech 2017, 2017

MGB-3 but system: Low-resource ASR on Egyptian YouTube data.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
KALAKA-3: a database for the assessment of spoken language recognition technology on YouTube audios.
Lang. Resour. Evaluation, 2016

2014
On the Projection of PLLRs for Unbounded Feature Distributions in Spoken Language Recognition.
IEEE Signal Process. Lett., 2014

On the Complementarity of Phone Posterior Probabilities for Improved Speaker Recognition.
IEEE Signal Process. Lett., 2014

GTTS-EHU Systems for QUESST at MediaEval 2014.
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014

KALAKA-3: a database for the recognition of spoken European languages on YouTube audios.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

PLLR features in language recognition system for RATS.
Proceedings of the INTERSPEECH 2014, 2014

New insight into the use of phone log-likelihood ratios as features for language recognition.
Proceedings of the INTERSPEECH 2014, 2014

On the complementarity of short-time fourier analysis windows of different lengths for improved language recognition.
Proceedings of the INTERSPEECH 2014, 2014

Optimizing PLLR Features for Spoken Language Recognition.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

High-performance Query-by-Example Spoken Term Detection on the SWS 2013 evaluation.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Language Recognition on Albayzin 2010 LRE using PLLR features.
Proces. del Leng. Natural, 2013

GTTS Systems for the SWS Task at MediaEval 2013.
Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013

Handling recordings acquired simultaneously over multiple channels with PLDA.
Proceedings of the INTERSPEECH 2013, 2013

The albayzin 2012 language recognition evaluation.
Proceedings of the INTERSPEECH 2013, 2013

Using phone log-likelihood ratios as features for speaker recognition.
Proceedings of the INTERSPEECH 2013, 2013

Dimensionality reduction of phone log-likelihood ratio features for spoken language recognition.
Proceedings of the INTERSPEECH 2013, 2013


2012
On the use of phone log-likelihood ratios as features in spoken language recognition.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Evaluation of spoken language recognition technology using broadcast speech: performance and challenges.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

GTTS System for the Spoken Web Search Task at MediaEval 2012.
Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012

KALAKA-2: a TV Broadcast Speech Database for the Recognition of Iberian Languages in Clean and Noisy Environments.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Using Time-Synchronous Phone Co-occurrences in a SVM-Phonotactic Dialect Recognition System.
Proceedings of the INTERSPEECH 2012, 2012

The BLZ Submission to the NIST 2011 LRE: Data Collection, System Development and Performance.
Proceedings of the INTERSPEECH 2012, 2012

The EHU Systems for the NIST 2011 Language Recognition Evaluation.
Proceedings of the INTERSPEECH 2012, 2012

Study of Different Backends in a State-Of-the-Art Language Recognition System.
Proceedings of the INTERSPEECH 2012, 2012

2011
A Spoken Document Retrieval System for TV Broadcast News in Spanish and Basque.
Proces. del Leng. Natural, 2011

Spoken language recognition in conversational telephone speech and TV broadcast news (GLOSA).
Proces. del Leng. Natural, 2011

The Albayzin 2010 Language Recognition Evaluation.
Proceedings of the INTERSPEECH 2011, 2011

On the Use of Dot Scoring for Speaker Diarization.
Proceedings of the Pattern Recognition and Image Analysis - 5th Iberian Conference, 2011


2010
Search and access to information contained in the speech of multimedia resources.
Proces. del Leng. Natural, 2010

Verification of the four Spanish official languages on TV show recordings.
Proces. del Leng. Natural, 2010

KALAKA: A TV Broadcast Speech Database for the Evaluation of Language Recognition Systems.
Proceedings of the International Conference on Language Resources and Evaluation, 2010


  Loading...