Mireia Díez
Orcid: 0000-0001-7894-8377
According to our database1,
Mireia Díez
authored at least 61 papers
between 2010 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Joint Training of Speaker Embedding Extractor, Speech and Overlap Detection for Diarization.
CoRR, 2024
Do End-to-End Neural Diarization Attractors Need to Encode Speaker Characteristic Information?
Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Multi-Speaker and Wide-Band Simulated Conversations as Training Data for End-to-End Neural Diarization.
Proceedings of the IEEE International Conference on Acoustics, 2023
2022
Bayesian HMM clustering of x-vector sequences (VBx) in speaker diarization: Theory, implementation and analysis on standard tasks.
Comput. Speech Lang., 2022
From Simulated Mixtures to Simulated Conversations as Training Data for End-to-End Neural Diarization.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 6th International Conference, 2022
2021
Proceedings of the IEEE International Conference on Acoustics, 2021
2020
IEEE ACM Trans. Audio Speech Lang. Process., 2020
End-to-end DNN based text-independent speaker recognition for long and short utterances.
Comput. Speech Lang., 2020
13 years of speaker recognition research at BUT, with longitudinal analysis of NIST SRE.
Comput. Speech Lang., 2020
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Optimizing Bayesian Hmm Based X-Vector Clustering for the Second Dihard Speech Diarization Challenge.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
2018
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
2016
KALAKA-3: a database for the assessment of spoken language recognition technology on YouTube audios.
Lang. Resour. Evaluation, 2016
2014
On the Projection of PLLRs for Unbounded Feature Distributions in Spoken Language Recognition.
IEEE Signal Process. Lett., 2014
On the Complementarity of Phone Posterior Probabilities for Improved Speaker Recognition.
IEEE Signal Process. Lett., 2014
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014
KALAKA-3: a database for the recognition of spoken European languages on YouTube audios.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
New insight into the use of phone log-likelihood ratios as features for language recognition.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
On the complementarity of short-time fourier analysis windows of different lengths for improved language recognition.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 22nd International Conference on Pattern Recognition, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
2013
Proces. del Leng. Natural, 2013
Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Dimensionality reduction of phone log-likelihood ratio features for spoken language recognition.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the International Conference on Biometrics, 2013
2012
On the use of phone log-likelihood ratios as features in spoken language recognition.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012
Evaluation of spoken language recognition technology using broadcast speech: performance and challenges.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012
Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012
KALAKA-2: a TV Broadcast Speech Database for the Recognition of Iberian Languages in Clean and Noisy Environments.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012
Using Time-Synchronous Phone Co-occurrences in a SVM-Phonotactic Dialect Recognition System.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
The BLZ Submission to the NIST 2011 LRE: Data Collection, System Development and Performance.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
2011
Proces. del Leng. Natural, 2011
Spoken language recognition in conversational telephone speech and TV broadcast news (GLOSA).
Proces. del Leng. Natural, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the Pattern Recognition and Image Analysis - 5th Iberian Conference, 2011
Multi-site heterogeneous system fusions for the Albayzin 2010 Language Recognition Evaluation.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011
2010
Proces. del Leng. Natural, 2010
Proces. del Leng. Natural, 2010
KALAKA: A TV Broadcast Speech Database for the Evaluation of Language Recognition Systems.
Proceedings of the International Conference on Language Resources and Evaluation, 2010