Doroteo T. Toledano
Orcid: 0000-0003-1159-6455Affiliations:
- Autonomous University of Madrid, AUDIAS, Spain
According to our database1,
Doroteo T. Toledano
authored at least 78 papers
between 1997 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on linkedin.com
-
on orcid.org
On csauthors.net:
Bibliography
2024
Whisper-based spoken term detection systems for search on speech ALBAYZIN evaluation challenge.
EURASIP J. Audio Speech Music. Process., December, 2024
Enhancing Conformer-Based Sound Event Detection Using Frequency Dynamic Convolutions and BEATs Audio Embeddings.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Voxceleb-ESP: preliminary experiments detecting Spanish celebrities from their voices.
CoRR, 2024
2022
Source Separation for Sound Event Detection in Domestic Environments using Jointly Trained Models.
Proceedings of the 17th International Workshop on Acoustic Signal Enhancement, 2022
Proceedings of the 6th International Conference, 2022
2021
A Multi-Resolution CRNN-Based Approach for Semi-Supervised Sound Event Detection in DCASE 2020 Challenge.
IEEE Access, 2021
Proceedings of the Fifth International Conference, 2021
An analysis of Sound Event Detection under acoustic degradation using multi-resolution systems.
Proceedings of the Fifth International Conference, 2021
Query-by-Example Spoken Term Detection using Attentive Pooling Networks at ALBAYZIN 2020 Evaluation: The AUDIAS-UAM System.
Proceedings of the Fifth International Conference, 2021
Multiple Feature Resolutions for Different Polyphonic Sound Detection Score Scenarios in DCASE 2021 Task 4.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021
2020
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020
2019
Estudio sobre documentos reutilizables como recursos lingüísticos en el marco del desarrollo del Plan de Impulso de las Tecnologías del Lenguaje.
Proces. del Leng. Natural, 2019
Search on speech from spoken queries: the Multi-domain International ALBAYZIN 2018 Query-by-Example Spoken Term Detection Evaluation.
EURASIP J. Audio Speech Music. Process., 2019
ALBAYZIN 2018 spoken term detection evaluation: a multi-domain international evaluation in Spanish.
EURASIP J. Audio Speech Music. Process., 2019
Exploring convolutional, recurrent, and hybrid deep neural networks for speech and music detection in a large audio dataset.
EURASIP J. Audio Speech Music. Process., 2019
2018
EURASIP J. Audio Speech Music. Process., 2018
DNN-based Embeddings for Speaker Diarization in the AuDIaS-UAM System for the Albayzin 2018 IberSPEECH-RTVE Evaluation.
Proceedings of the Fourth International Conference, 2018
Audio event detection on Google's Audio Set database: Preliminary results using different types of DNNs.
Proceedings of the Fourth International Conference, 2018
AUDIAS-CEU: A Language-independent approach for the Query-by-Example Spoken Term Detection task of the Search on Speech ALBAYZIN 2018 evaluation.
Proceedings of the Fourth International Conference, 2018
2017
ALBAYZIN 2016 spoken term detection evaluation: an international open competitive evaluation in Spanish.
EURASIP J. Audio Speech Music. Process., 2017
2016
Comparison of ALBAYZIN query-by-example spoken term detection 2012 and 2014 evaluations.
EURASIP J. Audio Speech Music. Process., 2016
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2016
2015
Proceedings of the Encyclopedia of Biometrics, Second Edition, 2015
Proceedings of the Encyclopedia of Biometrics, Second Edition, 2015
Spoken term detection ALBAYZIN 2014 evaluation: overview, systems, results, and discussion.
EURASIP J. Audio Speech Music. Process., 2015
Comput. Math. Methods Medicine, 2015
An end-to-end approach to language identification in short utterances using convolutional neural networks.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
2014
Comput. Speech Lang., 2014
Analysis of voice features related to obstructive sleep apnoea and their application in diagnosis support.
Comput. Speech Lang., 2014
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2014
2013
Query-by-Example Spoken Term Detection ALBAYZIN 2012 evaluation: overview, systems, results, and discussion.
EURASIP J. Audio Speech Music. Process., 2013
2012
Mejorando el acceso, el análisis y la visibilidad de la Información y los contenidos Multilingues y Multimedia en Red para la Comunidad de Madrid.
Proces. del Leng. Natural, 2012
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2012
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2012
2011
Analyzing Training Dependencies and Posterior Fusion in Discriminant Classification of Apnea Patients Based on Sustained and Connected Speech.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
2010
Multilevel and Session Variability Compensated Language Recognition: ATVS-UAM Systems at NIST LRE 2009.
IEEE J. Sel. Top. Signal Process., 2010
A Study of the Influence of Speech Type on Automatic Language Recognition Performance.
Proceedings of the International Conference on Language Resources and Evaluation, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 20th International Conference on Pattern Recognition, 2010
2009
IEEE Trans. Speech Audio Process., 2009
Assessment of Severe Apnoea through Voice Analysis, Automatic Speech, and Speaker Recognition Techniques.
EURASIP J. Adv. Signal Process., 2009
Severe Apnoea Detection using Speaker Recognition Techniques.
Proceedings of the BIOSIGNALS 2009, 2009
2008
Herramientas de anotación de corpus de habla espontánea del Laboratorio de Lingística Informática de la UAM.
Proces. del Leng. Natural, 2008
Proceedings of the Odyssey 2008: The Speaker and Language Recognition Workshop, 2008
Proceedings of the International Conference on Language Resources and Evaluation, 2008
Design of a Multimodal Database for Research on Automatic Detection of Severe Apnoea Cases.
Proceedings of the International Conference on Language Resources and Evaluation, 2008
Developing a Phonemic and Syllabic Frequency Inventory for Spontaneous Spoken Castilian Spanish and their Comparison to Text-Based Inventories.
Proceedings of the International Conference on Language Resources and Evaluation, 2008
Proceedings of the International Conference on Language Resources and Evaluation, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
2007
Emulating DNA: Rigorous Quantification of Evidential Weight in Transparent and Testable Forensic Speaker Recognition.
IEEE Trans. Speech Audio Process., 2007
IEEE Signal Process. Lett., 2007
Ann. des Télécommunications, 2007
Multivariate Cepstral Feature Compensation on Band-limited Data for Robust Speech Recognition.
Proceedings of the 16th Nordic Conference of Computational Linguistics, 2007
Improved language recognition using better phonetic decoders and fusion with MFCC and SDC features.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
2006
IEEE Trans. Speech Audio Process., 2006
Interact. Comput., 2006
Proceedings of the Odyssey 2006: The Speaker and Language Recognition Workshop, 2006
Proceedings of the Odyssey 2006: The Speaker and Language Recognition Workshop, 2006
Unsupervised Class-Based Feature Compensation for Time-Variable Bandwidth-Limited Speech.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
2005
On the relationship between phonetic modeling precision and phonetic speaker recognition accuracy.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Statistical class-based MFCC enhancement of filtered and band-limited speech for robust ASR.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Proceedings of the ISCA Tutorial and Research Workshop (ITRW) on Disfluency in Spontaneous Speech, 2005
2003
2002
Proceedings of the Third International Conference on Language Resources and Evaluation, 2002
2001
Local refinement of phonetic boundaries: a general framework and its application using different transition models.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
2000
Proceedings of the IEEE International Conference on Acoustics, 2000
1998
Trying to mimic human segmentation of speech using HMM and fuzzy logic post-correction rules.
Proceedings of the Third ESCA/COCOSDA Workshop on Speech Synthesis, 1998
1997
Automatic alternative transcription generation and vocabulary selection for flexible word recognizers.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997