Doroteo T. Toledano

Daniel Ramos-Castro

CoRR, January, 2025

2024

Whisper-based spoken term detection systems for search on speech ALBAYZIN evaluation challenge.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., December, 2024

Enhancing Conformer-Based Sound Event Detection Using Frequency Dynamic Convolutions and BEATs Audio Embeddings.

[BibT_eX]

[DOI]

Sara Barahona

IEEE ACM Trans. Audio Speech Lang. Process., 2024

Voxceleb-ESP: preliminary experiments detecting Spanish celebrities from their voices.

[BibT_eX]

[DOI]

Beltrán Labrador

Manuel Otero-Gonzalez

Daniel Ramos-Castro

CoRR, 2024

2022

Source Separation for Sound Event Detection in Domestic Environments using Jointly Trained Models.

[BibT_eX]

[DOI]

Katerina Zmolíková

Sergio Izquierdo del Alamo

Proceedings of the 17th International Workshop on Acoustic Signal Enhancement, 2022

Efficient Transformers for End-to-End Neural Speaker Diarization.

[BibT_eX]

[DOI]

Beltrán Labrador

Proceedings of the 6th International Conference, 2022

2021

BiosecurID: a multimodal biometric database.

[BibT_eX]

[DOI]

CoRR, 2021

A Multi-Resolution CRNN-Based Approach for Semi-Supervised Sound Event Detection in DCASE 2020 Challenge.

[BibT_eX]

[DOI]

María Pilar Fernández-Gallego

IEEE Access, 2021

A study of data augmentation for increased ASR robustness against packet losses.

[BibT_eX]

[DOI]

Proceedings of the Fifth International Conference, 2021

An analysis of Sound Event Detection under acoustic degradation using multi-resolution systems.

[BibT_eX]

[DOI]

Daniel Ramos-Castro

Juan Ignacio Álvarez-Trejos

Proceedings of the Fifth International Conference, 2021

Query-by-Example Spoken Term Detection using Attentive Pooling Networks at ALBAYZIN 2020 Evaluation: The AUDIAS-UAM System.

[BibT_eX]

[DOI]

Proceedings of the Fifth International Conference, 2021

Multiple Feature Resolutions for Different Polyphonic Sound Detection Score Scenarios in DCASE 2021 Task 4.

[BibT_eX]

[DOI]

Sergio Segovia

Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

2020

A Multi-Resolution Approach to Sound Event Detection in DCASE 2020 Task4.

[BibT_eX]

[DOI]

Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

2019

Estudio sobre documentos reutilizables como recursos lingüísticos en el marco del desarrollo del Plan de Impulso de las Tecnologías del Lenguaje.

[BibT_eX]

[DOI]

Leonardo Campillos Llanos

Ana Valverde

Proces. del Leng. Natural, 2019

Search on speech from spoken queries: the Multi-domain International ALBAYZIN 2018 Query-by-Example Spoken Term Detection Evaluation.

[BibT_eX]

[DOI]

Luis Javier Rodríguez-Fuentes

Mikel Peñagarikano

EURASIP J. Audio Speech Music. Process., 2019

ALBAYZIN 2018 spoken term detection evaluation: a multi-domain international evaluation in Spanish.

[BibT_eX]

[DOI]

Luis Javier Rodríguez-Fuentes

Ana R. Montalvo

Jose M. Ramirez

Mikel Peñagarikano

EURASIP J. Audio Speech Music. Process., 2019

Exploring convolutional, recurrent, and hybrid deep neural networks for speech and music detection in a large audio dataset.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2019

2018

ALBAYZIN Query-by-example Spoken Term Detection 2016 evaluation.

[BibT_eX]

[DOI]

Jorge Proença

Fernando Perdigão

Fernando García-Granada

Emilio Sanchis

Anna Pompili

Alberto Abad

EURASIP J. Audio Speech Music. Process., 2018

DNN-based Embeddings for Speaker Diarization in the AuDIaS-UAM System for the Albayzin 2018 IberSPEECH-RTVE Evaluation.

[BibT_eX]

[DOI]

Proceedings of the Fourth International Conference, 2018

Audio event detection on Google's Audio Set database: Preliminary results using different types of DNNs.

[BibT_eX]

[DOI]

Javier Darna-Sequeiros

Proceedings of the Fourth International Conference, 2018

AUDIAS-CEU: A Language-independent approach for the Query-by-Example Spoken Term Detection task of the Search on Speech ALBAYZIN 2018 evaluation.

[BibT_eX]

[DOI]

Maria Cabello

Proceedings of the Fourth International Conference, 2018

2017

ALBAYZIN 2016 spoken term detection evaluation: an international open competitive evaluation in Spanish.

[BibT_eX]

[DOI]

Alejandro Coucheiro-Limeres

Luis Serrano

Inma Hernáez

Javier Ferreiros

Julia Olcoz

Jorge Llombart

EURASIP J. Audio Speech Music. Process., 2017

2016

Comparison of ALBAYZIN query-by-example spoken term detection 2012 and 2014 evaluations.

[BibT_eX]

[DOI]

María Pilar Fernández-Gallego

Carmen García-Mateo

EURASIP J. Audio Speech Music. Process., 2016

Detection of Publicity Mentions in Broadcast Radio: Preliminary Results.

[BibT_eX]

[DOI]

Álvaro Mesa-Castellanos

Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2016

2015

Speech Analysis.

[BibT_eX]

[DOI]

Proceedings of the Encyclopedia of Biometrics, Second Edition, 2015

Publisher's Erratum to: Voice Device.

[BibT_eX]

[DOI]

Proceedings of the Encyclopedia of Biometrics, Second Edition, 2015

Voice Device.

[BibT_eX]

[DOI]

Proceedings of the Encyclopedia of Biometrics, Second Edition, 2015

Speaker Features.

[BibT_eX]

[DOI]

Proceedings of the Encyclopedia of Biometrics, Second Edition, 2015

Spoken term detection ALBAYZIN 2014 evaluation: overview, systems, results, and discussion.

[BibT_eX]

[DOI]

Julián D. Echeverry-Correa

Carmen García-Mateo

Antonio Cardenal López

Alejandro Coucheiro-Limeres

Julia Olcoz

Antonio Miguel

EURASIP J. Audio Speech Music. Process., 2015

Speech Signal and Facial Image Processing for Obstructive Sleep Apnea Assessment.

[BibT_eX]

[DOI]

Fernando Espinoza-Cuadros

Comput. Math. Methods Medicine, 2015

An end-to-end approach to language identification in short utterances using convolutional neural networks.

[BibT_eX]

[DOI]

Rubén Zazo-Candil

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

2014

Feature analysis for discriminative confidence estimation in spoken term detection.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2014

Analysis of voice features related to obstructive sleep apnoea and their application in diagnosis support.

[BibT_eX]

[DOI]

Ana Montero Benavides

Comput. Speech Lang., 2014

ATVS-CSLT-HCTLab System for NIST 2013 Open Keyword Search Evaluation.

[BibT_eX]

[DOI]

Dong Wang

Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2014

2013

Query-by-Example Spoken Term Detection ALBAYZIN 2012 evaluation: overview, systems, results, and discussion.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2013

2012

Mejorando el acceso, el análisis y la visibilidad de la Información y los contenidos Multilingues y Multimedia en Red para la Comunidad de Madrid.

[BibT_eX]

[DOI]

Felisa Verdejo

Raquel Martínez-Unanue

Pablo Castells

Darwin Patricio Córdova Lucero

Víctor Fresno-Fernández

Proces. del Leng. Natural, 2012

Preliminary Results of Alignment of Text and Audio in News and Songs.

[BibT_eX]

[DOI]

Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2012

Using HMM to Detect Speakers with Severe Obstructive Sleep Apnoea Syndrome.

[BibT_eX]

[DOI]

Ana Montero Benavides

Alejandra Fernández

Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2012

2011

Analyzing Training Dependencies and Posterior Fusion in Discriminant Classification of Apnea Patients Based on Sustained and Connected Speech.

[BibT_eX]

[DOI]

Fernando Alonso-Fernandez

Javier Caminero

Eduardo López

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

2010

BiosecurID: a multimodal biometric database.

[BibT_eX]

[DOI]

Juan A. Sigüenza

Javier Garrido Salas

Pattern Anal. Appl., 2010

Multilevel and Session Variability Compensated Language Recognition: ATVS-UAM Systems at NIST LRE 2009.

[BibT_eX]

[DOI]

Ignacio López-Moreno

Javier Franco-Pedroso

Guillermo Gonzalez-Caravaca

IEEE J. Sel. Top. Signal Process., 2010

A Study of the Influence of Speech Type on Automatic Language Recognition Performance.

[BibT_eX]

[DOI]

Daniel Hernández López

Proceedings of the International Conference on Language Resources and Evaluation, 2010

Augmented set of features for confidence estimation in spoken term detection.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Phone-Conditioned Suboptimal Wiener Filtering.

[BibT_eX]

[DOI]

Maria Puertas

Proceedings of the 20th International Conference on Pattern Recognition, 2010

2009

Speech Analysis.

[BibT_eX]

[DOI]

Proceedings of the Encyclopedia of Biometrics, 2009

Voice Device.

[BibT_eX]

[DOI]

Proceedings of the Encyclopedia of Biometrics, 2009

Speaker Features.

[BibT_eX]

[DOI]

Proceedings of the Encyclopedia of Biometrics, 2009

Feature Compensation Techniques for ASR on Band-Limited Speech.

[BibT_eX]

[DOI]

Nicolás Morales

John H. L. Hansen

Javier Garrido Salas

IEEE Trans. Speech Audio Process., 2009

Assessment of Severe Apnoea through Voice Analysis, Automatic Speech, and Speaker Recognition Techniques.

[BibT_eX]

[DOI]

EURASIP J. Adv. Signal Process., 2009

Severe Apnoea Detection using Speaker Recognition Techniques.

[BibT_eX]

Proceedings of the BIOSIGNALS 2009, 2009

2008

Herramientas de anotación de corpus de habla espontánea del Laboratorio de Lingística Informática de la UAM.

[BibT_eX]

[DOI]

José María Guirao

Proces. del Leng. Natural, 2008

Phoneme and sub-phoneme t-normalization for text-dependent speaker recognition.

[BibT_eX]

[DOI]

Cristina Esteve-Elizalde

Proceedings of the Odyssey 2008: The Speaker and Language Recognition Workshop, 2008

BioSec Multimodal Biometric Database in Text-Dependent Speaker Recognition.

[BibT_eX]

[DOI]

Daniel Hernández López

Cristina Esteve-Elizalde

Julian Fiérrez

Proceedings of the International Conference on Language Resources and Evaluation, 2008

Design of a Multimodal Database for Research on Automatic Detection of Severe Apnoea Cases.

[BibT_eX]

[DOI]

Guillermo Portillo

Proceedings of the International Conference on Language Resources and Evaluation, 2008

Developing a Phonemic and Syllabic Frequency Inventory for Spontaneous Spoken Castilian Spanish and their Comparison to Text-Based Inventories.

[BibT_eX]

[DOI]

Raùl de la Torre

Marta Garrote Salazar

José María Guirao

Proceedings of the International Conference on Language Resources and Evaluation, 2008

rre STC-TIMIT: Generation of a Single-channel Telephone Corpus.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Language Resources and Evaluation, 2008

MAP and sub-word level t-norm for text-dependent speaker recognition.

[BibT_eX]

[DOI]

Daniel Hernández López

Cristina Esteve-Elizalde

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Anchor-model fusion for language recognition.

[BibT_eX]

[DOI]

Ignacio López-Moreno

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

2007

Emulating DNA: Rigorous Quantification of Evidential Weight in Transparent and Testable Forensic Speaker Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2007

Blind Feature Compensation for Time-Variant Band-Limited Speech Recognition.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2007

Biosec baseline corpus: A multimodal biometric database.

[BibT_eX]

[DOI]

Julian Fiérrez-Aguilar

Pattern Recognit., 2007

Beyond objective performance evaluation in multimodal biometric systems.

[BibT_eX]

[DOI]

Álvaro Hernández Trapote

David Díaz Pardo de Vera

Ann. des Télécommunications, 2007

Multivariate Cepstral Feature Compensation on Band-limited Data for Robust Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 16th Nordic Conference of Computational Linguistics, 2007

Improved language recognition using better phonetic decoders and fusion with MFCC and SDC features.

[BibT_eX]

[DOI]

Alejandro Abejón-Gonzalez

Danilo Spada

Ismael Mateos-Garcia

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

2006

Initialization, training, and context-dependency in HMM-based formant tracking.

[BibT_eX]

[DOI]

Jesús Gómez Villardebó

IEEE Trans. Speech Audio Process., 2006

Usability evaluation of multi-modal biometric verification systems.

[BibT_eX]

[DOI]

Álvaro Hernández Trapote

Interact. Comput., 2006

Exploring PPRLM performance for NIST 2005 Language Recognition Evaluation.

[BibT_eX]

[DOI]

Alberto Montero-Asenjo

Proceedings of the Odyssey 2006: The Speaker and Language Recognition Workshop, 2006

Using Data-driven and Phonetic Units for Speaker Verification.

[BibT_eX]

[DOI]

Asmaa El Hannani

Dijana Petrovska-Delacrétaz

Alberto Montero-Asenjo

Jean Hennebert

Proceedings of the Odyssey 2006: The Speaker and Language Recognition Workshop, 2006

Unsupervised Class-Based Feature Compensation for Time-Variable Bandwidth-Limited Speech.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005

On the relationship between phonetic modeling precision and phonetic speaker recognition accuracy.

[BibT_eX]

[DOI]

Carlos Fombella

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Statistical class-based MFCC enhancement of filtered and band-limited speech for robust ASR.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

MFCC Compensation for Improved Recognition of Filtered and Band-Limited Speech.

[BibT_eX]

[DOI]

Nicolás Morales

John H. L. Hansen

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Acoustic-phonetic decoding of different types of spontaneous speech in Spanish.

[BibT_eX]

[DOI]

José Colás Pasamontes

Javier Garrido Salas

Proceedings of the ISCA Tutorial and Research Workshop (ITRW) on Disfluency in Spontaneous Speech, 2005

2003

Automatic phonetic segmentation.

[BibT_eX]

[DOI]

Luis Villarrubia Grande

IEEE Trans. Speech Audio Process., 2003

2002

HMMs for Automatic Phonetic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Third International Conference on Language Resources and Evaluation, 2002

2001

Local refinement of phonetic boundaries: a general framework and its application using different transition models.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000

Neural network boundary refining for automatic speech segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2000

1998

Trying to mimic human segmentation of speech using HMM and fuzzy logic post-correction rules.

[BibT_eX]

[DOI]

Miguel Ángel Rodríguez Crespo

José Gregorio Escalada Sardina

Proceedings of the Third ESCA/COCOSDA Workshop on Speech Synthesis, 1998

1997

Automatic alternative transcription generation and vocabulary selection for flexible word recognizers.

[BibT_eX]

[DOI]

Luis Villarrubia