Alfonso Ortega Giménez

Dayana Ribas

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

ViVoLAB Speaker Diarization System for the DIHARD 2019 Challenge.

[BibT_eX]

[DOI]

Pablo Gimeno

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Optimization of False Acceptance/Rejection Rates and Decision Threshold for End-to-End Text-Dependent Speaker Verification Systems.

[BibT_eX]

[DOI]

Victoria Mingote

Dayana Ribas

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Language Recognition Using Triplet Neural Networks.

[BibT_eX]

[DOI]

Victoria Mingote

Mitchell McLaren

Mahesh Kumar Nandwana

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Progressive Speech Enhancement with Residual Connections.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Speech Enhancement with Wide Residual Networks in Reverberant Environments.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2018

AMIC: Affective multimedia analytics with inclusive and natural communication.

[BibT_eX]

[DOI]

Proces. del Leng. Natural, 2018

Text-to-Pictogram Summarization for Augmentative and Alternative Communication.

[BibT_eX]

[DOI]

Proces. del Leng. Natural, 2018

Estimation of the Number of Speakers with Variational Bayesian PLDA in the DIHARD Diarization Challenge.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Phonetic Variability Influence on Short Utterances in Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the Fourth International Conference, 2018

In-domain Adaptation Solutions for the RTVE 2018 Diarization Challenge.

[BibT_eX]

[DOI]

Proceedings of the Fourth International Conference, 2018

Differentiable Supervector Extraction for Encoding Speaker and Phrase Information in Text Dependent Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the Fourth International Conference, 2018

Wide Residual Networks 1D for Automatic Text Punctuation.

[BibT_eX]

[DOI]

Proceedings of the Fourth International Conference, 2018

A Recurrent Neural Network Approach to Audio Segmentation for Broadcast Domain Data.

[BibT_eX]

[DOI]

Proceedings of the Fourth International Conference, 2018

2017

Domain Adaptation of PLDA Models in Broadcast Diarization by Means of Unsupervised Speaker Clustering.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Tied Hidden Factors in Neural Networks for End-to-End Speaker Recognition.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2016

Bayesian Networks to Model the Variability of Speaker Verification Scores in Adverse Environments.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2016

Analysis of speech quality measures for the task of estimating the reliability of speaker verification decisions.

[BibT_eX]

[DOI]

Speech Commun., 2016

ASLP-MULAN: Audio speech and language processing for multimedia analytics.

[BibT_eX]

[DOI]

Proces. del Leng. Natural, 2016

Bottleneck Based Front-End for Diarization Systems.

[BibT_eX]

[DOI]

Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2016

2015

Intelligibility Assessment and Speech Recognizer Word Accuracy Rate Prediction for Dysarthric Speakers in a Factor Analysis Subspace.

[BibT_eX]

[DOI]

ACM Trans. Access. Comput., 2015

Albayzín-2014 evaluation: audio segmentation and classification in broadcast news domains.

[BibT_eX]

[DOI]

David Tavarez

Paula Lopez-Otero

Javier Franco-Pedroso

Héctor Delgado

Eva Navas

Laura Docío Fernández

EURASIP J. Audio Speech Music. Process., 2015

Spoofing detection with DNN and one-class SVM for the ASVspoof 2015 challenge.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Variational Bayesian PLDA for speaker diarization in the MGB challenge.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014

Low bit rate compression methods of feature vectors for distributed speech recognition.

[BibT_eX]

[DOI]

Speech Commun., 2014

Audio segmentation-by-classification approach based on factor analysis in broadcast news domain.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2014

ViVoLab and CVLab - MediaEval 2014: Violent Scenes Detection Affect Task.

[BibT_eX]

[DOI]

Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014

Factor analysis with sampling methods for text dependent speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Confidence Measures in Automatic Speech Recognition Systems for Error Detection in Restricted Domains.

[BibT_eX]

[DOI]

Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2014

Unsupervised Accent Modeling for Language Identification.

[BibT_eX]

[DOI]

Eduardo Lleida-Solano

Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2014

A Preliminary Study of Acoustic Events Classification with Factor Analysis in Meeting Rooms.

[BibT_eX]

[DOI]

Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2014

2013

Quality Assessment for Speaker Diarization and Its Application in Speaker Characterization.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2013

TIMPANO: Technology for complex Human-Machine conversational interaction with dynamic learning.

[BibT_eX]

[DOI]

Proces. del Leng. Natural, 2013

The I3a speaker recognition system for NIST SRE12: post-evaluation analysis.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

A new Bayesian network to assess the reliability of speaker verification decisions.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Suprasegmental information modelling for autism disorder spectrum and specific language impairment classification.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Broadcast News Segmentation with Factor Analysis System.

[BibT_eX]

[DOI]

Proceedings of the First Workshop on Speech, 2013

Prosodic features and formant modeling for an ivector-based language recognition system.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Segmentation-by-classification system based on factor analysis.

[BibT_eX]

[DOI]

Luis Javier Rodríguez-Fuentes

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

The BLZ Submission to the NIST 2011 LRE: Data Collection, System Development and Performance.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Reliability Estimation of the Speaker Verification Decisions Using Bayesian Networks to Combine Information from Multiple Speech Quality Measures.

[BibT_eX]

[DOI]

Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2012

Voice Pathology Detection on the Saarbrücken Voice Database with Calibration and Fusion of Scores Using MultiFocal Toolkit.

[BibT_eX]

[DOI]

Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2012

Score Level versus Audio Level Fusion for Voice Pathology Detection on the Saarbrücken Voice Database.

[BibT_eX]

[DOI]

Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2012

Evaluation of a New Beam-Search Formant Tracking Algorithm in Noisy Environments.

[BibT_eX]

[DOI]

Dayana Ribas González

José Ramón Calvo de Lara

Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2012

Factor Analysis Segmentation and Classification in Broadcast News Domain.

[BibT_eX]

[DOI]

Carlos Vaquero Avilés Casco

Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2012

2011

Bayesian Networks for Discrete Observation Distributions in Speech Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2011

Speaker Verification On Summed-Channel Conditions With Confidence Measures.

[BibT_eX]

[DOI]

Eduardo Lleida-Solano

Computación y Sistemas, 2011

Partitioning of Two-Speaker Conversation Datasets.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

I3A Language Recognition System for Albayzin 2010 LRE.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Hierarchical Audio Segmentation with HMM and Factor Analysis in Broadcast News Domain.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Intra-session variability compensation and a hypothesis generation and selection strategy for speaker segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Multi-site heterogeneous system fusions for the Albayzin 2010 Language Recognition Evaluation.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010

Unsupervised Data-Driven Feature Vector Normalization With Acoustic Model Adaptation for Robust Speech Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2010

Confidence measures for speaker segmentation and their relation to speaker verification.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Non-linear predictive vector quantization of feature vectors for distributed speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009

Real-time live broadcast news subtitling system for Spanish.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Graphical models for discrete hidden Markov models in speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Local projections and support vector based feature selection in speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Differential vector quantization of feature vectors for distributed speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Unsupervised training scheme with non-stereo data for empirical feature vector compensation.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

2008

Capturing Local Variability for Speaker Normalization in Speech Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2008

Feature vector normalization with combined standard and throat microphones for robust ASR.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

2007

Cepstral Vector Normalization Based on Stereo Data for Robust Speech Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2007

Evaluation of the combined use of MEMLIN and MLLR on the non-native adaptation task of hiwire project database.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

On the jointly unsupervised feature vector normalization and acoustic model compensation for robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Robust speech recognition with on-line unsupervised acoustic feature compensation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006

Study of time and frequency variability in pathological speech and error reduction methods for automatic speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Local transformation models for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Time-dependent cross-probability model for multi-environment model based LInear normalization.

[BibT_eX]

[DOI]

Luis Buera

Juan Arturo Nolazco-Flores

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Stability Control in a Two-Channel Speech Reinforcement System for Vehicles.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005

An adaptive digital method of imbalances cancellation in LINC transmitters.

[BibT_eX]

[DOI]

IEEE Trans. Veh. Technol., 2005

Nonlinear distortion cancellation using LINC transmitters in OFDM systems.

[BibT_eX]

[DOI]

IEEE Trans. Broadcast., 2005

Speech Reinforcement System for Car Cabin Communications.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2005

Acoustic feedback cancellation in speech reinforcement systems for vehicles.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Augmented state space acoustic decoding for modeling local variability in speech.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Robust speech recognition in cars using phoneme dependent multi-environment linear normalization.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

2004

Nonlinear distortion cancellation in OFDM systems using an adaptive LINC structure.

[BibT_eX]

[DOI]

Proceedings of the IEEE 15th International Symposium on Personal, 2004

AV@CAR: A Spanish Multichannel Multimodal Corpus for In-Vehicle Automatic Audio-Visual Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

Multi-environment models based linear normalization for speech recognition in car conditions.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003

Residual echo power estimation for speech reinforcement systems in vehicles.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002

Cabin car communication system to improve communications inside a car.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2002

DSP to improve oral communications inside vehicles.

[BibT_eX]

[DOI]

Proceedings of the 11th European Signal Processing Conference, 2002

2001

Acoustic echo control and noise reduction for cabin car communication.

[BibT_eX]

[DOI]