Eduardo Lleida

Orcid: 0000-0001-9137-4013

According to our database1, Eduardo Lleida authored at least 171 papers between 1987 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Audio-Visual Speaker Diarization: Current Databases, Approaches and Challenges.
CoRR, 2024

Defining and Measuring Disentanglement for non-Independent Factors of Variation.
CoRR, 2024

Predefined Prototypes for Intra-Class Separation and Disentanglement.
CoRR, 2024

2023
Class token and knowledge distillation for multi-head self-attention speaker verification systems.
Digit. Signal Process., March, 2023

Automatic Voice Disorder Detection Using Self-Supervised Representations.
IEEE Access, 2023

Improved Vocal Effort Transfer Vector Estimation For Vocal Effort-Robust Speaker Verification.
Proceedings of the 33rd IEEE International Workshop on Machine Learning for Signal Processing, 2023

On the Use of High Frequency Information for Voice Pathology Classification.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Variational Classifier for Unsupervised Anomalous Sound Detection under Domain Generalization.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

On the Problem of Data Availability in Automatic Voice Disorder Detection.
Proceedings of the 16th International Joint Conference on Biomedical Engineering Systems and Technologies, 2023

2022
aDCF Loss Function for Deep Metric Learning in End-to-End Text-Dependent Speaker Verification Systems.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Shouted and whispered speech compensation for speaker verification systems.
Digit. Signal Process., 2022

S3prl-Disorder: Open-Source Voice Disorder Detection System based in the Framework of S3PRL-toolkit.
Proceedings of the 6th International Conference, 2022

Cross-Corpus Speech Emotion Recognition with HuBERT Self-Supervised Representation.
Proceedings of the 6th International Conference, 2022

ViVoLAB System Description for the S2TC IberSPEECH-RTVE 2022 challenge.
Proceedings of the 6th International Conference, 2022

A Study on the Use of wav2vec Representations for Multiclass Audio Segmentation.
Proceedings of the 6th International Conference, 2022

2021
Generalizing AUC Optimization to Multiclass Classification for Audio Segmentation With Limited Training Data.
IEEE Signal Process. Lett., 2021

Progressive loss functions for speech enhancement with deep neural networks.
EURASIP J. Audio Speech Music. Process., 2021

Log-Likelihood-Ratio Cost Function as Objective Loss for Speaker Verification Systems.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Unsupervised Representation Learning for Speech Activity Detection in the Fearless Steps Challenge 2021.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Memory Layers with Multi-Head Attention Mechanisms for Text-Dependent Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2021

Diarization and Identity Attribution Compatibility in the Albayzin 2020 Challenge.
Proceedings of the Fifth International Conference, 2021

ViVoLAB Multimodal Diarization System for RTVE 2020 Challenge.
Proceedings of the Fifth International Conference, 2021

Convolutional Recurrent Neural Networks for Speech Activity Detection in Naturalistic Audio from Apollo Missions.
Proceedings of the Fifth International Conference, 2021

2020
Multiclass audio segmentation based on recurrent neural networks for broadcast domain data.
EURASIP J. Audio Speech Music. Process., 2020

Optimization of the area under the ROC curve using neural network supervectors for text-dependent speaker verification.
Comput. Speech Lang., 2020

Shouted Speech Compensation for Speaker Verification Robust to Vocal Effort Conditions.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Training Speaker Enrollment Models by Network Optimization.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Partial AUC Optimisation Using Recurrent Neural Networks for Music Detection with Limited Training Data.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Knowledge Distillation and Random Erasing Data Augmentation for Text-Dependent Speaker Verification.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Phonetically-Aware Embeddings, Wide Residual Networks with Time-Delay Neural Networks and Self Attention Models for the 2018 NIST Speaker Recognition Evaluation.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

ViVoLAB Speaker Diarization System for the DIHARD 2019 Challenge.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Optimization of False Acceptance/Rejection Rates and Decision Threshold for End-to-End Text-Dependent Speaker Verification Systems.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Language Recognition Using Triplet Neural Networks.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Progressive Speech Enhancement with Residual Connections.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Speech Enhancement with Wide Residual Networks in Reverberant Environments.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2018
AMIC: Affective multimedia analytics with inclusive and natural communication.
Proces. del Leng. Natural, 2018

Text-to-Pictogram Summarization for Augmentative and Alternative Communication.
Proces. del Leng. Natural, 2018

Speaker and language recognition and characterization: Introduction to the CSL special issue.
Comput. Speech Lang., 2018

Estimation of the Number of Speakers with Variational Bayesian PLDA in the DIHARD Diarization Challenge.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Phonetic Variability Influence on Short Utterances in Speaker Verification.
Proceedings of the Fourth International Conference, 2018

In-domain Adaptation Solutions for the RTVE 2018 Diarization Challenge.
Proceedings of the Fourth International Conference, 2018

Differentiable Supervector Extraction for Encoding Speaker and Phrase Information in Text Dependent Speaker Verification.
Proceedings of the Fourth International Conference, 2018

Wide Residual Networks 1D for Automatic Text Punctuation.
Proceedings of the Fourth International Conference, 2018

A Recurrent Neural Network Approach to Audio Segmentation for Broadcast Domain Data.
Proceedings of the Fourth International Conference, 2018

2017
Domain Adaptation of PLDA Models in Broadcast Diarization by Means of Unsupervised Speaker Clustering.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Tied Hidden Factors in Neural Networks for End-to-End Speaker Recognition.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2016
Bayesian Networks to Model the Variability of Speaker Verification Scores in Adverse Environments.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Analysis of speech quality measures for the task of estimating the reliability of speaker verification decisions.
Speech Commun., 2016

ASLP-MULAN: Audio speech and language processing for multimedia analytics.
Proces. del Leng. Natural, 2016

Bottleneck Based Front-End for Diarization Systems.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2016

2015
Intelligibility Assessment and Speech Recognizer Word Accuracy Rate Prediction for Dysarthric Speakers in a Factor Analysis Subspace.
ACM Trans. Access. Comput., 2015

Albayzín-2014 evaluation: audio segmentation and classification in broadcast news domains.
EURASIP J. Audio Speech Music. Process., 2015

Spoofing detection with DNN and one-class SVM for the ASVspoof 2015 challenge.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Context-Aware Communicator for All.
Proceedings of the Universal Access in Human-Computer Interaction. Access to Today's Technologies, 2015

Variational Bayesian PLDA for speaker diarization in the MGB challenge.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Low bit rate compression methods of feature vectors for distributed speech recognition.
Speech Commun., 2014

Audio segmentation-by-classification approach based on factor analysis in broadcast news domain.
EURASIP J. Audio Speech Music. Process., 2014

ViVoLab and CVLab - MediaEval 2014: Violent Scenes Detection Affect Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014

Factor analysis with sampling methods for text dependent speaker recognition.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Unsupervised adaptation of PLDA by using variational Bayes methods.
Proceedings of the IEEE International Conference on Acoustics, 2014

Unscented transform for ivector-based noisy speaker recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

Unsupervised Training of PLDA with Variational Bayes.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2014

Confidence Measures in Automatic Speech Recognition Systems for Error Detection in Restricted Domains.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2014

Articulatory Feature Extraction from Voice and Their Impact on Hybrid Acoustic Models.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2014

Unsupervised Accent Modeling for Language Identification.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2014

A Preliminary Study of Acoustic Events Classification with Factor Analysis in Meeting Rooms.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2014

Zero Phase speech representation for robust formant tracking.
Proceedings of the 22nd European Signal Processing Conference, 2014

2013
Quality Assessment for Speaker Diarization and Its Application in Speaker Characterization.
IEEE Trans. Speech Audio Process., 2013

Handling recordings acquired simultaneously over multiple channels with PLDA.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

The I3a speaker recognition system for NIST SRE12: post-evaluation analysis.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

A new Bayesian network to assess the reliability of speaker verification decisions.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Suprasegmental information modelling for autism disorder spectrum and specific language impairment classification.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Broadcast News Segmentation with Factor Analysis System.
Proceedings of the First Workshop on Speech, 2013

Handling i-vectors from different recording conditions using multi-channel simplified PLDA in speaker recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

Prosodic features and formant modeling for an ivector-based language recognition system.
Proceedings of the IEEE International Conference on Acoustics, 2013

Segmentation-by-classification system based on factor analysis.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
A prelingual tool for the education of altered voices.
Speech Commun., 2012

Bayesian adaptation of PLDA based speaker recognition to domains with scarce development data.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

The BLZ Submission to the NIST 2011 LRE: Data Collection, System Development and Performance.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Reliability Estimation of the Speaker Verification Decisions Using Bayesian Networks to Combine Information from Multiple Speech Quality Measures.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2012

Voice Pathology Detection on the Saarbrücken Voice Database with Calibration and Fusion of Scores Using MultiFocal Toolkit.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2012

Score Level versus Audio Level Fusion for Voice Pathology Detection on the Saarbrücken Voice Database.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2012

Evaluation of a New Beam-Search Formant Tracking Algorithm in Noisy Environments.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2012

Factor Analysis Segmentation and Classification in Broadcast News Domain.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2012

Beam-Search Formant Tracking Algorithm Based on Trajectory Functions for Continuous Speech.
Proceedings of the Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications, 2012

2011
Bayesian Networks for Discrete Observation Distributions in Speech Recognition.
IEEE Trans. Speech Audio Process., 2011

Development of Voice-Based Tools for Accessibility to Computer Services.
Computación y Sistemas, 2011

Speaker Verification On Summed-Channel Conditions With Confidence Measures.
Computación y Sistemas, 2011

Partitioning of Two-Speaker Conversation Datasets.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

I3A Language Recognition System for Albayzin 2010 LRE.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Hierarchical Audio Segmentation with HMM and Factor Analysis in Broadcast News Domain.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Preventing replay attacks on speaker verification systems.
Proceedings of the International Carnahan Conference on Security Technology, 2011

Intra-session variability compensation and a hypothesis generation and selection strategy for speaker segmentation.
Proceedings of the IEEE International Conference on Acoustics, 2011

Detecting Replay Attacks from Far-Field Recordings on Speaker Verification Systems.
Proceedings of the Biometrics and ID Management, 2011


2010
Unsupervised Data-Driven Feature Vector Normalization With Acoustic Model Adaptation for Robust Speech Recognition.
IEEE Trans. Speech Audio Process., 2010

SD-TEAM: Tecnologías de aprendizaje interactivo, autoevaluación y multimodalidad en sistemas de diálogo hablado multidominio.
Proces. del Leng. Natural, 2010

The Alborada-I3A Corpus of Disordered Speech.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Confidence measures for speaker segmentation and their relation to speaker verification.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Non-linear predictive vector quantization of feature vectors for distributed speech recognition.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Speaker Verification in Noisy Environment Using Missing Feature Approach.
Proceedings of the Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications, 2010

2009
Tools and Technologies for Computer-Aided Speech and Language Therapy.
Speech Commun., 2009

Analysis of Acoustic Features in Speakers with Cognitive Disorders and Speech Impairments.
EURASIP J. Adv. Signal Process., 2009

Avoiding speaker variability in pronunciation verification of children' disordered speech.
Proceedings of the Second Workshop on Child, Computer and Interaction, 2009

COMUNICA: multilevel tools for Spanish CALL.
Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2009

An experience with a Spanish second language learning tool in a multilingual environment.
Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2009

Formant estimation in children's speech and its application for a Spanish speech therapy tool.
Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2009

Combination of acoustic and lexical speaker adaptation for disordered speech recognition.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Real-time live broadcast news subtitling system for Spanish.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Graphical models for discrete hidden Markov models in speech recognition.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Local projections and support vector based feature selection in speech recognition.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Differential vector quantization of feature vectors for distributed speech recognition.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Unsupervised training scheme with non-stereo data for empirical feature vector compensation.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

A study of pronunciation verification in a speech therapy application.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Capturing Local Variability for Speaker Normalization in Speech Recognition.
IEEE Trans. Speech Audio Process., 2008

A novel corpus of children<sup>2</sup>s disordered speech.
Proceedings of the First Workshop on Child, Computer and Interaction, 2008

COMUNICA - tools for speech and language therapy.
Proceedings of the First Workshop on Child, Computer and Interaction, 2008

Verifying pronunciation accuracy from speakers with neuromuscular disorders.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Feature vector normalization with combined standard and throat microphones for robust ASR.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

E-inclusion technologies for the speech handicapped.
Proceedings of the IEEE International Conference on Acoustics, 2008

Improving dialogue systems in a home automation environment.
Proceedings of the 1st International ICST Conference on Ambient Media and Systems, 2008

2007
Cepstral Vector Normalization Based on Stereo Data for Robust Speech Recognition.
IEEE Trans. Speech Audio Process., 2007

Evaluation of the combined use of MEMLIN and MLLR on the non-native adaptation task of hiwire project database.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

On the jointly unsupervised feature vector normalization and acoustic model compensation for robust speech recognition.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Robust Automatic Speech Recognition Using PD-MEEMLIN.
Proceedings of the Pattern Recognition and Image Analysis, Third Iberian Conference, 2007

Robust speech recognition with on-line unsupervised acoustic feature compensation.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Study of time and frequency variability in pathological speech and error reduction methods for automatic speech recognition.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Local transformation models for speech recognition.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Time-dependent cross-probability model for multi-environment model based LInear normalization.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Stability Control in a Two-Channel Speech Reinforcement System for Vehicles.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Speech Reinforcement System for Car Cabin Communications.
IEEE Trans. Speech Audio Process., 2005

Acoustic feedback cancellation in speech reinforcement systems for vehicles.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Augmented state space acoustic decoding for modeling local variability in speech.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Robust speech recognition in cars using phoneme dependent multi-environment linear normalization.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Lip Reading for Robust Speech Recognition on Embedded Devices.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
AV@CAR: A Spanish Multichannel Multimodal Corpus for In-Vehicle Automatic Audio-Visual Speech Recognition.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

Multi-environment models based linear normalization for speech recognition in car conditions.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
Residual echo power estimation for speech reinforcement systems in vehicles.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002
Cabin car communication system to improve communications inside a car.
Proceedings of the IEEE International Conference on Acoustics, 2002

DSP to improve oral communications inside vehicles.
Proceedings of the 11th European Signal Processing Conference, 2002

2001
A new method for epoch detection based on the Cohen's class of time frequency representations.
IEEE Signal Process. Lett., 2001

OISTI (An Oral-Interface System to Provide Tourist-Information Inside a Car.
Proceedings of the 2001 International Symposium on Information Technology (ITCC 2001), 2001

Acoustic echo control and noise reduction for cabin car communication.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
Utterance verification in continuous speech recognition: decoding and training procedures.
IEEE Trans. Speech Audio Process., 2000

1999
An improved speech endpoint detection system in noisy environments by means of third-order spectra.
IEEE Signal Process. Lett., 1999

Performance comparison of several adaptive schemes for microphone array beamforming.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Microphone array design for robust speech acquisition and recognition.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1998
Robust continuous speech recognition system based on a microphone array.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997
Non-quadratic criterion algorithms for speech enhancement.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Speech recognition using automatically derived acoustic baseforms.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1996
A user-configurable system for voice label recognition.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Likelihood ratio decoding and confidence measures for continuous speech recognition.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Wavelet transforms for non-uniform speech recogntion systems.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Pitch detection and voiced/unvoiced decision algorithm based on wavelet transforms.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Efficient decoding and training procedures for utterance verification in continuous speech recognition.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995
Semantic decoding of speech in constrained domains.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

1993
Albayzin speech database: design of the phonetic corpus.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Out-of-vocabulary word modelling and rejection for keyword spotting.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

TELEMACO - a real time keyword spotting application for voice dialling.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

1992
Smoothing hidden Markov models ay means of a self organizing feature map.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

Syllabic fillers for Spanish HMM keyword spotting.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

On the AR modelling of the one-sided autocorrelation sequence for noisy speech recognition.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

1991
Two level continuous speech recognition using demisyllable-based HMM word spotting.
Proceedings of the Second European Conference on Speech Communication and Technology, 1991

Demisyllable-based HMM spotting for continuous speech recognition.
Proceedings of the 1991 International Conference on Acoustics, 1991

1990
Statistical feature selection for isolated word recognition.
Proceedings of the 1990 International Conference on Acoustics, 1990

1989
Modeling of the analytic spectrum for speech recognition.
Proceedings of the First European Conference on Speech Communication and Technology, 1989

New backpropagation algorithm using quadratic potential functions, and an experiment on isolated word recognition.
Proceedings of the First European Conference on Speech Communication and Technology, 1989

Recognition of numbers and strings of numbers by using demisyllables: one speaker experiment.
Proceedings of the First European Conference on Speech Communication and Technology, 1989

1987
Demisyllable based Spanish number recognition experiments.
Proceedings of the European Conference on Speech Technology, 1987

Speech parametrization and recognition using block and recursive linear prediction with data compression.
Proceedings of the European Conference on Speech Technology, 1987


  Loading...