Alfonso Ortega Giménez
Orcid: 0000-0002-3886-7748Affiliations:
- University of Zaragoza, Spain
According to our database1,
Alfonso Ortega Giménez
authored at least 121 papers
between 2001 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
CoRR, 2024
CoRR, 2024
Explainable by-design Audio Segmentation through Non-Negative Matrix Factorization and Probing.
CoRR, 2024
Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024
3MAS: a multitask, multilabel, multidataset semi-supervised audio segmentation model.
Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Unsupervised multiple domain translation through controlled Disentanglement in variational autoencoder.
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
Class token and knowledge distillation for multi-head self-attention speaker verification systems.
Digit. Signal Process., March, 2023
IEEE Access, 2023
Improved Vocal Effort Transfer Vector Estimation For Vocal Effort-Robust Speaker Verification.
Proceedings of the 33rd IEEE International Workshop on Machine Learning for Signal Processing, 2023
Variational Classifier for Unsupervised Anomalous Sound Detection under Domain Generalization.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 16th International Joint Conference on Biomedical Engineering Systems and Technologies, 2023
2022
aDCF Loss Function for Deep Metric Learning in End-to-End Text-Dependent Speaker Verification Systems.
IEEE ACM Trans. Audio Speech Lang. Process., 2022
Digit. Signal Process., 2022
S3prl-Disorder: Open-Source Voice Disorder Detection System based in the Framework of S3PRL-toolkit.
Proceedings of the 6th International Conference, 2022
Proceedings of the 6th International Conference, 2022
Proceedings of the 6th International Conference, 2022
Proceedings of the 6th International Conference, 2022
Proceedings of the 6th International Conference, 2022
2021
Generalizing AUC Optimization to Multiclass Classification for Audio Segmentation With Limited Training Data.
IEEE Signal Process. Lett., 2021
EURASIP J. Audio Speech Music. Process., 2021
Log-Likelihood-Ratio Cost Function as Objective Loss for Speaker Verification Systems.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Unsupervised Representation Learning for Speech Activity Detection in the Fearless Steps Challenge 2021.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Memory Layers with Multi-Head Attention Mechanisms for Text-Dependent Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the Fifth International Conference, 2021
Proceedings of the Fifth International Conference, 2021
Convolutional Recurrent Neural Networks for Speech Activity Detection in Naturalistic Audio from Apollo Missions.
Proceedings of the Fifth International Conference, 2021
2020
Multiclass audio segmentation based on recurrent neural networks for broadcast domain data.
EURASIP J. Audio Speech Music. Process., 2020
Optimization of the area under the ROC curve using neural network supervectors for text-dependent speaker verification.
Comput. Speech Lang., 2020
Proceedings of the Conversational Dialogue Systems for the Next Decade, 2020
Shouted Speech Compensation for Speaker Verification Robust to Vocal Effort Conditions.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Partial AUC Optimisation Using Recurrent Neural Networks for Music Detection with Limited Training Data.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Knowledge Distillation and Random Erasing Data Augmentation for Text-Dependent Speaker Verification.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
2019
Phonetically-Aware Embeddings, Wide Residual Networks with Time-Delay Neural Networks and Self Attention Models for the 2018 NIST Speaker Recognition Evaluation.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Optimization of False Acceptance/Rejection Rates and Decision Threshold for End-to-End Text-Dependent Speaker Verification Systems.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
2018
Proces. del Leng. Natural, 2018
Proces. del Leng. Natural, 2018
Estimation of the Number of Speakers with Variational Bayesian PLDA in the DIHARD Diarization Challenge.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the Fourth International Conference, 2018
Proceedings of the Fourth International Conference, 2018
Differentiable Supervector Extraction for Encoding Speaker and Phrase Information in Text Dependent Speaker Verification.
Proceedings of the Fourth International Conference, 2018
Proceedings of the Fourth International Conference, 2018
Proceedings of the Fourth International Conference, 2018
2017
Domain Adaptation of PLDA Models in Broadcast Diarization by Means of Unsupervised Speaker Clustering.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
2016
Bayesian Networks to Model the Variability of Speaker Verification Scores in Adverse Environments.
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Analysis of speech quality measures for the task of estimating the reliability of speaker verification decisions.
Speech Commun., 2016
Proces. del Leng. Natural, 2016
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2016
2015
Intelligibility Assessment and Speech Recognizer Word Accuracy Rate Prediction for Dysarthric Speakers in a Factor Analysis Subspace.
ACM Trans. Access. Comput., 2015
Albayzín-2014 evaluation: audio segmentation and classification in broadcast news domains.
EURASIP J. Audio Speech Music. Process., 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
2014
Low bit rate compression methods of feature vectors for distributed speech recognition.
Speech Commun., 2014
Audio segmentation-by-classification approach based on factor analysis in broadcast news domain.
EURASIP J. Audio Speech Music. Process., 2014
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Confidence Measures in Automatic Speech Recognition Systems for Error Detection in Restricted Domains.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2014
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2014
A Preliminary Study of Acoustic Events Classification with Factor Analysis in Meeting Rooms.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2014
2013
Quality Assessment for Speaker Diarization and Its Application in Speaker Characterization.
IEEE Trans. Speech Audio Process., 2013
TIMPANO: Technology for complex Human-Machine conversational interaction with dynamic learning.
Proces. del Leng. Natural, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Suprasegmental information modelling for autism disorder spectrum and specific language impairment classification.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the First Workshop on Speech, 2013
Prosodic features and formant modeling for an ivector-based language recognition system.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
2012
The BLZ Submission to the NIST 2011 LRE: Data Collection, System Development and Performance.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Reliability Estimation of the Speaker Verification Decisions Using Bayesian Networks to Combine Information from Multiple Speech Quality Measures.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2012
Voice Pathology Detection on the Saarbrücken Voice Database with Calibration and Fusion of Scores Using MultiFocal Toolkit.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2012
Score Level versus Audio Level Fusion for Voice Pathology Detection on the Saarbrücken Voice Database.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2012
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2012
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2012
2011
IEEE Trans. Speech Audio Process., 2011
Computación y Sistemas, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Hierarchical Audio Segmentation with HMM and Factor Analysis in Broadcast News Domain.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Intra-session variability compensation and a hypothesis generation and selection strategy for speaker segmentation.
Proceedings of the IEEE International Conference on Acoustics, 2011
Multi-site heterogeneous system fusions for the Albayzin 2010 Language Recognition Evaluation.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011
2010
Unsupervised Data-Driven Feature Vector Normalization With Acoustic Model Adaptation for Robust Speech Recognition.
IEEE Trans. Speech Audio Process., 2010
Confidence measures for speaker segmentation and their relation to speaker verification.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Non-linear predictive vector quantization of feature vectors for distributed speech recognition.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Differential vector quantization of feature vectors for distributed speech recognition.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Unsupervised training scheme with non-stereo data for empirical feature vector compensation.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
2008
IEEE Trans. Speech Audio Process., 2008
Feature vector normalization with combined standard and throat microphones for robust ASR.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
2007
IEEE Trans. Speech Audio Process., 2007
Evaluation of the combined use of MEMLIN and MLLR on the non-native adaptation task of hiwire project database.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
On the jointly unsupervised feature vector normalization and acoustic model compensation for robust speech recognition.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007
2006
Study of time and frequency variability in pathological speech and error reduction methods for automatic speech recognition.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Time-dependent cross-probability model for multi-environment model based LInear normalization.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
2005
IEEE Trans. Veh. Technol., 2005
IEEE Trans. Broadcast., 2005
IEEE Trans. Speech Audio Process., 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Robust speech recognition in cars using phoneme dependent multi-environment linear normalization.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
2004
Proceedings of the IEEE 15th International Symposium on Personal, 2004
AV@CAR: A Spanish Multichannel Multimodal Corpus for In-Vehicle Automatic Audio-Visual Speech Recognition.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004
Multi-environment models based linear normalization for speech recognition in car conditions.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
2002
Proceedings of the IEEE International Conference on Acoustics, 2002
Proceedings of the 11th European Signal Processing Conference, 2002
2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001