Jesús Villalba
Orcid: 0000-0001-9459-8426Affiliations:
- Johns Hopkins University, Center for Language and Speech Processing, Baltimore, MD, USA
- University of Zaragoza, Spain
According to our database1,
Jesús Villalba
authored at least 117 papers
between 2010 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on linkedin.com
-
on orcid.org
On csauthors.net:
Bibliography
2024
Comput. Biol. Medicine, March, 2024
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Time-Domain Speech Super-Resolution With GAN Based Modeling for Telephony Speaker Verification.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Unraveling Adversarial Examples against Speaker Identification - Techniques for Attack Detection and Victim Model Classification.
Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024
Discovering Invariant Patterns of Cognitive Decline Via an Automated Analysis of the Cookie Thief Picture Description Task.
Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024
Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024
2023
Interpretable speech features vs. DNN embeddings: What to use in the automatic assessment of Parkinson's disease in multi-lingual scenarios.
Comput. Biol. Medicine, November, 2023
CoRR, 2023
CoRR, 2023
DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Self-FiLM: Conditioning GANs with self-supervised representations for bandwidth extension based speaker recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Do Phonatory Features Display Robustness to Characterize Parkinsonian Speech Across Corpora?
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Segmental SpeechCLIP: Utilizing Pretrained Image-text Models for Audio-Visual Learning.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Advances in Language Recognition in Low Resource African Languages: The JHU-MIT Submission for NIST LRE22.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Clustering Unsupervised Representations as Defense Against Poisoning Attacks on Speech Commands Classification System.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
Joint Energy-Based Model for Robust Speech Classification System Against Dirty-Label Backdoor Poisoning Attacks.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
2022
Unsupervised Speech Segmentation and Variable Rate Representation Learning Using Segmental Contrastive Predictive Coding.
IEEE ACM Trans. Audio Speech Lang. Process., 2022
Non-Contrastive Self-Supervised Learning for Utterance-Level Information Extraction From Speech.
IEEE J. Sel. Top. Signal Process., 2022
Defense against Adversarial Attacks on Hybrid Speech Recognition using Joint Adversarial Fine-tuning with Denoiser.
CoRR, 2022
A Multi-Modal Array of Interpretable Features to Evaluate Language and Speech Patterns in Different Neurological Disorders.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Vsameter: Evaluation of a New Open-Source Tool to Measure Vowel Space Area and Related Metrics.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Advances in Cross-Lingual and Cross-Source Audio-Visual Speaker Recognition: The JHU-MIT System for NIST SRE21.
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022
Advances in Speaker Recognition for Multilingual Conversational Telephone Speech: The JHU-MIT System for NIST SRE20 CTS Challenge.
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
End-to-End Neural Speaker Diarization with an Iterative Refinement of Non-Autoregressive Attention-based Attractors.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker verification.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
AdvEst: Adversarial Perturbation Estimation to Classify and Detect Adversarial Attacks against Speaker Identification.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Defense against Adversarial Attacks on Hybrid Speech Recognition System using Adversarial Fine-tuning with Denoiser.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
2021
Study of Pre-Processing Defenses Against Adversarial Attacks on State-of-the-Art Speaker Recognition Systems.
IEEE Trans. Inf. Forensics Secur., 2021
IEEE Signal Process. Lett., 2021
Proceedings of the Statistical Language and Speech Processing, 2021
Representation Learning to Classify and Detect Adversarial Attacks Against Speaker and Speech Recognition Systems.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Spine2Net: SpineNet with Res2Net and Time-Squeeze-and-Excitation Blocks for Speaker Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Automatic Detection and Assessment of Alzheimer Disease Using Speech and Language Technologies in Low-Resource Scenarios.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Deep Feature CycleGANs: Speaker Identity Preserving Non-Parallel Microphone-Telephone Domain Adaptation for Speaker Verification.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Perceptual Loss Based Speech Denoising with an Ensemble of Audio Pattern Recognition and Self-Supervised Models.
Proceedings of the IEEE International Conference on Acoustics, 2021
Improving Reconstruction Loss Based Speaker Embedding in Unsupervised and Semi-Supervised Scenarios.
Proceedings of the IEEE International Conference on Acoustics, 2021
Focus on the Present: A Regularization Method for the ASR Source-Target Attention Layer.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
2020
State-of-the-art speaker recognition with neural network embeddings in NIST SRE18 and Speakers in the Wild evaluations.
Comput. Speech Lang., 2020
CoRR, 2020
Advances in Speaker Recognition for Telephone and Audio-Visual Data: the JHU-MIT Submission for NIST SRE19.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020
Black-Box Attacks on Spoofing Countermeasures Using Transferability of Adversarial Examples.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
x-Vectors Meet Adversarial Attacks: Benchmarking Adversarial Robustness in Speaker Verification.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
X-Vectors Meet Emotions: A Study On Dependencies Between Emotion and Speaker Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
2019
Listen and Fill in the Missing Letters: Non-Autoregressive Transformer for Speech Recognition.
CoRR, 2019
A forced gaussians based methodology for the differential evaluation of Parkinson's Disease by means of speech processing.
Biomed. Signal Process. Control., 2019
State-of-the-Art Speaker Recognition for Telephone and Video Speech: The JHU-MIT Submission for NIST SRE18.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Tied Mixture of Factor Analyzers Layer to Combine Frame Level Representations in Neural Speaker Embeddings.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Investigation on Neural Bandwidth Extension of Telephone Speech for Improved Speaker Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019
Language Model Integration Based on Memory Control for Sequence to Sequence Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the 2019 IEEE Global Conference on Signal and Information Processing, 2019
Proceedings of the 2019 IEEE Global Conference on Signal and Information Processing, 2019
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
2018
Analysis of speaker recognition methodologies and the influence of kinetic changes to automatically detect Parkinson's Disease.
Appl. Soft Comput., 2018
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018
End-to-End versus Embedding Neural Networks for Language Recognition in Mismatched Conditions.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018
Diarization is Hard: Some Experiences and Lessons Learned for the JHU Team in the Inaugural DIHARD Challenge.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Joint Verification-Identification in end-to-end Multi-Scale CNN Framework for Topic Identification.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Measuring Uncertainty in Deep Regression Models: The Case of Age Estimation from Speech.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the Fourth International Conference, 2018
2017
Domain Adaptation of PLDA Models in Broadcast Diarization by Means of Unsupervised Speaker Clustering.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
2016
Bayesian Networks to Model the Variability of Speaker Verification Scores in Adverse Environments.
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Analysis of speech quality measures for the task of estimating the reliability of speaker verification decisions.
Speech Commun., 2016
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2016
2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2014
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2014
2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Handling i-vectors from different recording conditions using multi-channel simplified PLDA in speaker recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
2012
Bayesian adaptation of PLDA based speaker recognition to domains with scarce development data.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012
The BLZ Submission to the NIST 2011 LRE: Data Collection, System Development and Performance.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Reliability Estimation of the Speaker Verification Decisions Using Bayesian Networks to Combine Information from Multiple Speech Quality Measures.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2012
Voice Pathology Detection on the Saarbrücken Voice Database with Calibration and Fusion of Scores Using MultiFocal Toolkit.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2012
2011
Computación y Sistemas, 2011
Towards Fully Bayesian Speaker Recognition: Integrating Out the Between-Speaker Covariance.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Hierarchical Audio Segmentation with HMM and Factor Analysis in Broadcast News Domain.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the International Carnahan Conference on Security Technology, 2011
Proceedings of the Biometrics and ID Management, 2011
Multi-site heterogeneous system fusions for the Albayzin 2010 Language Recognition Evaluation.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011
2010
Confidence measures for speaker segmentation and their relation to speaker verification.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications, 2010