Najim Dehak
Orcid: 0000-0002-4489-5753Affiliations:
- MIT, Cambridge, USA
According to our database1,
Najim Dehak
authored at least 183 papers
between 2006 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
Comput. Biol. Medicine, March, 2024
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Time-Domain Speech Super-Resolution With GAN Based Modeling for Telephony Speaker Verification.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
IEEE ACM Trans. Audio Speech Lang. Process., 2024
SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.
CoRR, 2024
Explainable Metrics for the Assessment of Neurodegenerative Diseases through Handwriting Analysis.
CoRR, 2024
SSR-Speech: Towards Stable, Safe and Robust Zero-shot Text-based Speech Editing and Synthesis.
CoRR, 2024
Unraveling Adversarial Examples against Speaker Identification - Techniques for Attack Detection and Victim Model Classification.
Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024
Odyssey 2024 - Speech Emotion Recognition Challenge: Dataset, Baseline Framework, and Results.
Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024
Discovering Invariant Patterns of Cognitive Decline Via an Automated Analysis of the Cookie Thief Picture Description Task.
Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Finding Spoken Identifications: Using GPT-4 Annotation for an Efficient and Fast Dataset Creation Pipeline.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
2023
Interpretable speech features vs. DNN embeddings: What to use in the automatic assessment of Parkinson's disease in multi-lingual scenarios.
Comput. Biol. Medicine, November, 2023
CoRR, 2023
CoRR, 2023
DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Self-FiLM: Conditioning GANs with self-supervised representations for bandwidth extension based speaker recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Do Phonatory Features Display Robustness to Characterize Parkinsonian Speech Across Corpora?
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Segmental SpeechCLIP: Utilizing Pretrained Image-text Models for Audio-Visual Learning.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Advances in Language Recognition in Low Resource African Languages: The JHU-MIT Submission for NIST LRE22.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Clustering Unsupervised Representations as Defense Against Poisoning Attacks on Speech Commands Classification System.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
Joint Energy-Based Model for Robust Speech Classification System Against Dirty-Label Backdoor Poisoning Attacks.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
2022
Unsupervised Speech Segmentation and Variable Rate Representation Learning Using Segmental Contrastive Predictive Coding.
IEEE ACM Trans. Audio Speech Lang. Process., 2022
Non-Contrastive Self-Supervised Learning for Utterance-Level Information Extraction From Speech.
IEEE J. Sel. Top. Signal Process., 2022
Comput. Speech Lang., 2022
Defense against Adversarial Attacks on Hybrid Speech Recognition using Joint Adversarial Fine-tuning with Denoiser.
CoRR, 2022
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
A Multi-Modal Array of Interpretable Features to Evaluate Language and Speech Patterns in Different Neurological Disorders.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Vsameter: Evaluation of a New Open-Source Tool to Measure Vowel Space Area and Related Metrics.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Advances in Cross-Lingual and Cross-Source Audio-Visual Speaker Recognition: The JHU-MIT System for NIST SRE21.
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022
Advances in Speaker Recognition for Multilingual Conversational Telephone Speech: The JHU-MIT System for NIST SRE20 CTS Challenge.
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
End-to-End Neural Speaker Diarization with an Iterative Refinement of Non-Autoregressive Attention-based Attractors.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker verification.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
AdvEst: Adversarial Perturbation Estimation to Classify and Detect Adversarial Attacks against Speaker Identification.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Defense against Adversarial Attacks on Hybrid Speech Recognition System using Adversarial Fine-tuning with Denoiser.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
2021
Study of Pre-Processing Defenses Against Adversarial Attacks on State-of-the-Art Speaker Recognition Systems.
IEEE Trans. Inf. Forensics Secur., 2021
What Helps Transformers Recognize Conversational Structure? Importance of Context, Punctuation, and Labels in Dialog Act Recognition.
Trans. Assoc. Comput. Linguistics, 2021
IEEE Signal Process. Lett., 2021
Advances in Parkinson's Disease detection and assessment using voice and speech: A review of the articulatory and phonatory aspects.
Biomed. Signal Process. Control., 2021
Proceedings of the Statistical Language and Speech Processing, 2021
Representation Learning to Classify and Detect Adversarial Attacks Against Speaker and Speech Recognition Systems.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Spine2Net: SpineNet with Res2Net and Time-Squeeze-and-Excitation Blocks for Speaker Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Automatic Detection and Assessment of Alzheimer Disease Using Speech and Language Technologies in Low-Resource Scenarios.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Deep Feature CycleGANs: Speaker Identity Preserving Non-Parallel Microphone-Telephone Domain Adaptation for Speaker Verification.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Align or attend? Toward More Efficient and Accurate Spoken Word Discovery Using Speech-to-Image Retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Perceptual Loss Based Speech Denoising with an Ensemble of Audio Pattern Recognition and Self-Supervised Models.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Improving Reconstruction Loss Based Speaker Embedding in Unsupervised and Semi-Supervised Scenarios.
Proceedings of the IEEE International Conference on Acoustics, 2021
Focus on the Present: A Regularization Method for the ASR Source-Target Attention Layer.
Proceedings of the IEEE International Conference on Acoustics, 2021
New tools for the differential evaluation of Parkinson's disease using voice and speech processing.
Proceedings of the Fifth International Conference, 2021
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
Joint Prediction of Truecasing and Punctuation for Conversational Speech in Low-Resource Scenarios.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
2020
Analysis of the Effects of Supraglottal Tract Surgical Procedures in Automatic Speaker Recognition Performance.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Introduction to the Issue on Automatic Assessment of Health Disorders Based on Voice, Speech, and Language Processing.
IEEE J. Sel. Top. Signal Process., 2020
State-of-the-art speaker recognition with neural network embeddings in NIST SRE18 and Speakers in the Wild evaluations.
Comput. Speech Lang., 2020
Comput. Speech Lang., 2020
CoRR, 2020
Advances in Speaker Recognition for Telephone and Audio-Visual Data: the JHU-MIT Submission for NIST SRE19.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020
Black-Box Attacks on Spoofing Countermeasures Using Transferability of Adversarial Examples.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
That Sounds Familiar: An Analysis of Phonetic Representations Transfer Across Languages.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
x-Vectors Meet Adversarial Attacks: Benchmarking Adversarial Robustness in Speaker Verification.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Using State of the Art Speaker Recognition and Natural Language Processing Technologies to Detect Alzheimer's Disease and Assess its Severity.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Punctuation Prediction in Spontaneous Conversations: Can We Mitigate ASR Errors with Retrofitted Word Embeddings?
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
X-Vectors Meet Emotions: A Study On Dependencies Between Emotion and Speaker Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
2019
Listen and Fill in the Missing Letters: Non-Autoregressive Transformer for Speech Recognition.
CoRR, 2019
Speaker Sincerity Detection based on Covariance Feature Vectors and Ensemble Methods.
CoRR, 2019
A forced gaussians based methodology for the differential evaluation of Parkinson's Disease by means of speech processing.
Biomed. Signal Process. Control., 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
State-of-the-Art Speaker Recognition for Telephone and Video Speech: The JHU-MIT Submission for NIST SRE18.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
MCE 2018: The 1st Multi-Target Speaker Detection and Identification Challenge Evaluation.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Improving Emotion Identification Using Phone Posteriors in Raw Speech Waveform Based DNN.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Study of the Performance of Automatic Speech Recognition Systems in Speakers with Parkinson's Disease.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Tied Mixture of Factor Analyzers Layer to Combine Frame Level Representations in Neural Speaker Embeddings.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Investigation on Neural Bandwidth Extension of Telephone Speech for Improved Speaker Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Language Model Integration Based on Memory Control for Sequence to Sequence Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the 2019 IEEE Global Conference on Signal and Information Processing, 2019
Proceedings of the 2019 IEEE Global Conference on Signal and Information Processing, 2019
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
2018
Digit. Signal Process., 2018
MCE 2018: The 1st Multi-target Speaker Detection and Identification Challenge Evaluation (MCE) Plan, Dataset and Baseline System.
CoRR, 2018
The JHU Speech LOREHLT 2017 System: Cross-Language Transfer for Situation-Frame Detection.
CoRR, 2018
Analysis of speaker recognition methodologies and the influence of kinetic changes to automatically detect Parkinson's Disease.
Appl. Soft Comput., 2018
IEEE Access, 2018
Building an ASR System for Mboshi Using A Cross-Language Definition of Acoustic Units Approach.
Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018
End-to-End versus Embedding Neural Networks for Language Recognition in Mismatched Conditions.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Automatic Speech Recognition and Topic Identification from Speech for Almost-Zero-Resource Languages.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Diarization is Hard: Some Experiences and Lessons Learned for the JHU Team in the Inaugural DIHARD Challenge.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Joint Verification-Identification in end-to-end Multi-Scale CNN Framework for Topic Identification.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Characterizing Performance of Speaker Diarization Systems on Far-Field Speech Using Standard Methods.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Measuring Uncertainty in Deep Regression Models: The Case of Age Estimation from Speech.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the Fourth International Conference, 2018
Study of the Automatic Detection of Parkison's Disease Based on Speaker Recognition Technologies and Allophonic Distillation.
Proceedings of the 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2018
2017
Language Independent Assessment of Motor Impairments of Patients with Parkinson's Disease Using i-Vectors.
Proceedings of the Text, Speech, and Dialogue - 20th International Conference, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Evaluation of the Neurological State of People with Parkinson's Disease Using i-Vectors.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Multi-view representation learning via gcca for multimodal analysis of Parkinson's disease.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
2016
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
2015
IEEE Signal Process. Lett., 2015
Proceedings of the 5th International Workshop on Audio/Visual Emotion Challenge, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
2014
Non-Negative Factor Analysis of Gaussian Mixture Model Weight Adaptation for Language and Dialect Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2014
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014
Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
2013
IEEE Trans. Speech Audio Process., 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
2012
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
2011
IEEE Trans. Speech Audio Process., 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
The MIT LL 2010 speaker recognition evaluation system: Scalable language-independent speaker recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
2010
Unsupervised Speaker Adaptation based on the Cosine Similarity for Text-Independent Speaker Verification.
Proceedings of the Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 28, 2010
An i-vector Extractor Suitable for Speaker Recognition with both Microphone and Telephone Speech.
Proceedings of the Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 28, 2010
Proceedings of the Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 28, 2010
2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Support vector machines versus fast scoring in the low-dimensional total variability space for speaker verification.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Comparison of scoring methods used in speaker recognition with Joint Factor Analysis.
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
2008
IEEE Trans. Speech Audio Process., 2008
Proceedings of the Odyssey 2008: The Speaker and Language Recognition Workshop, 2008
Proceedings of the Odyssey 2008: The Speaker and Language Recognition Workshop, 2008
Comparison between factor analysis and GMM support vector machines for speaker verification.
Proceedings of the Odyssey 2008: The Speaker and Language Recognition Workshop, 2008
Development of the primary CRIM system for the NIST 2008 speaker recognition evaluation.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
2007
IEEE Trans. Speech Audio Process., 2007
Continuous prosodic features and formant modeling with joint factor analysis for speaker verification.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
2006
Proceedings of the Odyssey 2006: The Speaker and Language Recognition Workshop, 2006
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006