Gregory Sell

Hynek Hermansky

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

2020

State-of-the-art speaker recognition with neural network embeddings in NIST SRE18 and Speakers in the Wild evaluations.

[BibT_eX]

[DOI]

Pedro A. Torres-Carrasquillo

Fred Richardson

Réda Dehak

Najim Dehak

Comput. Speech Lang., 2020

Advances in Speaker Recognition for Telephone and Audio-Visual Data: the JHU-MIT Submission for NIST SRE19.

[BibT_eX]

[DOI]

Jesús Antonio Villalba López

Pedro Torres-Carrasquiilo

Saurabh Kataria

Phani Sankar Nidadavolu

Najim Dehak

Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

MagNetO: X-vector Magnitude Estimation Network plus Offset for Improved Speaker Recognition.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

A Practical Two-Stage Training Strategy for Multi-Stream End-to-End Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Jhu-HLTCOE System for the Voxsrc Speaker Recognition Challenge.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Analysis of Robustness of Deep Single-Channel Speech Separation Using Corpora Constructed From Multiple Domains.

[BibT_eX]

[DOI]

Matthew Maciejewski

Yusuke Fujita

Shinji Watanabe

Sanjeev Khudanpur

Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

State-of-the-Art Speaker Recognition for Telephone and Video Speech: The JHU-MIT Submission for NIST SRE18.

[BibT_eX]

[DOI]

Pedro A. Torres-Carrasquillo

Daniel Povey

Sanjeev Khudanpur

Najim Dehak

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

The JHU Speaker Recognition System for the VOiCES 2019 Challenge.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Speaker Diarization Using Leave-One-Out Gaussian PLDA Clustering of DNN Embeddings.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Performance Monitoring for End-to-End Speech Recognition.

[BibT_eX]

[DOI]

Ruizhi Li

Hynek Hermansky

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Speaker Recognition Benchmark Using the CHiME-5 Corpus.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

x-Vector DNN Refinement with Full-Length Recordings for Speaker Recognition.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Script Identification using Across- and Within-Image Distribution Estimation.

[BibT_eX]

[DOI]

Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

A Synthetic Recipe for OCR.

[BibT_eX]

[DOI]

Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Speaker Recognition for Multi-speaker Conversations Using X-vectors.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Deriving Spectro-temporal Properties of Hearing from Speech Data.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Joint Acoustic and Class Inference for Weakly Supervised Sound Event Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

Building Corpora for Single-Channel Speech Separation Across Multiple Domains.

[BibT_eX]

[DOI]

Matthew Maciejewski

Shinji Watanabe

Sanjeev Khudanpur

CoRR, 2018

Spoken Language Recognition using X-vectors.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Language Recognition for Telephone and Video Speech: The JHU HLTCOE Submission for NIST LRE17.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Diarization is Hard: Some Experiences and Lessons Learned for the JHU Team in the Inaugural DIHARD Challenge.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

X-Vectors: Robust DNN Embeddings for Speaker Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Audio-Visual Person Recognition in Multimedia Data From the Iarpa Janus Program.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Scalable out-of-sample extension of graph embeddings using deep neural networks.

[BibT_eX]

[DOI]

Aren Jansen

Vince Lyzinski

Pattern Recognit. Lett., 2017

Extended Variability Modeling and Unsupervised Adaptation for PLDA Speaker Recognition.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Multi-speaker conversations, cross-talk, and diarization for speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Speaker diarization using deep neural network embeddings.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Leveraging side information for speaker identification with the Enron conversational telephone speech collection.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016

Augmented Data Training of Joint Acoustic/Phonotactic DNN i-vectors for NIST LRE15.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

Priors for Speaker Counting and Diarization with AHC.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015

Speaker diarization with i-vectors from DNN senone posteriors.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

An evaluation of graph clustering methods for unsupervised term discovery.

[BibT_eX]

[DOI]

Vince Lyzinski

Aren Jansen

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Content-based recommender systems for spoken documents.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Diarization resegmentation in the factor analysis subspace.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014

Speaker diarization with plda i-vector scoring and unsupervised calibration.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Music tonality features for speech/music discrimination.

[BibT_eX]

[DOI]

Pascal Clark

Proceedings of the IEEE International Conference on Acoustics, 2014

Automatic carrier pitch estimation for coherent demodulation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

Optimizing coherent demodulation for improved separation of overlapping sources.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

2011

Speech recognitionwith segmental conditional random fields: A summary of the JHU CLSP 2010 Summer Workshop.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

A novel approach using modulation features for multiphone-based speech recognition.

[BibT_eX]

[DOI]

Pascal Clark

Les E. Atlas

Proceedings of the IEEE International Conference on Acoustics, 2011

2010

Solving Demodulation as an Optimization Problem.

[BibT_eX]

[DOI]

Malcolm Slaney

IEEE Trans. Speech Audio Process., 2010

The information content of demodulated speech.

[BibT_eX]

[DOI]