Gregory Sell

According to our database1, Gregory Sell authored at least 43 papers between 2010 and 2021.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2021
Two-Stage Augmentation and Adaptive CTC Fusion for Improved Robustness of Multi-Stream end-to-end ASR.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

2020
State-of-the-art speaker recognition with neural network embeddings in NIST SRE18 and Speakers in the Wild evaluations.
Comput. Speech Lang., 2020

Advances in Speaker Recognition for Telephone and Audio-Visual Data: the JHU-MIT Submission for NIST SRE19.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

MagNetO: X-vector Magnitude Estimation Network plus Offset for Improved Speaker Recognition.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

A Practical Two-Stage Training Strategy for Multi-Stream End-to-End Speech Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Jhu-HLTCOE System for the Voxsrc Speaker Recognition Challenge.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Analysis of Robustness of Deep Single-Channel Speech Separation Using Corpora Constructed From Multiple Domains.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

State-of-the-Art Speaker Recognition for Telephone and Video Speech: The JHU-MIT Submission for NIST SRE18.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

The JHU Speaker Recognition System for the VOiCES 2019 Challenge.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Speaker Diarization Using Leave-One-Out Gaussian PLDA Clustering of DNN Embeddings.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Performance Monitoring for End-to-End Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Speaker Recognition Benchmark Using the CHiME-5 Corpus.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

x-Vector DNN Refinement with Full-Length Recordings for Speaker Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Script Identification using Across- and Within-Image Distribution Estimation.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

A Synthetic Recipe for OCR.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Speaker Recognition for Multi-speaker Conversations Using X-vectors.
Proceedings of the IEEE International Conference on Acoustics, 2019

Deriving Spectro-temporal Properties of Hearing from Speech Data.
Proceedings of the IEEE International Conference on Acoustics, 2019

Joint Acoustic and Class Inference for Weakly Supervised Sound Event Detection.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Building Corpora for Single-Channel Speech Separation Across Multiple Domains.
CoRR, 2018

Spoken Language Recognition using X-vectors.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Language Recognition for Telephone and Video Speech: The JHU HLTCOE Submission for NIST LRE17.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Diarization is Hard: Some Experiences and Lessons Learned for the JHU Team in the Inaugural DIHARD Challenge.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

X-Vectors: Robust DNN Embeddings for Speaker Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Audio-Visual Person Recognition in Multimedia Data From the Iarpa Janus Program.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Scalable out-of-sample extension of graph embeddings using deep neural networks.
Pattern Recognit. Lett., 2017

Extended Variability Modeling and Unsupervised Adaptation for PLDA Speaker Recognition.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Multi-speaker conversations, cross-talk, and diarization for speaker recognition.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Speaker diarization using deep neural network embeddings.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Leveraging side information for speaker identification with the Enron conversational telephone speech collection.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
Augmented Data Training of Joint Acoustic/Phonotactic DNN i-vectors for NIST LRE15.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

Priors for Speaker Counting and Diarization with AHC.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015
Speaker diarization with i-vectors from DNN senone posteriors.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

An evaluation of graph clustering methods for unsupervised term discovery.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Content-based recommender systems for spoken documents.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Diarization resegmentation in the factor analysis subspace.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Speaker diarization with plda i-vector scoring and unsupervised calibration.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Music tonality features for speech/music discrimination.
Proceedings of the IEEE International Conference on Acoustics, 2014

Automatic carrier pitch estimation for coherent demodulation.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Optimizing coherent demodulation for improved separation of overlapping sources.
Proceedings of the IEEE International Conference on Acoustics, 2013

2011
Speech recognitionwith segmental conditional random fields: A summary of the JHU CLSP 2010 Summer Workshop.
Proceedings of the IEEE International Conference on Acoustics, 2011

A novel approach using modulation features for multiphone-based speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Solving Demodulation as an Optimization Problem.
IEEE Trans. Speech Audio Process., 2010

The information content of demodulated speech.
Proceedings of the IEEE International Conference on Acoustics, 2010


  Loading...