Nanxin Chen

Piotr Zelasko

Laureano Moro-Velázquez

Leibny Paola García-Perera

Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

WaveGrad: Estimating Gradients for Waveform Generation.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

Focus on the Present: A Regularization Method for the ASR Source-Target Attention Layer.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020

State-of-the-art speaker recognition with neural network embeddings in NIST SRE18 and Speakers in the Wild evaluations.

[BibT_eX]

[DOI]

Fred Richardson

Réda Dehak

Pedro A. Torres-Carrasquillo

Jesús Antonio Villalba López

Comput. Speech Lang., 2020

Advances in Speaker Recognition for Telephone and Audio-Visual Data: the JHU-MIT Submission for NIST SRE19.

[BibT_eX]

[DOI]

Leibny Paola García-Perera

Saurabh Kataria

Pedro Torres-Carrasquiilo

Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Mask CTC: Non-Autoregressive End-to-End ASR with CTC and Mask Predict.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2020, 2020

Robust Training of Vector Quantized Bottleneck Models.

[BibT_eX]

[DOI]

Hans J. G. A. Dolfing

Sameer Khurana

Tanel Alumäe

Antoine Laurent

Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

Improving Language Identification for Multilingual Speakers.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

X-Vectors Meet Emotions: A Study On Dependencies Between Emotion and Speaker Recognition.

[BibT_eX]

[DOI]

Raghavendra Pappagari

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Feature Enhancement with Deep Feature Losses for Speaker Verification.

[BibT_eX]

[DOI]

Saurabh Kataria

L. Paola García-Perera

Leibny Paola García-Perera

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Zero-Shot Multi-Speaker Text-To-Speech with State-Of-The-Art Neural Speaker Embeddings.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Listen and Fill in the Missing Letters: Non-Autoregressive Transformer for Speech Recognition.

[BibT_eX]

[DOI]

CoRR, 2019

State-of-the-Art Speaker Recognition for Telephone and Video Speech: The JHU-MIT Submission for NIST SRE18.

[BibT_eX]

[DOI]

Daniel Povey

Pedro A. Torres-Carrasquillo

Sanjeev Khudanpur

Proceedings of the Interspeech 2019, 2019

The JHU Speaker Recognition System for the VOiCES 2019 Challenge.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2019, 2019

ASSERT: Anti-Spoofing with Squeeze-Excitation and Residual Networks.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2019, 2019

Tied Mixture of Factor Analyzers Layer to Combine Frame Level Representations in Neural Speaker Embeddings.

[BibT_eX]

[DOI]

Nelson Enrique Yalta Soplin

Proceedings of the Interspeech 2019, 2019

A Comparative Study on Transformer vs RNN in Speech Applications.

[BibT_eX]

[DOI]

Ryuichi Yamamoto

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018

Age Estimation in Short Speech Utterances Based on LSTM Recurrent Neural Networks.

[BibT_eX]

[DOI]

Rubén Zazo-Candil

Joaquin Gonzalez-Rodriguez

Pedro A. Torres-Carrasquillo

IEEE Access, 2018

The MIT Lincoln Laboratory / JHU / EPITA-LSE LRE17 System.

[BibT_eX]

[DOI]

Fred Richardson

Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

ESPnet: End-to-End Speech Processing Toolkit.

[BibT_eX]

[DOI]

Nelson Enrique Yalta Soplin

Jahn Heymann

Matthew Wiesner

Adithya Renduchintala

Tsubasa Ochiai

Proceedings of the Interspeech 2018, 2018

End-to-end Deep Neural Network Age Estimation.

[BibT_eX]

[DOI]

Pegah Ghahremani

Proceedings of the Interspeech 2018, 2018

An Investigation of Non-linear i-vectors for Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2018, 2018

Measuring Uncertainty in Deep Regression Models: The Case of Age Estimation from Speech.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Deep Feature Engineering for Noise Robust Spoofing Detection.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2017

End-to-end spoofing detection with raw waveform CLDNNS.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016

Deep features for automatic spoofing detection.

[BibT_eX]

[DOI]

Yanmin Qian

Ricardo Paranhos Velloso Violato

Kai Yu

Speech Commun., 2016

Overview of BTAS 2016 speaker anti-spoofing competition.

[BibT_eX]

[DOI]

Flávio Olmos Simões

M. U. Neto

Marcus de Assis Angeloni

Proceedings of the 8th IEEE International Conference on Biometrics Theory, 2016

2015

Deep feature for text-dependent speaker verification.

[BibT_eX]

[DOI]

Speech Commun., 2015

Multi-task learning for text-dependent speaker verification.

[BibT_eX]

[DOI]