Nanxin Chen

Orcid: 0000-0001-6698-1604

According to our database1, Nanxin Chen authored at least 47 papers between 2014 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
E3 TTS: Easy End-to-End Diffusion-based Text to Speech.
CoRR, 2023

SLM: Bridge the thin gap between speech and text foundation models.
CoRR, 2023

Efficient Adapters for Giant Speech Models.
CoRR, 2023

How to Estimate Model Transferability of Pre-Trained Speech Models?
CoRR, 2023

Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages.
CoRR, 2023

Noise2Music: Text-conditioned Music Generation with Diffusion Models.
CoRR, 2023

A Quantum Kernel Learning Approach to Acoustic Modeling for Spoken Command Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

From English to More Languages: Parameter-Efficient Model Reprogramming for Cross-Lingual Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

SLM: Bridge the Thin Gap Between Speech and Text Foundation Models.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

E3 TTS: Easy End-to-End Diffusion-Based Text To Speech.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Residual Adapters for Few-Shot Text-to-Speech Speaker Adaptation.
CoRR, 2022

Maestro-U: Leveraging Joint Speech-Text Representation Learning for Zero Supervised Speech ASR.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral Shaping.
Proceedings of the Interspeech 2022, 2022

2021
Non-Autoregressive Transformer for Speech Recognition.
IEEE Signal Process. Lett., 2021

WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Align-Denoise: Single-Pass Non-Autoregressive Speech Recognition.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

WaveGrad: Estimating Gradients for Waveform Generation.
Proceedings of the 9th International Conference on Learning Representations, 2021

Focus on the Present: A Regularization Method for the ASR Source-Target Attention Layer.
Proceedings of the IEEE International Conference on Acoustics, 2021

A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
State-of-the-art speaker recognition with neural network embeddings in NIST SRE18 and Speakers in the Wild evaluations.
Comput. Speech Lang., 2020

Advances in Speaker Recognition for Telephone and Audio-Visual Data: the JHU-MIT Submission for NIST SRE19.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Mask CTC: Non-Autoregressive End-to-End ASR with CTC and Mask Predict.
Proceedings of the Interspeech 2020, 2020

Robust Training of Vector Quantized Bottleneck Models.
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

Improving Language Identification for Multilingual Speakers.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

X-Vectors Meet Emotions: A Study On Dependencies Between Emotion and Speaker Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Feature Enhancement with Deep Feature Losses for Speaker Verification.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Zero-Shot Multi-Speaker Text-To-Speech with State-Of-The-Art Neural Speaker Embeddings.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Listen and Fill in the Missing Letters: Non-Autoregressive Transformer for Speech Recognition.
CoRR, 2019

State-of-the-Art Speaker Recognition for Telephone and Video Speech: The JHU-MIT Submission for NIST SRE18.
Proceedings of the Interspeech 2019, 2019

The JHU Speaker Recognition System for the VOiCES 2019 Challenge.
Proceedings of the Interspeech 2019, 2019

ASSERT: Anti-Spoofing with Squeeze-Excitation and Residual Networks.
Proceedings of the Interspeech 2019, 2019

Tied Mixture of Factor Analyzers Layer to Combine Frame Level Representations in Neural Speaker Embeddings.
Proceedings of the Interspeech 2019, 2019

A Comparative Study on Transformer vs RNN in Speech Applications.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Age Estimation in Short Speech Utterances Based on LSTM Recurrent Neural Networks.
IEEE Access, 2018

The MIT Lincoln Laboratory / JHU / EPITA-LSE LRE17 System.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018


End-to-end Deep Neural Network Age Estimation.
Proceedings of the Interspeech 2018, 2018

An Investigation of Non-linear i-vectors for Speaker Verification.
Proceedings of the Interspeech 2018, 2018

Measuring Uncertainty in Deep Regression Models: The Case of Age Estimation from Speech.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Deep Feature Engineering for Noise Robust Spoofing Detection.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

End-to-end spoofing detection with raw waveform CLDNNS.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Deep features for automatic spoofing detection.
Speech Commun., 2016


2015
Deep feature for text-dependent speaker verification.
Speech Commun., 2015

Multi-task learning for text-dependent speaker verification.
Proceedings of the INTERSPEECH 2015, 2015

Robust deep feature for spoofing detection - the SJTU system for ASVspoof 2015 challenge.
Proceedings of the INTERSPEECH 2015, 2015

2014
Development of Early-Warning Model for Intensive Pig Breeding.
Proceedings of the Computer and Computing Technologies in Agriculture VIII, 2014


  Loading...