Guanlong Zhao

Angelo Scorza Scarpati

CoRR, 2023

Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network.

[BibT_eX]

[DOI]

CoRR, 2023

Augmenting Transformer-Transducer Based Speaker Change Detection with Token-Level Training Loss.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Exploring Sequence-to-Sequence Transformer-Transducer Models for Keyword Spotting.

[BibT_eX]

[DOI]

Beltrán Labrador

Ignacio López-Moreno

Angelo Scorza Scarpati

Liam Fowl

Quan Wang

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Accentron: Foreign accent conversion to arbitrary non-native speakers using zero-shot learning.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2022

Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering.

[BibT_eX]

[DOI]

CoRR, 2022

2021

Converting Foreign Accent Speech Without a Reference.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

Effects of Voice Type and Task on L2 Learners' Awareness of Pronunciation Errors.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Assessing Posterior-Based Mispronunciation Detection on Field-Collected Recordings from Child Speech Therapy Sessions.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020

Learning Structured Sparse Representations for Voice Conversion.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2020

LSTM Acoustic Models Learn to Align and Pronounce with Graphemes.

[BibT_eX]

[DOI]

CoRR, 2020

Improving the Speaker Identity of Non-Parallel Many-to-Many Voice Conversion with Adversarial Speaker Recognition.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Understanding the Effect of Voice Quality and Accent on Talker Similarity.

[BibT_eX]

[DOI]

Anurag Das

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2019

Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2019

Golden speaker builder - An interactive tool for pronunciation training.

[BibT_eX]

[DOI]

Speech Commun., 2019

Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2018

Improved Techniques for Learning to Dehaze and Beyond: A Collective Study.

[BibT_eX]

[DOI]

CoRR, 2018

PAD-Net: A Perception-Aided Single Image Dehazing Network.

[BibT_eX]

[DOI]

Yu Liu

CoRR, 2018

L2-ARCTIC: A Non-native English Speech Corpus.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Improving Sparse Representations in Exemplar-Based Voice Conversion with a Phoneme-Selective Objective Function.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Accent Conversion Using Phonetic Posteriorgrams.

[BibT_eX]

[DOI]

Sinem Sonsaat

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Voice Conversion Through Residual Warping in a Sparse, Anchor-Based Representation of Speech.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Exemplar selection methods in voice conversion.

[BibT_eX]

[DOI]