Guanlong Zhao

Orcid: 0000-0002-6059-4053

According to our database1, Guanlong Zhao authored at least 25 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
DiarizationLM: Speaker Diarization Post-Processing with Large Language Models.
CoRR, 2024

USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Personalizing Keyword Spotting with Speaker Information.
CoRR, 2023

Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network.
CoRR, 2023

Augmenting Transformer-Transducer Based Speaker Change Detection with Token-Level Training Loss.
Proceedings of the IEEE International Conference on Acoustics, 2023

Exploring Sequence-to-Sequence Transformer-Transducer Models for Keyword Spotting.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Accentron: Foreign accent conversion to arbitrary non-native speakers using zero-shot learning.
Comput. Speech Lang., 2022

Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering.
CoRR, 2022

2021
Converting Foreign Accent Speech Without a Reference.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Effects of Voice Type and Task on L2 Learners' Awareness of Pronunciation Errors.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Assessing Posterior-Based Mispronunciation Detection on Field-Collected Recordings from Child Speech Therapy Sessions.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020
Learning Structured Sparse Representations for Voice Conversion.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

LSTM Acoustic Models Learn to Align and Pronounce with Graphemes.
CoRR, 2020

Improving the Speaker Identity of Non-Parallel Many-to-Many Voice Conversion with Adversarial Speaker Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Understanding the Effect of Voice Quality and Accent on Talker Similarity.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2019
Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Golden speaker builder - An interactive tool for pronunciation training.
Speech Commun., 2019

Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2018
Improved Techniques for Learning to Dehaze and Beyond: A Collective Study.
CoRR, 2018

PAD-Net: A Perception-Aided Single Image Dehazing Network.
CoRR, 2018

L2-ARCTIC: A Non-native English Speech Corpus.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Improving Sparse Representations in Exemplar-Based Voice Conversion with a Phoneme-Selective Objective Function.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Accent Conversion Using Phonetic Posteriorgrams.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Voice Conversion Through Residual Warping in a Sparse, Anchor-Based Representation of Speech.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Exemplar selection methods in voice conversion.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017


  Loading...