Sung-Feng Huang
Orcid: 0000-0002-9720-811X
According to our database1,
Sung-Feng Huang
authored at least 19 papers
between 2018 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Generative Speech Foundation Model Pretraining for High-Quality Speech Extraction and Restoration.
CoRR, 2024
2023
Personalized Lightweight Text-to-Speech: Voice Cloning with Adaptive Structured Pruning.
Proceedings of the IEEE International Conference on Acoustics, 2023
Maximizing Data Efficiency for Cross-Lingual TTS Adaptation by Self-Supervised Representation Mixing and Embedding Initialization.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
2022
Learning Phone Recognition From Unpaired Audio and Phone Sequences Based on Generative Adversarial Network.
IEEE ACM Trans. Audio Speech Lang. Process., 2022
IEEE ACM Trans. Audio Speech Lang. Process., 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
2021
Non-autoregressive Mandarin-English Code-switching Speech Recognition with Pinyin Mask-CTC and Word Embedding Regularization.
CoRR, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
2020
Self-supervised Pre-training Reduces Label Permutation Instability of Speech Separation.
CoRR, 2020
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
2019
Audio Word2vec: Sequence-to-Sequence Autoencoding for Unsupervised Learning of Audio Segmentation and Representation.
IEEE ACM Trans. Audio Speech Lang. Process., 2019
From Semi-supervised to Almost-unsupervised Speech Recognition with Very-low Resource by Jointly Learning Phonetic Structures from Audio and Text Embeddings.
CoRR, 2019
2018
Improved Audio Embeddings by Adjacency-Based Clustering with Applications in Spoken Term Detection.
CoRR, 2018
Almost-unsupervised Speech Recognition with Close-to-zero Resource Based on Phonetic Structures Learned from Very Small Unpaired Speech and Text Data.
CoRR, 2018
Towards Unsupervised Automatic Speech Recognition Trained by Unaligned Speech and Text only.
CoRR, 2018
Phonetic-and-Semantic Embedding of Spoken words with Applications in Spoken Content Retrieval.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018