We stand with Ukraine

We stand with Ukraine

Sung-Feng Huang

Orcid: 0000-0002-9720-811X

According to our database¹, Sung-Feng Huang authored at least 20 papers between 2018 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

2018

2019

2020

2021

2022

2023

2024

2025

0

1

2

3

4

5

6

1

1

2

3

1

2

3

2

1

2

1

1

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits.

[BibT_eX]

[DOI]

Sung-Feng Huang

,

,

,

,

Chao-Han Huck Yang

,

,

Yu-Chiang Frank Wang

,

,

CoRR, January, 2025

2024

Generative Speech Foundation Model Pretraining for High-Quality Speech Extraction and Restoration.

[BibT_eX]

[DOI]

,

Alexander H. Liu

,

,

Sung-Feng Huang

,

,

CoRR, 2024

2023

Personalized Lightweight Text-to-Speech: Voice Cloning with Adaptive Structured Pruning.

[BibT_eX]

[DOI]

Sung-Feng Huang

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2023

Maximizing Data Efficiency for Cross-Lingual TTS Adaptation by Self-Supervised Representation Mixing and Embedding Initialization.

[BibT_eX]

[DOI]

,

Sung-Feng Huang

,

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022

Learning Phone Recognition From Unpaired Audio and Phone Sequences Based on Generative Adversarial Network.

[BibT_eX]

[DOI]

,

,

,

Sung-Feng Huang

,

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2022

Meta-TTS: Meta-Learning for Few-Shot Speaker Adaptive Text-to-Speech.

[BibT_eX]

[DOI]

Sung-Feng Huang

,

,

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2022

Few Shot Cross-Lingual TTS Using Transferable Phoneme Embedding.

[BibT_eX]

[DOI]

,

,

Sung-Feng Huang

,

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021

Meta-TTS: Meta-Learning for Few-Shot Speaker Adaptive Text-to-Speech.

[BibT_eX]

[DOI]

Sung-Feng Huang

,

,

CoRR, 2021

SpeechNet: A Universal Modularized Model for Speech Processing Tasks.

[BibT_eX]

[DOI]

,

,

,

,

,

Sung-Feng Huang

,

,

,

Cheng-Kuang Lee

,

CoRR, 2021

Non-autoregressive Mandarin-English Code-switching Speech Recognition with Pinyin Mask-CTC and Word Embedding Regularization.

[BibT_eX]

[DOI]

,

,

Sung-Feng Huang

,

CoRR, 2021

Stabilizing Label Assignment for Speech Separation by Self-Supervised Pre-Training.

[BibT_eX]

[DOI]

Sung-Feng Huang

,

,

,

,

,

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Non-Autoregressive Mandarin-English Code-Switching Speech Recognition.

[BibT_eX]

[DOI]

,

,

Sung-Feng Huang

,

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020

Self-supervised Pre-training Reduces Label Permutation Instability of Speech Separation.

[BibT_eX]

[DOI]

Sung-Feng Huang

,

,

,

,

,

CoRR, 2020

Pretrained Language Model Embryology: The Birth of ALBERT.

[BibT_eX]

[DOI]

David Cheng-Han Chiang

,

Sung-Feng Huang

,

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

2019

Audio Word2vec: Sequence-to-Sequence Autoencoding for Unsupervised Learning of Audio Segmentation and Representation.

[BibT_eX]

[DOI]

,

Sung-Feng Huang

,

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2019

From Semi-supervised to Almost-unsupervised Speech Recognition with Very-low Resource by Jointly Learning Phonetic Structures from Audio and Text Embeddings.

[BibT_eX]

[DOI]

,

Sung-Feng Huang

,

,

CoRR, 2019

2018

Improved Audio Embeddings by Adjacency-Based Clustering with Applications in Spoken Term Detection.

[BibT_eX]

[DOI]

Sung-Feng Huang

,

,

,

CoRR, 2018

Almost-unsupervised Speech Recognition with Close-to-zero Resource Based on Phonetic Structures Learned from Very Small Unpaired Speech and Text Data.

[BibT_eX]

[DOI]

,

,

Sung-Feng Huang

,

,

CoRR, 2018

Towards Unsupervised Automatic Speech Recognition Trained by Unaligned Speech and Text only.

[BibT_eX]

[DOI]

,

,

Sung-Feng Huang

,

CoRR, 2018

Phonetic-and-Semantic Embedding of Spoken words with Applications in Spoken Content Retrieval.

[BibT_eX]

[DOI]

,

Sung-Feng Huang

,

,

,

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Loading...