Yongguo Kang

According to our database1, Yongguo Kang authored at least 13 papers between 2004 and 2019.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2019
Multi-reference Tacotron by Intercross Training for Style Disentangling, Transfer and Control in Speech Synthesis.
CoRR, 2019

2018
EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Multi-task WaveNet: A Multi-task Generative Model for Statistical Parametric Speech Synthesis without Fundamental Frequency Conditions.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017
Deep Voice: Real-time Neural Text-to-Speech.
CoRR, 2017

Deep Voice: Real-time Neural Text-to-Speech.
Proceedings of the 34th International Conference on Machine Learning, 2017

2013
Multi-centroidal duration generation algorithm for HMM-based TTS.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

2006
Prosody conversion from neutral speech to emotional speech.
IEEE Trans. Speech Audio Process., 2006

Nonlinear Emotional Prosody Generation and Annotation.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

Applying Pitch Target Model to Convert F0 Contour for Expressive Mandarin Speech Synthesis.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Features Importance Analysis for Emotional Speech Classification.
Proceedings of the Affective Computing and Intelligent Interaction, 2005

A Hybrid GMM and Codebook Mapping Method for Spectral Conversion.
Proceedings of the Affective Computing and Intelligent Interaction, 2005

2004
Multi-source based acoustic model for speech synthesis.
Proceedings of the Fifth ISCA ITRW on Speech Synthesis, 2004

A new multicomponent AM-FM demodulation with predicting frequency boundaries and its application to formant estimation.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004


  Loading...