Yao Qian

IEICE Trans. Inf. Syst., 2014

Dynamic facial expression recognition based on K-order emotional intensity model.

[BibT_eX]

[DOI]

Changqin Quan

Fuji Ren

Proceedings of the 2014 IEEE International Conference on Robotics and Biomimetics, 2014

4.3 An 87%-peak-efficiency DVS-capable single-inductor 4-output DC-DC buck converter with ripple-based adaptive off-time control.

[BibT_eX]

[DOI]

Danzhu Lu

Zhiliang Hong

Proceedings of the 2014 IEEE International Conference on Solid-State Circuits Conference, 2014

Pitch transformation in neural network based voice conversion.

[BibT_eX]

[DOI]

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

A new Neural Network based logistic regression classifier for improving mispronunciation detection of L2 language learners.

[BibT_eX]

[DOI]

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Modeling DCT parameterized F0 trajectory at intonation phrase level with DNN or decision tree.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Sequence error (SE) minimization training of neural network for voice conversion.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

TTS synthesis with bidirectional LSTM based recurrent neural networks.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

On the training aspects of Deep Neural Network (DNN) for parametric TTS synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

A DNN-based acoustic modeling of tonal language and its application to Mandarin pronunciation training.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

A Unified Trajectory Tiling Approach to High Quality Speech Rendering.

[BibT_eX]

[DOI]

Zhi-Jie Yan

IEEE Trans. Speech Audio Process., 2013

A new preprocessing algorithm and local binary pattern based facial expression recognition.

[BibT_eX]

[DOI]

Fuji Ren

Changqin Quan

Proceedings of the 2013 IEEE/SICE International Symposium on System Integration, 2013

A new DNN-based high quality pronunciation evaluation for computer-aided language learning (CALL).

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

A fast table lookup based, statistical model driven non-uniform unit selection TTS.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

VCCS controlled LDO with small on-chip capacitor.

[BibT_eX]

[DOI]

Proceedings of the IEEE 10th International Conference on ASIC, 2013

2012

Computer-Assisted Audiovisual Language Learning.

[BibT_eX]

[DOI]

Computer, 2012

Tip tap tones: mobile microtraining of mandarin sounds.

[BibT_eX]

[DOI]

Proceedings of the Mobile HCI '12, 2012

Break index labeling of mandarin text via syntactic-to-prosodic tree mapping.

[BibT_eX]

[DOI]

Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

A unified trajectory tiling approach to high quality TTS and cross-lingual voice transformation.

[BibT_eX]

[DOI]

Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Pitch accent detection and prediction with DCT features and CRF model.

[BibT_eX]

[DOI]

Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Turning a Monolingual Speaker into Multilingual for a Mixed-language TTS.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

2011

Improved Prosody Generation by Maximizing Joint Probability of State and Longer Units.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2011

A New Phonetic Candidate Generator for Improving Search Query Efficiency.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

A frame mapping based HMM approach to cross-lingual voice transformation.

[BibT_eX]

[DOI]

Ji Xu

Proceedings of the IEEE International Conference on Acoustics, 2011

Improved F0 modeling and generation in voice conversion.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

2010

Automatic prosody prediction and detection with Conditional Random Field (CRF) models.

[BibT_eX]

[DOI]

Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Formant-based frequency warping for improving speaker adaptation in HMM TTS.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

An HMM trajectory tiling (HTT) approach to high quality TTS.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Improved modeling for F0 generation and V/U decision in HMM-based TTS.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

RIch-context Unit Selection (RUS) approach to high quality TTS.

[BibT_eX]

[DOI]

Zhi-Jie Yan

Proceedings of the IEEE International Conference on Acoustics, 2010

An HMM Trajectory Tiling (HTT) Approach to High Quality TTS - Microsoft Entry to Blizzard Challenge 2010.

[BibT_eX]

[DOI]

Proceedings of the Blizzard Challenge 2010, Kansai Science City, Japan, September 25, 2010, 2010

2009

A Cross-Language State Sharing and Mapping Approach to Bilingual (Mandarin-English) TTS.

[BibT_eX]

[DOI]

Hui Liang

IEEE Trans. Speech Audio Process., 2009

A Multi-Space Distribution (MSD) and two-stream tone modeling approach to Mandarin speech recognition.

[BibT_eX]

[DOI]

Speech Commun., 2009

Rich context modeling for high quality HMM-based TTS.

[BibT_eX]

[DOI]

Zhi-Jie Yan

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

A minimum v/u error approach to F0 generation in HMM-based TTS.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Improved prosody generation by maximizing joint likelihood of state and longer units.

[BibT_eX]

[DOI]

Zhizheng Wu

Proceedings of the IEEE International Conference on Acoustics, 2009

State mapping for cross-language speaker adaptation in TTS.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

2008

Tone-enhanced generalized character posterior probability (GCPP) for Cantonese LVCSR.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2008

Modeling and Generating Tone Contour with Phrase Intonation for Mandarin Chinese Speech.

[BibT_eX]

[DOI]

Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

HMM-Based Mixed-Language (Mandarin-English) Speech Synthesis.

[BibT_eX]

[DOI]

Houwei Cao

Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Prosody for Mandarin speech recognition: a comparative study of read and spontaneous speech.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

A real-time text to audio-visual speech synthesis system.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Generating natural F0 trajectory with additive trees.

[BibT_eX]

[DOI]

Hui Liang

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Duration refinement by jointly optimizing state and longer unit likelihood.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

A cross-language state mapping approach to bilingual (Mandarin-English) TTS.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

2007

An HMM-based bilingual (Mandarin-English) TTS.

[BibT_eX]

[DOI]

Hui Liang

Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007

Robust F0 modeling for Mandarin speech recognition in noise.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

2006

Improved Mandarin Speech Recognition by Lattice Rescoring with Enhanced Tone Models.

[BibT_eX]

[DOI]

Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

An HMM-Based Mandarin Chinese Text-To-Speech System.

[BibT_eX]

[DOI]

Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

A multi-space distribution (MSD) approach to speech recognition of tonal languages.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

2004

Analysis and modeling of F0 contours for cantonese text-to-speech.

[BibT_eX]

[DOI]

Yujia Li

ACM Trans. Asian Lang. Inf. Process., 2004

Tone information as a confidence measure for improving Cantonese LVCSR.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

2003

Overlapped di-tone modeling for tone recognition in continuous Cantonese speech.

[BibT_eX]

[DOI]

Yujia Li

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002

Acoustical F0 analysis of continuous cantonese speech.

[BibT_eX]

[DOI]

Yujia Li

Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002

Assigning phrase accent to Chinese Text-to-Speech system.

[BibT_eX]

[DOI]

Fang Chen

Proceedings of the IEEE International Conference on Acoustics, 2002

2001

Locating Boundaries for Prosodic Constituents in Unrestricted Mandarin Texts.

[BibT_eX]

[DOI]

Min Chu

Int. J. Comput. Linguistics Chin. Lang. Process., 2001

Segmenting unrestricted Chinese text into prosodic words instead of lexical words.

[BibT_eX]

[DOI]