Lianhong Cai
According to our database1,
Lianhong Cai
authored at least 139 papers
between 1987 and 2022.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2022
PeerJ Comput. Sci., 2022
Proceedings of the Computer Science and Education - 17th International Conference, 2022
2018
Siamese Recurrent Auto-Encoder Representation for Query-by-Example Spoken Term Detection.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Emotion Recognition from Variable-Length Speech Segments Using Deep Learning on Spectrograms.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Applying Multitask Learning to Acoustic-Phonemic Model for Mispronunciation Detection and Diagnosis in L2 English Speech.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Emphatic Speech Generation with Conditioned Input Layer and Bidirectional LSTMS for Expressive Speech Synthesis.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Multi-modal Multi-scale Speech Expression Evaluation in Computer-Assisted Language Learning.
Proceedings of the Artificial Intelligence and Mobile Services - AIMS 2018, 2018
2017
Proceedings of the 2017 ACM on Multimedia Conference, 2017
Speech Emotion Recognition with Emotion-Pair Based Framework Considering Emotion Distribution Information in Dimensional Emotion Space.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Spectro-Temporal Modelling with Time-Frequency LSTM and Structured Output Layer for Voice Conversion.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Multi-Task Learning for Prosodic Structure Generation Using BLSTM RNN with Structured Output Layer.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Learning cross-lingual knowledge with multilingual BLSTM for emphasis detection with limited training data.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Multi-task learning of structured output layer bidirectional LSTMS for speech synthesis.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
2016
Learning robust uniform features for cross-media social data by using cross autoencoders.
Knowl. Based Syst., 2016
CoRR, 2016
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016
THear: Development of a mobile multimodal audiometry application on a cross-platform framework.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Combining CNN and BLSTM to Extract Textual and Acoustic Features for Recognizing Stances in Mandarin Ideological Debate Competition.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Expressive Speech Driven Talking Avatar Synthesis with DBLSTM Using Limited Amount of Emotional Bimodal Data.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Heterogeneity-entropy based unsupervised feature learning for personality prediction with cross-media data.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016
Recognizing stances in Mandarin social ideological debates with text and acoustic features.
Proceedings of the 2016 IEEE International Conference on Multimedia & Expo Workshops, 2016
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016
Learning cross-lingual information with multilingual BLSTM for speech synthesis of low-resource languages.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Question detection from acoustic features using recurrent neural network with gated recurrent unit.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
A deep bidirectional long short-term memory based multi-scale approach for music dynamic emotion prediction.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Low level descriptors based DBLSTM bottleneck feature for speech driven talking avatar.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
2015
IEEE Trans. Affect. Comput., 2015
Multim. Tools Appl., 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA, November 09, 2015
HMM-based emphatic speech synthesis for corrective feedback in computer-aided pronunciation training.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Understanding speaking styles of internet speech data with LSTM and low-resource training.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015
2014
Synthesizing English emphatic speech for multimodal corrective feedback in computer-aided pronunciation training.
Multim. Tools Appl., 2014
Multim. Tools Appl., 2014
Grading the Severity of Mispronunciations in CAPT Based on Statistical Analysis and Computational Speech Perception.
J. Comput. Sci. Technol., 2014
Sci. China Inf. Sci., 2014
Proceedings of the Social Media Processing - Third National Conference, 2014
Proceedings of the MultiMedia Modeling - 20th Anniversary International Conference, 2014
User-level psychological stress detection from social media using deep neural network.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014
Using conditional random fields to predict focus word pair in spontaneous spoken English.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Improved keyword spotting system by optimizing posterior confidence measure vector using feed-forward neural network.
Proceedings of the 2014 International Joint Conference on Neural Networks, 2014
Acoustics, content and geo-information based sentiment prediction from large-scale networked voice data.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014
Psychological stress detection from cross-media microblog data using Deep Sparse Neural Network.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014
2013
Feature Learning with Gaussian Restricted Boltzmann Machine for Robust Speech Recognition.
CoRR, 2013
Proceedings of the ACM Multimedia Conference, 2013
Proceedings of the Ninth International Conference on Natural Computation, 2013
Proceedings of the IEEE International Conference on Image Processing, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013
TalkingAndroid: An interactive, multimodal and real-time talking avatar application on mobile phones.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013
Comparing feature dimension reduction algorithms for GMM-SVM based speech emotion recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013
2012
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012
Can we understand van gogh's mood?: learning to infer affects from images in social networks.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012
Adaptive named entity recognition based on conditional random fields with automatic updated dynamic gazetteers.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012
Detection and emphatic realization of contrastive word pairs for expressive text-to-speech synthesis.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012
Perceptual clustering based unit selection optimization for concatenative text-to-speech synthesis.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012
Hierarchical English Emphatic Speech Synthesis Based on HMM with Limited Training Data.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Intention understanding based on multi-source information integration for Chinese Mandarin spoken commands.
Proceedings of the 9th International Conference on Fuzzy Systems and Knowledge Discovery, 2012
Proceedings of the Computational Visual Media - First International Conference, 2012
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
2011
IEEE Trans. Speech Audio Process., 2011
Combining Active and Semi-Supervised Learning for Homograph Disambiguation in Mandarin Text-to-Speech Synthesis.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the Applied Informatics and Communication - International Conference, 2011
2010
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010
Investigation of the relation between acoustic features and articulation - An application to emotional speech analysis.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 20th International Conference on Pattern Recognition, 2010
Proceedings of the Sixth International Conference on Natural Computation, 2010
Proceedings of the International Conference on Image Processing, 2010
The Intelligent Music Editor: Towards an Automated Platform for Music Analysis and Editing.
Proceedings of the Advanced Intelligent Computing Theories and Applications. With Aspects of Artificial Intelligence, 2010
Facial Expression Synthesis Based on Emotion Dimensions for Affective Talking Avatar.
Proceedings of the Modeling Machine Emotions for Realizing Intelligence, 2010
2009
Modeling the Expressivity of Input Text Semantics for Chinese Text-to-Speech Synthesis in a Spoken Dialog System.
IEEE Trans. Speech Audio Process., 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Automatic Emphasis Labeling for Emotional Speech by Measuring Prosody Generation Error.
Proceedings of the Emerging Intelligent Computing Technology and Applications, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
2008
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008
Proceedings of the Fourth International Conference on Natural Computation, 2008
2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the Advanced Intelligent Computing Theories and Applications. With Aspects of Theoretical and Methodological Issues, 2007
Proceedings of the Advances in Biometrics, International Conference, 2007
Head Movement Synthesis Based on Semantic and Prosodic Features for a Chinese Expressive Avatar.
Proceedings of the IEEE International Conference on Acoustics, 2007
Script Design Based on Decision Tree with Context Vector and Acoustic Distance for Mandarin TTS.
Proceedings of the IEEE International Conference on Acoustics, 2007
Facial Expression Synthesis Using PAD Emotional Parameters for a Chinese Expressive Avatar.
Proceedings of the Affective Computing and Intelligent Interaction, 2007
Proceedings of the Affective Computing and Intelligent Interaction, 2007
2006
IEEE Trans. Speech Audio Process., 2006
IEICE Trans. Inf. Syst., 2006
Modelling the Global acoustic Correlates of Expressivity for Chinese Text-to-speech Synthesis.
Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006
Prosodic Boundary Prediction Based on Maximum Entropy Model with Error-Driven Modification.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006
Proceedings of the 5th International Symposium on Chinese Spoken Language Processing, 2006
Proceedings of the 5th International Symposium on Chinese Spoken Language Processing, 2006
Modeling the acoustic correlates of expressive elements in text genres for expressive text-to-speech synthesis.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Real-time synthesis of Chinese visual speech and facial expressions using MPEG-4 FAP features in a three-dimensional avatar.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the Computational Intelligence, 2006
Proceedings of the Advances in Biometrics, International Conference, 2006
2005
Proceedings of the Advances in Biometric Person Authentication, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Unsupervised auditory scene categorization via key audio effects and information-theoretic co-clustering.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Grapheme-to-Phoneme Conversion Based on a Fast TBL Algorithm in Mandarin TTS Systems.
Proceedings of the Fuzzy Systems and Knowledge Discovery, Second International Conference, 2005
2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 17th International Conference on Pattern Recognition, 2004
Speech emotion classification with the combination of statistic features and temporal features.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
2003
Approach to the Correlation Discovery of Chinese Linguistic Parameters Based on Bayesian Method.
J. Comput. Sci. Technol., 2003
Proceedings of the Advances in Web-Age Information Management, 2003
Proceedings of the IEEE International Conference on Systems, 2003
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003
2002
Proceedings of The Eleventh Text REtrieval Conference, 2002
Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002
Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002
Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002
Proceedings of the First Workshop on Chinese Language Processing, 2002
2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
1998
Proceedings of the 1998 International Symposium on Chinese Spoken Language Processing, 1998
1987
Proceedings of the IEEE International Conference on Acoustics, 1987