Chung-Hsien Wu
Orcid: 0000-0002-3947-2123Affiliations:
- National Cheng Kung University, Tainan, Taiwan
According to our database1,
Chung-Hsien Wu
authored at least 171 papers
between 1991 and 2023.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2023
Empathetic Response Generation Based on Plug-and-Play Mechanism With Empathy Perturbation.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
Decomposition and Reorganization of Phonetic Information for Speaker Embedding Learning.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
Generalization Ability Improvement of Speaker Representation and Anti-Interference for Speaker Verification.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
Applying Segment-Level Attention on Bi-Modal Transformer Encoder for Audio-Visual Emotion Recognition.
IEEE Trans. Affect. Comput., 2023
Automatic Bipolar Disorder Assessment Using Machine Learning With Smartphone-Based Digital Phenotyping.
IEEE Access, 2023
Speech Enhancement Using Dynamic Learning in Knowledge Distillation via Reinforcement Learning.
IEEE Access, 2023
Temporal and Type Correlation in Digital Phenotyping for Bipolar Disorder State Prediction Using Multitask Self-Supervised Learning.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023
Data Selection Based on Phoneme Affinity Matrix for Electrolarynx Speech Recognition.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023
2022
CoRR, 2022
Proceedings of the ISPD 2022: International Symposium on Physical Design, Virtual Event, Canada, March 27, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Applying Emotional Keyphrase Correlation for Diversity Enhancement in Empathetic Dialogue Response Generation.
Proceedings of the International Conference on Asian Language Processing, 2022
2021
Speech Emotion Recognition Considering Nonverbal Vocalization in Affective Conversations.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Exploring Macroscopic and Microscopic Fluctuations of Elicited Facial Expressions for Mood Disorder Classification.
IEEE Trans. Affect. Comput., 2021
Transformer-based Empathetic Response Generation Using Dialogue Situation and Advanced-Level Definition of Empathy.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021
Assessment of Bipolar Disorder Using Heterogeneous Data of Smartphone-Based Digital Phenotyping.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the International Conference on Asian Language Processing, 2021
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
2020
Cell-Coupled Long Short-Term Memory With L-Skip Fusion Mechanism for Mood Disorder Detection Through Elicited Audiovisual Features.
IEEE Trans. Neural Networks Learn. Syst., 2020
Sound Events Recognition and Retrieval Using Multi-Convolutional-Channel Sparse Coding Convolutional Neural Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
A Two-Stage Transformer-Based Approach for Variable-Length Abstractive Summarization.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Attention-Based Response Generation Using Parallel Double Q-Learning for Dialog Policy Decision in a Conversational System.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Detecting Unipolar and Bipolar Depressive Disorders from Elicited Speech Responses Using Latent Affective Structure Model.
IEEE Trans. Affect. Comput., 2020
Combining Deep Embeddings of Acoustic and Articulatory Features for Speaker Identification.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Statistics Pooling Time Delay Neural Network Based on X-Vector for Speaker Verification.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
2019
Response Selection and Automatic Message-Response Expansion in Retrieval-Based QA Systems using Semantic Dependency Pair Model.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2019
Attention-based convolutional neural network and long short-term memory for short-term detection of mood disorders based on elicited speech responses.
Pattern Recognit., 2019
Follow-Up Question Generation Using Neural Tensor Network-Based Domain Ontology Population in an Interview Coaching System.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Speech Emotion Recognition Using Deep Neural Network Considering Verbal and Nonverbal Speech Sounds.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
Sequential Speaker Embedding and Transfer Learning for Text-Independent Speaker Identification.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
2018
Sound Event Recognition Using Auditory-Receptive-Field Binary Pattern and Hierarchical-Diving Deep Belief Network.
IEEE ACM Trans. Audio Speech Lang. Process., 2018
Speech Emotion Recognition using Convolutional Neural Network with Audio Word-based Embedding.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018
Follow-up Question Generation Using Pattern-based Seq2seq with a Small Corpus for Interview Coaching.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Locality-Preserving Complex-Valued Gaussian Process Latent Variable Model for Robust Face Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
2017
IEEE Trans. Inf. Forensics Secur., 2017
Personalized Spontaneous Speech Synthesis Using a Small-Sized Unsegmented Semispontaneous Speech.
IEEE ACM Trans. Audio Speech Lang. Process., 2017
Interaction Style Recognition Based on Multi-Layer Multi-View Profile Representation.
IEEE Trans. Affect. Comput., 2017
Coupled HMM-based multimodal fusion for mood disorder detection through elicited audio-visual signals.
J. Ambient Intell. Humaniz. Comput., 2017
Miscommunication handling in spoken dialog systems based on error-aware dialog state detection.
EURASIP J. Audio Speech Music. Process., 2017
Recognition and retrieval of sound events using sparse coding convolutional neural network.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Fully complex deep neural network for phase-incorporating monaural source separation.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Mood detection from daily conversational speech using denoising autoencoder and LSTM.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Personality trait perception from speech signals using multiresolution analysis and convolutional neural networks.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
2016
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Exploiting Turn-Taking Temporal Evolution for Personality Trait Perception in Dyadic Conversations.
IEEE ACM Trans. Audio Speech Lang. Process., 2016
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Candidate Expansion and Prosody Adjustment for Natural Speech Synthesis Using a Small Corpus.
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016
Unipolar Depression vs. Bipolar Disorder: An Elicitation-Based Approach to Short-Term Detection of Mood Disorder.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Generation of Emotion Control Vector Using MDS-Based Space Transformation for Expressive Speech Synthesis.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Dialog State Tracking and action selection using deep learning mechanism for interview coaching.
Proceedings of the 2016 International Conference on Asian Language Processing, 2016
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016
2015
Code-Switching Event Detection by Using a Latent Language Space Model and the Delta-Bayesian Information Criterion.
IEEE ACM Trans. Audio Speech Lang. Process., 2015
Speech Emotion Verification Using Emotion Variance Modeling and Discriminant Scale-Frequency Maps.
IEEE ACM Trans. Audio Speech Lang. Process., 2015
Model Generation of Accented Speech using Model Transformation and Verification for Bilingual Speech Recognition.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2015
Soft Comput., 2015
Affective structure modeling of speech using probabilistic context free grammar for emotion recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Hierarchical modeling of temporal course in emotional expression for speech emotion recognition.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015
Emotion recognition of affective speech based on multiple classifiers using acoustic-prosodic information and semantic labels (Extended abstract).
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015
2014
Chinese-English Phone Set Construction for Code-Switching ASR Using Acoustic and DNN-Extracted Articulatory Features.
IEEE ACM Trans. Audio Speech Lang. Process., 2014
Synthesis of Spontaneous Speech With Syllable Contraction Using State-Based Context-Dependent Voice Transformation.
IEEE ACM Trans. Audio Speech Lang. Process., 2014
Polyglot Speech Synthesis Based on Cross-Lingual Frame Selection Using Auditory and Articulatory Features.
IEEE ACM Trans. Audio Speech Lang. Process., 2014
Interlocutor personality perception based on BFI profiles and coupled HMMs in a dyadic conversation.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014
Natural speech synthesis based on hybrid approach with candidate expansion and verification.
Proceedings of the IEEE International Conference on Acoustics, 2014
Emotion recognition of conversational affective speech using temporal course modeling-based error weighted cross-correlation model.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014
2013
Speaking Effect Removal on Emotion Recognition From Facial Expressions Based on Eigenface Conversion.
IEEE Trans. Multim., 2013
Two-Level Hierarchical Alignment for Semi-Coupled HMM-Based Audiovisual Emotion Recognition With Temporal Course.
IEEE Trans. Multim., 2013
Personalized Spectral and Prosody Conversion Using Frame-Based Codeword Distribution and Adaptive CRF.
IEEE Trans. Speech Audio Process., 2013
Multim. Syst., 2013
Emotion recognition of conversational affective speech using temporal course modeling.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Facial action unit prediction under partial occlusion based on Error Weighted Cross-Correlation Model.
Proceedings of the IEEE International Conference on Acoustics, 2013
Personalized natural speech synthesis based on retrieval of pitch patterns using hierarchical Fujisaki model.
Proceedings of the IEEE International Conference on Acoustics, 2013
Automatic pronunciation clustering using a World English archive and pronunciation structure analysis.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013
2012
ACM Trans. Storage, 2012
Error Weighted Semi-Coupled Hidden Markov Model for Audio-Visual Emotion Recognition.
IEEE Trans. Multim., 2012
Error Diagnosis of Chinese Sentences Using Inductive Learning Algorithm and Decomposition-Based Testing Mechanism.
ACM Trans. Asian Lang. Inf. Process., 2012
Robust dialogue act detection based on partial sentence tree, derivation rule, and spectral clustering algorithm.
EURASIP J. Audio Speech Music. Process., 2012
Alternative hypothesis generation using a weighted kernel feature matrix for ASR substitution error correction.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012
Hierarchical prosodic pattern selection based on Fujisaki model for natural mandarin speech synthesis.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012
Phone set construction based on context-sensitive articulatory attributes for code-switching speech recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
2011
Unsupervised Alignment of News Video and Text Using Visual Patterns and Textual Concepts.
IEEE Trans. Multim., 2011
Speaker Clustering Using Decision Tree-Based Phone Cluster Models With Multi-Space Probability Distributions.
IEEE Trans. Speech Audio Process., 2011
Articulation-Disordered Speech Recognition Using Speaker-Adaptive Acoustic Models and Personalized Articulation Patterns.
ACM Trans. Asian Lang. Inf. Process., 2011
Interruption Point Detection of Spontaneous Speech Using Inter-Syllable Boundary-Based Prosodic Features.
ACM Trans. Asian Lang. Inf. Process., 2011
Emotion Recognition of Affective Speech Based on Multiple Classifiers Using Acoustic-Prosodic Information and Semantic Labels.
IEEE Trans. Affect. Comput., 2011
Proceedings of the Affective Computing and Intelligent Interaction, 2011
Semi-Coupled Hidden Markov Model with State-Based Alignment Strategy for Audio-Visual Emotion Recognition.
Proceedings of the Affective Computing and Intelligent Interaction, 2011
2010
Sentence Correction Incorporating Relative Position and Parse Template Language Models.
IEEE Trans. Speech Audio Process., 2010
Hierarchical Prosody Conversion Using Regression-Based Clustering for Emotional Speech Synthesis.
IEEE Trans. Speech Audio Process., 2010
IEEE Trans. Speech Audio Process., 2010
Exploiting Prosody Hierarchy and Dynamic Features for Pitch Modeling and Generation in HMM-Based Speech Synthesis.
IEEE Trans. Speech Audio Process., 2010
Proceedings of the 16th IEEE International Conference on Embedded and Real-Time Computing Systems and Applications, 2010
Sentence Decomplexification using holistic aspect-based clause detection for long sentence understanding.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010
Error diagnosis using penalized probabilistic FOIL for Chinese as a Second Language learner.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2010), May 30, 2010
Pronunciation variation generation for spontaneous speech synthesis using state-based voice transformation.
Proceedings of the IEEE International Conference on Acoustics, 2010
2009
IEEE Trans. Speech Audio Process., 2009
Story Segmentation and Topic Classification of Broadcast News via a Topic-Based Segmental Model and a Genetic Algorithm.
IEEE Trans. Speech Audio Process., 2009
Improving Structural Statistical Machine Translation for Sign Language With Small Corpus Using Thematic Role Templates as Translation Memory.
IEEE Trans. Speech Audio Process., 2009
Introduction to the Special Issue on Recent Advances in Asian Language Spoken Document Retrieval.
ACM Trans. Asian Lang. Inf. Process., 2009
IEEE Signal Process. Lett., 2009
Extraction of Query Term-related Visual Phrases for News Video Retrieval using Mutual Information.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2009), 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
An Articulation Training System with Intelligent Interface and Multimode Feedbacks to Articulation Disorders.
Proceedings of the 2009 International Conference on Asian Language Processing, 2009
Proceedings of the Affective Information Processing, 2009
2008
Extended probabilistic HAL with close temporal association for psychiatric query document retrieval.
ACM Trans. Inf. Syst., 2008
HAL-Based Evolutionary Inference for Pattern Induction From Psychiatry Web Resources.
IEEE Trans. Evol. Comput., 2008
Stochastic vector mapping-based feature enhancement using prior-models and model adaptation for noisy speech recognition.
Speech Commun., 2008
Ontology-based speech act identification in a bilingual dialog system using partial pattern trees.
J. Assoc. Inf. Sci. Technol., 2008
Video News Retrieval Incorporating Relevant Terms Based on Distribution of Document Frequency.
Proceedings of the Advances in Multimedia Information Processing, 2008
Word Order Correction for Language Transfer Using Relative Position Language Modeling.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008
Recognition of Syllable-Contracted Words in Spontaneous Speech Using Word Expansion and Duration Information.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008
Interruption point detection of spontaneous speech using prior knowledge and multiple features.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008
Unsupervised pronunciation grammar growing using knowledge-based and data-driven approaches.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008
Automatic assessment of articulation disorders using confident unit-based model adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2008
2007
IEEE Trans. Multim., 2007
Psychiatric Consultation Record Retrieval Using Scenario-Based Representation and Multilevel Mixture Model.
IEEE Trans. Inf. Technol. Biomed., 2007
Generation of Phonetic Units for Mixed-Language Speech Recognition Based on Acoustic and Contextual Analysis.
IEEE Trans. Computers, 2007
Conversion Function Clustering and Selection Using Linguistic and Spectral Information for Emotional Voice Conversion.
IEEE Trans. Computers, 2007
IEEE Trans. Speech Audio Process., 2007
IEEE Trans. Speech Audio Process., 2007
ACM Trans. Asian Lang. Inf. Process., 2007
Joint Optimization of Word Alignment and Epenthesis Generation for Chinese to Taiwanese Sign Synthesis.
IEEE Trans. Pattern Anal. Mach. Intell., 2007
Proceedings of the Ninth IEEE International Symposium on Multimedia, 2007
Phone Set Generation Based on Acoustic and Contextual Analysis for Multilingual Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
2006
IEEE Trans. Knowl. Data Eng., 2006
IEEE Trans. Circuits Syst. Video Technol., 2006
Edit disfluency detection and correction using a cleanup language model and an alignment model.
IEEE Trans. Speech Audio Process., 2006
IEEE Trans. Speech Audio Process., 2006
Multiple change-point audio segmentation and classification using an MDL-based Gaussian model.
IEEE Trans. Speech Audio Process., 2006
Automatic segmentation and identification of mixed-language speech using delta-BIC and LSA-based GMMs.
IEEE Trans. Speech Audio Process., 2006
ACM Trans. Asian Lang. Inf. Process., 2006
Contextual Maximum Entropy Model for Edit Disfluency Detection of Spontaneous Speech.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006
2005
Speech act modeling and verification of spontaneous speech with disfluency in a spoken dialogue system.
IEEE Trans. Speech Audio Process., 2005
ACM Trans. Asian Lang. Inf. Process., 2005
IEEE Intell. Syst., 2005
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005
Proceedings of the Affective Computing and Intelligent Interaction, 2005
Proceedings of the Affective Computing and Intelligent Interaction, 2005
Proceedings of the Affective Computing and Intelligent Interaction, 2005
Proceedings of the Affective Computing and Intelligent Interaction, 2005
2004
Acoustic Feature Analysis and Discriminative Modeling of Filled Pauses for Spontaneous Speech Recognition.
J. VLSI Signal Process., 2004
Recovery from false rejection using statistical partial pattern trees for sentence verification.
Speech Commun., 2004
Error-Tolerant Sign Retrieval Using Visual Features and Maximum A Posteriori Estimation.
IEEE Trans. Pattern Anal. Mach. Intell., 2004
2002
Meaningful term extraction and discriminative term selection in text categorization via unknown-word methodology.
ACM Trans. Asian Lang. Inf. Process., 2002
Speech act modeling in a spoken dialog system using a fuzzy fragment-class Markov model.
Speech Commun., 2002
Generation of robust phonetic set and decision tree for Mandarin using chi-square testing.
Speech Commun., 2002
Text-to-Visual Speech Synthesis for General Objects Using Parameter-Based Lip Models.
Proceedings of the Advances in Multimedia Information Processing, 2002
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002
2001
Automatic generation of synthesis units and prosodic information for Chinese concatenative synthesis.
Speech Commun., 2001
Multi-keyword spotting of telephone speech using a fuzzy search algorithm and keyword-driven two-level CBSM.
Speech Commun., 2001
Proceedings of the Advances in Multimedia Information Processing, 2001
1997
A novel two-level method for the computation of the LSP frequencies using a decimation-in-degree algorithm.
IEEE Trans. Speech Audio Process., 1997
1991
A hierarchical neural network model based on a C/V segmentation algorithm for isolated Mandarin speech recognition.
IEEE Trans. Signal Process., 1991
A shunting multilayer perceptron network for confusing/composite pattern recognition.
Pattern Recognit., 1991
Speaker-Independent Recognition of isolated Words using concatenated Neural Networks.
Int. J. Pattern Recognit. Artif. Intell., 1991