Chung-Hsien Wu

Orcid: 0000-0002-3947-2123

Affiliations:
  • National Cheng Kung University, Tainan, Taiwan


According to our database1, Chung-Hsien Wu authored at least 270 papers between 1990 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Development of a Taiwanese Speech Synthesis System Using Hidden Markov Models and a Robust Tonal Phoneme Corpus.
J. Inf. Sci. Eng., March, 2024

Digital Phenotyping-Based Bipolar Disorder Assessment Using Multiple Correlation Data Imputation and Lasso-MLP.
IEEE Trans. Affect. Comput., 2024

Dynamic Sampling-Based Meta-Learning Using Multilingual Acoustic Data for Under-Resourced Speech Recognition.
IEEE Access, 2024

2023
Empathetic Response Generation Based on Plug-and-Play Mechanism With Empathy Perturbation.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Decomposition and Reorganization of Phonetic Information for Speaker Embedding Learning.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Generalization Ability Improvement of Speaker Representation and Anti-Interference for Speaker Verification.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Applying Segment-Level Attention on Bi-Modal Transformer Encoder for Audio-Visual Emotion Recognition.
IEEE Trans. Affect. Comput., 2023

Automatic Bipolar Disorder Assessment Using Machine Learning With Smartphone-Based Digital Phenotyping.
IEEE Access, 2023

Speech Enhancement Using Dynamic Learning in Knowledge Distillation via Reinforcement Learning.
IEEE Access, 2023

Speech Emotion Recognition using Decomposed Speech via Multi-task Learning.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Temporal and Type Correlation in Digital Phenotyping for Bipolar Disorder State Prediction Using Multitask Self-Supervised Learning.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

Data Selection Based on Phoneme Affinity Matrix for Electrolarynx Speech Recognition.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
CREER: A Large-Scale Corpus for Relation Extraction and Entity Recognition.
CoRR, 2022

Linear-time Mixed-Cell-Height Legalization for Minimizing Maximum Displacement.
Proceedings of the ISPD 2022: International Symposium on Physical Design, Virtual Event, Canada, March 27, 2022

Memory-Efficient Multi-Step Speech Enhancement with Neural ODE.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Applying Emotional Keyphrase Correlation for Diversity Enhancement in Empathetic Dialogue Response Generation.
Proceedings of the International Conference on Asian Language Processing, 2022

2021
Speech Emotion Recognition Considering Nonverbal Vocalization in Affective Conversations.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Exploring Macroscopic and Microscopic Fluctuations of Elicited Facial Expressions for Mood Disorder Classification.
IEEE Trans. Affect. Comput., 2021

Transformer-based Empathetic Response Generation Using Dialogue Situation and Advanced-Level Definition of Empathy.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

Assessment of Bipolar Disorder Using Heterogeneous Data of Smartphone-Based Digital Phenotyping.
Proceedings of the IEEE International Conference on Acoustics, 2021

Latent Attribute Control for Story Generation.
Proceedings of the International Conference on Asian Language Processing, 2021

Ensemble of One Model: Creating Model Variations for Transformer with Layer Permutation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

Task-Aware BERT-based Sentiment Analysis from Multiple Essences of the Text.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

Improvement of Spatial Ambiguity in Multi-Channel Speech Separation Using Channel Attention.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

Speech Enhancement Based on Masking Approach Considering Speech Quality and Acoustic Confidence for Noisy Speech Recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020
Cell-Coupled Long Short-Term Memory With L-Skip Fusion Mechanism for Mood Disorder Detection Through Elicited Audiovisual Features.
IEEE Trans. Neural Networks Learn. Syst., 2020

Sound Events Recognition and Retrieval Using Multi-Convolutional-Channel Sparse Coding Convolutional Neural Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

A Two-Stage Transformer-Based Approach for Variable-Length Abstractive Summarization.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Attention-Based Response Generation Using Parallel Double Q-Learning for Dialog Policy Decision in a Conversational System.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Detecting Unipolar and Bipolar Depressive Disorders from Elicited Speech Responses Using Latent Affective Structure Model.
IEEE Trans. Affect. Comput., 2020

Combining Deep Embeddings of Acoustic and Articulatory Features for Speaker Identification.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Statistics Pooling Time Delay Neural Network Based on X-Vector for Speaker Verification.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Natural Language Processing Methods for Detection of Influenza-Like Illness from Chief Complaints.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

Attentively-Coupled Long Short-Term Memory for Audio-Visual Emotion Recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

Acoustic and Textual Data Augmentation for Code-Switching Speech Recognition in Under-Resourced Language.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
Response Selection and Automatic Message-Response Expansion in Retrieval-Based QA Systems using Semantic Dependency Pair Model.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2019

Attention-based convolutional neural network and long short-term memory for short-term detection of mood disorders based on elicited speech responses.
Pattern Recognit., 2019

Follow-Up Question Generation Using Neural Tensor Network-Based Domain Ontology Population in an Interview Coaching System.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Speech Emotion Recognition Using Deep Neural Network Considering Verbal and Nonverbal Speech Sounds.
Proceedings of the IEEE International Conference on Acoustics, 2019

Automatic Ontology Population Using Deep Learning for Triple Extraction.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Sequential Speaker Embedding and Transfer Learning for Text-Independent Speaker Identification.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018
Sound Event Recognition Using Auditory-Receptive-Field Binary Pattern and Hierarchical-Diving Deep Belief Network.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Speech Emotion Recognition using Convolutional Neural Network with Audio Word-based Embedding.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Follow-up Question Generation Using Pattern-based Seq2seq with a Small Corpus for Interview Coaching.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Attention-Based Dialog State Tracking for Conversational Interview Coaching.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Locality-Preserving Complex-Valued Gaussian Process Latent Variable Model for Robust Face Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Speaker Identification Using Discriminative Features and Sparse Representation.
IEEE Trans. Inf. Forensics Secur., 2017

Personalized Spontaneous Speech Synthesis Using a Small-Sized Unsegmented Semispontaneous Speech.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Interaction Style Recognition Based on Multi-Layer Multi-View Profile Representation.
IEEE Trans. Affect. Comput., 2017

Coupled HMM-based multimodal fusion for mood disorder detection through elicited audio-visual signals.
J. Ambient Intell. Humaniz. Comput., 2017

Miscommunication handling in spoken dialog systems based on error-aware dialog state detection.
EURASIP J. Audio Speech Music. Process., 2017

Recognition and retrieval of sound events using sparse coding convolutional neural network.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Speech emotion recognition with ensemble learning methods.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Fully complex deep neural network for phase-incorporating monaural source separation.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Mood detection from daily conversational speech using denoising autoencoder and LSTM.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Personality trait perception from speech signals using multiresolution analysis and convolutional neural networks.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Mood disorder identification using deep bottleneck features of elicited speech.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016
Compressive Sensing-Based Speech Enhancement.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Exploiting Turn-Taking Temporal Evolution for Personality Trait Perception in Dyadic Conversations.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Improving Mandarin Prosody Generation Using Alternative Smoothing Techniques.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Candidate Expansion and Prosody Adjustment for Natural Speech Synthesis Using a Small Corpus.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Detection of mood disorder using speech emotion profiles and LSTM.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Dialog state tracking for interview coaching using two-level LSTM.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Unipolar Depression vs. Bipolar Disorder: An Elicitation-Based Approach to Short-Term Detection of Mood Disorder.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Generation of Emotion Control Vector Using MDS-Based Space Transformation for Expressive Speech Synthesis.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Dialog State Tracking and action selection using deep learning mechanism for interview coaching.
Proceedings of the 2016 International Conference on Asian Language Processing, 2016

Speech emotion classification using multiple kernel Gaussian process.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

2015
Code-Switching Event Detection by Using a Latent Language Space Model and the Delta-Bayesian Information Criterion.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Speech Emotion Verification Using Emotion Variance Modeling and Discriminant Scale-Frequency Maps.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Model Generation of Accented Speech using Model Transformation and Verification for Bilingual Speech Recognition.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2015

Sentence extraction with topic modeling for question-answer pair generation.
Soft Comput., 2015

Fluent personalized speech synthesis with prosodic word-level spontaneous speech generation.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Affective structure modeling of speech using probabilistic context free grammar for emotion recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Hierarchical modeling of temporal course in emotional expression for speech emotion recognition.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

Emotion recognition of affective speech based on multiple classifiers using acoustic-prosodic information and semantic labels (Extended abstract).
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

2014
Chinese-English Phone Set Construction for Code-Switching ASR Using Acoustic and DNN-Extracted Articulatory Features.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

Synthesis of Spontaneous Speech With Syllable Contraction Using State-Based Context-Dependent Voice Transformation.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

Exploiting Psychological Factors for Interaction Style Recognition in Spoken Conversation.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

Polyglot Speech Synthesis Based on Cross-Lingual Frame Selection Using Auditory and Articulatory Features.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

Interlocutor personality perception based on BFI profiles and coupled HMMs in a dyadic conversation.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Natural speech synthesis based on hybrid approach with candidate expansion and verification.
Proceedings of the IEEE International Conference on Acoustics, 2014

Emotion recognition of conversational affective speech using temporal course modeling-based error weighted cross-correlation model.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

Emoticon recommendation in microblog using affective trajectory model.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2013
Speaking Effect Removal on Emotion Recognition From Facial Expressions Based on Eigenface Conversion.
IEEE Trans. Multim., 2013

Two-Level Hierarchical Alignment for Semi-Coupled HMM-Based Audiovisual Emotion Recognition With Temporal Course.
IEEE Trans. Multim., 2013

Personalized Spectral and Prosody Conversion Using Frame-Based Codeword Distribution and Adaptive CRF.
IEEE Trans. Speech Audio Process., 2013

Multiple visual concept discovery using concept-based visual word clustering.
Multim. Syst., 2013

HMM-based Mandarin Singing Voice Synthesis Using Tailored Synthesis Units and Question Sets.
Int. J. Comput. Linguistics Chin. Lang. Process., 2013

Residual compensation based on articulatory feature-based phone clustering for hybrid Mandarin speech synthesis.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Speaker-based accented English clustering using a world English archive.
Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2013

合成單元與問題集之定義於隱藏式馬可夫模型中文歌聲合成系統之建立 (Synthesis Unit and Question Set Definition for Mandarin HMM-based Singing Voice Synthesis).
Proceedings of the 25th Conference on Computational Linguistics and Speech Processing, 2013

Emotion recognition of conversational affective speech using temporal course modeling.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Code-Switching event detection based on delta-BIC using phonetic eigenvoice models.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Interaction style detection based on Fused Cross-Correlation Model in spoken conversation.
Proceedings of the IEEE International Conference on Acoustics, 2013

Facial action unit prediction under partial occlusion based on Error Weighted Cross-Correlation Model.
Proceedings of the IEEE International Conference on Acoustics, 2013

Personalized natural speech synthesis based on retrieval of pitch patterns using hierarchical Fujisaki model.
Proceedings of the IEEE International Conference on Acoustics, 2013

Automatic pronunciation clustering using a World English archive and pronunciation structure analysis.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

Emotion recognition from multi-modal information.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

BFI-based speaker personality perception using acoustic-prosodic features.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Affective-cognitive dialogue act detection in an error-aware spoken dialogue system.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Candidate Scoring Using Web-Based Measure for Chinese Spelling Error Correction.
Proceedings of the Seventh SIGHAN Workshop on Chinese Language Processing, 2013

2012
MFTL: A Design and Implementation for MLC Flash Memory Storage Systems.
ACM Trans. Storage, 2012

Error Weighted Semi-Coupled Hidden Markov Model for Audio-Visual Emotion Recognition.
IEEE Trans. Multim., 2012

Error Diagnosis of Chinese Sentences Using Inductive Learning Algorithm and Decomposition-Based Testing Mechanism.
ACM Trans. Asian Lang. Inf. Process., 2012

Robust dialogue act detection based on partial sentence tree, derivation rule, and spectral clustering algorithm.
EURASIP J. Audio Speech Music. Process., 2012

Alternative hypothesis generation using a weighted kernel feature matrix for ASR substitution error correction.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Hierarchical prosodic pattern selection based on Fujisaki model for natural mandarin speech synthesis.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Phone set construction based on context-sensitive articulatory attributes for code-switching speech recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Cross-lingual frame selection method for polyglot speech synthesis.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Unsupervised Alignment of News Video and Text Using Visual Patterns and Textual Concepts.
IEEE Trans. Multim., 2011

Speaker Clustering Using Decision Tree-Based Phone Cluster Models With Multi-Space Probability Distributions.
IEEE Trans. Speech Audio Process., 2011

Articulation-Disordered Speech Recognition Using Speaker-Adaptive Acoustic Models and Personalized Articulation Patterns.
ACM Trans. Asian Lang. Inf. Process., 2011

Interruption Point Detection of Spontaneous Speech Using Inter-Syllable Boundary-Based Prosodic Features.
ACM Trans. Asian Lang. Inf. Process., 2011

Emotion Recognition of Affective Speech Based on Multiple Classifiers Using Acoustic-Prosodic Information and Semantic Labels.
IEEE Trans. Affect. Comput., 2011

Candidate Generation for ASR Output Error Correction Using a Context-Dependent Syllable Cluster-Based Confusion Matrix.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Interactional Style Detection for Versatile Dialogue Response Using Prosodic and Semantic Features.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

An Efficient Pre-Processing Scheme to Improve the Sound Source Localization System in Noisy Environment.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Speech Indexing Using Semantic Context Inference.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Emotion Detection Based on Concept Inference and Spoken Sentence Analysis for Customer Service.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Semantic Information and Derivation Rules for Robust Dialogue Act Detection in a Spoken Dialogue System.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011

A Regression Approach to Affective Rating of Chinese Words from ANEW.
Proceedings of the Affective Computing and Intelligent Interaction, 2011

Semi-Coupled Hidden Markov Model with State-Based Alignment Strategy for Audio-Visual Emotion Recognition.
Proceedings of the Affective Computing and Intelligent Interaction, 2011

2010
Sentence Correction Incorporating Relative Position and Parse Template Language Models.
IEEE Trans. Speech Audio Process., 2010

Hierarchical Prosody Conversion Using Regression-Based Clustering for Emotional Speech Synthesis.
IEEE Trans. Speech Audio Process., 2010

Introduction to the Special Section on Voice Transformation.
IEEE Trans. Speech Audio Process., 2010

Exploiting Prosody Hierarchy and Dynamic Features for Pitch Modeling and Generation in HMM-Based Speech Synthesis.
IEEE Trans. Speech Audio Process., 2010

Annotation and verification of sense pools in OntoNotes.
Inf. Process. Manag., 2010

Word Sense Disambiguation Using Multiple Contextual Features.
Int. J. Comput. Linguistics Chin. Lang. Process., 2010

Scalable Audio-Content Analysis.
EURASIP J. Audio Speech Music. Process., 2010

Voice conversion using precise speech alignment based on spectral property and eigen-codeword distribution.
Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010

Design and Implementation for Multi-level Cell Flash Memory Storage Systems.
Proceedings of the 16th IEEE International Conference on Embedded and Real-Time Computing Systems and Applications, 2010

發音事件驗證於多語辨識發音變異模型之產生 (Pronunciation Variation Model Generation based on Pronunciation Event Verification for Multi-Lingual Speech Recognition) [In Chinese].
Proceedings of the 22th Conference on Computational Linguistics and Speech Processing, 2010

Sentence Decomplexification using holistic aspect-based clause detection for long sentence understanding.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Error diagnosis using penalized probabilistic FOIL for Chinese as a Second Language learner.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Extraction of robust visual phrases using graph mining for image retrieval.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2010), May 30, 2010

Prosodic word-based error correction in speech recognition using prosodic word expansion and contextual information.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Dialogue act detection in error-prone spoken dialogue systems using partial sentence tree and latent dialogue act matrix.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Pronunciation variation generation for spontaneous speech synthesis using state-based voice transformation.
Proceedings of the IEEE International Conference on Acoustics, 2010

Discriminative Training for Near-Synonym Substitution.
Proceedings of the COLING 2010, 2010

2009
Idiolect Extraction and Generation for Personalized Speaking Style Modeling.
IEEE Trans. Speech Audio Process., 2009

Story Segmentation and Topic Classification of Broadcast News via a Topic-Based Segmental Model and a Genetic Algorithm.
IEEE Trans. Speech Audio Process., 2009

Improving Structural Statistical Machine Translation for Sign Language With Small Corpus Using Thematic Role Templates as Translation Memory.
IEEE Trans. Speech Audio Process., 2009

Introduction to the Special Issue on Recent Advances in Asian Language Spoken Document Retrieval.
ACM Trans. Asian Lang. Inf. Process., 2009

Speech-Annotated Photo Retrieval Using Syllable-Transformed Patterns.
IEEE Signal Process. Lett., 2009

Psychiatric document retrieval using a discourse-aware model.
Artif. Intell., 2009

應用句型結構與部份樣本樹於對話行為之偵測 (Dialogue Act Detection Using Sentence Structure and Partial Pattern Trees) [In Chinese].
Proceedings of the 21st Conference on Computational Linguistics and Speech Processing, 2009

Extraction of Query Term-related Visual Phrases for News Video Retrieval using Mutual Information.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2009), 2009

Semantic role labeling with discriminative feature selection for spoken language understanding.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Regression-based clustering for hierarchical pitch conversion.
Proceedings of the IEEE International Conference on Acoustics, 2009

An Articulation Training System with Intelligent Interface and Multimode Feedbacks to Articulation Disorders.
Proceedings of the 2009 International Conference on Asian Language Processing, 2009

Mining Association Language Patterns for Negative Life Event Classification.
Proceedings of the ACL 2009, 2009

Emotion Perception and Recognition from Speech.
Proceedings of the Affective Information Processing, 2009

2008
Design and Implementation of Subspace-Based Speech Enhancement Under In-Car Noisy Environments.
IEEE Trans. Veh. Technol., 2008

Extended probabilistic HAL with close temporal association for psychiatric query document retrieval.
ACM Trans. Inf. Syst., 2008

HAL-Based Evolutionary Inference for Pattern Induction From Psychiatry Web Resources.
IEEE Trans. Evol. Comput., 2008

Interactive multimedia mirror system design.
IEEE Trans. Consumer Electron., 2008

Stochastic vector mapping-based feature enhancement using prior-models and model adaptation for noisy speech recognition.
Speech Commun., 2008

Ontology-based speech act identification in a bilingual dialog system using partial pattern trees.
J. Assoc. Inf. Sci. Technol., 2008

Corpus Cleanup of Mistaken Agreement Using Word Sense Disambiguation.
Int. J. Comput. Linguistics Chin. Lang. Process., 2008

Propositional Term Extraction over Short Text using Word Cohesiveness and Conditional Random Fields with Multi-Level Features.
Proceedings of the 20th Conference on Computational Linguistics and Speech Processing, 2008

Video News Retrieval Incorporating Relevant Terms Based on Distribution of Document Frequency.
Proceedings of the Advances in Multimedia Information Processing, 2008

Word Order Correction for Language Transfer Using Relative Position Language Modeling.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Recognition of Syllable-Contracted Words in Spontaneous Speech Using Word Expansion and Duration Information.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Robust speaker verification using short-time frequency with long-time window and fusion of multi-resolutions.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Adaptive decision tree-based phone cluster models for speaker clustering.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Interruption point detection of spontaneous speech using prior knowledge and multiple features.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Unsupervised pronunciation grammar growing using knowledge-based and data-driven approaches.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Automatic assessment of articulation disorders using confident unit-based model adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2008

OntoNotes: Corpus Cleanup of Mistaken Agreement Using Word Sense Disambiguation.
Proceedings of the COLING 2008, 2008

2007
Speech Sentence Compression Based on Speech Segment Extraction and Concatenation.
IEEE Trans. Multim., 2007

Psychiatric Consultation Record Retrieval Using Scenario-Based Representation and Multilevel Mixture Model.
IEEE Trans. Inf. Technol. Biomed., 2007

Generation of Phonetic Units for Mixed-Language Speech Recognition Based on Acoustic and Contextual Analysis.
IEEE Trans. Computers, 2007

Conversion Function Clustering and Selection Using Linguistic and Spectral Information for Emotional Voice Conversion.
IEEE Trans. Computers, 2007

Variable-Length Unit Selection in TTS Using Structural Syntactic Cost.
IEEE Trans. Speech Audio Process., 2007

Spoken Document Retrieval Using Multilevel Knowledge and Semantic Verification.
IEEE Trans. Speech Audio Process., 2007

Transfer-based statistical translation of Taiwanese sign language using PCFG.
ACM Trans. Asian Lang. Inf. Process., 2007

Joint Optimization of Word Alignment and Epenthesis Generation for Chinese to Taiwanese Sign Synthesis.
IEEE Trans. Pattern Anal. Mach. Intell., 2007

Emotion Recognition from Speech Using IG-Based Feature Compensation.
Int. J. Comput. Linguistics Chin. Lang. Process., 2007

Magic Mirror.
Proceedings of the Ninth IEEE International Symposium on Multimedia, 2007

Disfluency correction of spontaneous speech using conditional random fields with variable-length features.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Phone Set Generation Based on Acoustic and Contextual Analysis for Multilingual Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2007

Conversion Function Clustering and Selection for Expressive Voice Conversion.
Proceedings of the IEEE International Conference on Acoustics, 2007

Topic Analysis for Psychiatric Document Retrieval.
Proceedings of the ACL 2007, 2007

2006
Semantic Segment Extraction and Matching for Internet FAQ Retrieval.
IEEE Trans. Knowl. Data Eng., 2006

Movement Epenthesis Generation Using NURBS-Based Spatial Interpolation.
IEEE Trans. Circuits Syst. Video Technol., 2006

Edit disfluency detection and correction using a cleanup language model and an alignment model.
IEEE Trans. Speech Audio Process., 2006

Voice conversion using duration-embedded bi-HMMs for expressive speech synthesis.
IEEE Trans. Speech Audio Process., 2006

Multiple change-point audio segmentation and classification using an MDL-based Gaussian model.
IEEE Trans. Speech Audio Process., 2006

Automatic segmentation and identification of mixed-language speech using delta-BIC and LSA-based GMMs.
IEEE Trans. Speech Audio Process., 2006

Emotion recognition from text using semantic labels and separable mixture models.
ACM Trans. Asian Lang. Inf. Process., 2006

Multiband Subspace Tracking Speech Enhancement for In-Car Human Computer Speech Interaction.
J. Inf. Sci. Eng., 2006

MAP-Based Perceptual Modeling for Noisy Speech Recognition.
J. Inf. Sci. Eng., 2006

應用不定長度特徵之條件隨機域於口語不流暢語流修正 (Disfluency Correction of Spontaneous Speech using Conditional Random Fields with Variable Length Features) [In Chinese].
Proceedings of the 18th Conference on Computational Linguistics and Speech Processing, 2006

利用聲學與文脈分析於多語語音辨識單元之產生 (Generation of Phonetic Units for Multilingual Speech Recognition Based on Acoustic and Contextual Analysis) [In Chinese].
Proceedings of the 18th Conference on Computational Linguistics and Speech Processing, 2006

Contextual Maximum Entropy Model for Edit Disfluency Detection of Spontaneous Speech.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

Robust Speech-Annotated Photo Retrieval Using Syllable-Transformed Patterns.
Proceedings of the 5th International Symposium on Chinese Spoken Language Processing, 2006

Map-based adaptation for speech conversion using adaptation data selection and non-parallel training.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Stochastic vector mapping-based feature enhancement using prior model and environment adaptation for noisy speech recognition.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

HAL-Based Cascaded Model for Variable-Length Semantic Pattern Induction from Psychiatry Web Resources.
Proceedings of the ACL 2006, 2006

Stochastic Discourse Modeling in Spoken Dialogue Systems Using Semantic Dependency Graphs.
Proceedings of the ACL 2006, 2006

2005
Speech act modeling and verification of spontaneous speech with disfluency in a spoken dialogue system.
IEEE Trans. Speech Audio Process., 2005

Domain-specific FAQ retrieval using independent aspects.
ACM Trans. Asian Lang. Inf. Process., 2005

Automated Alignment and Extraction of a Bilingual Ontology for Cross-Language Domain-Specific Applications.
Int. J. Comput. Linguistics Chin. Lang. Process., 2005

TAICAR-The Collection and Annotation of an In-Car Speech Database Created in Taiwan.
Int. J. Comput. Linguistics Chin. Lang. Process., 2005

Using Semantic Dependencies to Mine Depressive Symptoms from Consultation Records.
IEEE Intell. Syst., 2005

應用錯誤型態分析於英語發音輔助學習 (English pronunciation assisted learning using error type analysis) [In Chinese].
Proceedings of the 17th Conference on Computational Linguistics and Speech Processing, 2005

Audio-video summarization of TV news using speech recognition and shot change detection.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Duration-embedded bi-HMM for expressive voice conversion.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Spoken document summarization using acoustic, prosodic and semantic information.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Facial Phoneme Extraction for Taiwanese Sign Language Recognition.
Proceedings of the Affective Computing and Intelligent Interaction, 2005

Vision-Based Recognition of Hand Shapes in Taiwanese Sign Language.
Proceedings of the Affective Computing and Intelligent Interaction, 2005

IG-Based Feature Extraction and Compensation for Emotion Recognition from Speech.
Proceedings of the Affective Computing and Intelligent Interaction, 2005

Hand Motion Recognition for the Vision-based Taiwanese Sign Language Interpretation.
Proceedings of the Affective Computing and Intelligent Interaction, 2005

2004
Acoustic Feature Analysis and Discriminative Modeling of Filled Pauses for Spontaneous Speech Recognition.
J. VLSI Signal Process., 2004

Recovery from false rejection using statistical partial pattern trees for sentence verification.
Speech Commun., 2004

Error-Tolerant Sign Retrieval Using Visual Features and Maximum A Posteriori Estimation.
IEEE Trans. Pattern Anal. Mach. Intell., 2004

應用語料庫和語意相依法則於中文語音文件之摘要 (Spoken Document Summarization Using Topic-Related Corpus and Semantic Dependency Grammar) [In Chinese].
Proceedings of the 16th Conference on Computational Linguistics and Speech Processing, 2004

應用機率式句法結構與隱含式語意索引於情緒語音合成之單元選取 (Unit Selection for Corpus-Based Emotional Speech Synthesis Using PCFG and LSI) [In Chinese].
Proceedings of the 16th Conference on Computational Linguistics and Speech Processing, 2004

Unsupervised speaker segmentation of broadcast news using MDL-based Gaussian model.
Proceedings of the Odyssey 2004: The Speaker and Language Recognition Workshop, Toledo, Spain, May 31, 2004

Variable-length unit selection using LSA-based syntactic structure cost.
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

Spoken document summarization using topic-related corpus and semantic dependency grammar.
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

Speech act identification using an ontology-based partial pattern tree.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Emotion recognition using acoustic features and textual content.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Language boundary detection and identification of mixed-language speech based on MAP estimation.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Automated Alignment and Extraction of Bilingual Domain Ontology for Cross-Language Domain-Specific Applications.
Proceedings of the COLING 2004, 2004

Automatic Alignment and Extraction of Bilingual Domain Ontology for Medical Domain Web Search.
Proceedings of the Third Workshop on Chinese Language Processing, 2004

2003
以知識概念模型為基礎之多主題對話管理系統 (Ontology-Based Dialog Management for Multiple Service Integration) [In Chinese].
Proceedings of the 15th Conference on Computational Linguistics and Speech Processing, 2003

Flexible speech act identification of spontaneous speech with disfluency.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002
Meaningful term extraction and discriminative term selection in text categorization via unknown-word methodology.
ACM Trans. Asian Lang. Inf. Process., 2002

Speech act modeling in a spoken dialog system using a fuzzy fragment-class Markov model.
Speech Commun., 2002

Generation of robust phonetic set and decision tree for Mandarin using chi-square testing.
Speech Commun., 2002

VLSI Architecture and Implementation for Speech Recognizer Based on Discriminative Bayesian Neural Network.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2002

Text-to-Visual Speech Synthesis for General Objects Using Parameter-Based Lip Models.
Proceedings of the Advances in Multimedia Information Processing, 2002

Emotion recognition via acoustic features and semantic contents in speech.
Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002

Sign language translation using an error tolerant retrieval algorithm.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Emotion recognition from textual input using an emotional semantic network.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Parameter-based lip modeling for facial animation of general objects.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002

Perceptual speech modeling for noisy speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2002

2001
Automatic generation of synthesis units and prosodic information for Chinese concatenative synthesis.
Speech Commun., 2001

Multi-keyword spotting of telephone speech using a fuzzy search algorithm and keyword-driven two-level CBSM.
Speech Commun., 2001

Using Chi-Square Testing in Modeling Confusion Characteristics for Robust Phonetic Set Generation.
Proceedings of the 14th Conference on Computational Linguistics and Speech Processing, 2001

Multi-modal Sign Icon Retrieval for Augmentative Communication.
Proceedings of the Advances in Multimedia Information Processing, 2001

Discriminative disfluency modeling for spontaneous speech recognition.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
Unknown Word and Phrase Extraction Using a Phrase-Like-Unit-Based Likelihood Ratio.
Int. J. Comput. Process. Orient. Lang., 2000

網際網路FAQ 檢索中意圖萃取與語意比對之研究 (Intention Extraction and Semantic Matching for Internet FAQ Retrieval Using Spoken Language Query) [In Chinese].
Proceedings of the 13th Conference on Computational Linguistics and Speech Processing, 2000

PC-Based 台灣手語轉語音溝通輔助系統 (PC-based Taiwanese Sign Language to Speech Communication Aided System)[In Chinese].
Proceedings of the 13th Conference on Computational Linguistics and Speech Processing, 2000

Error recovery and sentence verification using statistical partial pattern tree for conversational speech.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Natural language processing for Taiwanese sign language to speech conversion.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Intention extraction and semantic matching for internet FAQ retrieval using spoken language query.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999
台語多聲調音節合成單元資料庫暨文字轉語音雛形系統之發展 (Establish Taiwanese 7-Tones Syllable-based Synthesis Units Database for the Prototype Development of Text-To-Speech System) [In Chinese].
Proceedings of the 12th Research on Computational Linguistics Conference, 1999

On Modeling Remote and Local Dependencies in Language.
Proceedings of the 12th Research on Computational Linguistics Conference, 1999

Automatic Selection of Synthesis Units from a Large Speech Database.
Proceedings of the 13th Pacific Asia Conference on Language, 1999

Speech act modeling in a spoken dialogue system using fuzzy hidden Markov model and bayes' decision criterion.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Utterance verification using prosodic information for Mandarin telephone speech keyword spotting.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998
應用隱藏式馬可夫模型與口述對話系統之研究 (Spoken Dialogue System Using Hidden Markov Model) [In Chinese].
Proceedings of the 11th Research on Computational Linguistics Conference, 1998

Spoken dialogue system using corpus-based hidden Markov model.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Telephone speech multi-keyword spotting using fuzzy search algorithm and prosodic verification.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

1997
A novel two-level method for the computation of the LSP frequencies using a decimation-in-degree algorithm.
IEEE Trans. Speech Audio Process., 1997

1996
A Vowel-Driven Connected Mandarin Digit Recognition System.
J. Inf. Sci. Eng., 1996

中英文文句翻語音系統中連音處理之研究 (A Study on the Coarticulation Generation in Chinese-English Text to Speech Synthesis) [In Chinese].
Proceedings of 9th Computational Linguistics Conference, 1996

A Mandarin Voice Organizer Based on a Template-Matching Speech Recognizer.
Proceedings of the 11th Pacific Asia Conference on Language, Information and Computation, 1996

1995
Speaker Identification Based on Continuous Mandarin Digits Using Small Training Data and Fast Algorithms.
J. Inf. Sci. Eng., 1995

以CELP為基礎之文句翻語音中韻律訊息之產生與調整 (Prosodic Information Generation and Adjustment in Text to Speech Synthesis Based on CELP) [In Chinese].
Proceedings of 8th Computational Linguistics Conference, 1995

A Bayesian neural network chip design for speech recognition system.
Proceedings of International Conference on Neural Networks (ICNN'95), Perth, WA, Australia, November 27, 1995

Computer-aided analysis and classification of heart sounds based on neural networks and time analysis.
Proceedings of the 1995 International Conference on Acoustics, 1995

1991
A hierarchical neural network model based on a C/V segmentation algorithm for isolated Mandarin speech recognition.
IEEE Trans. Signal Process., 1991

A shunting multilayer perceptron network for confusing/composite pattern recognition.
Pattern Recognit., 1991

Speaker-Independent Recognition of isolated Words using concatenated Neural Networks.
Int. J. Pattern Recognit. Artif. Intell., 1991

Integrating neural nets and one-stage dynamic programming for speaker independent continuous Mandarin digit recognition.
Proceedings of the 1991 International Conference on Acoustics, 1991

1990
On Segmentation and Recognition of Connected Spoken Digits Based on a Neural Network Model.
J. Inf. Sci. Eng., 1990


  Loading...