Yonghong Yan

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014

Acoustic Echo Control with Frequency-Domain Stage-Wise Regression.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2014

Coalescence Type based Confidence Warping for Agglutinative Language Keyword Spotting.

[BibT_eX]

[DOI]

J. Softw., 2014

Voice biometrics using linear Gaussian model.

[BibT_eX]

[DOI]

IET Biom., 2014

Smoothing Method for Improved Minimum Phone Error Linear Regression.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2014

Markovian discriminative modeling for cross-domain dialog state tracking.

[BibT_eX]

[DOI]

Hang Ren

Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Markovian Discriminative Modeling for Dialog State Tracking.

[BibT_eX]

[DOI]

Hang Ren

Proceedings of the SIGDIAL 2014 Conference, 2014

The role of auditory feedback in speech production: Implications for speech perception in the hearing impaired.

[BibT_eX]

[DOI]

Proceedings of the 2014 International Symposium on Integrated Circuits (ISIC), 2014

Direction-of-arrival estimation of multiple speakers using a planar array.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

A robust step-size control algorithm for frequency domain acoustic echo cancellation.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

On the Performance and Robustness of Crosstalk Cancelation with Multiple Loudspeakers.

[BibT_eX]

[DOI]

Proceedings of the 2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2014

Enhanced Out of Vocabulary Word Detection Using Local Acoustic Information.

[BibT_eX]

[DOI]

Proceedings of the 2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2014

Melody Extraction for Vocal Polyphonic Music Based on Bayesian Framework.

[BibT_eX]

[DOI]

Liming Song

Proceedings of the 2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2014

Boosted Hybrid DNN/HMM System Based on Correlation-Generated Targets.

[BibT_eX]

[DOI]

Proceedings of the 2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2014

Speeding up deep neural networks for speech recognition on ARM Cortex-A series processors.

[BibT_eX]

[DOI]

Proceedings of the 10th International Conference on Natural Computation, 2014

Improved mandarin spoken term detection by using deep neural network for keyword verification.

[BibT_eX]

[DOI]

Proceedings of the 10th International Conference on Natural Computation, 2014

Language recognition system using language branch discriminative information.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Reverberation robust two-microphone Target Signal Detection algorithm with coherent interference.

[BibT_eX]

[DOI]

Proceedings of the IEEE China Summit & International Conference on Signal and Information Processing, 2014

An efficient time varying hybrid reverberator for room acoustic simulation.

[BibT_eX]

[DOI]

Proceedings of the IEEE China Summit & International Conference on Signal and Information Processing, 2014

2013

Noise Estimation Using a Constrained Sequential Hidden Markov Model in the Log-Spectral Domain.

[BibT_eX]

[DOI]

Dongwen Ying

IEEE Trans. Speech Audio Process., 2013

Robust and Fast Localization of Single Speech Source Using a Planar Array.

[BibT_eX]

[DOI]

Dongwen Ying

IEEE Signal Process. Lett., 2013

Spoken Term Detection Based on Improved Index Structure.

[BibT_eX]

[DOI]

J. Softw., 2013

Mixing-attack-proof Randomized Embedding Audio Watermarking System.

[BibT_eX]

[DOI]

J. Comput., 2013

A Novel Discriminative Method for Pronunciation Quality Assessment.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2013

Speaker Recognition Using Sparse Probabilistic Linear Discriminant Analysis.

[BibT_eX]

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2013

Fuzzy Matching of Semantic Class in Chinese Spoken Language Understanding.

[BibT_eX]

[DOI]

Yanling Li

IEICE Trans. Inf. Syst., 2013

Discriminative Approach to Build Hybrid Vocabulary for Conversational Telephone Speech Recognition of Agglutinative Languages.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2013

Dialog State Tracking using Conditional Random Fields.

[BibT_eX]

[DOI]

Proceedings of the SIGDIAL 2013 Conference, 2013

Discriminative pronunciation modeling based on minimum phone error training.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Prefix tree based n-best list re-scoring for recurrent neural network language model used in speech recognition system.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Comparative investigation of objective speech intelligibility prediction measures for noise-reduced signals in Mandarin and Japanese.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Effect of linguistic masker on the intelligibility of Mandarin sentences.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Head-Related Transfer Function Modeling Based on Finite-Impulse Response.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2013

Hybrid Reverberator Using Multiple Impulse Responses for Audio Rendering Improvement.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2013

A novel discriminative method for pronunciation quality assessment.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

A Computer-Assist Algorithm to Detect Repetitive Stuttering Automatically.

[BibT_eX]

[DOI]

Junbo Zhang

Proceedings of the 2013 International Conference on Asian Language Processing, 2013

Automatic Allophone Deriving for Korean Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Computational Intelligence and Security, 2013

Automatic Vocal Segments Detection in Popular Music.

[BibT_eX]

[DOI]

Liming Song

Proceedings of the Ninth International Conference on Computational Intelligence and Security, 2013

Web-Based Language Model Domain Adaptation for Real World Voice Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Computational Intelligence and Security, 2013

Direction of arrival estimation based on weighted minimum mean square error.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013

Objective Japanese intelligibility prediction for noisy speech signals before and after noise-reduction processing.

[BibT_eX]

[DOI]

Junfeng Li

Masato Akagi

Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013

2012

A Novel Similarity Measure to Induce Semantic Classes and Its Application for Language Model Adaptation in a Dialogue System.

[BibT_eX]

[DOI]

Yali Li

J. Comput. Sci. Technol., 2012

Logarithmic Adaptive Quantization Projection for Audio Watermarking.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2012

A Forced Alignment Based Approach for English Passage Reading Assessment.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2012

Factor Analysis of Neighborhood-Preserving Embedding for Speaker Verification.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2012

Two-Microphone Noise Reduction Using Spatial Information-Based Spectral Amplitude Estimation.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2012

Noise Robust Feature Scheme for Automatic Speech Recognition Based on Auditory Perceptual Mechanisms.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2012

Maximum A Posteriori Linear Regression for language recognition.

[BibT_eX]

[DOI]

Expert Syst. Appl., 2012

Low-dimensional representation of Gaussian mixture model supervector for language recognition.

[BibT_eX]

[DOI]

EURASIP J. Adv. Signal Process., 2012

Automatic Scoring on English Passage Reading Quality.

[BibT_eX]

[DOI]

Junbo Zhang

Proceedings of the Advances in Swarm Intelligence - Third International Conference, 2012

A fast two-microphone noise reduction algorithm based on power level ratio for mobile phone.

[BibT_eX]

[DOI]

Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Sparse Probabilistic Linear Discriminant Analysis for Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

A Initial Attempt on Task-Specific Adaptation for Deep Neural Network-based Large Vocabulary Continuous Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Discriminative Decision Function Based Scoring Method in Joint Factor Analysis for Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Speaker Verification Using Neighborhood Preserving Embedding.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Recurrent neural network language model in mandarin voice input system.

[BibT_eX]

[DOI]

Proceedings of the Eighth International Conference on Natural Computation, 2012

A two microphone-based approach for speech enhancement in adverse environments.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Consumer Electronics, 2012

Target speech detection based on microphone array using inter-channel phase differences.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Consumer Electronics, 2012

Noise estimation using a constrained sequential HMM IN log-spectral domain.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Factor analysis of Laplacian approach for speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Evaluation of objective intelligibility prediction measures for noise-reduced signals in mandarin.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

A two-microphone based voice activity detection for distant-talking speech in wide range of direction of arrival.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Improved acoustic models for Conversational Telephone Speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Fuzzy Systems and Knowledge Discovery, 2012

Optimized large vocabulary WFST speech recognition system.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Fuzzy Systems and Knowledge Discovery, 2012

An Improved Mandarin Voice Input System Using Recurrent Neural Network Language Model.

[BibT_eX]

[DOI]

Proceedings of the Eighth International Conference on Computational Intelligence and Security, 2012

Parallel implementation of neural networks training on graphic processing unit.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on BioMedical Engineering and Informatics, 2012

2011

Voice Activity Detection Based on an Unsupervised Learning Framework.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2011

Towards precise and robust automatic synchronization of live speech and its transcripts.

[BibT_eX]

[DOI]

Jie Gao

Speech Commun., 2011

Speaker Verification Using Sparse Representations on Total Variability i-vectors.

[BibT_eX]

[DOI]

Xiang Zhang

Shrikanth S. Narayanan

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

A Spread Spectrum Audio Watermarking System with High Perceptual Quality.

[BibT_eX]

[DOI]

Proceedings of the Third International Conference on Communications and Mobile Computing, 2011

Development of a Chinese song name recognition system.

[BibT_eX]

[DOI]

Proceedings of the Seventh International Conference on Natural Computation, 2011

Robust understanding of spoken Chinese through character-based tagging and prior knowledge exploitation.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

Language recognition with language total variability.

[BibT_eX]

[DOI]

Proceedings of the 2011 International Conference on Innovative Computing and Cloud Computing, 2011

Quantization Index Modulation audio watermarking system using a psychoacoustic model.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Information, 2011

2010

Development of a Mandarin-English Bilingual Speech Recognition System with Unified Acoustic Models.

[BibT_eX]

[DOI]

Qingqing Zhang

J. Inf. Sci. Eng., 2010

A Hybrid Speech Emotion Recognition System Based on Spectral and Prosodic Features.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2010

Enhancing the Robustness of the Posterior-Based Confidence Measures Using Entropy Information for Speech Recognition.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2010

Acoustic Feature Optimization Based on <i>F</i>-Ratio for Robust Speech Recognition.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2010

A bayesian logistic regression approach to spoken language identification.

[BibT_eX]

[DOI]

IEICE Electron. Express, 2010

A new linguistic feature for Automated Essay Scoring.

[BibT_eX]

[DOI]

Proceedings of the 4th International Universal Communication Symposium, 2010

Forward optimal measures for automatic mispronunciation detection.

[BibT_eX]

[DOI]

Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Intelligibility investigation of single-channel noise reduction algorithms for Chinese and Japanese.

[BibT_eX]

[DOI]

Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Large vocabulary Uyghur continuous speech recognition based on stems and suffixes.

[BibT_eX]

[DOI]

Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Speaker recognition using the resynthesized speech via spectrum modeling.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Speech enhancement using improved generalized sidelobe canceller in frequency domain with multi-channel postfiltering.

[BibT_eX]

[DOI]

Kai Li

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Robust character based tagging with domain lexical features for Chinese spoken language understanding.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Natural Computation, 2010

Maximum a posteriori linear regression for speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Improved modeling for F0 generation and V/U decision in HMM-based TTS.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Automatic Synchronization of live speech and its Transcripts based on a frame-synchronous likelihood ratio test.

[BibT_eX]

[DOI]

Jie Gao

Proceedings of the IEEE International Conference on Acoustics, 2010

Subset selection for articulatory feature based confidence measures.

[BibT_eX]

[DOI]

Proceedings of the Third International Workshop on Advanced Computational Intelligence, 2010

TBNR: the ThinkIT Broadcast News speech Recognition system.

[BibT_eX]

[DOI]

Proceedings of the Third International Workshop on Advanced Computational Intelligence, 2010

Semantic class induction and its application for a Chinese voice search system.

[BibT_eX]

[DOI]

Yali Li

Proceedings of the CIPS-SIGHAN Joint Conference on Chinese Language Processing, 2010

2009

Using a Kind of Novel Phonotactic Information for SVM Based Speaker Recognition.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2009

Approximate Decision Function and Optimization for GMM-UBM Based Speaker Verification.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2009

An LVCSR Based Reading Miscue Detection System Using Knowledge of Reference and Error Patterns.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2009

Automatic Singing Performance Evaluation for Untrained Singers.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2009

WAPS: An Audio Program Surveillance System for Large Scale Web Data Stream.

[BibT_eX]

[DOI]

Proceedings of the Web Information Systems and Mining, International Conference, 2009

Nonnative Speech Recognition Based on Bilingual Model Modification at State Level.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Symposium on Neural Networks, 2009

A Novel Fuzzy-Based Automatic Speaker Clustering Algorithm.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Networks, 2009

Dynamic Multiple Pronunciation Incorporation in a Refined Search Space for Reading Miscue Detection.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Symposium on Neural Networks, 2009

Improving Voice Search Using Forward-Backward LVCSR System Combination.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Symposium on Neural Networks, 2009

An SVM-Based Mandarin Pronunciation Quality Assessment System.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Symposium on Neural Networks, 2009

Simultaneous Synchronization of Text and Speech for Broadcast News Subtitling.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Networks, 2009

Physiologically-inspired feature extraction for emotion recognition.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Tonal articulatory feature for Mandarin and its application to conversational LVCSR.

[BibT_eX]

[DOI]

Qingqing Zhang

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

A one-step tone recognition approach using MSD-HMM for continuous speech.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Online detecting end times of spoken utterances for synchronization of live speech and its transcripts.

[BibT_eX]

[DOI]

Jie Gao

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Chinese Prosody Structure Prediction Based on Conditional Random Fields.

[BibT_eX]

[DOI]

Proceedings of the Fifth International Conference on Natural Computation, 2009

Nonnative speech recognition based on bilingual model modification.

[BibT_eX]

[DOI]

Proceedings of the FUZZ-IEEE 2009, 2009

Emotion Recognition and Conversion for Mandarin Speech.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Fuzzy Systems and Knowledge Discovery, 2009

Investigations to Minimum Phone Error Training in Bilingual Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Fuzzy Systems and Knowledge Discovery, 2009

Sample-Based Automatic Dictionary Generation for Keyword Spotting System.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Fuzzy Systems and Knowledge Discovery, 2009

Improving Automatic Speech Recognizer of Voice Search Using System Combination.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Fuzzy Systems and Knowledge Discovery, 2009

Improved Lattice-Based Confidence Measure for Speech Recognition via a Lattice Cutoff Procedure.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Fuzzy Systems and Knowledge Discovery, 2009

Automatic Detection of Pathological Voices Using GMM-SVM Method.

[BibT_eX]

[DOI]

Xiang Wang

Proceedings of the 2nd International Conference on BioMedical Engineering and Informatics, 2009

Automatic Detection of Pathological Voices Using GMM-MLLR Approach.

[BibT_eX]

[DOI]

Xiang Wang

Proceedings of the 2nd International Conference on BioMedical Engineering and Informatics, 2009

2008

Development of a Mandarin-English Bilingual Speech Recognition System for Real World Music Retrieval.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2008

Robust Speaker Clustering Using Affinity Propagation.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2008

Speech Enhancement Using Improved Adaptive Null-Forming in Frequency Domain with Postfilter.

[BibT_eX]

[DOI]

Heng Zhang

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2008

Effects of the Temporal Fine Structure in Different Frequency Bands on Mandarin Tone Perception.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2008

Melody Track Selection Using Discriminative Language Model.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2008

Automatic Language Identification with Discriminative Language Characterization Based on SVM.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2008

A One-Pass Real-Time Decoder Using Memory-Efficient State Network.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2008

Effective Acoustic Modeling for Pronunciation Quality Scoring of Strongly Accented Mandarin Speech.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2008

Using SVM as Back-End Classifier for Language Identification.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2008

Speaker Recognition using a Kind of Novel Phonotactic Information.

[BibT_eX]

[DOI]

Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Improved Semi-Parametric Mean Trajectory Model Using Discriminatively Trained Centroids.

[BibT_eX]

[DOI]

Ran Xu

Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Using Reference to Tune Language Model for Detection of Reading Miscues.

[BibT_eX]

[DOI]

Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Efficient System Combination for Syllable-Confusion-Network-Based Chinese Spoken Term Detection.

[BibT_eX]

[DOI]

Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

A Synchronous Method for Automatic Scoring of Language Learning.

[BibT_eX]

[DOI]

Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Nonnative speech recognition based on state-candidate bilingual model modification.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

A frequency domain approach for speech enhancement with directionality using compact microphone array.

[BibT_eX]

[DOI]

Heng Zhang

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Towards vocabulary-independent speech indexing for large-scale repositories.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Cochannel speech separation using multi-pitch estimation and model based voiced sequential grouping.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Forward optimal modeling of acoustic confusions in Mandarin CALL system.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Robust speaker change detection using Kernel-Gaussian model.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

An objective singing evaluation approach by relating acoustic measurements to perceptual ratings.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Recognizing named entities in spoken Chinese dialogues with a character-level maximum entropy tagger.

[BibT_eX]

[DOI]

Changchun Bao

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Wide-Band Low-Noise Quadrature VCO Design.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2008), 2008

Using Discriminative Training Techniques in Practical Intelligent Music Retrieval System.

[BibT_eX]

[DOI]

Ran Xu

Proceedings of the Fourth International Conference on Natural Computation, 2008

Application of LVCSR to the Detection of Chinese Mandarin Reading Miscues.

[BibT_eX]

[DOI]

Proceedings of the Fourth International Conference on Natural Computation, 2008

Spoken Term Detection Using Dynamic Match Subword Confusion Network.

[BibT_eX]

[DOI]

Proceedings of the Fourth International Conference on Natural Computation, 2008

Mandarin-English bilingual Speech Recognition for real world music retrieval.

[BibT_eX]

[DOI]

Qingqing Zhang

Proceedings of the IEEE International Conference on Acoustics, 2008

A novel speaker clustering algorithm via supervised affinity propagation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

Mandarin vowel pronunciation quality evaluation by a novel formant classification method and its combination with traditional algorithms.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

2007

New Machine Scores and Their Combinations for Automatic Mandarin Phonetic Pronunciation Quality Assessment.

[BibT_eX]

[DOI]

Proceedings of the Knowledge-Based Intelligent Information and Engineering Systems, 2007

Singing Melody Extraction in Polyphonic Music by Harmonic Tracking.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Music Information Retrieval, 2007

Contributions of temporal fine structure cues to Chinese speech recognition in cochlear implant simulation.

[BibT_eX]

[DOI]

Lin Yang

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

A fast fuzzy keyword spotting algorithm based on syllable confusion network.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Mandarin vowel pronunciation quality evaluation by using formant pattern recognition.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Spoken language identification using score vector modeling and support vector machine.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Robust voice activity detection based on adaptive sub-band energy sequence analysis and harmonic detection.

[BibT_eX]

[DOI]

Yanmeng Guo

Qian Qian

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Authentication and Quality Monitoring based on Audio Watermark for Analog AM Shortwave Broadcasting.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2007), 2007

Large Vocabulary Mandarin Continuous Speech Recognition under Noisy Environment.

[BibT_eX]

[DOI]

Proceedings of the Third International Conference on Natural Computation, 2007

Keyword Spotting Based on Syllable Confusion Network.

[BibT_eX]

[DOI]

Proceedings of the Third International Conference on Natural Computation, 2007

The Design of Backend Classifiers in PPRLM System for Language Identification.

[BibT_eX]

[DOI]

Proceedings of the Third International Conference on Natural Computation, 2007

Real Context Model for Tone Recognition in Mandarin Conversational Telephone Speech.

[BibT_eX]

[DOI]

Proceedings of the Third International Conference on Natural Computation, 2007

Mandarin Accent Analysis Based on Formant Frequencies.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

Audio Segmentation via Tri-Model Bayesian Information Criterion.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

A Decision-Tree-Based Online Speaker Clustering.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Image Analysis, Third Iberian Conference, 2007

A Spoken Dialogue System Based on Keyword Spotting Technology.

[BibT_eX]

[DOI]

Pengyuan Zhang

Proceedings of the Human-Computer Interaction. HCI Intelligent Multimodal Interaction Environments, 2007

High Quality Voice Conversion through Phoneme-Based Linear Mapping Functions with STRAIGHT for Mandarin.

[BibT_eX]

[DOI]

Kun Liu

Proceedings of the Fourth International Conference on Fuzzy Systems and Knowledge Discovery, 2007

2006

Keyword Spotting Based on Phoneme Confusion Matrix.

[BibT_eX]

[DOI]

Proceedings of the 5th International Symposium on Chinese Spoken Language Processing, 2006

Adaptive Null-Forming Algorithm with Auditory Sub-bands.

[BibT_eX]

[DOI]

Heng Zhang

Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

A Top-down Approach to Melody Match in Pitch Contour for Query by Humming.

[BibT_eX]

[DOI]

Proceedings of the 5th International Symposium on Chinese Spoken Language Processing, 2006

Syllable Based Audio Search Using Confusion Network Arc as Indexing Unit.

[BibT_eX]

[DOI]

Proceedings of the 5th International Symposium on Chinese Spoken Language Processing, 2006

Improvements in Tone Pronunciation Scoring for Strongly Accented Mandarin Speech.

[BibT_eX]

[DOI]

Proceedings of the 5th International Symposium on Chinese Spoken Language Processing, 2006

Speaker Diarization System Based on GMM and BIC.

[BibT_eX]

[DOI]

Tantan Liu

Xiaoxing Liu

Proceedings of the 5th International Symposium on Chinese Spoken Language Processing, 2006

Speech Endpoint Detection Based on Sub-band Energy and Harmonic Structure of Voice.

[BibT_eX]

[DOI]

Yanmeng Guo

Proceedings of the 5th International Symposium on Chinese Spoken Language Processing, 2006

An Efficient and Robust Approach to Audio ID Identification.

[BibT_eX]

[DOI]

Jian Liu

Proceedings of the 5th International Symposium on Chinese Spoken Language Processing, 2006

Automatic Scoring of Flat Tongue and Raised Tongue in Computer-assisted Mandarin Learning.

[BibT_eX]

[DOI]

Proceedings of the 5th International Symposium on Chinese Spoken Language Processing, 2006

A Novel Audio Watermarking in Wavelet Domain.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2006), 2006

2005

Fast confidence measure algorithm for continuous speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

2004

Speaker adaptation using constrained transformation.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2004

Robust state clustering using phonetic decision trees.

[BibT_eX]

[DOI]

Chaojun Liu

Speech Commun., 2004

Automatic assessment of pronunciation quality.

[BibT_eX]

[DOI]

Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

Fusion based speech segmentation in DARPA SPINE2 task.

[BibT_eX]

[DOI]

Chengyi Zheng

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003

A dynamic cross-reference pruning strategy for multiple feature fusion at decoder run time.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002

Run time information fusion in speech recognition.

[BibT_eX]

[DOI]

Chengyi Zheng

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2001

A context adaptation approach for building context dependent models in LVCSR.

[BibT_eX]

[DOI]

Xiaoxing Liu

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000

Develop Telephony Speech Recognition Systems for Real-world Application.

[BibT_eX]

[DOI]

Proceedings of the 2000 International Symposium on Chinese Spoken Language Processing, 2000

Word Error Rate Reduction by Bottom-Up Tone Integration to Chinese Continuous Speech Recognition System.

[BibT_eX]

[DOI]

Proceedings of the 2000 International Symposium on Chinese Spoken Language Processing, 2000

Toward Making Speech Part of People's Daily Life.

[BibT_eX]

[DOI]

Proceedings of the 2000 International Symposium on Chinese Spoken Language Processing, 2000

Efficiently using speaker adaptation data.

[BibT_eX]

[DOI]

Chengyi Zheng

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Improvements in search algorithm for large vocabulary continuous speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Effective vector quantization for a highly compact acoustic model for LVCSR.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

An orthogonal GMM based speaker verification system.

[BibT_eX]

[DOI]

Xiaoxing Liu

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Speaker change detection using minimum message length criterion.

[BibT_eX]

[DOI]

Chaojun Liu

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Dynamic threshold setting via Bayesian information criterion (BIC) in HMM training.

[BibT_eX]

[DOI]

Ying Jia

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Office message center - a spoken dialogue system.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Vocabulary-based acoustic model trim down and task adaptation.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Keyword spotting in auto-attendant system.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Linear regression under maximum a posteriori criterion with Markov random field prior.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2000

Markov Random Field Linear Regression.

[BibT_eX]

[DOI]

Proceedings of the 10th European Signal Processing Conference, 2000

1999

Understanding speech recognition using correlation-generated neural network targets.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 1999

Development of the 1998 OGI-FONIX broadcast news transcription system.

[BibT_eX]

[DOI]

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

High accuracy acoustic modeling using two-level decision-tree based state-tying.

[BibT_eX]

[DOI]

Chaojun Liu

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

High accuracy acoustic modeling based on multi-stage decision tree.

[BibT_eX]

[DOI]

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1998

Universal speech tools: the CSLU toolkit.

[BibT_eX]

[DOI]

Pieter J. E. Vermeulen

Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Accessible technology for interactive systems: a new approach to spoken language research.

[BibT_eX]

[DOI]

Ronald A. Cole

Stephen Sutton