Jia Liu

Proceedings of the IEEE International Symposium on Signal Processing and Information Technology, 2019

Large Margin Softmax Loss for Speaker Verification.

[BibT_eX]

[DOI]

Yi Liu

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Multi-objective Optimization Training of PLDA for Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Geometric Discriminant Analysis for I-vector Based Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Dilated-Gated Convolutional Neural Network with A New Loss Function on Sound Event Detection.

[BibT_eX]

[DOI]

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018

Lattice Based Transcription Loss for End-to-End Speech Recognition.

[BibT_eX]

[DOI]

J. Signal Process. Syst., 2018

Local Pairwise Linear Discriminant Analysis for Speaker Verification.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2018

Advanced recurrent network-based hybrid acoustic models for low resource speech recognition.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2018

Multiobjective Optimization Training of PLDA for Speaker Verification.

[BibT_eX]

[DOI]

CoRR, 2018

VB-HMM Speaker Diarization with Enhanced and Refined Segment Representation.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Latent Class Model for Single Channel Speaker Diarization.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Exploring a Unified Attention-Based Pooling Framework for Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Speaker Embedding Extraction with Phonetic Information.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Text Prompted Speaker Verification Based on Phoneme Clustering with Earth Mover's Distane and Cauchy-Schwarz Divergence.

[BibT_eX]

[DOI]

Zhuzi Chen

Yi Liu

Proceedings of the 2018 2nd International Conference on Algorithms, Computing and Systems, 2018

Improved Phonotactic Language Recognition Using Collaborated Language Model.

[BibT_eX]

[DOI]

Proceedings of the 5th IEEE International Conference on Cloud Computing and Intelligence Systems, 2018

Investigation of Frame Alignments for GMM-based Digit-prompted Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017

Investigation of Frame Alignments for GMM-based Text-prompted Speaker Verification.

[BibT_eX]

[DOI]

CoRR, 2017

Ivec-PLDA-AHC priors for VB-HMM speaker diarization system.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Workshop on Signal Processing Systems, 2017

Deep neural networks based speaker modeling at different levels of phonetic granularity.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

An LSTM-CTC based verification system for proxy-word based OOV keyword search.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Comparison of multiple features and modeling methods for text-dependent speaker verification.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Gated convolutional networks based hybrid acoustic models for low resource speech recognition.

[BibT_eX]

[DOI]

Jian Kang

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016

Discriminative Boosting Algorithm for Diversified Front-End Phonotactic Language Recognition.

[BibT_eX]

[DOI]

J. Signal Process. Syst., 2016

Maxout neurons for deep convolutional and LSTM neural networks in speech recognition.

[BibT_eX]

[DOI]

Speech Commun., 2016

Investigation of Senone-based Long-Short Term Memory RNNs for Spoken Language Recognition.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

Application of i-vector in speech and music classification.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Symposium on Signal Processing and Information Technology, 2016

A speech enhancement algorithm using computational auditory scene analysis with spectral subtraction.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Symposium on Signal Processing and Information Technology, 2016

Gated recurrent units based hybrid acoustic models for robust speech recognition.

[BibT_eX]

[DOI]

Jian Kang

Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Lattice based transcription loss for end-to-end speech recognition.

[BibT_eX]

[DOI]

Jian Kang

Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

A study of variational method for text-independent speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Improving Deep Neural Networks Based Speaker Verification Using Unlabeled Data.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

A Novel Discriminative Score Calibration Method for Keyword Search.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Investigating Various Diarization Algorithms for Speaker in the Wild (SITW) Speaker Recognition Challenge.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

THU-EE System Description for NIST LRE 2015.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015

Multi-resolution time frequency feature and complementary combination for short utterance speaker recognition.

[BibT_eX]

[DOI]

Zhiyi Li

Multim. Tools Appl., 2015

Regularized minimum class variance extreme learning machine for language recognition.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2015

THUEE language modeling method for the OpenKWS 2015 evaluation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Signal Processing and Information Technology, 2015

Convolutional maxout neural networks for speech separation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Signal Processing and Information Technology, 2015

Dialog state tracking using long short-term memory neural networks.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Using word confusion networks for slot filling in spoken language understanding.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Investigation of bottleneck features and multilingual deep neural networks for speaker verification.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Simultaneous utilization of spectral magnitude and phase information to extract supervectors for speaker verification anti-spoofing.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Neuron sparseness versus connection sparseness in deep neural network for large vocabulary speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

The THUEE system for the openKWS14 keyword search evaluation.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Stacked bottleneck features for speaker verification.

[BibT_eX]

[DOI]

Yao Tian

Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

PRISM: A statistical modeling framework for text-independent speaker verification.

[BibT_eX]

[DOI]

Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

Improved system fusion for keyword search.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

High-performance Swahili keyword search with very limited language pack: The THUEE system for the OpenKWS15 evaluation.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Calibration of word posterior estimation in confusion networks for keyword search.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014

Efficient One-Pass Decoding with NNLM for Speech Recognition.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2014

Spoken language recognition based on gap-weighted subsequence kernels.

[BibT_eX]

[DOI]

Speech Commun., 2014

Empirically combining unnormalized NNLM and back-off N-gram for fast N-best rescoring in speech recognition.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2014

Homogenous ensemble phonotactic language recognition based on SVM supervector reconstruction.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2014

Text-Independent Speaker Verification via State Alignment.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014

Multi-scale kernels for short utterance speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Deep belief network based CRF for spoken language understanding.

[BibT_eX]

[DOI]

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Word embeddings: A semi-supervised learning method for slot-filling in spoken dialog systems.

[BibT_eX]

[DOI]

Zhenfeng Chen

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Speaker verification using Fisher vector.

[BibT_eX]

[DOI]

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Discriminative boosting regression backend for phonotactic language recognition.

[BibT_eX]

[DOI]

Wei-Wei Liu

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Improved multitaper PNCC feature for robust speaker verification.

[BibT_eX]

[DOI]

Yi Liu

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Phonotactic language recognition based on DNN-HMM acoustic model.

[BibT_eX]

[DOI]

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

A new fast and memory effective i-vector extraction based on factor analysis of KLD derived GMM supervector.

[BibT_eX]

[DOI]

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Convolutional maxout neural networks for low-resource speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Phonotactic language recognition based on time-gap-weighted lattice kernels.

[BibT_eX]

[DOI]

Wei-Wei Liu

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Variance regularization of RNNLM for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Improved phonotactic language recognition based on RNN feature reconstruction.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Stochastic pooling maxout networks for low-resource speech recognition.

[BibT_eX]

[DOI]

Yongzhe Shi

Proceedings of the IEEE International Conference on Acoustics, 2014

Semi-supervised learning of dialogue acts using sentence similarity based on word embeddings.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Audio, 2014

Dereverberation for Speaker Identification in Meeting.

[BibT_eX]

[DOI]

Yi Yang

Proceedings of the HCI International 2014 - Posters' Extended Abstracts, 2014

Exploring the Large-Scale TDOA Feature Space for Speaker Diarization.

[BibT_eX]

[DOI]

Yi Yang

Proceedings of the HCI International 2014 - Posters' Extended Abstracts, 2014

2013

Exploiting articulatory features for pitch accent detection.

[BibT_eX]

[DOI]

J. Zhejiang Univ. Sci. C, 2013

Fast Approximate Matching Algorithm for Phone-based Keyword Spotting.

[BibT_eX]

[DOI]

Xin Zhang

J. Networks, 2013

Exploiting contextual information for prosodic event detection using auto-context.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2013

RNN language model with word clustering and class-based output layer.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2013

THU-EE system fusion for the NIST 2012 speaker recognition evaluation.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

MLP-HMM two-stage unsupervised training for low-resource languages on conversational telephone speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Parallel absolute-relative feature based phonotactic language recognition.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Audiovisual synthesis of exaggerated speech for corrective feedback in computer-assisted pronunciation training.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Simplified domain transfer multiple kernel learning for language recognition.

[BibT_eX]

[DOI]

Jiaming Xu

Shanhong Xia

Proceedings of the IEEE International Conference on Acoustics, 2013

Temporal kernel neural network language model.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

I-matrix for text-independent speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Improve low-resource non-native mispronunciation detection with native speech by articulatory-based tandem feature.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013

THUEE system for the Albayzin 2012 language recognition evaluation.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013

Improving deep neural network acoustic models using unlabeled data.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013

Combination of data borrowing strategies for low-resource LVCSR.

[BibT_eX]

[DOI]

Kai Yu

Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

Deep maxout neural networks for speech recognition.

[BibT_eX]

[DOI]

Yongzhe Shi

Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012

Phone lattice reconstruction for embedded language recognition in LVCSR.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2012

Complementary combination in i-vector level for language recognition.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

Automatic pitch accent detection using auto-context with acoustic features.

[BibT_eX]

[DOI]

Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Improve mispronunciation detection with Tandem feature.

[BibT_eX]

[DOI]

Hua Yuan

Junhong Zhao

Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Articulatory Feature based Multilingual MLPs for Low-Resource Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Cross-Lingual and Ensemble MLPs Strategies for Low-Resource Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Orthogonal Subspace Combination Based on the Joint Factor Analysis for Text-Independent Speaker Recognition.

[BibT_eX]

[DOI]

Proceedings of the Biometric Recognition - 7th Chinese Conference, 2012

2011

Automatic player labeling, tracking and field registration and trajectory mapping in broadcast soccer video.

[BibT_eX]

[DOI]

ACM Trans. Intell. Syst. Technol., 2011

Time-Frequency Cepstral Features and Heteroscedastic Linear Discriminant Analysis for Language Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2011

Robust speaker recognition in cross-channel condition based on Gaussian mixture model.

[BibT_eX]

[DOI]

Yuxiang Shan

Multim. Tools Appl., 2011

Time-Frequency Cepstral Features and Combining Discriminative Training for Phonotactic Language Recognition.

[BibT_eX]

[DOI]

J. Comput., 2011

Language Recognition Based on Acoustic Diversified Phone Recognizers and Phonotactic Feature Fusion.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2011

Speaker segmentation and clustering based on the improved spectral clustering.

[BibT_eX]

[DOI]

Yong Ma

Chang-chun Bao

Proceedings of the 2011 IEEE International Workshop on Machine Learning for Signal Processing, 2011

Robust Audio Fingerprinting Based on Local Spectral Luminance Maxima Scheme.

[BibT_eX]

[DOI]

Yongzhe Shi

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Combining Lattice-Based Language Dependent and Independent Approaches for Out-of-Language Detection in LVCSR.

[BibT_eX]

[DOI]

Yuxiang Shan

Yan Deng

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

State-Level Data Borrowing for Low-Resource Speech Recognition Based on Subspace GMMs.

[BibT_eX]

[DOI]

Daniel Povey

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Strategies for using MLP based features with limited target-language training data.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010

Multiple Background Models for Speaker Verification.

[BibT_eX]

[DOI]

Yuxiang Shan

Proceedings of the Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 28, 2010

Mandarin-English bilingual phone modeling and combining MPE based Discriminative training for cross-language speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Multi-feature combination for speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Variant time-frequency cepstral features for speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

A fast query by humming system based on notes.

[BibT_eX]

[DOI]

Jingzhou Yang

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Combining Chinese spoken term detection systems via side-information conditioned linear logistic regression.

[BibT_eX]

[DOI]

Sha Meng

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Integration of Complementary Phone Recognizers for Phonotactic Language Recognition.

[BibT_eX]

[DOI]

Proceedings of the Information Computing and Applications - First International Conference, 2010

A CMLLR supervector kernel for SVM language recognition.

[BibT_eX]

[DOI]

Shan Zhong

Proceedings of the IEEE International Conference on Acoustics, 2010

Phone modeling and combining discriminative training for mandarinenglish bilingual speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

A modified Subband post-filtering approach for MVDR beamformer.

[BibT_eX]

[DOI]

Proceedings of the 9th IEEE International Conference on Cognitive Informatics, 2010

2009

Efficient embedded speech recognition for very large vocabulary Mandarin car-navigation systems.

[BibT_eX]

[DOI]

Michael T. Johnson

IEEE Trans. Consumer Electron., 2009

Automatic player detection, labeling and tracking in broadcast soccer video.

[BibT_eX]

[DOI]

Pattern Recognit. Lett., 2009

Research on Intersession Variability Compensation for MLLR-SVM Speaker Recognition.

[BibT_eX]

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2009

Two-layer network coordinate system for Internet distance prediction.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Ultra Modern Telecommunications, 2009

A Novel Embedded Speaker Verification on System on Chip.

[BibT_eX]

[DOI]

Pengfei Mao

Proceedings of the Sixth International Conference on Fuzzy Systems and Knowledge Discovery, 2009

A Combined De-correlation Method for Acoustic Feedback Cancellation in Hearing Aids.

[BibT_eX]

[DOI]

Hong Cao

Weiwei Zhang

Proceedings of the CSIE 2009, 2009 WRI World Congress on Computer Science and Information Engineering, March 31, 2009

2008

An Equalized Heteroscedastic Linear Discriminant Analysis Algorithm.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2008

Speaker Clustering Aided by Visual Dialogue Analysis.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing, 2008

Addressing the out-of-vocabulary problem for large-scale Chinese spoken term detection.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Fusing multiple systems into a compact lattice index for chinese spoken term detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

Fractional Fourier transform based auditory feature for language identification.

[BibT_eX]

[DOI]

Proceedings of the IEEE Asia Pacific Conference on Circuits and Systems, 2008

Channel compensation technology in differential GSV-SVM speaker verification system.

[BibT_eX]

[DOI]

Proceedings of the IEEE Asia Pacific Conference on Circuits and Systems, 2008

2007

English sentence stress detection system based on HMM framework.

[BibT_eX]

[DOI]

Chao-Lei Li

Shanhong Xia

Appl. Math. Comput., 2007

Subband Energy distance measure applied in multi-pass speech/non-speech discrimination.

[BibT_eX]

[DOI]

Wei Chu

Proceedings of the 9th International Symposium on Signal Processing and Its Applications, 2007

Using confidence measures to evaluate the speaker turns in speaker segmentation.

[BibT_eX]

[DOI]

Wei Chu

Proceedings of the 9th International Symposium on Signal Processing and Its Applications, 2007

Two-Stage Method for Specific Audio Retrieval.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

Confidence Measure Based Incremental Adaptation for Online Language Identification.

[BibT_eX]

[DOI]

Proceedings of the Human-Computer Interaction. HCI Intelligent Multimodal Interaction Environments, 2007

Automatic Player Detection, Labeling and Tracking in Broadcast Soccer Video.

[BibT_eX]

[DOI]

Proceedings of the British Machine Vision Conference 2007, 2007

A study of lattice-based spoken term detection for Chinese spontaneous speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006

Confidence Score Based Unsupervised Incremental Adaptation for OOV Words Detection.

[BibT_eX]

[DOI]

Wei Chu

Xi Xiao

Proceedings of the Structural, 2006

A Robust Acoustic Echo Canceller for Noisy Environment.

[BibT_eX]

[DOI]

Shenghao Qin

Sha Meng

Proceedings of the 5th International Symposium on Chinese Spoken Language Processing, 2006

Perceptual Evaluation of Pronunciation Quality for Computer Assisted Language Learning.

[BibT_eX]

[DOI]

Chao-Lei Li

Shanhong Xia

Proceedings of the Technologies for E-Learning and Digital Entertainment, 2006

2004

Language identification using discriminative weighted language models.

[BibT_eX]

[DOI]

Shizhen Wang

Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

Comparison of Pronunciation Scores in Spoken Language Learning System.

[BibT_eX]

[DOI]

Weiqian Liang

Proceedings of the Advances in Web-Based Learning, 2004

Embedded speech recognition system on 8-bit MCU core.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003

Towards Robustness to Speech Rate in Mandarin All-Syllable Recognition.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., 2003

Voice conversion with smoothed GMM and MAP adaptation.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

A novel efficient decoding algorithm for CDHMM-based speech recognizer on chip.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002

A Rejection Model Based on Multi-Layer Perceptrons for Mandarin Digit Recognition.

[BibT_eX]

[DOI]

Zhong Lin

J. Comput. Sci. Technol., 2002

Acoustic model comparison for an embedded phoneme-based Mandarin name dialing system.

[BibT_eX]

[DOI]

Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002

Real-time viterbi searching for practical telephone speech recognition systems.

[BibT_eX]

[DOI]

Jin Zhang

Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002

Fast likelihood computation method using block-diagonal covariance matrices in hidden Markov model.

[BibT_eX]

[DOI]

Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002

Comparative study of linear feature transformation techniques for Mandarin digit string recognition.

[BibT_eX]

[DOI]

Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002

2001

Single-chip speech recognition system based on 8051 microcontroller core.

[BibT_eX]

[DOI]

Yuanyuan Shi

IEEE Trans. Consumer Electron., 2001

2000

Confidence measure based unsupervised speaker adaptation.

[BibT_eX]

[DOI]

Husheng Li

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Rejection based on a posteriori probability estimated by MLP with application for Mandarin voice dialer on ASIC.

[BibT_eX]

[DOI]

Lin Zhong