Tan Lee
Orcid: 0000-0002-7089-3436Affiliations:
- Chinese University of Hong Kong, Department of Electronic Engineering, Hong Kong
According to our database1,
Tan Lee
authored at least 249 papers
between 1992 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
Automatic Detection of Speech Sound Disorder in Cantonese-Speaking Pre-School Children.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
CoRR, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Creating Personalized Synthetic Voices from Articulation Impaired Speech Using Augmented Reconstruction Loss.
Proceedings of the IEEE International Conference on Acoustics, 2024
Modeling Intrapersonal and Interpersonal Influences for Automatic Estimation of Therapist Empathy in Counseling Conversation.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Efficient Black-Box Speaker Verification Model Adaptation With Reprogramming And Backend Learning.
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis Based on Disentanglement Between Prosody and Timbre.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Creating Personalized Synthetic Voices from Post-Glossectomy Speech with Guided Diffusion Models.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
A Study on Prosodic Entrainment in Relation to Therapist Empathy in Counseling Conversation.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
A Study on Using Duration and Formant Features in Automatic Detection of Speech Sound Disorder in Children.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Leveraging Phone-Level Linguistic-Acoustic Similarity For Utterance-Level Pronunciation Scoring.
Proceedings of the IEEE International Conference on Acoustics, 2023
Convolution-Based Channel-Frequency Attention for Text-Independent Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2023
Functional Connectivity Analysis in Multi-channel EEG for Emotion Detection with Phase Locking Value and 3D CNN.
Proceedings of the 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2023
Diffusion-Based Mel-Spectrogram Enhancement for Personalized Speech Synthesis with Found Data.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
2022
IEEE ACM Trans. Audio Speech Lang. Process., 2022
iExam: A Novel Online Exam Monitoring and Analysis System Based on Face Detection and Recognition.
CoRR, 2022
An Investigation on Applying Acoustic Feature Conversion to ASR of Adult and Child Speech.
CoRR, 2022
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022
Aphasia Detection for Cantonese-Speaking and Mandarin-Speaking Patients Using Pre-Trained Language Models.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022
Label-free Knowledge Distillation with Contrastive Loss for Light-weight Speaker Recognition.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022
Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Hierarchical Attention Network for Evaluating Therapist Empathy in Counseling Session.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Automatic Detection of Speech Sound Disorder in Child Speech Using Posterior-based Speaker Representations.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
EDITnet: A Lightweight Network for Unsupervised Domain Adaptation in Speaker Verification.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Durational Patterning at Discourse Boundaries in Relation to Therapist Empathy in Psychotherapy.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
A Study on the Efficacy of Model Pre-Training In Developing Neural Text-to-Speech System.
Proceedings of the IEEE International Conference on Acoustics, 2022
Multivariate Empirical Mode Decomposition of EEG for Mental State Detection at Localized Brain Lobes.
Proceedings of the 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2022
Proceedings of the 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2022
2021
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Acoustical Analysis of Speech Under Physical Stress in Relation to Physical Activities and Physical Literacy.
CoRR, 2021
Data Augmentation with Locally-time Reversed Speech for Automatic Speech Recognition.
CoRR, 2021
CoRR, 2021
Proceedings of the NLPIR 2021: 5th International Conference on Natural Language Processing and Information Retrieval, Sanya, China, December 17, 2021
Estimating Mutual Information in Prosody Representation for Emotional Prosody Transfer in Speech Synthesis.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021
Automatic Detection of Word-Level Reading Errors in Non-native English Speech Based on ASR Output.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Fine-Grained Style Modeling, Transfer and Prediction in Text-to-Speech Synthesis via Phone-Level Content-Style Disentanglement.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Pairing Weak with Strong: Twin Models for Defending Against Adversarial Attack on Speaker Verification.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Detection of Consonant Errors in Disordered Speech Based on Consonant-Vowel Segment Embedding.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
EditSpeech: A Text Based Speech Editing System Using Partial Inference and Bidirectional Fusion.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
Utterance-Level Neural Confidence Measure for End-to-End Children Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
2020
An End-to-End Approach to Automatic Speech Assessment for Cantonese-speaking People with Aphasia.
J. Signal Process. Syst., 2020
IEEE J. Sel. Top. Signal Process., 2020
Introduction to the Issue on Automatic Assessment of Health Disorders Based on Voice, Speech, and Language Processing.
IEEE J. Sel. Top. Signal Process., 2020
Unsupervised Spoken Term Discovery Based on Re-clustering of Hypothesized Speech Segments with Siamese and Triplet Networks.
CoRR, 2020
CoRR, 2020
Fine-grained style modelling and transfer in text-to-speech synthesis via content-style disentanglement.
CoRR, 2020
Learning Syllable-Level Discrete Prosodic Representation for Expressive Speech Generation.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
CUCHILD: A Large-Scale Cantonese Corpus of Child Speech for Phonology and Articulation Assessment.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Automatic Detection of Phonological Errors in Child Speech Using Siamese Recurrent Autoencoder.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
EigenEmo: Spectral Utterance Representation Using Dynamic Mode Decomposition for Speech Emotion Classification.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Advancing Multiple Instance Learning with Attention Modeling for Categorical Speech Emotion Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Time-Frequency Feature Decomposition Based on Sound Duration for Acoustic Scene Classification.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Mixture Factorized Auto-Encoder for Unsupervised Hierarchical Deep Factorization of Speech Signal.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Resting-State EEG-Based Biometrics with Signals Features Extracted by Multivariate Empirical Mode Decomposition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020
2019
Acoustical Assessment of Voice Disorder With Continuous Speech Using ASR Posterior Features.
IEEE ACM Trans. Audio Speech Lang. Process., 2019
Exploiting Cross-Lingual Speaker and Phonetic Diversity for Unsupervised Subword Modeling.
IEEE ACM Trans. Audio Speech Lang. Process., 2019
Proceedings of the 22nd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2019
Fast DNN Acoustic Model Speaker Adaptation by Learning Hidden Unit Contribution Features.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Child Speech Disorder Detection with Siamese Recurrent Network Using Speech Attribute Features.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Deep Learning of Segment-Level Feature Representation with Multiple Instance Learning for Utterance-Level Speech Emotion Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Combining Adversarial Training and Disentangled Speech Representation for Robust Zero-Resource Subword Modeling.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Improving Unsupervised Subword Modeling via Disentangled Speech Representation Learning and Transformation.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
BLHUC: Bayesian Learning of Hidden Unit Contributions for Deep Neural Network Speaker Adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Combining Phone Posteriorgrams from Strong and Weak Recognizers for Automatic Speech Assessment of People with Aphasia.
Proceedings of the IEEE International Conference on Acoustics, 2019
Adversarial Multi-task Deep Features and Unsupervised Back-end Adaptation for Language Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
2018
J. Signal Process. Syst., 2018
Investigation of Stacked Deep Neural Networks and Mixture Density Networks for Acoustic-to-Articulatory Inversion.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018
Disordered Speech Assessment Using Kullback-Leibler Divergence Features with Multi-Task Acoustic Modeling.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018
Prediction of Voice Disorder Severity: Contributions from Sustained Vowels and Continuous Speech.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018
Automatic Speech Assessment for People with Aphasia Using TDNN-BLSTM with Multi-Task Learning.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Exploiting Speaker and Phonetic Diversity of Mismatched Language Resources for Unsupervised Subword Modeling.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Improving Cross-Lingual Knowledge Transferability Using Multilingual TDNN-BLSTM with Language-Dependent Pre-Final Layer.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Automatic Speech Assessment for Aphasic Patients Based on Syllable-Level Embedding and Supra-Segmental Duration Features.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Unsupervised Pattern Discovery from Thematic Speech Archives Based on Multilingual Bottleneck Features.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018
2017
Audio-visual expressions of attitude: How many different attitudes can perceivers decode?
Speech Commun., 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Acoustic Assessment of Disordered Voice with Continuous Speech Based on Utterance-Level ASR Posterior Features.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
On the Linguistic Relevance of Speech Units Learned by Unsupervised Acoustic Modeling.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Polyphonic piano note transcription with non-negative matrix factorization of differential spectrogram.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 14th International Conference on Auditory-Visual Speech Processing, 2017
2016
Surface Electromyographic Activity of Extrinsic Laryngeal Muscles in Cantonese Tone Production.
J. Signal Process. Syst., 2016
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016
Towards automatic assessment of aphasia speech using automatic speech recognition techniques.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016
Exploiting language-mismatched phoneme recognizers for unsupervised acoustic modeling.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Automatic speech recognition for acoustical analysis and assessment of cantonese pathological voice and speech.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
2015
Supervised Single-Microphone Multi-Talker Speech Separation with Conditional Random Fields.
IEEE ACM Trans. Audio Speech Lang. Process., 2015
IEEE ACM Trans. Audio Speech Lang. Process., 2015
A method of speech periodicity enhancement using transform-domain signal decomposition.
Speech Commun., 2015
Speech Commun., 2015
Proceedings of the 2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2015
Proceedings of the 17th IEEE International Workshop on Multimedia Signal Processing, 2015
Modeling temporal dependency for robust estimation of LP model parameters in speech enhancement.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
2014
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014
Correcting Chord Classification Errors Based on Tonal Organization Information of Classical Music.
Proceedings of the 2014 IEEE International Symposium on Multimedia, 2014
Surface electromyographic activity of non-laryngeal neck muscles in Cantonese tone production.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014
Multipitch tracking based on linear programming relaxation and sparsity-based pitch candidate estimation.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
A graph-based Gaussian component clustering approach to unsupervised acoustic modeling.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014
2013
IEEE Trans. Speech Audio Process., 2013
Pitch Estimation in Noisy Speech Using Accumulated Peak Spectrum and Sparse Estimation Technique.
IEEE Trans. Speech Audio Process., 2013
IEEE Signal Process. Lett., 2013
Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013
Unsupervised mining of acoustic subword units with segment-level Gaussian posteriorgrams.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Using parallel tokenizers with DTW matrix combination for low-resource spoken term detection.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2013
Structured mean field method for single-microphone speech separation with factorial Hidden Markov Model.
Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013
Chord classification of multi-instrumental music using exemplar-based sparse representation.
Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013
2012
Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012
Two objective measures for speech distortion and noise reduction evaluation of enhanced speech signals.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Integrating multiple observations for model-based single-microphone speech separation with conditional random fields.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Exploration of Phase and Vocal Excitation Modulation Features for Speaker Recognition.
Proceedings of the Biometric Recognition - 7th Chinese Conference, 2012
Classifying NMF components based on vector similarity for speech and music separation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
2011
IEEE Trans. Speech Audio Process., 2011
Proceedings of the International Symposium on Intelligent Signal Processing and Communications Systems, 2011
Score fusion and calibration in multiple language detectors with large performance variation.
Proceedings of the IEEE International Conference on Acoustics, 2011
2010
Proceedings of the Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 28, 2010
Similarity Measures for Chinese Pop Music Based on Low-level Audio Signal Attributes.
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010
Spectral trajectory estimation using nonnegative matrix factorization for model-based monaural speech separation.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010
Semantics-based language modeling for Cantonese-English code-mixing speech recognition.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Improved Cantonese Tone Recognition with Approximated F0 Contour: Implications for Cochlear Implants.
Proceedings of the International Conference on Asian Language Processing, 2010
A method of speech periodicity enhancement based on transform-domain signal decomposition.
Proceedings of the 18th European Signal Processing Conference, 2010
2009
Int. J. Asian Lang. Process., 2009
Int. J. Comput. Linguistics Chin. Lang. Process., 2009
EURASIP J. Adv. Signal Process., 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Effects of language mixing for automatic recognition of Cantonese-English code-mixing utterances.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 2009 International Conference on Asian Language Processing, 2009
2008
Tone-enhanced generalized character posterior probability (GCPP) for Cantonese LVCSR.
Comput. Speech Lang., 2008
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008
Mandarin Tone Perception with Temporal Envelope and Periodicity Cues from Different Frequency Regions.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008
Prosody for Mandarin speech recognition: a comparative study of read and spontaneous speech.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
2007
Static and Dynamic Spectral Features: Their Noise Robustness and Optimal Weights for ASR.
IEEE Trans. Speech Audio Process., 2007
Discrimination Power of Vocal Source and Vocal Tract Related Features for Speaker Segmentation.
IEEE Trans. Speech Audio Process., 2007
IEEE Signal Process. Lett., 2007
Integrating Complementary Features from Vocal Source and Vocal Tract for Speaker Identification.
Int. J. Comput. Linguistics Chin. Lang. Process., 2007
Digit. Signal Process., 2007
Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
2006
Speech recognition on DSP: issues on computational efficiency and performance analysis.
Microprocess. Microsystems, 2006
Int. J. Comput. Linguistics Chin. Lang. Process., 2006
Modeling Cantonese Pronunciation Variations for Large-Vocabulary Continuous Speech Recognition.
Int. J. Comput. Linguistics Chin. Lang. Process., 2006
Speaker Verification Using Complementary Information from Vocal Source and Vocal Tract.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006
Integrating Complementary Features with a Confidence Measure for Speaker Identification.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Feature Extraction From Talking Mouths for Video-Based Bi-Modal Speaker Verification.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
2004
ACM Trans. Asian Lang. Inf. Process., 2004
On noise robustness of dynamic and static features for continuous Cantonese digit recognition.
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004
Detection of language boundary in code-switching utterances by bi-phone probabilities.
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004
Noise-robust automatic speech recognition using mainlobe-resilient time-frequency quantile-based noise estimation.
Proceedings of the 2004 International Symposium on Circuits and Systems, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
2003
Proceedings of the 2003 International Symposium on Circuits and Systems, 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
2002
ACM Trans. Asian Lang. Inf. Process., 2002
A new approach to generating Pitch Cycle Waveform (PCW) for Waveform Interpolation codec.
Microprocess. Microsystems, 2002
Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
2001
Design, Compilation and Processing of CUCall: A Set of Cantonese Spoken Language Corpora Collected Over Telephone Networks.
Proceedings of the 14th Conference on Computational Linguistics and Speech Processing, 2001
Proceedings of the Advances in Multimedia Information Processing, 2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
2000
Proceedings of the 2000 International Symposium on Chinese Spoken Language Processing, 2000
Proceedings of the 2000 International Symposium on Chinese Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Incorporating tone information into Cantonese large-vocabulary continuous speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Lexical tree decoding with a class-based language model for Chinese speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Acoustic modeling for Chinese speech recognition: a comparative study of Mandarin and Cantonese.
Proceedings of the IEEE International Conference on Acoustics, 2000
1999
IEEE Trans. Speech Audio Process., 1999
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999
Two-dimensional multi-resolution analysis of speech signals and its application to speech recognition.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999
1998
Pattern Recognit., 1998
Proceedings of the 1998 International Symposium on Chinese Spoken Language Processing, 1998
Proceedings of the 1998 International Symposium on Chinese Spoken Language Processing, 1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
1997
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997
1996
Automatic recognition of isolated Cantonese syllables using neural networks =: 利用神經網絡識別粤語單音節.
PhD thesis, 1996
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
1995
IEEE Trans. Speech Audio Process., 1995
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995
Proceedings of the 1995 International Conference on Acoustics, 1995
1992