2025
Phone-purity Guided Discrete Tokens for Dysarthric Speech Recognition.
CoRR, January, 2025
Emotionally Challenging Games Can Satisfy Older Adults' Psychological Needs: From Empirical Study to Design Guidelines.
Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems, 2025
"Imitating at the Last Minute": Exploring Approaches to Enhance Public Speaking Performance in Limited Time.
Proceedings of the Extended Abstracts of the CHI Conference on Human Factors in Computing Systems, 2025
2024
Survey of neurocognitive disorder detection methods based on speech, visual, and virtual reality technologies.
Virtual Real. Intell. Hardw., 2024
Self-Supervised ASR Models and Features for Dysarthric and Elderly Speech Recognition.
,
,
,
,
,
,
,
,
,
,
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Structured Speaker-Deficiency Adaptation of Foundation Models for Dysarthric and Elderly Speech Recognition.
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Structured Dialogue System for Mental Health: An LLM Chatbot Leveraging the PM+ Guidelines.
CoRR, 2024
Homogeneous Speaker Features for On-the-Fly Dysarthric and Elderly Speaker Adaptation.
CoRR, 2024
Towards Effective and Efficient Non-autoregressive Decoding Using Block-based Attention Mask.
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Alzheimer's Disease Detection Based on Large Language Model Prompt Engineering.
Proceedings of the Social Robotics - 16th International Conference, 2024
Structured Dialogue System for Mental Health: An LLM Chatbot Leveraging the PM<sup>+</sup> Guidelines.
Proceedings of the Social Robotics - 16th International Conference, 2024
Investigation of Cross Modality Feature Fusion for Audio-Visual Dysarthric Speech Assessment.
Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024
Towards Effective and Efficient Non-autoregressive Decoding Using Block-based Attention Mask.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition.
,
,
,
,
,
,
,
,
,
,
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
Perceiver-Prompt: Flexible Speaker Adaptation in Whisper for Chinese Disordered Speech Recognition.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
Towards Automatic Data Augmentation for Disordered Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024
Towards High-Performance and Low-Latency Feature-Based Speaker Adaptation of Conformer Speech Recognition Systems.
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
Probing Lexical Ambiguity in Chinese Characters via Their Word Formations: Convergence of Perceived and Computed Metrics.
Cogn. Sci., November, 2023
Confidence Score Based Speaker Adaptation of Conformer Speech Recognition Systems.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
Exploiting Cross-Domain And Cross-Lingual Ultrasound Tongue Imaging Features For Elderly And Dysarthric Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
On-the-Fly Feature Based Rapid Speaker Adaptation for Dysarthric and Elderly Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Use of Speech Impairment Severity for Dysarthric Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Factorised Speaker-environment Adaptive Training of Conformer Speech Recognition Systems.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Unsupervised Model-Based Speaker Adaptation of End-To-End Lattice-Free MMI Model for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023
Adversarial Data Augmentation Using VAE-GAN for Disordered Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023
Exploring Self-Supervised Pre-Trained ASR Models for Dysarthric and Elderly Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023
ChallengeDetect: Investigating the Potential of Detecting In-Game Challenge Experience from Physiological Measures.
Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023
2022
Neural Architecture Search for LF-MMI Trained Time Delay Neural Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2022
Speaker Adaptation Using Spectro-Temporal Deep Features for Dysarthric and Elderly Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2022
Exploiting Cross-domain And Cross-Lingual Ultrasound Tongue Imaging Features For Elderly And Dysarthric Speech Recognition.
CoRR, 2022
On-the-fly Feature Based Speaker Adaptation for Dysarthric and Elderly Speech Recognition.
CoRR, 2022
Investigation of Deep Neural Network Acoustic Modelling Approaches for Low Resource Accented Mandarin Speech Recognition.
CoRR, 2022
A Multi-level Acoustic Feature Extraction Framework for Transformer Based End-to-End Speech Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Confidence Score Based Conformer Speaker Adaptation for Speech Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Two-pass Decoding and Cross-adaptation Based System Combination of End-to-end Conformer and Hybrid TDNN ASR Systems.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Exploiting Cross Domain Acoustic-to-Articulatory Inverted Features for Disordered Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022
Detecting challenge from physiological signals: A primary study with a typical game scenario.
Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022
2021
Bayesian Learning for Deep Neural Network Adaptation.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Recent Progress in the CUHK Dysarthric Speech Recognition System.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
RE-RLTuner: A topic-based music generation method.
Proceedings of the IEEE International Conference on Real-time Computing and Robotics, 2021
Variational Auto-Encoder Based Variability Encoding for Dysarthric Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Adversarial Data Augmentation for Disordered Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Bayesian Parametric and Architectural Domain Adaptation of LF-MMI Trained TDNNs for Elderly and Dysarthric Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Development of the Cuhk Elderly Speech Recognition System for Neurocognitive Disorder Detection Using the Dementiabank Corpus.
,
,
,
,
,
,
,
,
,
,
Proceedings of the IEEE International Conference on Acoustics, 2021
Neural Architecture Search for LF-MMI Trained Time Delay Neural Networks.
Proceedings of the IEEE International Conference on Acoustics, 2021
2020
Neural Architecture Search for Speech Recognition.
CoRR, 2020
Exploiting Cross-Domain Visual Feature Generation for Disordered Speech Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Investigation of Data Augmentation Techniques for Disordered Speech Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
2019
Fast DNN Acoustic Model Speaker Adaptation by Learning Hidden Unit Contribution Features.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
LF-MMI Training of Bayesian and Gaussian Process Time Delay Neural Networks for Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
BLHUC: Bayesian Learning of Hidden Unit Contributions for Deep Neural Network Speaker Adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2019
Bayesian and Gaussian Process Neural Networks for Large Vocabulary Continuous Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019
2018
Investigation of Stacked Deep Neural Networks and Mixture Density Networks for Acoustic-to-Articulatory Inversion.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018
Development of the CUHK Dysarthric Speech Recognition System for the UA Speech Corpus.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Gaussian Process Neural Networks for Speech Recognition.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
2017
RNN-LDA Clustering for Feature Based DNN Adaptation.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
2016
Deep Neural Network Based Acoustic-to-Articulatory Inversion Using Phone Sequence Information.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
2015
Generalized variable parameter HMMs based acoustic-to-articulatory inversion.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Efficient use of DNN bottleneck features in generalized variable parameter HMMs for noise robust speech recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
2014
Deep neural network bottleneck features for generalized variable parameter HMMs.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014