2025

Phone-purity Guided Discrete Tokens for Dysarthric Speech Recognition.

[DOI]

Huimeng Wang

Xurong Xie

CoRR, January, 2025

Emotionally Challenging Games Can Satisfy Older Adults' Psychological Needs: From Empirical Study to Design Guidelines.

[DOI]

Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems, 2025

"Imitating at the Last Minute": Exploring Approaches to Enhance Public Speaking Performance in Limited Time.

[DOI]

Proceedings of the Extended Abstracts of the CHI Conference on Human Factors in Computing Systems, 2025

2024

Survey of neurocognitive disorder detection methods based on speech, visual, and virtual reality technologies.

[DOI]

Virtual Real. Intell. Hardw., 2024

Self-Supervised ASR Models and Features for Dysarthric and Elderly Speech Recognition.

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2024

Structured Speaker-Deficiency Adaptation of Foundation Models for Dysarthric and Elderly Speech Recognition.

[DOI]

CoRR, 2024

Structured Dialogue System for Mental Health: An LLM Chatbot Leveraging the PM+ Guidelines.

[DOI]

CoRR, 2024

Homogeneous Speaker Features for On-the-Fly Dysarthric and Elderly Speaker Adaptation.

[DOI]

CoRR, 2024

Towards Effective and Efficient Non-autoregressive Decoding Using Block-based Attention Mask.

[DOI]

CoRR, 2024

Alzheimer's Disease Detection Based on Large Language Model Prompt Engineering.

[DOI]

Proceedings of the Social Robotics - 16th International Conference, 2024

Structured Dialogue System for Mental Health: An LLM Chatbot Leveraging the PM<sup>+</sup> Guidelines.

[DOI]

Proceedings of the Social Robotics - 16th International Conference, 2024

Investigation of Cross Modality Feature Fusion for Audio-Visual Dysarthric Speech Assessment.

[DOI]

Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024

Towards Effective and Efficient Non-autoregressive Decoding Using Block-based Attention Mask.

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition.

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Perceiver-Prompt: Flexible Speaker Adaptation in Whisper for Chinese Disordered Speech Recognition.

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Towards Automatic Data Augmentation for Disordered Speech Recognition.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Towards High-Performance and Low-Latency Feature-Based Speaker Adaptation of Conformer Speech Recognition Systems.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Probing Lexical Ambiguity in Chinese Characters via Their Word Formations: Convergence of Perceived and Computed Metrics.

[DOI]

Cogn. Sci., November, 2023

Confidence Score Based Speaker Adaptation of Conformer Speech Recognition Systems.

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2023

Exploiting Cross-Domain And Cross-Lingual Ultrasound Tongue Imaging Features For Elderly And Dysarthric Speech Recognition.

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

On-the-Fly Feature Based Rapid Speaker Adaptation for Dysarthric and Elderly Speech Recognition.

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Use of Speech Impairment Severity for Dysarthric Speech Recognition.

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Factorised Speaker-environment Adaptive Training of Conformer Speech Recognition Systems.

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Unsupervised Model-Based Speaker Adaptation of End-To-End Lattice-Free MMI Model for Speech Recognition.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Adversarial Data Augmentation Using VAE-GAN for Disordered Speech Recognition.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Exploring Self-Supervised Pre-Trained ASR Models for Dysarthric and Elderly Speech Recognition.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

ChallengeDetect: Investigating the Potential of Detecting In-Game Challenge Experience from Physiological Measures.

[DOI]

Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

2022

Neural Architecture Search for LF-MMI Trained Time Delay Neural Networks.

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2022

Speaker Adaptation Using Spectro-Temporal Deep Features for Dysarthric and Elderly Speech Recognition.

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2022

Exploiting Cross-domain And Cross-Lingual Ultrasound Tongue Imaging Features For Elderly And Dysarthric Speech Recognition.

[DOI]

CoRR, 2022

On-the-fly Feature Based Speaker Adaptation for Dysarthric and Elderly Speech Recognition.

[DOI]

CoRR, 2022

Investigation of Deep Neural Network Acoustic Modelling Approaches for Low Resource Accented Mandarin Speech Recognition.

[DOI]

CoRR, 2022

A Multi-level Acoustic Feature Extraction Framework for Transformer Based End-to-End Speech Recognition.

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Confidence Score Based Conformer Speaker Adaptation for Speech Recognition.

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Two-pass Decoding and Cross-adaptation Based System Combination of End-to-end Conformer and Hybrid TDNN ASR Systems.

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Exploiting Cross Domain Acoustic-to-Articulatory Inverted Features for Disordered Speech Recognition.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Detecting challenge from physiological signals: A primary study with a typical game scenario.

[DOI]

Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

2021

Bayesian Learning for Deep Neural Network Adaptation.

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

Recent Progress in the CUHK Dysarthric Speech Recognition System.

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for Speech Recognition.

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

RE-RLTuner: A topic-based music generation method.

[DOI]

Proceedings of the IEEE International Conference on Real-time Computing and Robotics, 2021

Variational Auto-Encoder Based Variability Encoding for Dysarthric Speech Recognition.

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Adversarial Data Augmentation for Disordered Speech Recognition.

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition.

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Bayesian Parametric and Architectural Domain Adaptation of LF-MMI Trained TDNNs for Elderly and Dysarthric Speech Recognition.

[DOI]

Jiajun Deng

Fabian Ritter Gutierrez

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Development of the Cuhk Elderly Speech Recognition System for Neurocognitive Disorder Detection Using the Dementiabank Corpus.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Neural Architecture Search for LF-MMI Trained Time Delay Neural Networks.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Neural Architecture Search for Speech Recognition.

[DOI]

CoRR, 2020

Exploiting Cross-Domain Visual Feature Generation for Disordered Speech Recognition.

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Investigation of Data Augmentation Techniques for Disordered Speech Recognition.

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2019

Fast DNN Acoustic Model Speaker Adaptation by Learning Hidden Unit Contribution Features.

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

LF-MMI Training of Bayesian and Gaussian Process Time Delay Neural Networks for Speech Recognition.

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

BLHUC: Bayesian Learning of Hidden Unit Contributions for Deep Neural Network Speaker Adaptation.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Bayesian and Gaussian Process Neural Networks for Large Vocabulary Continuous Speech Recognition.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

Investigation of Stacked Deep Neural Networks and Mixture Density Networks for Acoustic-to-Articulatory Inversion.

[DOI]

Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Development of the CUHK Dysarthric Speech Recognition System for the UA Speech Corpus.

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Gaussian Process Neural Networks for Speech Recognition.

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017

RNN-LDA Clustering for Feature Based DNN Adaptation.

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2016

Deep Neural Network Based Acoustic-to-Articulatory Inversion Using Phone Sequence Information.

[DOI]

Xurong Xie

Xunying Liu

Lan Wang

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015

Generalized variable parameter HMMs based acoustic-to-articulatory inversion.

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Efficient use of DNN bottleneck features in generalized variable parameter HMMs for noise robust speech recognition.

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

2014

Deep neural network bottleneck features for generalized variable parameter HMMs.

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014