Kai Yu
Orcid: 0000-0002-7102-9826Affiliations:
- Shanghai Jiao Tong University, Computer Science and Engineering Department, China
- Cambridge University, Engineering Department, UK (PhD 2006)
According to our database1,
Kai Yu
authored at least 323 papers
between 2004 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
Converging to a Lingua Franca: Evolution of Linguistic Regions and Semantics Alignment in Multilingual Large Language Models.
Proceedings of the 31st International Conference on Computational Linguistics, 2025
Proceedings of the 31st International Conference on Computational Linguistics, 2025
Proceedings of the 31st International Conference on Computational Linguistics, 2025
2024
Beyond the Status Quo: A Contemporary Survey of Advances and Challenges in Audio Captioning.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Unsupervised Speech Enhancement Using Optimal Transport and Speech Presence Probability.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
AdaEAGLE: Optimizing Speculative Decoding via Explicit Modeling of Adaptive Draft Structures.
CoRR, 2024
Neural Directed Speech Enhancement with Dual Microphone Array in High Noise Scenario.
CoRR, 2024
Why Do Speech Language Models Fail to Generate Semantically Coherent Outputs? A Modality Evolving Perspective.
CoRR, 2024
CoRR, 2024
CoRR, 2024
Compressing KV Cache for Long-Context LLM Inference with Inter-Layer Attention Similarity.
CoRR, 2024
CoRR, 2024
SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMs.
CoRR, 2024
CoRR, 2024
DiveSound: LLM-Assisted Automatic Taxonomy Construction for Diverse Audio Generation.
CoRR, 2024
Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
CoRR, 2024
GigaSpeech 2: An Evolving, Large-Scale and Multi-domain ASR Corpus for Low-Resource Languages with Automated Crawling, Transcription and Refinement.
CoRR, 2024
The X-LANCE Technical Report for Interspeech 2024 Speech Processing Using Discrete Speech Unit Challenge.
CoRR, 2024
Rejection Improves Reliability: Training LLMs to Refuse Unknown Questions Using RL from Knowledge Feedback.
CoRR, 2024
Is Cognition and Action Consistent or Not: Investigating Large Language Model's Personality.
CoRR, 2024
VALL-T: Decoder-Only Generative Transducer for Robust and Decoding-Controllable Text-to-Speech.
CoRR, 2024
Proceedings of the 17th ACM International Conference on Web Search and Data Mining, 2024
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
A Birgat Model for Multi-Intent Spoken Language Understanding with Hierarchical Semantic Frames.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations.
Proceedings of the IEEE International Conference on Acoustics, 2024
DiffDub: Person-Generic Visual Dubbing Using Inpainting Renderer with Diffusion Auto-Encoder.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
AlignSum: Data Pyramid Hierarchical Fine-tuning for Aligning with Human Summarization Preference.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Is LLM a Reliable Reviewer? A Comprehensive Evaluation of LLM on Automatic Paper Reviewing Tasks.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Multilingual Brain Surgeon: Large Language Models Can Be Compressed Leaving No Language behind.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
IBSEN: Director-Actor Agent Collaboration for Controllable and Interactive Drama Script Generation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
SciEval: A Multi-Level Large Language Model Evaluation Benchmark for Scientific Research.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023
Speech Enhancement With Integration of Neural Homomorphic Synthesis and Spectral Masking.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
IEEE ACM Trans. Audio Speech Lang. Process., 2023
OPAL: Ontology-Aware Pretrained Language Model for End-to-End Task-Oriented Dialogue.
Trans. Assoc. Comput. Linguistics, 2023
CoRR, 2023
Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation.
CoRR, 2023
CoRR, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
ReCLR: Reference-Enhanced Contrastive Learning of Audio Representation for Depression Detection.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Improving Code-Switching and Name Entity Recognition in ASR with Speech Editing based Data Augmentation.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Investigating Pooling Strategies and Loss Functions for Weakly-Supervised Text-to-Audio Grounding via Contrastive Learning.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Multi-Speaker End-to-End Multi-Modal Speaker Diarization System for the MISP 2022 Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
ACT-SQL: In-Context Learning for Text-to-SQL with Automatically-Generated Chain-of-Thought.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Proceedings of the The 61st Annual Meeting of the Association for Computational Linguistics: Industry Track, 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023
2022
Phone-Level Prosody Modelling With GMM-Based MDN for Diverse and Controllable Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process., 2022
Data augmentation based non-parallel voice conversion with frame-level speaker disentangler.
Speech Commun., 2022
Proceedings of the Seventh Conference on Machine Translation, 2022
Proceedings of the 23rd Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2022
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
Proceedings of the 19th International Conference on Spoken Language Translation, 2022
The X-Lance Speaker Diarization System for the Conversational Short-phrase Speaker Diarization Challenge 2022.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
VQTTS: High-Fidelity Text-to-Speech Synthesis with Self-Supervised VQ Acoustic Feature.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
2021
Voice Activity Detection in the Wild: A Data-Driven Approach Using Teacher-Student Training.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Diverse and Controllable Speech Synthesis with GMM-Based Phone-Level Prosody Modelling.
CoRR, 2021
CoRR, 2021
Proceedings of the Natural Language Processing and Chinese Computing, 2021
Proceedings of the Natural Language Processing and Chinese Computing, 2021
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Investigating Local and Global Information for Automated Audio Captioning with Transfer Learning.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021
LGESQL: Line Graph Enhanced Text-to-SQL Model with Mixed Local and Non-Local Relations.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
LET: Linguistic Knowledge Enhanced Graph Transformer for Chinese Short Text Matching.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
Prior Knowledge Driven Label Embedding for Slot Filling in Natural Language Understanding.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Neural Network Language Model Compression With Product Quantization and Soft Binarization.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Data Augmentation Using Deep Generative Models for Embedding Based Speaker Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Modular End-to-End Automatic Speech Recognition Framework for Acoustic-to-Word Model.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Distributed Structured Actor-Critic Reinforcement Learning for Universal Dialogue Management.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Nat. Mach. Intell., 2020
Vector Projection Network for Few-shot Slot Tagging in Natural Language Understanding.
CoRR, 2020
GPVAD: Towards noise robust voice activity detection via weakly supervised sound event detection.
CoRR, 2020
An Investigation on Different Underlying Quantization Schemes for Pre-trained Language Models.
Proceedings of the Natural Language Processing and Chinese Computing, 2020
Proceedings of the Natural Language Processing and Chinese Computing, 2020
Proceedings of the Natural Language Processing and Chinese Computing, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Jointly Encoding Word Confusion Network and Dialogue Context with BERT for Spoken Language Understanding.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 40th IEEE International Conference on Distributed Computing Systems, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Addressing the Polysemy Problem in Language Modeling with Attentional Multi-Sense Embeddings.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Channel Invariant Speaker Embedding Learning with Joint Multi-Task and Adversarial Training.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Efficient Context and Schema Fusion Networks for Multi-Domain Dialogue State Tracking.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Semi-Supervised Text Simplification with Back-Translation and Asymmetric Denoising Autoencoders.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
Schema-Guided Multi-Domain Dialogue State Tracking with Graph Attention Neural Networks.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
Discriminative Neural Embedding Learning for Short-Duration Text-Independent Speaker Verification.
IEEE ACM Trans. Audio Speech Lang. Process., 2019
AgentGraph: Toward Universal Dialogue Management With Structured Deep Reinforcement Learning.
IEEE ACM Trans. Audio Speech Lang. Process., 2019
AgentGraph: Towards Universal Dialogue Management with Structured Deep Reinforcement Learning.
CoRR, 2019
Proceedings of the Natural Language Processing and Chinese Computing, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Data Augmentation Using Variational Autoencoder for Embedding Based Speaker Verification.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
On the Usage of Phonetic Information for Text-Independent Speaker Embedding Extraction.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the International Conference on Multimodal Interaction, 2019
Proceedings of the International Conference on Multimodal Interaction, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Proceedings of the Blizzard Challenge 2019, Vienna, Austria, September 23, 2019, 2019
Highly Efficient Neural Network Language Model Compression Using Soft Binarization Training.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
2018
IEEE ACM Trans. Audio Speech Lang. Process., 2018
Adaptive Very Deep Convolutional Residual Network for Noise Robust Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2018
Investigating Raw Wave Deep Neural Networks for End-to-End Speaker Spoofing Detection.
IEEE ACM Trans. Audio Speech Lang. Process., 2018
Speech Commun., 2018
Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue, 2018
Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue, 2018
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018
Generative Adversarial Networks based X-vector Augmentation for Robust Probabilistic Linear Discriminant Analysis in Speaker Verification.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018
Proceedings of the Intelligence Science and Big Data Engineering, 2018
Proceedings of the Intelligence Science and Big Data Engineering, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 2nd International Conference on Video and Image Processing, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Focal Kl-Divergence Based Dilated Convolutional Neural Networks for Co-Channel Speaker Identification.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Semi-Supervised Training Using Adversarial Multi-Task Learning for Spoken Language Understanding.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018
Proceedings of the 27th International Conference on Computational Linguistics, 2018
2017
IEEE ACM Trans. Audio Speech Lang. Process., 2017
Proceedings of the Intelligence Science and Big Data Engineering, 2017
Proceedings of the Intelligence Science and Big Data Engineering, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017
Encoder-decoder with focus-mechanism for sequence labelling based spoken language understanding.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2017
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
2016
IEEE ACM Trans. Audio Speech Lang. Process., 2016
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Frontiers Comput. Sci., 2016
Proceedings of the 12th NTCIR Conference on Evaluation of Information Access Technologies, 2016
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016
On training bi-directional neural network language model with noise contrastive estimation.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Discriminatively trained joint speaker and environment representations for adaptation of deep neural network acoustic models.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
2015
IEEE ACM Trans. Audio Speech Lang. Process., 2015
Recurrent Polynomial Network for Dialogue State Tracking with Mismatched Semantic Parsers.
Proceedings of the SIGDIAL 2015 Conference, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Robust deep feature for spoofing detection - the SJTU system for ASVspoof 2015 challenge.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
An investigation of context clustering for statistical speech synthesis with deep neural network.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Automatic model redundancy reduction for fast back-propagation for deep neural networks in speech recognition.
Proceedings of the 2015 International Joint Conference on Neural Networks, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Recurrent neural network language model with structured word embeddings for speech recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Local trajectory based speech enhancement for robust speech recognition with deep neural network.
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015
An investigation on DNN-derived bottleneck features for GMM-HMM based robust speech recognition.
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
2014
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014
Proceedings of the SIGDIAL 2014 Conference, 2014
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 2014 International Joint Conference on Neural Networks, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
2013
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2013
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013
2012
Introduction to the Issue on Advances in Spoken Dialogue Systems and Mobile Interface.
IEEE J. Sel. Top. Signal Process., 2012
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012
Proceedings of the SIGDIAL 2012 Conference, 2012
Proceedings of the International Conference on Multimodal Interaction, 2012
Proceedings of the International Conference on Multimodal Interaction, 2012
2011
IEEE Trans. Speech Audio Process., 2011
Context adaptive training with factorized decision trees for HMM-based statistical parametric speech synthesis.
Speech Commun., 2011
Proceedings of the SIGDIAL 2011 Conference, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
On-line policy optimisation of spoken dialogue systems via live interaction with human subjects.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011
2010
Speech Commun., 2010
The Hidden Information State model: A practical framework for POMDP-based spoken dialogue management.
Comput. Speech Lang., 2010
Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010
Proceedings of the SIGDIAL 2010 Conference, 2010
Proceedings of the SIGDIAL 2010 Conference, 2010
Context adaptive training with factorized decision trees for HMM-based speech synthesis.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Natural belief-critic: a reinforcement algorithm for parameter estimation in statistical spoken dialogue systems.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Phrase-Based Statistical Language Generation Using Graphical Models and Active Learning.
Proceedings of the ACL 2010, 2010
2009
IEEE Trans. Speech Audio Process., 2009
Proceedings of the SIGDIAL 2009 Conference, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Spoken language understanding from unaligned data using discriminative classification models.
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009
2008
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008
Proceedings of the SIGDIAL 2008 Workshop, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
2007
IEEE Trans. Speech Audio Process., 2007
Unsupervised training with directed manual transcription for recognising Mandarin broadcast audio.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
Discriminative language model adaptation for Mandarin broadcast speech transcription and translation.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007
2006
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
2005
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Development of the CUHTK 2004 Mandarin Conversational Telephone Speech Transcription System.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
2004
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
Generating and evaluating segmentations for automatic speech recognition of conversational telephone speech.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004