2025
PreMind: Multi-Agent Video Understanding for Advanced Indexing of Presentation-style Videos.
CoRR, March, 2025
2024
Special issue on integrated sensing and communications (ISAC).
J. Commun. Networks, 2024
A Boosting-Type Convergence Result for AdaBoost.MH with Factorized Multi-Class Classifiers.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Sequential Kernel Goodness-of-fit Testing.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
DRF: Improving Certified Robustness via Distributional Robustness Framework.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
Sample Complexity for Distributionally Robust Learning under chi-square divergence.
J. Mach. Learn. Res., 2023
EyeClick: A Robust Two-Step Eye-Hand Interaction for Text Entry in Augmented Reality Glasses.
Proceedings of the Adjunct Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology, 2023
DelucionQA: Detecting Hallucinations in Domain-specific Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Knowledge-Grounded Natural Language Recommendation Explanation.
Proceedings of the 6th BlackboxNLP Workshop: Analyzing and Interpreting Neural Networks for NLP, 2023
CoAug: Combining Augmentation of Labels and Labelling Rules.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
2021
Using Paralinguistic Information to Disambiguate User Intentions for Distinguishing Phrase Structure and Sarcasm in Spoken Dialog Systems.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
2019
A Neural Network Based Ranking Framework to Improve ASR with NLU Related Knowledge Deployed.
Proceedings of the IEEE International Conference on Acoustics, 2019
2018
HDAA: High-Speed Data Acquisition Algorithm of IoT.
Proceedings of the Cognitive Systems and Signal Processing - 4th International Conference, 2018
2016
Sign Transition Modeling and a Scalable Solution to Continuous Sign Language Recognition for Real-World Applications.
ACM Trans. Access. Comput., 2016
2010
Pseudo-Conventional N-Gram Representation of the Discriminative N-Gram Model for LVCSR.
IEEE J. Sel. Top. Signal Process., 2010
2008
Recasting the discriminative n-gram model as a pseudo-conventional n-gram model for LVCSR.
Proceedings of the IEEE International Conference on Acoustics, 2008
2007
Complementarity and redundancy in multimodal user inputs with speech and pen gestures.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
2006
A multi-pass error detection and correction framework for Mandarin LVCSR.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
A Comparative Study of Discriminative Methods for Reranking LVCSR N-Best Hypotheses in Domain Adaptation and Generalization.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
2004
Error identification for large vocabulary speech recognition.
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004
A two-level schema for detecting recognition errors.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
2002
Improving language modeling by combining heteogeneous corpora.
Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002