Chen Chen
Orcid: 0000-0003-4181-9285Affiliations:
- Nanyang Technological University, School of Computer Science and Engineering, Singapore
According to our database1,
Chen Chen
authored at least 43 papers
between 2021 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on linkedin.com
-
on orcid.org
On csauthors.net:
Bibliography
2024
Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASR.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition.
CoRR, 2024
CoRR, 2024
CoRR, 2024
Listen Again and Choose the Right Answer: A New Paradigm for Automatic Speech Recognition with Large Language Models.
CoRR, 2024
GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators.
CoRR, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Cross-Modality and Within-Modality Regularization for Audio-Visual Deepfake Detection.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Listen Again and Choose the Right Answer: A New Paradigm for Automatic Speech Recognition with Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Overcoming Catastrophic Forgetting by Exemplar Selection in Task-oriented Dialogue System.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
2023
Generative error correction for code-switching speech recognition using large language models.
CoRR, 2023
HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023
Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2023
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Study of Generative Adversarial Networks for Noisy Speech Simulation from Clean Speech.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023
UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Leveraging Modality-Specific Representations for Audio-Visual Speech Recognition via Reinforcement Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
Interactive Audio-text Representation for Automated Audio Captioning with Contrastive Learning.
CoRR, 2022
DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Interactive Auido-text Representation for Automated Audio Captioning with Contrastive Learning.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
2021
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021