We stand with Ukraine

We stand with Ukraine

Chen Chen

Orcid: 0000-0003-4181-9285

Affiliations:

Nanyang Technological University, School of Computer Science and Engineering, Singapore

According to our database¹, Chen Chen authored at least 43 papers between 2021 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

Online presence:

On csauthors.net:

Bibliography

2024

Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASR.

[BibT_eX]

[DOI]

,

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2024

An Investigation on the Potential of KAN in Speech Enhancement.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2024

Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition.

[BibT_eX]

[DOI]

CoRR, 2024

Robust Zero-Shot Text-to-Speech Synthesis with Reverse Inference Optimization.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2024

Enhancing Zero-shot Text-to-Speech Synthesis with Human Feedback.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models.

[BibT_eX]

[DOI]

,

,

Chao-Han Huck Yang

,

,

,

,

CoRR, 2024

Listen Again and Choose the Right Answer: A New Paradigm for Automatic Speech Recognition with Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators.

[BibT_eX]

[DOI]

,

,

Chao-Han Huck Yang

,

,

,

,

CoRR, 2024

Large Language Models are Efficient Learners of Noise-Robust Speech Recognition.

[BibT_eX]

[DOI]

,

,

Chao-Han Huck Yang

,

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition.

[BibT_eX]

[DOI]

,

,

,

Sabato Marco Siniscalchi

,

,

,

Chao-Han Huck Yang

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Cross-Modality and Within-Modality Regularization for Audio-Visual Deepfake Detection.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2024

Noise-Aware Speech Separation with Contrastive Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2024

In-Context Learning with Iterative Demonstration Selection.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators.

[BibT_eX]

[DOI]

,

,

Chao-Han Huck Yang

,

,

,

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Listen Again and Choose the Right Answer: A New Paradigm for Automatic Speech Recognition with Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Overcoming Catastrophic Forgetting by Exemplar Selection in Task-oriented Dialogue System.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

Generative error correction for code-switching speech recognition using large language models.

[BibT_eX]

[DOI]

,

,

Chao-Han Huck Yang

,

,

Sabato Marco Siniscalchi

,

CoRR, 2023

Noise-aware Speech Enhancement using Diffusion Probabilistic Model.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2023

A Neural State-Space Model Approach to Efficient Speech Separation.

[BibT_eX]

[DOI]

,

Chao-Han Huck Yang

,

,

,

,

CoRR, 2023

Study of GANs for Noisy Speech Simulation from Clean Speech.

[BibT_eX]

[DOI]

Leander Melroy Maben

,

,

,

Utkarsh Chudiwal

,

CoRR, 2023

Noise-aware Speech Separation with Contrastive Learning.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2023

HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models.

[BibT_eX]

[DOI]

,

,

Chao-Han Huck Yang

,

Sabato Marco Siniscalchi

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

A Neural State-Space Modeling Approach to Efficient Speech Separation.

[BibT_eX]

[DOI]

,

Chao-Han Huck Yang

,

,

,

,

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2023

Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2023

Unsupervised Noise Adaptation Using Data Simulation.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2023

Metric-Oriented Speech Enhancement Using Diffusion Probabilistic Model.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2023

Lifelong Sequence Generation with Dynamic Module Expansion and Adaptation.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Study of Generative Adversarial Networks for Noisy Speech Simulation from Clean Speech.

[BibT_eX]

[DOI]

Leander Melroy Maben

,

,

,

Utkarsh Chudiwal

,

Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Leveraging Modality-Specific Representations for Audio-Visual Speech Recognition via Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Interactive Audio-text Representation for Automated Audio Captioning with Contrastive Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2022

DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognition.

[BibT_eX]

[DOI]

,

,

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Interactive Auido-text Representation for Automated Audio Captioning with Contrastive Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Speech Emotion Recognition with Co-Attention Based Multi-Level Acoustic Information.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2022

Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2022

Noise-Robust Speech Recognition With 10 Minutes Unparalleled In-Domain Data.

[BibT_eX]

[DOI]

,

,

,

Shashank Shirol

,

Proceedings of the IEEE International Conference on Acoustics, 2022

Self-Critical Sequence Training for Automatic Speech Recognition.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Time Domain Speech Enhancement With Attentive Multi-scale Approach.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

Loading...