Chenda Li

Orcid: 0000-0003-0299-9914

According to our database1, Chenda Li authored at least 33 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Unified Cross-Modal Attention: Robust Audio-Visual Speech Recognition and Beyond.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Diffusion-based Generative Modeling with Discriminative Guidance for Streamable Speech Enhancement.
CoRR, 2024

URGENT Challenge: Universality, Robustness, and Generalizability For Speech Enhancement.
CoRR, 2024

Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enhancement.
CoRR, 2024

SpeechComposer: Unifying Multiple Speech Tasks with Prompt Composition.
CoRR, 2024

2023
Software Design and User Interface of ESPnet-SE++: Speech Enhancement for Robust Speech Processing.
J. Open Source Softw., November, 2023

Software Design and User Interface of ESPnet-SE++: Speech Enhancement for Robust Speech Processing (espnet-v.202310).
Dataset, October, 2023

Overlap Aware Continuous Speech Separation without Permutation Invariant Training.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Adapting Multi-Lingual ASR Models for Handling Multiple Talkers.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Light-Weight Visualvoice: Neural Network Quantization On Audio Visual Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Predictive Skim: Contrastive Predictive Coding for Low-Latency Online Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Target Sound Extraction with Variable Cross-Modality Clues.
Proceedings of the IEEE International Conference on Acoustics, 2023

Robust Audio-Visual ASR with Unified Cross-Modal Attention.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Dual-Path Modeling With Memory Embedding Model for Continuous Speech Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Improving Speech Separation with Knowledge Distilled from Self-supervised Pre-trained Models.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Time-Domain Audio-Visual Speech Separation on Low Quality Videos.
Proceedings of the IEEE International Conference on Acoustics, 2022

The Sjtu System For Multimodal Information Based Speech Processing Challenge 2021.
Proceedings of the IEEE International Conference on Acoustics, 2022

Towards Low-Distortion Multi-Channel Speech Enhancement: The ESPNET-Se Submission to the L3DAS22 Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2022

Skim: Skipping Memory Lstm for Low-Latency Real-Time Continuous Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Closing the Gap Between Time-Domain Multi-Channel Speech Enhancement on Real and Simulation Conditions.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

Dual-Path RNN for Long Recording Speech Separation.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Audio-Visual Multi-Talker Speech Recognition in a Cocktail Party.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Continuous Speech Separation Using Speaker Inventory for Long Recording.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Rethinking The Separation Layers In Speech Separation Networks.
Proceedings of the IEEE International Conference on Acoustics, 2021

Dual-Path Modeling for Long Recording Speech Separation in Meetings.
Proceedings of the IEEE International Conference on Acoustics, 2021

Recent Developments on Espnet Toolkit Boosted By Conformer.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans.
CoRR, 2020

Continuous Speech Separation Using Speaker Inventory for Long Multi-talker Recording.
CoRR, 2020

Listen, Watch and Understand at the Cocktail Party: Audio-Visual-Contextual Speech Separation.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Deep Audio-Visual Speech Separation with Attention Mechanism.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Prosody Usage Optimization for Children Speech Recognition with Zero Resource Children Speech.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019


  Loading...