Heqing Zou

Orcid: 0000-0003-0038-2822

According to our database1, Heqing Zou authored at least 15 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
From Seconds to Hours: Reviewing MultiModal Large Language Models on Comprehensive Long Video Understanding.
CoRR, 2024

Text-based Talking Video Editing with Cascaded Conditional Diffusion.
CoRR, 2024

Cross-Modality and Within-Modality Regularization for Audio-Visual Deepfake Detection.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Towards Balanced Active Learning for Multimodal Classification.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Unsupervised Noise Adaptation Using Data Simulation.
Proceedings of the IEEE International Conference on Acoustics, 2023

UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Leveraging Modality-Specific Representations for Audio-Visual Speech Recognition via Reinforcement Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Interactive Audio-text Representation for Automated Audio Captioning with Contrastive Learning.
CoRR, 2022

Interactive Auido-text Representation for Automated Audio Captioning with Contrastive Learning.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Speech Emotion Recognition with Co-Attention Based Multi-Level Acoustic Information.
Proceedings of the IEEE International Conference on Acoustics, 2022

Self-Critical Sequence Training for Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

2020
Edge-Gan: Edge Conditioned Multi-View Face Image Generation.
Proceedings of the IEEE International Conference on Image Processing, 2020


  Loading...