2025
Semi-Supervised Audio-Visual Video Action Recognition with Audio Source Localization Guided Mixup.
CoRR, March, 2025

2023
Sound of Story: Multi-modal Storytelling with Audio.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023