2025
Faithful, Unfaithful or Ambiguous? Multi-Agent Debate with Initial Stance for Summary Evaluation.
CoRR, February, 2025

2024
Zero-resource Speech Translation and Recognition with LLMs.
CoRR, 2024

Speech Retrieval-Augmented Generation without Automatic Speech Recognition.
CoRR, 2024

SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models.
CoRR, 2024

SpeechVerse: A Large-scale Generalizable Audio Language Model.
CoRR, 2024

PEAVS: Perceptual Evaluation of Audio-Visual Synchrony Grounded in Viewers' Opinion Scores.
CoRR, 2024

Perceptual Evaluation of Audio-Visual Synchrony Grounded in Viewers' Opinion Scores.
Proceedings of the Computer Vision - ECCV 2024, 2024

SpeechGuard: Exploring the Adversarial Robustness of Multi-modal Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Utility-Preserving Privacy-Enabled Speech Embeddings for Emotion Detection.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023