Faithful, Unfaithful or Ambiguous? Multi-Agent Debate with Initial Stance for Summary Evaluation.
,
,
,
,
,
,
,
,
,
,
,
CoRR, February, 2025
Zero-resource Speech Translation and Recognition with LLMs.
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Speech Retrieval-Augmented Generation without Automatic Speech Recognition.
CoRR, 2024
SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models.
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
PEAVS: Perceptual Evaluation of Audio-Visual Synchrony Grounded in Viewers' Opinion Scores.
CoRR, 2024
Perceptual Evaluation of Audio-Visual Synchrony Grounded in Viewers' Opinion Scores.
Proceedings of the Computer Vision - ECCV 2024, 2024
SpeechGuard: Exploring the Adversarial Robustness of Multi-modal Large Language Models.
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Utility-Preserving Privacy-Enabled Speech Embeddings for Emotion Detection.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023