2025

Faithful, Unfaithful or Ambiguous? Multi-Agent Debate with Initial Stance for Summary Evaluation.

[DOI]

Mahnaz Koupaee

Jake W. Vincent

CoRR, February, 2025

2024

Zero-resource Speech Translation and Recognition with LLMs.

[DOI]

Veera Raghavendra Elluru

CoRR, 2024

Speech Retrieval-Augmented Generation without Automatic Speech Recognition.

[DOI]

Do June Min

Karel Mundnich

Andy Lapastora

Erfan Soltanmohammadi

Srikanth Ronanki

Kyu J. Han

CoRR, 2024

SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models.

[DOI]

Raghuveer Peri

Sai Muralidhar Jayanthi

Srikanth Vishnubhotla

Daniel Garcia-Romero

Sundararajan Srinivasan

Kyu J. Han

Katrin Kirchhoff

CoRR, 2024

SpeechVerse: A Large-scale Generalizable Audio Language Model.

[DOI]

Sai Muralidhar Jayanthi

Xilai Li

Karel Mundnich

Monica Sunkara

Sundararajan Srinivasan

Kyu J. Han

Katrin Kirchhoff

CoRR, 2024

PEAVS: Perceptual Evaluation of Audio-Visual Synchrony Grounded in Viewers' Opinion Scores.

[DOI]

Lucas Goncalves

Prashant Mathur

Chandrashekhar Lavania

Metehan Cekic

Marcello Federico

Kyu J. Han

CoRR, 2024

Perceptual Evaluation of Audio-Visual Synchrony Grounded in Viewers' Opinion Scores.

[DOI]

Lucas Goncalves

Prashant Mathur

Chandrashekhar Lavania

Metehan Cekic

Marcello Federico

Kyu J. Han

Proceedings of the Computer Vision - ECCV 2024, 2024

SpeechGuard: Exploring the Adversarial Robustness of Multi-modal Large Language Models.

[DOI]

Raghuveer Peri

Sai Muralidhar Jayanthi

Srikanth Vishnubhotla

Daniel Garcia-Romero

Sundararajan Srinivasan

Kyu J. Han

Katrin Kirchhoff

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

Utility-Preserving Privacy-Enabled Speech Embeddings for Emotion Detection.

[DOI]

Chandrashekhar Lavania

Sanjiv Das

Xin Huang

Kyu J. Han

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023