Sungwoong Kim

CoRR, June, 2025

Can Video LLMs Refuse to Answer? Alignment for Answerability in Video Large Language Models.

[DOI]

Mark A. Hasegawa-Johnson

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

Physics Informed Distillation for Diffusion Models.

[DOI]

Trans. Mach. Learn. Res., 2024

LI-TTA: Language Informed Test-Time Adaptation for Automatic Speech Recognition.

[DOI]

John B. Harvill

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

C-TPT: Calibrated Test-Time Prompt Tuning for Vision-Language Models via Text Feature Dispersion.

[DOI]

Joshua Tian Jin Tee

Mark A. Hasegawa-Johnson

Yingzhen Li

Proceedings of the Twelfth International Conference on Learning Representations, 2024

AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

BI-MDRG: Bridging Image History in Multimodal Dialogue Response Generation.

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback.

[DOI]

Sungwoong Kim

Chang Dong Yoo

Proceedings of the Findings of the Association for Computational Linguistics, 2024

SimPSI: A Simple Strategy to Preserve Spectral Information in Time Series Data Augmentation.

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

One-Shot Exemplification Modeling via Latent Sense Representations.

[DOI]

John B. Harvill

Proceedings of the 8th Workshop on Representation Learning for NLP, 2023

Mitigating the Exposure Bias in Sentence-Level Grapheme-to-Phoneme (G2P) Transduction.

[DOI]

Chanwoo Kim

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

ESD: Expected Squared Difference as a Tuning-Free Trainable Calibration Measure.

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Counterfactual Two-Stage Debiasing For Video Corpus Moment Retrieval.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

HEAR: Hearing Enhanced Audio Response for Video-grounded Dialogue.

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

INTapt: Information-Theoretic Adversarial Prompt Tuning for Enhanced Non-Native Speech Recognition.

[DOI]

John B. Harvill

Chang Dong Yoo

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

Selective Query-guided Debiasing Network for Video Corpus Moment Retrieval.

[DOI]

CoRR, 2022

Information-Theoretic Text Hallucination Reduction for Video-grounded Dialogue.

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

SMSMix: Sense-Maintained Sentence Mixup for Word Sense Disambiguation.

[DOI]