Game-MUG: Multimodal Oriented Game Situation Understanding and Commentary Generation Dataset.
CoRR, 2024
MCAD: Multi-teacher Cross-modal Alignment Distillation for efficient image-text retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024