2024
Exploiting Instance-level Relationships in Weakly Supervised Text-to-Video Retrieval.
ACM Trans. Multim. Comput. Commun. Appl., October, 2024

T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs.
CoRR, 2024

MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs.
CoRR, 2024

I-AM-G: Interest Augmented Multimodal Generator for Item Personalization.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
Woodpecker: Hallucination Correction for Multimodal Large Language Models.
CoRR, 2023

A Survey on Multimodal Large Language Models.
CoRR, 2023

AU-aware graph convolutional network for Macro- and Micro-expression spotting.
CoRR, 2023

AU-aware graph convolutional network for Macroand Micro-expression spotting.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

2022
Fine-grained Micro-Expression Generation based on Thin-Plate Spline and Relative AU Constraint.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022