2025
Removal of Hallucination on Hallucination: Debate-Augmented RAG.
CoRR, May, 2025

MolGround: A Benchmark for Molecular Grounding.
CoRR, March, 2025

Mean of Means: Human Localization with Calibration-free and Unconstrained Camera Settings (extended version).
CoRR, February, 2025

2024
PolySmart @ TRECVid 2024 Video-To-Text.
CoRR, 2024

Mean of Means: A 10-dollar Solution for Human Localization with Calibration-free and Unconstrained Camera Settings.
CoRR, 2024

A Survey on Personalized Content Synthesis with Diffusion Models.
CoRR, 2024

A Picture Is Worth a Graph: Blueprint Debate on Graph for Multimodal Reasoning.
CoRR, 2024

A Picture Is Worth a Graph: A Blueprint Debate Paradigm for Multimodal Reasoning.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Generative Active Learning for Image Synthesis Personalization.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024