Removal of Hallucination on Hallucination: Debate-Augmented RAG.
CoRR, May, 2025
MolGround: A Benchmark for Molecular Grounding.
CoRR, March, 2025
Mean of Means: Human Localization with Calibration-free and Unconstrained Camera Settings (extended version).
CoRR, February, 2025
PolySmart @ TRECVid 2024 Video-To-Text.
CoRR, 2024
Mean of Means: A 10-dollar Solution for Human Localization with Calibration-free and Unconstrained Camera Settings.
CoRR, 2024
A Survey on Personalized Content Synthesis with Diffusion Models.
CoRR, 2024
A Picture Is Worth a Graph: Blueprint Debate on Graph for Multimodal Reasoning.
CoRR, 2024
A Picture Is Worth a Graph: A Blueprint Debate Paradigm for Multimodal Reasoning.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Generative Active Learning for Image Synthesis Personalization.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024