2025
MolGround: A Benchmark for Molecular Grounding.
CoRR, March, 2025

Mean of Means: Human Localization with Calibration-free and Unconstrained Camera Settings (extended version).
CoRR, February, 2025

Removal of Hallucination on Hallucination: Debate-Augmented RAG.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
PolySmart @ TRECVid 2024 Video-To-Text.
CoRR, 2024

Mean of Means: A 10-dollar Solution for Human Localization with Calibration-free and Unconstrained Camera Settings.
CoRR, 2024

A Survey on Personalized Content Synthesis with Diffusion Models.
CoRR, 2024

A Picture Is Worth a Graph: Blueprint Debate on Graph for Multimodal Reasoning.
CoRR, 2024

GraphATC: advancing multilevel and multi-label anatomical therapeutic chemical classification via atom-level graph learning.
Briefings Bioinform., 2024

A Picture Is Worth a Graph: A Blueprint Debate Paradigm for Multimodal Reasoning.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Generative Active Learning for Image Synthesis Personalization.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024