Disentangling Instruction Influence in Diffusion Transformers for Parallel Multi-Instruction-Guided Image Editing.
CoRR, April, 2025
Test-time Adaptation for Foundation Medical Segmentation Model without Parametric Updates.
CoRR, April, 2025
Enhancing Zero-Shot Image Recognition in Vision-Language Models through Human-like Concept Guidance.
CoRR, March, 2025
Q-PART: Quasi-Periodic Adaptive Regression with Test-time Training for Pediatric Left Ventricular Ejection Fraction Regression.
CoRR, March, 2025
Robust Domain Misinformation Detection via Multi-Modal Feature Alignment.
IEEE Trans. Inf. Forensics Secur., 2024
Large Language Models for Lossless Image Compression: Next-Pixel Prediction in Language Space is All You Need.
CoRR, 2024
Unraveling the Mechanics of Learning-Based Demonstration Selection for In-Context Learning.
CoRR, 2024
TELLER: A Trustworthy Framework for Explainable, Generalizable and Controllable Fake News Detection.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Interpretable Multimodal Misinformation Detection with Logic Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Towards Multi-Modal Sarcasm Detection via Hierarchical Congruity Modeling with Knowledge Enhancement.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022