2025
Optimizing Singular Spectrum for Large Language Model Compression.
CoRR, February, 2025

HiMix: Reducing Computational Complexity in Large Vision-Language Models.
CoRR, January, 2025

2024
Manga Generation via Layout-controllable Diffusion.
CoRR, 2024

LinVT: Empower Your Image-level Large Language Model to Understand Videos.
CoRR, 2024

TASR: Timestep-Aware Diffusion Model for Image Super-Resolution.
CoRR, 2024

RFSR: Improving ISR Diffusion Models via Reward Feedback Learning.
CoRR, 2024

2023
Zero-Shot Semantic Segmentation with Decoupled One-Pass Network.
CoRR, 2023

Open-Vocabulary Semantic Segmentation with Decoupled One-Pass Network.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
DiP: Learning Discriminative Implicit Parts for Person Re-Identification.
CoRR, 2022

SoccerNet 2022 Challenges Results.
Proceedings of the MMSports@MM 2022: Proceedings of the 5th International ACM Workshop on Multimedia Content Analysis in Sports, 2022

2021
Video Temporal Relationship Mining for Data-Efficient Person Re-identification.
CoRR, 2021