Non-autoregressive Sequence-to-Sequence Vision-Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Enhancing Vision-Language Pre-Training with Rich Supervisions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Musketeer (All for One, and One for All): A Generalist Vision-Language Model with Task Explanation Prompts.
CoRR, 2023
Learning Instance Occlusion for Panoptic Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020