2024
VLDadaptor: Domain Adaptive Object Detection With Vision-Language Model Distillation.
IEEE Trans. Multim., 2024

ProFPN: Progressive feature pyramid network with soft proposal assignment for object detection.
Knowl. Based Syst., 2024

Weighted parallel decoupled feature pyramid network for object detection.
Neurocomputing, 2024

Cropper: Vision-Language Model for Image Cropping through In-Context Learning.
CoRR, 2024

Optical Diffusion Models for Image Generation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

ArtVLM: Attribute Recognition Through Vision-Based Prefix Language Modeling.
Proceedings of the Computer Vision - ECCV 2024, 2024

Parrot: Pareto-Optimal Multi-reward Reinforcement Learning Framework for Text-to-Image Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Rich Human Feedback for Text-to-Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Forward-Forward Training of an Optical Neural Network.
CoRR, 2023

MRET: Multi-resolution Transformer for Video Quality Assessment.
CoRR, 2023

VILA: Learning Image Aesthetics from User Comments with Vision-Language Pretraining.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Revisiting the Efficiency of UGC Video Quality Assessment.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Identifying Document Images with Glare Using Global and Localized Feature Fusion.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

2021
Multi-path Neural Networks for On-device Multi-domain Visual Classification.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

MUSIQ: Multi-scale Image Quality Transformer.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Rich Features for Perceptual Quality Assessment of UGC Videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Adversarially Adaptive Normalization for Single Domain Generalization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021