ROICtrl: Boosting Instance Control for Visual Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Universal Pyramid Adversarial Training for Improved ViT Performance.
CoRR, 2023
VoxelFormer: Bird's-Eye-View Feature Generation based on Dual-view Attention for Multi-view 3D Object Detection.
CoRR, 2023
A Unified Model for Tracking and Image-Video Detection Has More Power.
CoRR, 2022
Few-Shot Fast-Adaptive Anomaly Detection.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Self-appearance-aided Differential Evolution for Motion Transfer.
CoRR, 2021
Joint Audio-Visual Deepfake Detection.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Learning Beyond-pixel Mappings from Internet Videos.
PhD thesis, 2019
Dance Dance Generation: Motion Transfer for Internet Videos.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019
Image2GIF: Generating Cinemagraphs Using Recurrent Deep Q-Networks.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018
Visual to Sound: Generating Natural Sound for Videos in the Wild.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018
Learning Temporal Transformations from Time-Lapse Videos.
Proceedings of the Computer Vision - ECCV 2016, 2016
Temporal Perception and Prediction in Ego-Centric Video.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015
A MAP-Estimation Framework for Blind Deblurring Using High-Level Edge Priors.
Proceedings of the Computer Vision - ECCV 2014, 2014