2025
Controllable Human-centric Keyframe Interpolation with Generative Prior.
CoRR, June, 2025

OpenUni: A Simple Baseline for Unified Multimodal Understanding and Generation.
CoRR, May, 2025

MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation.
Int. J. Comput. Vis., April, 2025

Harmonizing Visual Representations for Unified Multimodal Understanding and Generation.
CoRR, March, 2025

Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency.
CoRR, March, 2025

Contextual Object Detection with Multimodal Large Language Models.
Int. J. Comput. Vis., February, 2025

O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning.
CoRR, January, 2025

LLaVA-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

F-LMM: Grounding Frozen Large Multimodal Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

DoF-Gaussian: Controllable Depth-of-Field for 3D Gaussian Splatting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

WildAvatar: Learning In-the-wild 3D Avatars from the Web.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Visual Evidence Prompting Mitigates Hallucinations in Large Vision-Language Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
UniGrad-FS: Unified Gradient Projection With Flatter Sharpness for Continual Learning.
IEEE Trans. Ind. Informatics, December, 2024

Efficient Masked Autoencoders With Self-Consistency.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Learning to cluster person via graph convolution networks for video-based person re-identification.
Concurr. Comput. Pract. Exp., August, 2024

A Three-Coil Wear Particle Sensor Based on Radial Magnetic Field.
IEEE Trans. Instrum. Meas., 2024

Investigation of the Effect of Debris Position on the Detection Stability of a Magnetic Plug Sensor Based on Alternating Current Bridge.
Sensors, 2024

Video Instruction Tuning With Synthetic Data.
CoRR, 2024

LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models.
CoRR, 2024

WildAvatar: Web-scale In-the-wild Video Dataset for 3D Avatar Creation.
CoRR, 2024

StyleMaster: Towards Flexible Stylized Image Generation with Diffusion Models.
CoRR, 2024

Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo.
CoRR, 2024

Make Continual Learning Stronger via C-Flat.
CoRR, 2024

OMG-Seg: Is One Model Good Enough For All Segmentation?
CoRR, 2024

A Generalist FaceX via Learning Unified Facial Representation.
CoRR, 2024

Generalizable Implicit Motion Modeling for Video Frame Interpolation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Make Continual Learning Stronger via C-Flat.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

MDT-A2G: Exploring Masked Diffusion Transformers for Co-Speech Gesture Generation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Interpretable Composition Attribution Enhancement for Visio-linguistic Compositional Understanding.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo.
Proceedings of the Computer Vision - ECCV 2024, 2024

Face-Adapter for Pre-trained Diffusion Models with Fine-Grained ID and Attribute Control.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
Dynamic Condition Adversarial Adaptation for Fault Diagnosis of Wind Turbine Gearbox.
Sensors, December, 2023

Bi-Level Implicit Semantic Data Augmentation for Vehicle Re-Identification.
IEEE Trans. Intell. Transp. Syst., April, 2023

A Deep Learning System to Predict Recurrence and Disability Outcomes in Patients with Transient Ischemic Attack or Ischemic Stroke.
Adv. Intell. Syst., April, 2023

A Method for Distinguishing Mixed Metal Abrasives Based on Frequency Conversion.
IEEE Trans. Instrum. Meas., 2023

Flexible Nonlinear Error Correction Method Based on Support Vector Regression in Fringe Projection Profilometry.
IEEE Trans. Instrum. Meas., 2023

Efficient Masked Autoencoders with Self-Consistency.
CoRR, 2023

Research on Intelligent Vehicle Traffic Scenes Mapping Algorithm Considering Dynamic Objects.
Proceedings of the 26th IEEE International Conference on Intelligent Transportation Systems, 2023

Masked Frequency Modeling for Self-Supervised Visual Pre-Training.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Balancing Logit Variation for Long-Tailed Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Correlational Image Modeling for Self-Supervised Visual Pre-Training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Siamese DETR.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

SeqCo-DETR: Sequence Consistency Training for Self-Supervised Object Detection with Transformers.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

Exploring Stochastic Autoregressive Image Modeling for Visual Representation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
An Oil Multipollutant Detection Sensor With High Sensitivity and High Throughput.
IEEE Trans. Instrum. Meas., 2022

Capacitive-Inductive Magnetic Plug Sensor With High Adaptability for Online Debris Monitoring.
IEEE Trans. Instrum. Meas., 2022

An Ultrasensitive Microsensor Based on Impedance Analysis for Oil Condition Monitoring.
IEEE Trans. Ind. Electron., 2022

Research on High Sensitivity Oil Debris Detection Sensor Using High Magnetic Permeability Material and Coil Mutual Inductance.
Sensors, 2022

Analysis of the Effect of Velocity on the Eddy Current Effect of Metal Particles of Different Materials in Inductive Bridges.
Sensors, 2022

Beyond Classifiers: Remote Sensing Change Detection with Metric Learning.
Remote. Sens., 2022

MIAD: A Maintenance Inspection Dataset for Unsupervised Anomaly Detection.
CoRR, 2022

Unified Vision and Language Prompt Learning.
CoRR, 2022

Toward Real-world Single Image Deraining: A New Benchmark and Beyond.
CoRR, 2022

Learning from Future: A Novel Self-Training Framework for Semantic Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Open-Vocabulary DETR with Conditional Matching.
Proceedings of the Computer Vision - ECCV 2022, 2022

Semi-Supervised Semantic Segmentation Using Unreliable Pseudo-Labels.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

UniVIP: A Unified Framework for Self-Supervised Visual Pre-training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
An Ultrasensitive Debris Microsensor for Oil Health Monitoring Based on Resistance-Inductance Parameter.
IEEE Trans. Instrum. Meas., 2021

FastFlow: Unsupervised Anomaly Detection and Localization via 2D Normalizing Flows.
CoRR, 2021

MST: Masked Self-Supervised Transformer for Visual Representation.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

An On-Chip Inductive-Capacitive Sensor for the Detection of Wear Debris and Air Bubbles in Hydraulic Oil.
Proceedings of the 16th IEEE International Conference on Nano/Micro Engineered and Molecular Systems, 2021

CodeVIO: Visual-Inertial Odometry with Learned Optimizable Dense Depth.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021