2025
Controllable Human-centric Keyframe Interpolation with Generative Prior.
CoRR, June, 2025
OpenUni: A Simple Baseline for Unified Multimodal Understanding and Generation.
CoRR, May, 2025
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation.
Int. J. Comput. Vis., April, 2025
Harmonizing Visual Representations for Unified Multimodal Understanding and Generation.
CoRR, March, 2025
Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency.
CoRR, March, 2025
Contextual Object Detection with Multimodal Large Language Models.
Int. J. Comput. Vis., February, 2025
O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning.
CoRR, January, 2025
LLaVA-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
F-LMM: Grounding Frozen Large Multimodal Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
DoF-Gaussian: Controllable Depth-of-Field for 3D Gaussian Splatting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
WildAvatar: Learning In-the-wild 3D Avatars from the Web.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
Visual Evidence Prompting Mitigates Hallucinations in Large Vision-Language Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
2024
UniGrad-FS: Unified Gradient Projection With Flatter Sharpness for Continual Learning.
IEEE Trans. Ind. Informatics, December, 2024
Efficient Masked Autoencoders With Self-Consistency.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024
Learning to cluster person via graph convolution networks for video-based person re-identification.
Concurr. Comput. Pract. Exp., August, 2024
A Three-Coil Wear Particle Sensor Based on Radial Magnetic Field.
IEEE Trans. Instrum. Meas., 2024
Investigation of the Effect of Debris Position on the Detection Stability of a Magnetic Plug Sensor Based on Alternating Current Bridge.
Sensors, 2024
Video Instruction Tuning With Synthetic Data.
CoRR, 2024
LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models.
CoRR, 2024
WildAvatar: Web-scale In-the-wild Video Dataset for 3D Avatar Creation.
CoRR, 2024
StyleMaster: Towards Flexible Stylized Image Generation with Diffusion Models.
CoRR, 2024
Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo.
CoRR, 2024
Make Continual Learning Stronger via C-Flat.
CoRR, 2024
OMG-Seg: Is One Model Good Enough For All Segmentation?
CoRR, 2024
A Generalist FaceX via Learning Unified Facial Representation.
CoRR, 2024
Generalizable Implicit Motion Modeling for Video Frame Interpolation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Make Continual Learning Stronger via C-Flat.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
MDT-A2G: Exploring Masked Diffusion Transformers for Co-Speech Gesture Generation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Interpretable Composition Attribution Enhancement for Visio-linguistic Compositional Understanding.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo.
Proceedings of the Computer Vision - ECCV 2024, 2024
Face-Adapter for Pre-trained Diffusion Models with Fine-Grained ID and Attribute Control.
Proceedings of the Computer Vision - ECCV 2024, 2024
2023
Dynamic Condition Adversarial Adaptation for Fault Diagnosis of Wind Turbine Gearbox.
Sensors, December, 2023
Bi-Level Implicit Semantic Data Augmentation for Vehicle Re-Identification.
IEEE Trans. Intell. Transp. Syst., April, 2023
A Deep Learning System to Predict Recurrence and Disability Outcomes in Patients with Transient Ischemic Attack or Ischemic Stroke.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Adv. Intell. Syst., April, 2023
A Method for Distinguishing Mixed Metal Abrasives Based on Frequency Conversion.
IEEE Trans. Instrum. Meas., 2023
Flexible Nonlinear Error Correction Method Based on Support Vector Regression in Fringe Projection Profilometry.
IEEE Trans. Instrum. Meas., 2023
Efficient Masked Autoencoders with Self-Consistency.
CoRR, 2023
Research on Intelligent Vehicle Traffic Scenes Mapping Algorithm Considering Dynamic Objects.
Proceedings of the 26th IEEE International Conference on Intelligent Transportation Systems, 2023
Masked Frequency Modeling for Self-Supervised Visual Pre-Training.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Balancing Logit Variation for Long-Tailed Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Correlational Image Modeling for Self-Supervised Visual Pre-Training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
SeqCo-DETR: Sequence Consistency Training for Self-Supervised Object Detection with Transformers.
Proceedings of the 34th British Machine Vision Conference 2023, 2023
Exploring Stochastic Autoregressive Image Modeling for Visual Representation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
An Oil Multipollutant Detection Sensor With High Sensitivity and High Throughput.
IEEE Trans. Instrum. Meas., 2022
Capacitive-Inductive Magnetic Plug Sensor With High Adaptability for Online Debris Monitoring.
,
,
,
,
,
,
,
,
,
,
,
,
IEEE Trans. Instrum. Meas., 2022
An Ultrasensitive Microsensor Based on Impedance Analysis for Oil Condition Monitoring.
IEEE Trans. Ind. Electron., 2022
Research on High Sensitivity Oil Debris Detection Sensor Using High Magnetic Permeability Material and Coil Mutual Inductance.
Sensors, 2022
Analysis of the Effect of Velocity on the Eddy Current Effect of Metal Particles of Different Materials in Inductive Bridges.
,
,
,
,
,
,
,
,
,
,
Sensors, 2022
Beyond Classifiers: Remote Sensing Change Detection with Metric Learning.
Remote. Sens., 2022
MIAD: A Maintenance Inspection Dataset for Unsupervised Anomaly Detection.
CoRR, 2022
Unified Vision and Language Prompt Learning.
CoRR, 2022
Toward Real-world Single Image Deraining: A New Benchmark and Beyond.
CoRR, 2022
Learning from Future: A Novel Self-Training Framework for Semantic Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks.
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Open-Vocabulary DETR with Conditional Matching.
Proceedings of the Computer Vision - ECCV 2022, 2022
Semi-Supervised Semantic Segmentation Using Unreliable Pseudo-Labels.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
UniVIP: A Unified Framework for Self-Supervised Visual Pre-training.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
2021
An Ultrasensitive Debris Microsensor for Oil Health Monitoring Based on Resistance-Inductance Parameter.
IEEE Trans. Instrum. Meas., 2021
FastFlow: Unsupervised Anomaly Detection and Localization via 2D Normalizing Flows.
CoRR, 2021
MST: Masked Self-Supervised Transformer for Visual Representation.
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
An On-Chip Inductive-Capacitive Sensor for the Detection of Wear Debris and Air Bubbles in Hydraulic Oil.
Proceedings of the 16th IEEE International Conference on Nano/Micro Engineered and Molecular Systems, 2021
CodeVIO: Visual-Inertial Odometry with Learned Optimizable Dense Depth.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021