2025
Separate to Collaborate: Dual-Stream Diffusion Model for Coordinated Piano Hand Motion Synthesis.
CoRR, April, 2025

General retinal image enhancement via reconstruction: Bridging distribution shifts using latent diffusion adaptors.
Medical Image Anal., 2025

Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
AToM: Aligning Text-to-Motion Model at Event-Level with GPT-4Vision Reward.
CoRR, 2024

REPARO: Compositional 3D Assets Generation with Differentiable 3D Layout Alignment.
CoRR, 2024

Absolute-Unified Multi-Class Anomaly Detection via Class-Agnostic Distribution Alignment.
CoRR, 2024

BATON: Aligning Text-to-Audio Model with Human Preference Feedback.
CoRR, 2024

MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Consistent123: One Image to Highly Consistent 3D Asset Using Case-Aware Diffusion Priors.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

BATON: Aligning Text-to-Audio Model Using Human Preference Feedback.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

2023
SSGD: A Smartphone Screen Glass Dataset for Defect Detection.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Single-image dehazing using scene radiance constraint and color gradient guided filter.
Signal Image Video Process., 2022

Research on D&A System Based on K-means++ Clustering and Machine Learning.
Proceedings of the 5th IEEE International Conference on Information Systems and Computer Aided Education, 2022