Separate to Collaborate: Dual-Stream Diffusion Model for Coordinated Piano Hand Motion Synthesis.
CoRR, April, 2025
General retinal image enhancement via reconstruction: Bridging distribution shifts using latent diffusion adaptors.
Medical Image Anal., 2025
Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
AToM: Aligning Text-to-Motion Model at Event-Level with GPT-4Vision Reward.
CoRR, 2024
REPARO: Compositional 3D Assets Generation with Differentiable 3D Layout Alignment.
CoRR, 2024
Absolute-Unified Multi-Class Anomaly Detection via Class-Agnostic Distribution Alignment.
CoRR, 2024
BATON: Aligning Text-to-Audio Model with Human Preference Feedback.
CoRR, 2024
MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Consistent123: One Image to Highly Consistent 3D Asset Using Case-Aware Diffusion Priors.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
BATON: Aligning Text-to-Audio Model Using Human Preference Feedback.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024
SSGD: A Smartphone Screen Glass Dataset for Defect Detection.
Proceedings of the IEEE International Conference on Acoustics, 2023
Single-image dehazing using scene radiance constraint and color gradient guided filter.
Signal Image Video Process., 2022
Research on D&A System Based on K-means++ Clustering and Machine Learning.
Proceedings of the 5th IEEE International Conference on Information Systems and Computer Aided Education, 2022