2025

DyST-XL: Dynamic Layout Planning and Content Control for Compositional Text-to-Video Generation.

[DOI]

Weijie He

Mushui Liu

Yunlong Yu

Zhao Wang

Chao Wu

CoRR, April, 2025

CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation.

[DOI]

CoRR, March, 2025

RectifiedHR: Enable Efficient High-Resolution Image Generation via Energy Rectification.

[DOI]

CoRR, March, 2025

MINT: Multi-modal Chain of Thought in Unified Generative Models for Enhanced Image Generation.

[DOI]

CoRR, March, 2025

Synth-CLIP: Synthetic data make CLIP generalize better in data-limited scenarios.

[DOI]

Neural Networks, 2025

Envisioning Class Entity Reasoning by Large Language Models for Few-shot Learning.

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

LLM4GEN: Leveraging Semantic Representation of LLMs for Text-to-Image Generation.

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Frame Order Matters: A Temporal Sequence-Aware Model for Few-Shot Action Recognition.

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis.

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

Tolerant Self-Distillation for image classification.

[DOI]

Neural Networks, 2024

Similar norm more transferable: Rethinking feature norms discrepancy in adversarial domain adaptation.

[DOI]

Knowl. Based Syst., 2024

RestorerID: Towards Tuning-Free Face Restoration with ID Preservation.

[DOI]

CoRR, 2024

Hybrid Mask Generation for Infrared Small Target Detection with Single-Point Supervision.

[DOI]

CoRR, 2024

Fully Fine-tuned CLIP Models are Efficient Few-Shot Learners.

[DOI]

Mushui Liu

Bozheng Li

Yunlong Yu

CoRR, 2024

LLM4GEN: Leveraging Semantic Representation of LLMs for Text-to-Image Generation.

[DOI]

CoRR, 2024

CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation.

[DOI]

CoRR, 2024

HOGDA: Boosting Semi-supervised Graph Domain Adaptation via High-Order Structure-Guided Adaptive Feature Alignment.

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Improving Zero-Shot Generalization for CLIP with Variational Adapter.

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

OmniCLIP: Adapting CLIP for Video Recognition with Spatial-Temporal Omni-Scale Feature Learning.

[DOI]

Mushui Liu

Bozheng Li

Yunlong Yu

Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

2023

Lightweight MIMO-WNet for single image deblurring.

[DOI]

Neurocomputing, 2023

SYNC-CLIP: Synthetic Data Make CLIP Generalize Better in Data-Limited Scenarios.

[DOI]

CoRR, 2023