2025
LEARN: Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy.
CoRR, 2024

ASR-enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval.
CoRR, 2024

Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application.
CoRR, 2024

Knowledge Condensation and Reasoning for Knowledge-based VQA.
CoRR, 2024

Spatiotemporal Fine-grained Video Description for Short Videos.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Spatiotemporal Graph Guided Multi-modal Network for Livestreaming Product Retrieval.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Learning Multi-Dimensional Human Preference for Text-to-Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
KwaiYiiMath: Technical Report.
CoRR, 2023

Cross-view Semantic Alignment for Livestreaming Product Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Cross-Domain Product Representation Learning for Rich-Content E-Commerce.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2020
TEA: Temporal Excitation and Aggregation for Action Recognition.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Mixed Supervised Object Detection with Robust Objectness Transfer.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Transductive Zero-Shot Learning with Visual Structure Constraint.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Transductive Zero-Shot Learning via Visual Center Adaptation.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Discriminative Learning of Latent Features for Zero-Shot Recognition.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Deep Semantic Structural Constraints for Zero-Shot Learning.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018