2025
ProtoCLIP: Prototypical Contrastive Language Image Pretraining.
IEEE Trans. Neural Networks Learn. Syst., January, 2025
High-Dimension Human Value Representation in Large Language Models.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025
Linguistic Minimal Pairs Elicit Linguistic Similarity in Large Language Models.
Proceedings of the 31st International Conference on Computational Linguistics, 2025
Making Large Vision Language Models to Be Good Few-Shot Learners.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
2024
Few-Shot Classification Model Compression via School Learning.
IEEE Trans. Circuits Syst. Video Technol., December, 2024
Few-shot adaptation of multi-modal foundation models: a survey.
Artif. Intell. Rev., October, 2024
Few-shot classification guided by generalization error bound.
Pattern Recognit., January, 2024
RemoteCLIP: A Vision Language Foundation Model for Remote Sensing.
IEEE Trans. Geosci. Remote. Sens., 2024
Prompting DirectSAM for Semantic Contour Extraction in Remote Sensing Images.
CoRR, 2024
Making Large Vision Language Models to be Good Few-shot Learners.
CoRR, 2024
LLM Internal States Reveal Hallucination Risk Faced With a Query.
CoRR, 2024
Subobject-level Image Tokenization.
CoRR, 2024
Few-shot Adaptation of Multi-modal Foundation Models: A Survey.
CoRR, 2024
Measuring Political Bias in Large Language Models: What Is Said and How It Is Said.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Visual Instruction Tuning with Polite Flamingo.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
MEP-3M: A large-scale multi-modal E-commerce product dataset.
Pattern Recognit., August, 2023
Asymmetric exponential loss function for crack segmentation.
Multim. Syst., April, 2023
Deep learning based single sample face recognition: a survey.
Artif. Intell. Rev., March, 2023
RemoteCLIP: A Vision Language Foundation Model for Remote Sensing.
CoRR, 2023
Taming Diffusion Models for Music-driven Conducting Motion Generation.
CoRR, 2023
Few-shot Classification via Ensemble Learning with Multi-Order Statistics.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023
Towards Joint Modeling of Dialogue Response and Speech Synthesis based on Large Language Model.
Proceedings of the 6th International Conference on Natural Language and Speech Processing (ICNLSP 2023), 2023
Single Sample Face Recognition Based on Identity-Attribute Disentanglement and Adversarial Feature Augmentation.
Proceedings of the Biometric Recognition - 17th Chinese Conference, 2023
2022
Self-Supervised Music Motion Synchronization Learning for Music-Driven Conducting Motion Generation.
J. Comput. Sci. Technol., 2022
A review of driver fatigue detection and its advances on the use of RGB-D camera and deep learning.
Eng. Appl. Artif. Intell., 2022
Prototypical Contrastive Language Image Pretraining.
CoRR, 2022
A Simple Baseline for Adversarial Domain Adaptation-based Unsupervised Flood Forecasting.
CoRR, 2022
Knowledge Graph Based Chicken Disease Diagnosis Question Answering System.
Proceedings of the Data Mining and Big Data - 7th International Conference, 2022
MDF-Net: Multimodal Deep Fusion for Large-Scale Product Recognition.
Proceedings of the Biometric Recognition - 16th Chinese Conference, 2022
2021
VirtualConductor: Music-driven Conducting Video Generation System.
CoRR, 2021
Significant Wave Height Prediction based on Wavelet Graph Neural Network.
CoRR, 2021
2020
Deep Learning Based Single Sample Per Person Face Recognition: A Survey.
CoRR, 2020
A Review of Automatically Diagnosing COVID-19 based on Scanning Image.
CoRR, 2020
A Review of Automated Diagnosis of COVID-19 Based on Scanning Images.
Proceedings of the ICRAI 2020: 6th International Conference on Robotics and Artificial Intelligence, 2020