2025

ProtoCLIP: Prototypical Contrastive Language Image Pretraining.

[DOI]

,

,

,

,

,

,

IEEE Trans. Neural Networks Learn. Syst., January, 2025

High-Dimension Human Value Representation in Large Language Models.

[DOI]

Samuel Cahyawijaya

,

,

,

Leila Khalatbari

,

,

,

,

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Linguistic Minimal Pairs Elicit Linguistic Similarity in Large Language Models.

[DOI]

,

,

Samuel Cahyawijaya

,

,

Zhenguang G. Cai

Proceedings of the 31st International Conference on Computational Linguistics, 2025

Making Large Vision Language Models to Be Good Few-Shot Learners.

[DOI]

,

,

,

,

,

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

Few-Shot Classification Model Compression via School Learning.

[DOI]

,

,

,

,

IEEE Trans. Circuits Syst. Video Technol., December, 2024

Few-shot adaptation of multi-modal foundation models: a survey.

[DOI]

,

,

,

,

,

,

Artif. Intell. Rev., October, 2024

Few-shot classification guided by generalization error bound.

[DOI]

,

,

,

,

Pattern Recognit., January, 2024

RemoteCLIP: A Vision Language Foundation Model for Remote Sensing.

[DOI]

,

,

Zhangqingyun Guan

,

,

,

,

,

IEEE Trans. Geosci. Remote. Sens., 2024

Prompting DirectSAM for Semantic Contour Extraction in Remote Sensing Images.

[DOI]

,

,

,

,

,

,

CoRR, 2024

Making Large Vision Language Models to be Good Few-shot Learners.

[DOI]

,

,

,

,

,

CoRR, 2024

LLM Internal States Reveal Hallucination Risk Faced With a Query.

[DOI]

,

,

,

Samuel Cahyawijaya

,

,

,

CoRR, 2024

The Pyramid of Captions.

[DOI]

,

Samuel Cahyawijaya

,

,

,

,

CoRR, 2024

Subobject-level Image Tokenization.

[DOI]

,

Samuel Cahyawijaya

,

,

,

CoRR, 2024

Few-shot Adaptation of Multi-modal Foundation Models: A Survey.

[DOI]

,

,

,

,

,

CoRR, 2024

Measuring Political Bias in Large Language Models: What Is Said and How It Is Said.

[DOI]

,

,

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Visual Instruction Tuning with Polite Flamingo.

[DOI]

,

,

,

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

MEP-3M: A large-scale multi-modal E-commerce product dataset.

[DOI]

,

,

,

,

Pattern Recognit., August, 2023

Asymmetric exponential loss function for crack segmentation.

[DOI]

,

,

,

,

Multim. Syst., April, 2023

Deep learning based single sample face recognition: a survey.

[DOI]

,

,

,

,

Artif. Intell. Rev., March, 2023

RemoteCLIP: A Vision Language Foundation Model for Remote Sensing.

[DOI]

,

,

Zhangqingyun Guan

,

,

,

CoRR, 2023

Taming Diffusion Models for Music-driven Conducting Motion Generation.

[DOI]

,

,

,

,

CoRR, 2023

Few-shot Classification via Ensemble Learning with Multi-Order Statistics.

[DOI]

,

,

,

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Towards Joint Modeling of Dialogue Response and Speech Synthesis based on Large Language Model.

[DOI]

,

,

Proceedings of the 6th International Conference on Natural Language and Speech Processing (ICNLSP 2023), 2023

Single Sample Face Recognition Based on Identity-Attribute Disentanglement and Adversarial Feature Augmentation.

[DOI]

,

,

,

,

Proceedings of the Biometric Recognition - 17th Chinese Conference, 2023

2022

Self-Supervised Music Motion Synchronization Learning for Music-Driven Conducting Motion Generation.

[DOI]

,

,

,

,

J. Comput. Sci. Technol., 2022

A review of driver fatigue detection and its advances on the use of RGB-D camera and deep learning.

[DOI]

,

,

,

Eng. Appl. Artif. Intell., 2022

Prototypical Contrastive Language Image Pretraining.

[DOI]

,

,

,

,

,

,

CoRR, 2022

A Simple Baseline for Adversarial Domain Adaptation-based Unsupervised Flood Forecasting.

[DOI]

,

,

,

CoRR, 2022

Knowledge Graph Based Chicken Disease Diagnosis Question Answering System.

[DOI]

,

,

,

,

,

,

,

Proceedings of the Data Mining and Big Data - 7th International Conference, 2022

MDF-Net: Multimodal Deep Fusion for Large-Scale Product Recognition.

[DOI]

,

,

,

,

,

Proceedings of the Biometric Recognition - 16th Chinese Conference, 2022

2021

VirtualConductor: Music-driven Conducting Video Generation System.

[DOI]

,

,

,

CoRR, 2021

Significant Wave Height Prediction based on Wavelet Graph Neural Network.

[DOI]

,

,

,

,

CoRR, 2021

2020

Deep Learning Based Single Sample Per Person Face Recognition: A Survey.

[DOI]

,

,

CoRR, 2020

A Review of Automatically Diagnosing COVID-19 based on Scanning Image.

[DOI]

,

,

CoRR, 2020

A Review of Automated Diagnosis of COVID-19 Based on Scanning Images.

[DOI]

,

,

,

,

Proceedings of the ICRAI 2020: 6th International Conference on Robotics and Artificial Intelligence, 2020