Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Mastering Robot Manipulation with Multimodal Prompts through Pretraining and Multi-task Fine-tuning.

[BibT_eX]

[DOI]

Jiachen Li

Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023

MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens.

[BibT_eX]

[DOI]

Kaizhi Zheng

Xuehai He

Xin Eric Wang

CoRR, 2023

Discriminative Diffusion Models as Few-shot Vision and Language Learners.

[BibT_eX]

[DOI]

CoRR, 2023

LayoutGPT: Compositional Visual Planning and Generation with Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Multimodal Graph Transformer for Multimodal Question Answering.

[BibT_eX]

[DOI]

Xuehai He

Xin Eric Wang

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Parameter-Efficient Model Adaptation for Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

ComCLIP: Training-Free Compositional Image and Text Matching.

[BibT_eX]

[DOI]

CoRR, 2022

JARVIS: A Neuro-Symbolic Commonsense Reasoning Framework for Conversational Embodied Agents.

[BibT_eX]

[DOI]

CoRR, 2022

Parameter-efficient Fine-tuning for Vision Transformers.

[BibT_eX]

[DOI]

CoRR, 2022

CPL: Counterfactual Prompt Learning for Vision and Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021

On the Generation of Medical Dialogs for COVID-19.

[BibT_eX]

[DOI]

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Towards Visual Question Answering on Pathology Images.

[BibT_eX]

[DOI]

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020

Pathological Visual Question Answering.

[BibT_eX]

[DOI]

CoRR, 2020