Liunian Harold Li

According to our database1, Liunian Harold Li authored at least 23 papers between 2019 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Matryoshka Query Transformer for Large Vision-Language Models.
CoRR, 2024

Tailoring Self-Rationalizers with Multi-Reward Distillation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
DesCo: Learning Object Recognition with Rich Language Descriptions.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

On the Paradox of Learning to Reason from Data.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

MetaVL: Transferring In-Context Learning Ability From Language Models to Vision-Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

Symbolic Chain-of-Thought Distillation: Small Models Can Also "Think" Step-by-Step.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
DisinfoMeme: A Multimodal Dataset for Detecting Meme Intentionally Spreading Out Disinformation.
CoRR, 2022

GLIPv2: Unifying Localization and Vision-Language Understanding.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

BERTHop: An Effective Vision-and-Language Model for Chest X-ray Disease Diagnosis.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

How Much Can CLIP Benefit Vision-and-Language Tasks?
Proceedings of the Tenth International Conference on Learning Representations, 2022

GeoMLAMA: Geo-Diverse Commonsense Probing on Multilingual Pre-Trained Language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

RegionCLIP: Region-based Language-Image Pretraining.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Grounded Language-Image Pre-training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

SGEITL: Scene Graph Enhanced Image-Text Learning for Visual Commonsense Reasoning.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Unsupervised Vision-and-Language Pre-training Without Parallel Images and Captions.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

BERTHop: An Effective Vision-and-Language Model for Chest X-ray Disease Diagnosis.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020
Weakly-supervised VisualBERT: Pre-training without Parallel Images and Captions.
CoRR, 2020

What Does BERT with Vision Look At?
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Efficient Contextual Representation Learning With Continuous Outputs.
Trans. Assoc. Comput. Linguistics, 2019

VisualBERT: A Simple and Performant Baseline for Vision and Language.
CoRR, 2019

Efficient Contextual Representation Learning Without Softmax Layer.
CoRR, 2019


  Loading...