Jiasen Lu

According to our database1, Jiasen Lu authored at least 34 papers between 2011 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
MM-Ego: Towards Building Egocentric Multimodal LLMs.
CoRR, 2024

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models.
CoRR, 2024

SoupLM: Model Integration in Large Language and Multi-Modal Models.
CoRR, 2024

Preserving Identity with Variational Score for General-purpose 3D Editing.
CoRR, 2024

Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Cross-Domain Graph Convolutions for Adversarial Unsupervised Domain Adaptation.
IEEE Trans. Neural Networks Learn. Syst., August, 2023

UNIFIED-IO: A Unified Model for Vision, Language, and Multi-modal Tasks.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
ASC me to Do Anything: Multi-task Training for Embodied AI.
CoRR, 2022

MERLOT RESERVE: Neural Script Knowledge through Vision and Language and Sound.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Multi-Modal Answer Validation for Knowledge-Based VQA.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
A Simple Long-Tailed Recognition Baseline via Vision-Language Model.
CoRR, 2021

Container: Context Aggregation Network.
CoRR, 2021

Container: Context Aggregation Networks.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Transferable Feature Learning on Graphs Across Visual Domains.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

2020
Visually Grounded Language Understanding and Generation.
PhD thesis, 2020

Dialog without Dialog Data: Learning Visual Dialog Agents from VQA Data.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Spatially Aware Multimodal Transformers for TextVQA.
Proceedings of the Computer Vision - ECCV 2020, 2020

12-in-1: Multi-Task Vision and Language Representation Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Emergence of Compositional Language with Deep Generational Transmission.
CoRR, 2019

ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Self-Monitoring Navigation Agent via Auxiliary Progress Estimation.
Proceedings of the 7th International Conference on Learning Representations, 2019

2018
Graph R-CNN for Scene Graph Generation.
Proceedings of the Computer Vision - ECCV 2018, 2018

Neural Baby Talk.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Visual Curiosity: Learning to Ask Questions to Learn Visual Recognition.
Proceedings of the 2nd Annual Conference on Robot Learning, 2018

2017
VQA: Visual Question Answering - www.visualqa.org.
Int. J. Comput. Vis., 2017

Best of Both Worlds: Transferring Knowledge from Discriminative Learning to a Generative Visual Dialog Model.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

ParlAI: A Dialog Research Software Platform.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Knowing When to Look: Adaptive Attention via a Visual Sentinel for Image Captioning.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Hierarchical Question-Image Co-Attention for Visual Question Answering.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

2015
VQA: Visual Question Answering.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Human action segmentation with hierarchical supervoxel consistency.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2012
Palmprint and Face Multi-Modal Biometric Recognition Based on SDA-GSVD and Its Kernelization.
Sensors, 2012

2011
Supervised local sparsity preserving projection for face feature extraction.
Proceedings of the First Asian Conference on Pattern Recognition, 2011


  Loading...