2025
Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation.
CoRR, February, 2025

2024
From an Image to a Scene: Learning to Imagine the World from a Million 360 Videos.
CoRR, 2024

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models.
CoRR, 2024

From an Image to a Scene: Learning to Imagine the World from a Million 360° Videos.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

2023
Objaverse-XL: A Universe of 10M+ 3D Objects.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Objaverse: A Universe of Annotated 3D Objects.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Phone2Proc: Bringing Robust Robots into Our Chaotic World.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Phone2Proc: Bringing Robust Robots Into Our Chaotic World.
CoRR, 2022

Retrospectives on the Embodied AI Workshop.
CoRR, 2022

ProcTHOR: Large-Scale Embodied AI Using Procedural Generation.
CoRR, 2022

🏘️ ProcTHOR: Large-Scale Embodied AI Using Procedural Generation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
Visual Room Rearrangement.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
RoboTHOR: An Open Simulation-to-Real Embodied AI Platform.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020