Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation.
,
,
,
,
,
,
,
,
,
,
CoRR, February, 2025
From an Image to a Scene: Learning to Imagine the World from a Million 360 Videos.
CoRR, 2024
From an Image to a Scene: Learning to Imagine the World from a Million 360° Videos.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Objaverse-XL: A Universe of 10M+ 3D Objects.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Objaverse: A Universe of Annotated 3D Objects.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Phone2Proc: Bringing Robust Robots into Our Chaotic World.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Phone2Proc: Bringing Robust Robots Into Our Chaotic World.
CoRR, 2022
ProcTHOR: Large-Scale Embodied AI Using Procedural Generation.
,
,
,
,
,
,
,
,
,
,
CoRR, 2022
🏘️ ProcTHOR: Large-Scale Embodied AI Using Procedural Generation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Visual Room Rearrangement.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
RoboTHOR: An Open Simulation-to-Real Embodied AI Platform.
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020