2024
Why Would You Suggest That? Human Trust in Language Model Responses.
CoRR, 2024

BEHAVIOR-1K: A Human-Centered, Embodied AI Benchmark with 1, 000 Everyday Activities and Realistic Simulation.
CoRR, 2024

2023
Exploring and Improving the Spatial Reasoning Abilities of Large Language Models.
CoRR, 2023

2022
BEHAVIOR-1K: A Benchmark for Embodied AI with 1, 000 Everyday Activities and Realistic Simulation.
Proceedings of the Conference on Robot Learning, 2022