2024

Why Would You Suggest That? Human Trust in Language Model Responses.

[DOI]

CoRR, 2024

BEHAVIOR-1K: A Human-Centered, Embodied AI Benchmark with 1, 000 Everyday Activities and Realistic Simulation.

[DOI]

CoRR, 2024

2023

Exploring and Improving the Spatial Reasoning Abilities of Large Language Models.

[DOI]

CoRR, 2023

2022

BEHAVIOR-1K: A Benchmark for Embodied AI with 1, 000 Everyday Activities and Realistic Simulation.

[DOI]

Proceedings of the Conference on Robot Learning, 2022