2025
Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents.
CoRR, February, 2025

2024
Tooling or Not Tooling? The Impact of Tools on Language Agents for Chemistry Problem Solving.
CoRR, 2024

Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents.
CoRR, 2024

Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents.
CoRR, 2024

GPT-4V(ision) is a Generalist Web Agent, if Grounded.
Proceedings of the Forty-first International Conference on Machine Learning, 2024