Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents.
CoRR, February, 2025
Tooling or Not Tooling? The Impact of Tools on Language Agents for Chemistry Problem Solving.
CoRR, 2024
Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents.
CoRR, 2024
Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents.
CoRR, 2024
GPT-4V(ision) is a Generalist Web Agent, if Grounded.
Proceedings of the Forty-first International Conference on Machine Learning, 2024