2025
Learning API Functionality from Demonstrations for Tool-based Agents.
CoRR, May, 2025

2024
AIME: AI System Optimization via Multiple LLM Evaluators.
CoRR, 2024

Embodied Question Answering via Multi-LLM Systems.
CoRR, 2024

Global Optimality without Mixing Time Oracles in Average-reward RL via Multi-level Actor-Critic.
CoRR, 2024

Right Place, Right Time! Towards ObjectNav for Non-Stationary Goals.
CoRR, 2024

Towards Global Optimality for Practical Average Reward Reinforcement Learning without Mixing Time Oracles.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Ada-NAV: Adaptive Trajectory-Based Sample Efficient Policy Learning for Robotic Navigation.
CoRR, 2023

Beyond Exponentially Fast Mixing in Average-Reward Reinforcement Learning via Multi-Level Monte Carlo Actor-Critic.
Proceedings of the International Conference on Machine Learning, 2023

2020
In Pursuit of Interpretable, Fair and Accurate Machine Learning for Criminal Recidivism Prediction.
CoRR, 2020

NTIRE 2020 Challenge on Image and Video Deblurring.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020