Learning API Functionality from Demonstrations for Tool-based Agents.
CoRR, May, 2025
AIME: AI System Optimization via Multiple LLM Evaluators.
CoRR, 2024
Embodied Question Answering via Multi-LLM Systems.
CoRR, 2024
Global Optimality without Mixing Time Oracles in Average-reward RL via Multi-level Actor-Critic.
CoRR, 2024
Right Place, Right Time! Towards ObjectNav for Non-Stationary Goals.
CoRR, 2024
Towards Global Optimality for Practical Average Reward Reinforcement Learning without Mixing Time Oracles.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Ada-NAV: Adaptive Trajectory-Based Sample Efficient Policy Learning for Robotic Navigation.
CoRR, 2023
Beyond Exponentially Fast Mixing in Average-Reward Reinforcement Learning via Multi-Level Monte Carlo Actor-Critic.
Proceedings of the International Conference on Machine Learning, 2023
In Pursuit of Interpretable, Fair and Accurate Machine Learning for Criminal Recidivism Prediction.
CoRR, 2020
NTIRE 2020 Challenge on Image and Video Deblurring.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020