Wesley Suttle

Aamodh Suresh

Carlos Nieto-Granda

CoRR, February, 2025

2024

Occupancy Information Ratio: Infinite-Horizon, Information-Directed, Parameterized Policy Search.

[BibT_eX]

[DOI]

Alec Koppel

SIAM J. Control. Optim., 2024

Hierarchical Preference Optimization: Learning to achieve goals via feasible subgoals prediction.

[BibT_eX]

[DOI]

CoRR, 2024

AIME: AI System Optimization via Multiple LLM Evaluators.

[BibT_eX]

[DOI]

CoRR, 2024

Global Optimality without Mixing Time Oracles in Average-reward RL via Multi-level Actor-Critic.

[BibT_eX]

[DOI]

CoRR, 2024

Sampling-based Safe Reinforcement Learning for Nonlinear Dynamical Systems.

[BibT_eX]

[DOI]

Vipul K. Sharma

CoRR, 2024

LANCAR: Leveraging Language for Context-Aware Robot Locomotion in Unstructured Environments.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024

PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Towards Global Optimality for Practical Average Reward Reinforcement Learning without Mixing Time Oracles.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Deceptive Path Planning via Reinforcement Learning with Graph Neural Networks.

[BibT_eX]

[DOI]

Michael Y. Fatemi

Brian M. Sadler

Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

Sampling-based Safe Reinforcement Learning for Nonlinear Dynamical Systems.

[BibT_eX]

[DOI]

Wesley Suttle

Vipul Kumar Sharma

Seetharaman Sivaranjani

Vijay Gupta

Brian M. Sadler

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

2023

Ada-NAV: Adaptive Trajectory-Based Sample Efficient Policy Learning for Robotic Navigation.

[BibT_eX]

[DOI]

CoRR, 2023

Beyond Exponentially Fast Mixing in Average-Reward Reinforcement Learning via Multi-Level Monte Carlo Actor-Critic.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Information-Directed Policy Search in Sparse-Reward Settings via the Occupancy Information Ratio.

[BibT_eX]

[DOI]

Alec Koppel

Proceedings of the 57th Annual Conference on Information Sciences and Systems, 2023

2022

Reinforcement Learning Based Distributed Control of Dissipative Networked Systems.

[BibT_eX]

[DOI]

IEEE Trans. Control. Netw. Syst., 2022

Policy Gradient for Ratio Optimization: A Case Study.

[BibT_eX]

[DOI]

Alec Koppel

Proceedings of the 56th Annual Conference on Information Sciences and Systems, 2022

2021

Reinforcement Learning for Cost-Aware Markov Decision Processes.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

2020

Reinforcement Learning based Distributed Control of Dissipative Networked Systems.

[BibT_eX]

[DOI]