Wesley Suttle

Orcid: 0000-0003-1234-7151

According to our database1, Wesley Suttle authored at least 18 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
AIME: AI System Optimization via Multiple LLM Evaluators.
CoRR, 2024

DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning.
CoRR, 2024

Global Optimality without Mixing Time Oracles in Average-reward RL via Multi-level Actor-Critic.
CoRR, 2024

Sampling-based Safe Reinforcement Learning for Nonlinear Dynamical Systems.
CoRR, 2024

PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Towards Global Optimality for Practical Average Reward Reinforcement Learning without Mixing Time Oracles.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Deceptive Path Planning via Reinforcement Learning with Graph Neural Networks.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

Sampling-based Safe Reinforcement Learning for Nonlinear Dynamical Systems.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

2023
Ada-NAV: Adaptive Trajectory-Based Sample Efficient Policy Learning for Robotic Navigation.
CoRR, 2023

Beyond Exponentially Fast Mixing in Average-Reward Reinforcement Learning via Multi-Level Monte Carlo Actor-Critic.
Proceedings of the International Conference on Machine Learning, 2023

Information-Directed Policy Search in Sparse-Reward Settings via the Occupancy Information Ratio.
Proceedings of the 57th Annual Conference on Information Sciences and Systems, 2023

2022
Reinforcement Learning Based Distributed Control of Dissipative Networked Systems.
IEEE Trans. Control. Netw. Syst., 2022

Occupancy Information Ratio: Infinite-Horizon, Information-Directed, Parameterized Policy Search.
CoRR, 2022

Policy Gradient for Ratio Optimization: A Case Study.
Proceedings of the 56th Annual Conference on Information Sciences and Systems, 2022

2021
Reinforcement Learning for Cost-Aware Markov Decision Processes.
Proceedings of the 38th International Conference on Machine Learning, 2021

2020
Reinforcement Learning based Distributed Control of Dissipative Networked Systems.
CoRR, 2020

2019
Stochastic Convergence Results for Regularized Actor-Critic Methods.
CoRR, 2019

A Multi-Agent Off-Policy Actor-Critic Algorithm for Distributed Reinforcement Learning.
CoRR, 2019


  Loading...