Aviv Rosenberg

Affiliations:
  • Tel Aviv University, Israel


According to our database1, Aviv Rosenberg authored at least 22 papers between 2019 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Building Math Agents with Multi-Turn Iterative Preference Learning.
CoRR, 2024

Warm-up Free Policy Optimization: Improved Regret in Linear Markov Decision Processes.
CoRR, 2024

Multi-turn Reinforcement Learning from Preference Human Feedback.
CoRR, 2024

Near-Optimal Regret in Linear MDPs with Aggregate Bandit Feedback.
CoRR, 2024

Near-Optimal Regret in Linear MDPs with Aggregate Bandit Feedback.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Delay-Adapted Policy Optimization and Improved Regret for Adversarial MDP with Delayed Bandit Feedback.
Proceedings of the International Conference on Machine Learning, 2023

A Unified Analysis of Nonstochastic Delayed Feedback for Combinatorial Semi-Bandits, Linear Bandits, and MDPs.
Proceedings of the Thirty Sixth Annual Conference on Learning Theory, 2023

Planning and Learning with Adaptive Lookahead.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Regret minimization in reinforcement learning
PhD thesis, 2022

Near-Optimal Regret for Adversarial MDP with Delayed Bandit Feedback.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Cooperative Online Learning in Stochastic and Adversarial MDPs.
Proceedings of the International Conference on Machine Learning, 2022

Policy Optimization for Stochastic Shortest Path.
Proceedings of the Conference on Learning Theory, 2-5 July 2022, London, UK., 2022

Learning Adversarial Markov Decision Processes with Delayed Feedback.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Oracle-Efficient Regret Minimization in Factored MDPs with Unknown Structure.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Minimax Regret for Stochastic Shortest Path.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Stochastic Shortest Path with Adversarially Changing Costs.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

2020
Oracle-Efficient Reinforcement Learning in Factored MDPs with Unknown Structure.
CoRR, 2020

Adversarial Stochastic Shortest Path.
CoRR, 2020

Optimistic Policy Optimization with Bandit Feedback.
Proceedings of the 37th International Conference on Machine Learning, 2020

Near-optimal Regret Bounds for Stochastic Shortest Path.
Proceedings of the 37th International Conference on Machine Learning, 2020

2019
Online Stochastic Shortest Path with Bandit Feedback and Unknown Transition Function.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Online Convex Optimization in Adversarial Markov Decision Processes.
Proceedings of the 36th International Conference on Machine Learning, 2019


  Loading...