Aviv Rosenberg

Affiliations:

Tel Aviv University, Israel

According to our database¹, Aviv Rosenberg authored at least 24 papers between 2019 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

2019

2020

2021

2022

2023

2024

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Building Math Agents with Multi-Turn Iterative Preference Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Warm-up Free Policy Optimization: Improved Regret in Linear Markov Decision Processes.

[BibT_eX]

[DOI]

Asaf B. Cassel

Aviv Rosenberg

CoRR, 2024

Multi-turn Reinforcement Learning from Preference Human Feedback.

[BibT_eX]

[DOI]

CoRR, 2024

Near-Optimal Regret in Linear MDPs with Aggregate Bandit Feedback.

[BibT_eX]

[DOI]

CoRR, 2024

Multi-turn Reinforcement Learning with Preference Human Feedback.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Warm-up Free Policy Optimization: Improved Regret in Linear Markov Decision Processes.

[BibT_eX]

[DOI]

Asaf Cassel

Aviv Rosenberg

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Near-Optimal Regret in Linear MDPs with Aggregate Bandit Feedback.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023

Delay-Adapted Policy Optimization and Improved Regret for Adversarial MDP with Delayed Bandit Feedback.

[BibT_eX]

[DOI]

Tal Lancewicki

Aviv Rosenberg

Dmitry Sotnikov

Proceedings of the International Conference on Machine Learning, 2023

A Unified Analysis of Nonstochastic Delayed Feedback for Combinatorial Semi-Bandits, Linear Bandits, and MDPs.

[BibT_eX]

[DOI]

Proceedings of the Thirty Sixth Annual Conference on Learning Theory, 2023

Planning and Learning with Adaptive Lookahead.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Regret minimization in reinforcement learning

[BibT_eX]

[DOI]

Aviv Rosenberg

PhD thesis, 2022

Near-Optimal Regret for Adversarial MDP with Delayed Bandit Feedback.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Cooperative Online Learning in Stochastic and Adversarial MDPs.

[BibT_eX]

[DOI]

Tal Lancewicki

Aviv Rosenberg

Yishay Mansour

Proceedings of the International Conference on Machine Learning, 2022

Policy Optimization for Stochastic Shortest Path.

[BibT_eX]

[DOI]

Liyu Chen

Haipeng Luo

Aviv Rosenberg

Proceedings of the Conference on Learning Theory, 2-5 July 2022, London, UK., 2022

Learning Adversarial Markov Decision Processes with Delayed Feedback.

[BibT_eX]

[DOI]

Tal Lancewicki

Aviv Rosenberg

Yishay Mansour

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Oracle-Efficient Regret Minimization in Factored MDPs with Unknown Structure.

[BibT_eX]

[DOI]

Aviv Rosenberg

Yishay Mansour

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Minimax Regret for Stochastic Shortest Path.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Stochastic Shortest Path with Adversarially Changing Costs.

[BibT_eX]

[DOI]

Aviv Rosenberg

Yishay Mansour

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

2020

Oracle-Efficient Reinforcement Learning in Factored MDPs with Unknown Structure.

[BibT_eX]

[DOI]

Aviv Rosenberg

Yishay Mansour

CoRR, 2020

Adversarial Stochastic Shortest Path.

[BibT_eX]

[DOI]

Aviv Rosenberg

Yishay Mansour

CoRR, 2020

Optimistic Policy Optimization with Bandit Feedback.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

Near-optimal Regret Bounds for Stochastic Shortest Path.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

2019

Online Stochastic Shortest Path with Bandit Feedback and Unknown Transition Function.

[BibT_eX]

[DOI]

Aviv Rosenberg

Yishay Mansour

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Online Convex Optimization in Adversarial Markov Decision Processes.

[BibT_eX]

[DOI]

Aviv Rosenberg

Yishay Mansour

Proceedings of the 36th International Conference on Machine Learning, 2019

Aviv Rosenberg

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...