Guy Tennenholtz

According to our database1, Guy Tennenholtz authored at least 31 papers between 2017 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Benchmarks for Reinforcement Learning with Biased Offline Data and Imperfect Simulators.
CoRR, 2024

Embedding-Aligned Language Models.
CoRR, 2024

DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning.
CoRR, 2024

Bayesian Regret Minimization in Offline Bandits.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Demystifying Embedding Spaces using Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Delphic Offline Reinforcement Learning under Nonidentifiable Hidden Confounding.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Recommender Ecosystems: A Mechanism Design Perspective on Holistic Modeling and Optimization.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Ever Evolving Evaluator (EV3): Towards Flexible and Reliable Meta-Optimization for Knowledge Distillation.
CoRR, 2023

Factual and Personalized Recommendations using Language Models and Reinforcement Learning.
CoRR, 2023

Modeling Recommender Ecosystems: Research Challenges at the Intersection of Mechanism Design, Reinforcement Learning and Generative Models.
CoRR, 2023

A Convex Relaxation Approach to Bayesian Regret Minimization in Offline Bandits.
CoRR, 2023

Ranking with Popularity Bias: User Welfare under Self-Amplification Dynamics.
CoRR, 2023

Reinforcement Learning with History Dependent Dynamic Contexts.
Proceedings of the International Conference on Machine Learning, 2023

Representation-Driven Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2023

Never Worse, Mostly Better: Stable Policy Improvement in Deep Reinforcement Learning.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

2022
Reinforcement Learning with a Terminator.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Uncertainty Estimation Using Riemannian Model Dynamics for Offline Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Locality Matters: A Scalable Value Decomposition Approach for Cooperative Multi-Agent Reinforcement Learning.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Maximum Entropy Reinforcement Learning with Mixture Policies.
CoRR, 2021

GELATO: Geometrically Enriched Latent Model for Offline Reinforcement Learning.
CoRR, 2021

Bandits with partially observable confounded data.
Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

Action redundancy in reinforcement learning.
Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

2020
The Pendulum Arrangement: Maximizing the Escape Time of Heterogeneous Random Walks.
CoRR, 2020

Bandits with Partially Observable Offline Data.
CoRR, 2020

Off-Policy Evaluation in Partially Observable Environments.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Natural Language State Representation for Reinforcement Learning.
CoRR, 2019

Distributional Policy Optimization: An Alternative Approach for Continuous Control.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

The Natural Language of Actions.
Proceedings of the 36th International Conference on Machine Learning, 2019

2018
Train on Validation: Squeezing the Data Lemon.
CoRR, 2018

2017
The Stochastic Firefighter Problem.
CoRR, 2017


  Loading...