Mark Rowland

Affiliations:
  • Google DeepMind, London, UK


According to our database1, Mark Rowland authored at least 69 papers between 2016 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
An Analysis of Quantile Temporal-Difference Learning.
J. Mach. Learn. Res., 2024

Foundations of Multivariate Distributional Reinforcement Learning.
CoRR, 2024

A Unifying Framework for Action-Conditional Self-Predictive Reinforcement Learning.
CoRR, 2024

Human Alignment of Large Language Models through Online Preference Optimisation.
CoRR, 2024

Near-Minimax-Optimal Distributional Reinforcement Learning with a Generative Model.
CoRR, 2024

Off-policy Distributional Q(λ): Distributional RL without Importance Sampling.
CoRR, 2024

A Distributional Analogue to the Successor Representation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Distributional Bellman Operators over Mean Embeddings.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Generalized Preference Optimization: A Unified Approach to Offline Alignment.
Proceedings of the Forty-first International Conference on Machine Learning, 2024


Human Alignment of Large Language Models through Online Preference Optimisation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

A General Theoretical Paradigm to Understand Learning from Human Preferences.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

2023
A Kernel Perspective on Behavioural Metrics for Markov Decision Processes.
Trans. Mach. Learn. Res., 2023

Nash Learning from Human Feedback.
CoRR, 2023

A General Theoretical Paradigm to Understand Learning from Human Preferences.
CoRR, 2023

VA-learning as a more efficient alternative to Q-learning.
Proceedings of the International Conference on Machine Learning, 2023

DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm.
Proceedings of the International Conference on Machine Learning, 2023

Understanding Self-Predictive Learning for Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2023

The Statistical Benefits of Quantile Temporal-Difference Learning for Value Estimation.
Proceedings of the International Conference on Machine Learning, 2023

Quantile Credit Assignment.
Proceedings of the International Conference on Machine Learning, 2023

Bootstrapped Representations in Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2023

A Novel Stochastic Gradient Descent Algorithm for Learning Principal Subspaces.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022

Evolutionary Dynamics and Phi-Regret Minimization in Games.
J. Artif. Intell. Res., 2022

Learning Correlated Equilibria in Mean-Field Games.
CoRR, 2022

Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning.
CoRR, 2022

Learning Dynamics and Generalization in Reinforcement Learning.
CoRR, 2022

Optimistic Posterior Sampling for Reinforcement Learning with Few Samples and Tight Guarantees.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

The Nature of Temporal Difference Errors in Multi-step Distributional Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Generalised Policy Improvement with Geometric Policy Composition.
Proceedings of the International Conference on Machine Learning, 2022

Learning Dynamics and Generalization in Deep Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2022

Understanding and Preventing Capacity Loss in Reinforcement Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Learning Equilibria in Mean-Field Games: Introducing Mean-Field PSRO.
Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022

Marginalized Operators for Off-policy Reinforcement Learning.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021
Game Plan: What AI can do for Football, and What Football can do for AI.
J. Artif. Intell. Res., 2021

Evolutionary Dynamics and Φ-Regret Minimization in Games.
CoRR, 2021

MICo: Learning improved representations via sampling-based state similarity for Markov decision processes.
CoRR, 2021

Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

MICo: Improved representations via sampling-based state similarity for Markov decision processes.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Taylor Expansion of Discount Factors.
Proceedings of the 38th International Conference on Machine Learning, 2021

From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization.
Proceedings of the 38th International Conference on Machine Learning, 2021

Revisiting Peng's Q(λ) for Modern Reinforcement Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021

On the Effect of Auxiliary Tasks on Representation Dynamics.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

The Value-Improvement Path: Towards Better Representations for Reinforcement Learning.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Navigating the Landscape of Games.
CoRR, 2020

Fast computation of Nash Equilibria in Imperfect Information Games.
Proceedings of the 37th International Conference on Machine Learning, 2020

Revisiting Fundamentals of Experience Replay.
Proceedings of the 37th International Conference on Machine Learning, 2020

A Generalized Training Approach for Multiagent Learning.
Proceedings of the 8th International Conference on Learning Representations, 2020

Conditional Importance Sampling for Off-Policy Learning.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

Adaptive Trade-Offs in Off-Policy Learning.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

2019
Antithetic and Monte Carlo kernel estimators for partial rankings.
Stat. Comput., 2019

Meta-learning of Sequential Strategies.
CoRR, 2019

α-Rank: Multi-Agent Evaluation by Evolution.
CoRR, 2019

Multiagent Evaluation under Incomplete Information.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Statistics and Samples in Distributional Reinforcement Learning.
Proceedings of the 36th International Conference on Machine Learning, 2019

Unifying Orthogonal Monte Carlo Methods.
Proceedings of the 36th International Conference on Machine Learning, 2019

Orthogonal Estimation of Wasserstein Distances.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

2018
Geometrically Coupled Monte Carlo Sampling.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Structured Evolution with Compact Architectures for Scalable Policy Optimization.
Proceedings of the 35th International Conference on Machine Learning, 2018

Gaussian Process Behaviour in Wide Deep Neural Networks.
Proceedings of the 6th International Conference on Learning Representations, 2018

An Analysis of Categorical Distributional Reinforcement Learning.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

The Geometry of Random Features.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

Distributional Reinforcement Learning With Quantile Regression.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Uprooting and Rerooting Higher-Order Graphical Models.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

The Unreasonable Effectiveness of Structured Random Orthogonal Embeddings.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Magnetic Hamiltonian Monte Carlo.
Proceedings of the 34th International Conference on Machine Learning, 2017

Conditions beyond treewidth for tightness of higher-order LP relaxations.
Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017

2016
Black-Box Alpha Divergence Minimization.
Proceedings of the 33nd International Conference on Machine Learning, 2016

Tightness of LP Relaxations for Almost Balanced Models.
Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, 2016


  Loading...