Rémi Munos
According to our database1,
Rémi Munos
authored at least 225 papers
between 1996 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024
2023
CoRR, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
The Statistical Benefits of Quantile Temporal-Difference Learning for Value Estimation.
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice.
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition.
Proceedings of the International Conference on Machine Learning, 2023
2022
Figure Data for the paper "Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning".
Dataset, October, 2022
CoRR, 2022
Optimistic Posterior Sampling for Reinforcement Learning with Few Samples and Tight Guarantees.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
The Nature of Temporal Difference Errors in Multi-step Distributional Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022
2021
Concentration bounds for temporal difference learning with linear function approximation: the case of batch data and uniform sampling.
Mach. Learn., 2021
J. Artif. Intell. Res., 2021
Model-Free Learning for Two-Player Zero-Sum Partially Observable Markov Games with Perfect Recall.
CoRR, 2021
Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Learning in two-player zero-sum partially observable Markov games with perfect recall.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization.
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the 37th International Conference on Machine Learning, 2020
Proceedings of the 37th International Conference on Machine Learning, 2020
Proceedings of the 37th International Conference on Machine Learning, 2020
Proceedings of the 37th International Conference on Machine Learning, 2020
Proceedings of the 8th International Conference on Learning Representations, 2020
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020
2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
Proceedings of the 7th International Conference on Learning Representations, 2019
Proceedings of the 7th International Conference on Learning Representations, 2019
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019
2018
Optimistic planning with an adaptive number of action switches for near-optimal nonlinear control.
Eng. Appl. Artif. Intell., 2018
Low-pass Recurrent Neural Networks - A memory architecture for longer-term correlation discovery.
CoRR, 2018
Continuous-action planning for discounted infinite-horizon nonlinear optimal control with Lipschitz values.
Autom., 2018
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
Proceedings of the 35th International Conference on Machine Learning, 2018
Proceedings of the 35th International Conference on Machine Learning, 2018
Proceedings of the 35th International Conference on Machine Learning, 2018
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures.
Proceedings of the 35th International Conference on Machine Learning, 2018
Proceedings of the 35th International Conference on Machine Learning, 2018
Transfer in Deep Reinforcement Learning Using Successor Features and Generalised Policy Improvement.
Proceedings of the 35th International Conference on Machine Learning, 2018
The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning.
Proceedings of the 6th International Conference on Learning Representations, 2018
Proceedings of the 6th International Conference on Learning Representations, 2018
Proceedings of the 6th International Conference on Learning Representations, 2018
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
2017
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
Proceedings of the 34th International Conference on Machine Learning, 2017
Proceedings of the 34th International Conference on Machine Learning, 2017
Proceedings of the 34th International Conference on Machine Learning, 2017
Proceedings of the 34th International Conference on Machine Learning, 2017
Proceedings of the 5th International Conference on Learning Representations, 2017
Proceedings of the 5th International Conference on Learning Representations, 2017
Proceedings of the 39th Annual Meeting of the Cognitive Science Society, 2017
2016
J. Mach. Learn. Res., 2016
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016
Discounted near-optimal control of general continuous-action nonlinear systems using optimistic planning.
Proceedings of the 2016 American Control Conference, 2016
Proceedings of the Algorithmic Learning Theory - 27th International Conference, 2016
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016
2015
CoRR, 2015
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015
Proceedings of the 32nd International Conference on Machine Learning, 2015
Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, 2015
Fast Gradient Descent for Drifting Least Squares Regression, with Application to Bandits.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015
2014
Theor. Comput. Sci., 2014
From Bandits to Monte-Carlo Tree Search: The Optimistic Principle Applied to Optimization and Planning.
Found. Trends Mach. Learn., 2014
Proceedings of the Seventh ACM International Conference on Web Search and Data Mining, 2014
Fast LSTD Using Stochastic Approximation: Finite Time Analysis and Application to Traffic Control.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2014
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014
Efficient learning by implicit exploration in bandit problems with side observations.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014
Proceedings of the 31th International Conference on Machine Learning, 2014
Proceedings of the 31th International Conference on Machine Learning, 2014
Proceedings of the IEEE Congress on Evolutionary Computation, 2014
Optimistic planning with a limited number of action switches for near-optimal nonlinear control.
Proceedings of the 53rd IEEE Conference on Decision and Control, 2014
Proceedings of the 2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, 2014
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014
2013
Minimax PAC bounds on the sample complexity of reinforcement learning with a generative model.
Mach. Learn., 2013
Analysis of stochastic approximation for efficient least squares regression and LSTD.
CoRR, 2013
Online gradient descent for least squares regression: Non-asymptotic bounds and application to bandits.
CoRR, 2013
Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence, 2013
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013
Proceedings of the 30th International Conference on Machine Learning, 2013
Proceedings of the 30th International Conference on Machine Learning, 2013
Proceedings of the Algorithmic Learning Theory - 24th International Conference, 2013
Proceedings of the 2013 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, 2013
Proceedings of the 2013 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, 2013
2012
Bandit Theory meets Compressed Sensing for high dimensional Stochastic Linear Bandit.
Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, 2012
Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, 2012
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012
Bandit Algorithms boost Brain Computer Interfaces for motor-task selection of a brain-controlled button.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012
Adaptive Stratified Sampling for Monte-Carlo integration of Differentiable functions.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012
Proceedings of the 29th International Conference on Machine Learning, 2012
Proceedings of the Algorithmic Learning Theory - 23rd International Conference, 2012
Proceedings of the Algorithmic Learning Theory - 23rd International Conference, 2012
Proceedings of the Reinforcement Learning, 2012
2011
Theor. Comput. Sci., 2011
A Finite-Time Analysis of Multi-armed Bandits Problems with Kullback-Leibler Divergences.
Proceedings of the COLT 2011, 2011
Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, 2011
Optimistic Optimization of a Deterministic Function without the Knowledge of its Smoothness.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011
Proceedings of the 28th International Conference on Machine Learning, 2011
Regularized Least Squares Temporal Difference Learning with Nested ℓ2 and ℓ1 Penalization.
Proceedings of the Recent Advances in Reinforcement Learning - 9th European Workshop, 2011
Proceedings of the Algorithmic Learning Theory - 22nd International Conference, 2011
Proceedings of the 2011 IEEE Symposium on Adaptive Dynamic Programming And Reinforcement Learning, 2011
2010
Proceedings of the 2nd Asian Conference on Machine Learning, 2010
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2010
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010
2009
Theor. Comput. Sci., 2009
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009
Proceedings of the 26th Annual International Conference on Machine Learning, 2009
Proceedings of the Algorithmic Learning Theory, 20th International Conference, 2009
2008
Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path.
Mach. Learn., 2008
Proceedings of the Advances in Neural Information Processing Systems 21, 2008
Proceedings of the Advances in Neural Information Processing Systems 21, 2008
Proceedings of the Advances in Neural Information Processing Systems 21, 2008
Proceedings of the Recent Advances in Reinforcement Learning, 8th European Workshop, 2008
2007
SIAM J. Control. Optim., 2007
Analyse en norme Lp de l'algorithme d'itérations sur les valeurs avec approximations.
Rev. d'Intelligence Artif., 2007
Proceedings of the Advances in Neural Information Processing Systems 20, 2007
Proceedings of the Algorithmic Learning Theory, 18th International Conference, 2007
2006
Geometric Variance Reduction in Markov Chains: Application to Value Function and Gradient Estimation.
J. Mach. Learn. Res., 2006
2005
Sensitivity Analysis Using It[o-circumflex]--Malliavin Calculus and Martingales, and Application to Stochastic Optimal Control.
SIAM J. Control. Optim., 2005
Proceedings of the Machine Learning, 2005
2003
Proceedings of the Machine Learning, 2003
2002
2001
Proceedings of the Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, 2001
2000
A Study of Reinforcement Learning in the Continuous Case by the Means of Viscosity Solutions.
Mach. Learn., 2000
Rates of Convergence for Variable Resolution Schemes in Optimal Control.
Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, June 29, 2000
1999
Gradient descent approaches to neural-net-based solutions of the Hamilton-Jacobi-Bellman equation.
Proceedings of the International Joint Conference Neural Networks, 1999
Variable Resolution Discretization for High-Accuracy Solutions of Optimal Control Problems.
Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence, 1999
1998
Proceedings of the Advances in Neural Information Processing Systems 11, [NIPS Conference, Denver, Colorado, USA, November 30, 1998
Proceedings of the Machine Learning: ECML-98, 1998
1997
Proceedings of the Advances in Neural Information Processing Systems 10, 1997
A Convergent Reinforcement Learning Algorithm in the Continuous Case Based on a Finite Difference Method.
Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence, 1997
Finite-Element Methods with Local Triangulation Refinement for Continuous Reimforcement Learning Problems.
Proceedings of the Machine Learning: ECML-97, 1997
1996
A Convergent Reinforcement Learning Algorithm in the Continuous Case: The Finite-Element Reinforcement Learning.
Proceedings of the Machine Learning, 1996