Shimon Whiteson

Anne Schuth

SIGWEB Newsl., 2014

Efficient Abstraction Selection in Reinforcement Learning.

[BibT_eX]

[DOI]

Leon J. H. M. Kester

Comput. Intell., 2014

Learning potential functions and their representations for multi-task reinforcement learning.

[BibT_eX]

[DOI]

Auton. Agents Multi Agent Syst., 2014

Relative confidence sampling for efficient on-line ranker evaluation.

[BibT_eX]

[DOI]

Proceedings of the Seventh ACM International Conference on Web Search and Data Mining, 2014

Queued Pareto Local Search for Multi-Objective Optimization.

[BibT_eX]

[DOI]

Proceedings of the Parallel Problem Solving from Nature - PPSN XIII, 2014

Relative Upper Confidence Bound for the K-Armed Dueling Bandit Problem.

[BibT_eX]

[DOI]

Proceedings of the 31th International Conference on Machine Learning, 2014

Learning from human reward benefits from socio-competitive feedback.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Development and Learning and on Epigenetic Robotics, 2014

Challenge balancing for personalised game spaces.

[BibT_eX]

[DOI]

Sander Bakkes

Guangliang Li

George Viorel Visniuc

Efstathios Charitos

Norbert Heijne

Arjen Swellengrebel

Proceedings of the 2014 IEEE Games Media Entertainment, 2014

Design criteria for challenge balancing of personalised game spaces.

[BibT_eX]

[DOI]

Sander Bakkes

Proceedings of the 9th International Conference on the Foundations of Digital Games, 2014

Optimizing Base Rankers Using Clicks - A Case Study Using BM25.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2014

Multileaved Comparisons for Fast Online Evaluation.

[BibT_eX]

[DOI]

Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

Linear support for multi-objective coordination graphs.

[BibT_eX]

[DOI]

Diederik M. Roijers

Paris Mavromoustakos Blom

Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

Leveraging social networks to motivate humans to train agents.

[BibT_eX]

[DOI]

Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

Bounded Approximations for Linear Multi-Objective Planning Under Uncertainty.

[BibT_eX]

[DOI]

Diederik Marijn Roijers

Proceedings of the Twenty-Fourth International Conference on Automated Planning and Scheduling, 2014

Towards Personalised Gaming via Facial Expression Recognition.

[BibT_eX]

[DOI]

Proceedings of the Tenth AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment, 2014

2013

Fidelity, Soundness, and Efficiency of Interleaved Comparison Methods.

[BibT_eX]

[DOI]

ACM Trans. Inf. Syst., 2013

A Survey of Multi-Objective Sequential Decision-Making.

[BibT_eX]

[DOI]

J. Artif. Intell. Res., 2013

Incremental Clustering and Expansion for Faster Optimal Planning in Dec-POMDPs.

[BibT_eX]

[DOI]

J. Artif. Intell. Res., 2013

Balancing exploration and exploitation in listwise and pairwise online learning to rank for information retrieval.

[BibT_eX]

[DOI]

Inf. Retr., 2013

Efficient Abstraction Selection in Reinforcement Learning (Extended Abstract).

[BibT_eX]

[DOI]

Leon J. H. M. Kester

Proceedings of the Tenth Symposium on Abstraction, Reformulation, and Approximation, 2013

Critical factors in the performance of hyperNEAT.

[BibT_eX]

[DOI]

Thomas G. van den Berg

Proceedings of the Genetic and Evolutionary Computation Conference, 2013

Reusing Historical Interaction Data for Faster Online Learning to Rank for IR.

[BibT_eX]

[DOI]

Proceedings of the 13th Dutch-Belgian Workshop on Information Retrieval, 2013

Lerot: an online learning to rank framework.

[BibT_eX]

[DOI]

Proceedings of the 2013 workshop on Living labs for information retrieval evaluation, 2013

Multi-objective variable elimination for collaborative graphical games.

[BibT_eX]

[DOI]

Diederik M. Roijers

Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

Approximate solutions for factored Dec-POMDPs with many agents.

[BibT_eX]

[DOI]

Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

Using informative behavior to increase engagement in the tamer framework.

[BibT_eX]

[DOI]

Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

Computing Convex Coverage Sets for Multi-objective Coordination Graphs.

[BibT_eX]

[DOI]

Diederik M. Roijers

Proceedings of the Algorithmic Decision Theory - Third International Conference, 2013

2012

Exploiting Structure in Cooperative Bayesian Games.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, 2012

Estimating interleaved comparison outcomes from historical click data.

[BibT_eX]

[DOI]

Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

V-MAX: tempered optimism for better PAC reinforcement learning.

[BibT_eX]

[DOI]

Karun Rao

Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

Evolutionary Computation for Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Reinforcement Learning, 2012

2011

Introduction to the special issue on empirical evaluations in reinforcement learning.

[BibT_eX]

[DOI]

Michael L. Littman

Mach. Learn., 2011

Exploiting Best-Match Equations for Efficient Reinforcement Learning.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2011

Neuroevolutionary reinforcement learning for generalized control of simulated helicopters.

[BibT_eX]

[DOI]

Rogier Koppejan

Evol. Intell., 2011

Exploiting Agent and Type Independence in Collaborative Graphical Bayesian Games

[BibT_eX]

[DOI]

CoRR, 2011

Adapting Rankers Online.

[BibT_eX]

[DOI]

Proceedings of the Multidisciplinary Information Retrieval, 2011

Robust central pattern generators for embodied hierarchical reinforcement learning.

[BibT_eX]

[DOI]

Yasuo Kuniyoshi

Proceedings of the 1st International Conference on Development and Learning and on Epigenetic Robotics, 2011

Critical factors in the performance of novelty search.

[BibT_eX]

[DOI]

Steijn Kistemaker

Proceedings of the 13th Annual Genetic and Evolutionary Computation Conference, 2011

Multi-Task Reinforcement Learning: Shaping and Feature Selection.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Reinforcement Learning - 9th European Workshop, 2011

Balancing Exploration and Exploitation in Learning to Rank Online.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2011

A probabilistic method for inferring preferences from clicks.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

Protecting against evaluation overfitting in empirical reinforcement learning.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE Symposium on Adaptive Dynamic Programming And Reinforcement Learning, 2011

2010

Switching between Representations in Reinforcement Learning.

[BibT_eX]

[DOI]

Leon J. H. M. Kester

Proceedings of the Interactive Collaborative Information Systems, 2010

Traffic Light Control by Multiagent Reinforcement Learning Systems.

[BibT_eX]

[DOI]

Proceedings of the Interactive Collaborative Information Systems, 2010

Adaptive Representations for Reinforcement Learning

[BibT_eX]

[DOI]

Studies in Computational Intelligence 291, Springer, ISBN: 978-3-642-13931-4, 2010

Report on the 2008 Reinforcement Learning Competition.

[BibT_eX]

[DOI]

Brian Tanner

Adam White

AI Mag., 2010

Critical factors in the empirical performance of temporal difference and evolutionary methods for reinforcement learning.

[BibT_eX]

[DOI]

Auton. Agents Multi Agent Syst., 2010

Multi-task evolutionary shaping without pre-specified representations.

[BibT_eX]

[DOI]

Proceedings of the Genetic and Evolutionary Computation Conference, 2010

2009

Machine learning for event selection in high energy physics.

[BibT_eX]

[DOI]

Daniel Whiteson

Eng. Appl. Artif. Intell., 2009

Postponed Updates for Temporal-Difference Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Intelligent Systems Design and Applications, 2009

Automatic Feature Selection for Model-Based Reinforcement Learning in Factored MDPs.

[BibT_eX]

[DOI]

Mark Kroon

Proceedings of the International Conference on Machine Learning and Applications, 2009

Neuroevolutionary reinforcement learning for generalized helicopter control.

[BibT_eX]

[DOI]

Rogier Koppejan

Proceedings of the Genetic and Evolutionary Computation Conference, 2009

Integrating distributed Bayesian inference and reinforcement learning for sensor management.

[BibT_eX]

[DOI]

Proceedings of the 12th International Conference on Information Fusion, 2009

Lossless clustering of histories in decentralized POMDPs.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2009), 2009

A theoretical and empirical analysis of Expected Sarsa.

[BibT_eX]

[DOI]

Proceedings of the IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, 2009

2008

Multiagent Reinforcement Learning for Urban Traffic Control Using Coordination Graphs.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2008

Exploiting locality of interaction in factored Dec-POMDPs.

[BibT_eX]

[DOI]

Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2008), 2008

2007

Empirical Studies in Action Selection with Reinforcement Learning.

[BibT_eX]

[DOI]

Adapt. Behav., 2007

Transfer via inter-task mappings in policy search reinforcement learning.

[BibT_eX]

[DOI]

Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2007), 2007

Stochastic Optimization for Collision Selection in High Energy Physics.

[BibT_eX]

[DOI]

Daniel Whiteson

Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007

Temporal Difference and Policy Search Methods for Reinforcement Learning: An Empirical Comparison.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007

2006

Evolutionary Function Approximation for Reinforcement Learning.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2006

On-line evolutionary computation for reinforcement learning in stochastic domains.

[BibT_eX]

[DOI]

Proceedings of the Genetic and Evolutionary Computation Conference, 2006

Comparing evolutionary and temporal difference methods in a reinforcement learning domain.

[BibT_eX]

[DOI]

Proceedings of the Genetic and Evolutionary Computation Conference, 2006

Sample-Efficient Evolutionary Function Approximation for Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Proceedings, 2006

2005

Evolving Soccer Keepaway Players Through Task Decomposition.

[BibT_eX]

[DOI]

Mach. Learn., 2005

Automatic feature selection in neuroevolution.

[BibT_eX]

[DOI]

Proceedings of the Genetic and Evolutionary Computation Conference, 2005

Improving Reinforcement Learning Function Approximators via Neuroevolution.

[BibT_eX]

[DOI]

Proceedings of the Proceedings, 2005

2004

Adaptive job routing and scheduling.

[BibT_eX]

[DOI]

Eng. Appl. Artif. Intell., 2004

Towards Autonomic Computing: Adaptive Network Routing and Scheduling.

[BibT_eX]

[DOI]

Proceedings of the 1st International Conference on Autonomic Computing (ICAC 2004), 2004

Towards Autonomic Computing: Adaptive Job Routing and Scheduling.

[BibT_eX]

Proceedings of the Nineteenth National Conference on Artificial Intelligence, 2004

2003

Evolving Keepaway Soccer Players through Task Decomposition.

[BibT_eX]

[DOI]

Proceedings of the Genetic and Evolutionary Computation, 2003

Concurrent layered learning.

[BibT_eX]

[DOI]