Sam Devlin

Orcid: 0000-0002-7769-3090

Affiliations:
  • Microsoft Research, Cambridge, UK


According to our database1, Sam Devlin authored at least 81 papers between 2009 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Efficient Offline Reinforcement Learning: The Critic is Critical.
CoRR, 2024

Aligning Agents like Large Language Models.
CoRR, 2024

2023
Visual Encoders for Data-Efficient Imitation Learning in Modern Video Games.
CoRR, 2023

Trust-Region-Free Policy Optimization for Stochastic Policies.
CoRR, 2023

Imitating Human Behaviour with Diffusion Models.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Contrastive Meta-Learning for Partially Observable Few-Shot Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Navigates Like Me: Understanding How People Evaluate Human-Like AI in Video Games.
Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

Trust Region Bounds for Decentralized PPO Under Non-stationarity.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

2022
A Comparison of Self-Play Algorithms Under a Generalized Framework.
IEEE Trans. Games, 2022

Rolling Horizon Evolutionary Algorithms for General Video Game Playing.
IEEE Trans. Games, 2022

UniMASK: Unified Inference in Sequential Decision Problems.
CoRR, 2022

Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers.
CoRR, 2022

Monotonic Improvement Guarantees under Non-stationarity for Decentralized PPO.
CoRR, 2022

You May Not Need Ratio Clipping in PPO.
CoRR, 2022

Uni[MASK]: Unified Inference in Sequential Decision Problems.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Imitating Playstyle with Dynamic Time Warping Imitation.
Proceedings of the FDG '22: Proceedings of the 17th International Conference on the Foundations of Digital Games, 2022

Turning Zeroes into Non-Zeroes: Sample Efficient Exploration with Monte Carlo Graph Search.
Proceedings of the IEEE Conference on Games, CoG 2022, Beijing, 2022

How Humans Perceive Human-like Behavior in Video Game Navigation.
Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

Adaptive Scaffolding in Block-Based Programming via Synthesizing New Tasks as Pop Quizzes.
Proceedings of the Artificial Intelligence in Education - 23rd International Conference, 2022

Deterministic and Discriminative Imitation (D2-Imitation): Revisiting Adversarial Imitation for Sample Efficiency.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Win Prediction in Multiplayer Esports: Live Professional Match Prediction.
IEEE Trans. Games, 2021

The MineRL 2020 Competition on Sample Efficient Reinforcement Learning using Human Priors.
CoRR, 2021

Strategically efficient exploration in competitive multi-agent reinforcement learning.
Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

Navigation Turing Test (NTT): Learning to Evaluate Human-Like Navigation.
Proceedings of the 38th International Conference on Machine Learning, 2021

Deep Interactive Bayesian Reinforcement Learning via Meta-Learning.
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

Evaluating the Robustness of Collaborative Agents.
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

Difference Rewards Policy Gradients.
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

Meta-Learning Divergences for Variational Inference.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

2020
Meta-Learning for Variational Inference.
CoRR, 2020

Analytic Manifold Learning: Unifying and Evaluating Representations for Continuous Control.
CoRR, 2020

AMRL: Aggregated Memory For Reinforcement Learning.
Proceedings of the 8th International Conference on Learning Representations, 2020

Automatic Similarity Detection in LEGO Ducks.
Proceedings of the Eleventh International Conference on Computational Creativity, 2020

Player Style Clustering without Game Variables.
Proceedings of the FDG '20: International Conference on the Foundations of Digital Games, 2020

"It's Unwieldy and It Takes a Lot of Time" - Challenges and Opportunities for Creating Agents in Commercial Games.
Proceedings of the Sixteenth AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment, 2020

2019
How the Business Model of Customizable Card Games Influences Player Engagement.
IEEE Trans. Games, 2019

Emulating Human Play in a Leading Mobile Card Game.
IEEE Trans. Games, 2019

The Text-Based Adventure AI Competition.
IEEE Trans. Games, 2019

Type inference in flexible model-driven engineering using classification algorithms.
Softw. Syst. Model., 2019

The Multi-Agent Reinforcement Learning in MalmÖ (MARLÖ) Competition.
CoRR, 2019

Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottleneck.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Win or Learn Fast Proximal Policy Optimisation.
Proceedings of the IEEE Conference on Games, 2019

A Generalized Framework for Self-Play Training.
Proceedings of the IEEE Conference on Games, 2019

MazeExplorer: A Customisable 3D Benchmark for Assessing Generalisation in Reinforcement Learning.
Proceedings of the IEEE Conference on Games, 2019

2018
Reward shaping for knowledge-based multi-objective multi-agent reinforcement learning.
Knowl. Eng. Rev., 2018

Narrative Bytes: Data-Driven Content Production in Esports.
Proceedings of the 2018 ACM International Conference on Interactive Experiences for TV and Online Video, 2018

2017
Multi-agent credit assignment in stochastic resource management games.
Knowl. Eng. Rev., 2017

Policy invariance under reward transformations for multi-objective reinforcement learning.
Neurocomputing, 2017

Win Prediction in Esports: Mixed-Rank Match Prediction in Multi-player Online Battle Arena Games.
CoRR, 2017

Exploration and Skill Acquisition in a Major Online Game.
Proceedings of the 39th Annual Meeting of the Cognitive Science Society, 2017

Clyde: A Deep Reinforcement Learning DOOM Playing Agent.
Proceedings of the Workshops of the The Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Context-sensitive reward shaping for sparse interaction multi-agent systems.
Knowl. Eng. Rev., 2016

Overcoming incorrect knowledge in plan-based reward shaping.
Knowl. Eng. Rev., 2016

Plan-based reward shaping for multi-agent reinforcement learning.
Knowl. Eng. Rev., 2016

Potential-based reward shaping for finite horizon online POMDP planning.
Auton. Agents Multi Agent Syst., 2016

Using association rule mining to predict opponent deck content in android: Netrunner.
Proceedings of the IEEE Conference on Computational Intelligence and Games, 2016

Multi-Objective Dynamic Dispatch Optimisation using Multi-Agent Reinforcement Learning: (Extended Abstract).
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

Resource Abstraction for Reinforcement Learning in Multiagent Congestion Problems.
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

Combining Gameplay Data with Monte Carlo Tree Search to Emulate Human Play.
Proceedings of the Twelfth AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment, 2016

2015
Player Preference and Style in a Leading Mobile Card Game.
IEEE Trans. Comput. Intell. AI Games, 2015

Distributed reinforcement learning for adaptive and robust network intrusion response.
Connect. Sci., 2015

Preface to the special issue: Adaptive Learning Agents Part 3.
Connect. Sci., 2015

Type Inference Using Concrete Syntax Properties in Flexible Model-Driven Engineering.
Proceedings of the Workshop on Flexible Model Driven Engineering co-located with ACM/IEEE 18th International Conference on Model Driven Engineering Languages & Systems (MoDELS 2015), 2015

Type Inference in Flexible Model-Driven Engineering.
Proceedings of the Modelling Foundations and Applications - 11th European Conference, 2015

Predicting player disengagement and first purchase with event-frequency based data representation.
Proceedings of the 2015 IEEE Conference on Computational Intelligence and Games, 2015

Expressing Arbitrary Reward Functions as Potential-Based Advice.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Preface to the special issue: Adaptive Learning Agents Part 2.
Connect. Sci., 2014

Preface to the special issue: Adaptive Learning Agents, Part 1.
Connect. Sci., 2014

A Phylogenetic Classification of the Video-Game Industry's Business Model Ecosystem.
Proceedings of the Collaborative Systems for Smart Networked Environments, 2014

Predicting Player Disengagement in Online Games.
Proceedings of the Computer Games - Third Workshop on Computer Games, 2014

Coordinated Team Learning and Difference Rewards for Distributed Intrusion Response.
Proceedings of the ECAI 2014 - 21st European Conference on Artificial Intelligence, 18-22 August 2014, Prague, Czech Republic, 2014

Game intelligence.
Proceedings of the 2014 IEEE Conference on Computational Intelligence and Games, 2014

Knowledge revision for reinforcement learning with abstract MDPs.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

Potential-based difference rewards for multiagent reinforcement learning.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

2013
Potential-based reward shaping for knowledge-based, multi-agent reinforcement learning.
PhD thesis, 2013

Overcoming erroneous domain knowledge in plan-based reward shaping.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

Potential-based reward shaping for POMDPs.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

2012
Dynamic potential-based reward shaping.
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

2011
An Empirical Study of Potential-Based Reward Shaping and Advice in Complex, Multi-Agent Systems.
Adv. Complex Syst., 2011

Theoretical considerations of potential-based reward shaping for multi-agent systems.
Proceedings of the 10th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2011), 2011

Multi-agent, reward shaping for RoboCup KeepAway.
Proceedings of the 10th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2011), 2011

2009
Reinforcement Learning in RoboCup KeepAway with Partial Observability.
Proceedings of the 2009 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, 2009


  Loading...