2024
Scaling Laws for Pre-training Agents and World Models.
CoRR, 2024
Toward Human-AI Alignment in Large-Scale Multi-Player Games.
CoRR, 2024
Transformer Neural Autoregressive Flows.
CoRR, 2024
Learning Safety Constraints from Demonstrations with Unknown Rewards.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024
2023
Trust-Region-Free Policy Optimization for Stochastic Policies.
CoRR, 2023
Imitating Human Behaviour with Diffusion Models.
,
,
,
,
,
,
,
,
,
,
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation.
Proceedings of the Conference on Lifelong Learning Agents, 2023
Navigates Like Me: Understanding How People Evaluate Human-Like AI in Video Games.
Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023
Trust Region Bounds for Decentralized PPO Under Non-stationarity.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023
2022
UniMASK: Unified Inference in Sequential Decision Problems.
,
,
,
,
,
,
,
,
,
,
CoRR, 2022
Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers.
,
,
,
,
,
,
,
,
,
,
CoRR, 2022
Monotonic Improvement Guarantees under Non-stationarity for Decentralized PPO.
CoRR, 2022
You May Not Need Ratio Clipping in PPO.
CoRR, 2022
Contextual Squeeze-and-Excitation for Efficient Few-Shot Image Classification.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Uni[MASK]: Unified Inference in Sequential Decision Problems.
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Interactively Learning Preference Constraints in Linear Bandits.
Proceedings of the International Conference on Machine Learning, 2022
How Humans Perceive Human-like Behavior in Video Game Navigation.
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022
Deterministic and Discriminative Imitation (D2-Imitation): Revisiting Adversarial Imitation for Sample Efficiency.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
VariBAD: Variational Bayes-Adaptive Deep RL via Meta-Learning.
J. Mach. Learn. Res., 2021
SpatialSim: Recognizing Spatial Configurations of Objects With Graph Neural Networks.
Frontiers Artif. Intell., 2021
NeurIPS 2021 Competition IGLU: Interactive Grounded Language Understanding in a Collaborative Environment.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2021
SocialAI: Benchmarking Socio-Cognitive Abilities in Deep Reinforcement Learning Agents.
CoRR, 2021
SoftDICE for Imitation Learning: Rethinking Off-policy Distribution Matching.
CoRR, 2021
Learning to Win, Lose and Cooperate through Reward Signal Evolution.
CoRR, 2021
SocialAI 0.1: Towards a Benchmark to Stimulate Research on Socio-Cognitive Abilities in Deep Reinforcement Learning Agents.
CoRR, 2021
Strategically efficient exploration in competitive multi-agent reinforcement learning.
Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021
Interactive Grounded Language Understanding in a Collaborative Environment: IGLU 2021.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the NeurIPS 2021 Competitions and Demonstrations Track, 2021
Grounding Spatio-Temporal Language with Transformers.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Memory Efficient Meta-Learning with Large Images.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Exploration in Approximate Hyper-State Space for Meta Reinforcement Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021
TeachMyAgent: a Benchmark for Automatic Curriculum Learning in Deep RL.
Proceedings of the 38th International Conference on Machine Learning, 2021
Navigation Turing Test (NTT): Learning to Evaluate Human-Like Navigation.
Proceedings of the 38th International Conference on Machine Learning, 2021
ORBIT: A Real-World Few-Shot Dataset for Teachable Object Recognition.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Deep Interactive Bayesian Reinforcement Learning via Meta-Learning.
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021
Evaluating the Robustness of Collaborative Agents.
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021
Disability-first Dataset Creation: Lessons from Constructing a Dataset for Teachable Object Recognition with Blind and Low Vision Data Collectors.
Proceedings of the ASSETS '21: The 23rd International ACM SIGACCESS Conference on Computers and Accessibility, 2021
2020
Meta Automatic Curriculum Learning.
CoRR, 2020
Exploration in Approximate Hyper-State Space for Meta Reinforcement Learning.
CoRR, 2020
Analytic Manifold Learning: Unifying and Evaluating Representations for Continuous Control.
CoRR, 2020
Guaranteeing Reproducibility in Deep Learning Competitions.
CoRR, 2020
Recognizing Spatial Configurations of Objects with Graph Neural Networks.
CoRR, 2020
Trying AGAIN instead of Trying Longer: Prior Learning for Automatic Curriculum Learning.
CoRR, 2020
NeurIPS 2020 Competition and Demonstration Track: Revised selected papers.
Proceedings of the NeurIPS 2020 Competition and Demonstration Track, 2020
Automatic Curriculum Learning For Deep RL: A Short Survey.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020
VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning.
Proceedings of the 8th International Conference on Learning Representations, 2020
Conservative Uncertainty Estimation By Fitting Prior Networks.
Proceedings of the 8th International Conference on Learning Representations, 2020
AMRL: Aggregated Memory For Reinforcement Learning.
Proceedings of the 8th International Conference on Learning Representations, 2020
A Novel Individually Rational Objective In Multi-Agent Multi-Armed Bandits: Algorithms and Regret Bounds.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020
Combining No-regret and Q-learning.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020
Variational Integrator Networks for Physically Structured Embeddings.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020
"It's Unwieldy and It Takes a Lot of Time" - Challenges and Opportunities for Creating Agents in Commercial Games.
Proceedings of the Sixteenth AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment, 2020
2019
Variational Integrator Networks for Physically Meaningful Embeddings.
CoRR, 2019
Near-Optimal Online Egalitarian learning in General Sum Repeated Matrix Games.
CoRR, 2019
The MineRL Competition on Sample Efficient Reinforcement Learning using Human Priors.
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2019
The Multi-Agent Reinforcement Learning in MalmÖ (MARLÖ) Competition.
CoRR, 2019
Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottleneck.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Better Exploration with Optimistic Actor Critic.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Fast Context Adaptation via Meta-Learning.
Proceedings of the 36th International Conference on Machine Learning, 2019
Teacher algorithms for curriculum learning of Deep RL in continuously parameterized environments.
Proceedings of the 3rd Annual Conference on Robot Learning, 2019
Win or Learn Fast Proximal Policy Optimisation.
Proceedings of the IEEE Conference on Games, 2019
MazeExplorer: A Customisable 3D Benchmark for Assessing Generalisation in Reinforcement Learning.
Proceedings of the IEEE Conference on Games, 2019
Minecraft as AI Playground and Laboratory.
Proceedings of the Annual Symposium on Computer-Human Interaction in Play, 2019
2018
How Players Speak to an Intelligent Game Character Using Natural Language Messages.
Trans. Digit. Games Res. Assoc., 2018
Successor Uncertainties: exploration and uncertainty in temporal difference learning.
CoRR, 2018
CAML: Fast Context Adaptation via Meta-Learning.
CoRR, 2018
Depth and nonlinearity induce implicit exploration for RL.
CoRR, 2018
Variational Inference for Data-Efficient Model Learning in POMDPs.
CoRR, 2018
Meta Reinforcement Learning with Latent Variable Gaussian Processes.
Proceedings of the Thirty-Fourth Conference on Uncertainty in Artificial Intelligence, 2018
Cross Domain Regularization for Neural Ranking Models using Adversarial Learning.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018
Advancements in Dueling Bandits.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018
2017
The Atari Grand Challenge Dataset.
CoRR, 2017
A New AI Evaluation Cosmos: Ready to Play the Game?
AI Mag., 2017
Spontaneous Interactions with a Virtually Embodied Intelligent Assistant in Minecraft.
Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, 2017
Asynchronous Data Aggregation for Training End to End Visual Control Networks.
Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017
2016
Online Evaluation for Information Retrieval.
Found. Trends Inf. Retr., 2016
A Deep Learning Approach for Joint Video Frame and Reward Prediction in Atari Games.
CoRR, 2016
Experimental and causal view on information integration in autonomous agents.
CoRR, 2016
Memory Lens: How Much Memory Does an Agent Use?
CoRR, 2016
Towards Conversational Recommender Systems.
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016
The Malmo Platform for Artificial Intelligence Experimentation.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016
Collective Noise Contrastive Estimation for Policy Transfer Learning.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016
2015
Online Learning to Rank: Absolute vs. Relative.
Proceedings of the 24th International Conference on World Wide Web Companion, 2015
Predicting Search Satisfaction Metrics with Interleaved Comparisons.
Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2015
Contextual Dueling Bandits.
Proceedings of The 28th Conference on Learning Theory, 2015
2014
"Learning to rank for information retrieval from user interactions" by K. Hofmann, S. Whiteson, A. Schuth, and M. de Rijke with Martin Vesely as coordinator.
SIGWEB Newsl., 2014
On user interactions with query auto-completion.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014
Online Experimentation for Information Retrieval.
Proceedings of the Information Retrieval, 2014
Effects of Position Bias on Click-Based Recommender Evaluation.
Proceedings of the Advances in Information Retrieval, 2014
An Eye-tracking Study of User Interactions with Query Auto Completion.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014
2013
Cornetto: A Combinatorial Lexical Semantic Database for Dutch.
Proceedings of the Essential Speech and Language Technology for Dutch, 2013
Fidelity, Soundness, and Efficiency of Interleaved Comparison Methods.
ACM Trans. Inf. Syst., 2013
Fast and reliable online learning to rank for information retrieval.
SIGIR Forum, 2013
Balancing exploration and exploitation in listwise and pairwise online learning to rank for information retrieval.
Inf. Retr., 2013
Practical Online Retrieval Evaluation.
Proceedings of the Advances in Information Retrieval, 2013
Reusing Historical Interaction Data for Faster Online Learning to Rank for IR.
Proceedings of the 13th Dutch-Belgian Workshop on Information Retrieval, 2013
Lerot: an online learning to rank framework.
Proceedings of the 2013 workshop on Living labs for information retrieval evaluation, 2013
Evaluating aggregated search using interleaving.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013
2012
Learning to Rank from Relevance Feedback for e-Discovery.
Proceedings of the Advances in Information Retrieval, 2012
Estimating interleaved comparison outcomes from historical click data.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012
On caption bias in interleaving experiments.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012
2011
Extraction of Hypernymy Information from Text<sup>∗</sup>.
Proceedings of the Interactive Multi-modal Question-Answering, 2011
DIR 2011: the eleventh Dutch-Belgian information retrieval workshop.
SIGIR Forum, 2011
The University of Amsterdam at the TREC 2011 Session Track.
Proceedings of The Twentieth Text REtrieval Conference, 2011
Search engines that learn online.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011
Proceedings of the Multidisciplinary Information Retrieval, 2011
Balancing Exploration and Exploitation in Learning to Rank Online.
Proceedings of the Advances in Information Retrieval, 2011
A probabilistic method for inferring preferences from clicks.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011
2010
Podcast search: user goals and retrieval technologies.
Online Inf. Rev., 2010
Contextual factors for finding similar experts.
J. Assoc. Inf. Sci. Technol., 2010
The University of Amsterdam at TREC 2010: Session, Entity and Relevance Feedback.
Proceedings of The Nineteenth Text REtrieval Conference, 2010
Comparing click-through data to purchase decisions for retrieval evaluation.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010
Validating Query Simulators: An Experiment Using Commercial Searches and Purchases.
Proceedings of the Multilingual and Multimodal Information Access Evaluation, 2010
2009
Heuristic Ranking and Diversification of Web Documents.
Proceedings of The Eighteenth Text REtrieval Conference, 2009
Generating a Non-English Subjectivity Lexicon: Relations That Matter.
Proceedings of the EACL 2009, 12th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference, Athens, Greece, March 30, 2009
Lexical Patterns or Dependency Patterns: Which Is Better for Hypernym Extraction?
Proceedings of the Thirteenth Conference on Computational Natural Language Learning, 2009
A Semantic Perspective on Query Log Analysis.
Proceedings of the Working Notes for CLEF 2009 Workshop co-located with the 13th European Conference on Digital Libraries (ECDL 2009) , Corfù, Greece, September 30, 2009
The impact of document structure on keyphrase extraction.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009
2008
Unobtrusive User Modeling For Adaptive Hypermedia.
Proceedings of the Personalization Techniques and Recommender Systems, 2008
Assessing concept selection for video retrieval.
Proceedings of the 1st ACM SIGMM International Conference on Multimedia Information Retrieval, 2008
An Exploratory Study of User Goals and Strategies in Podcast Search.
Proceedings of the LWA 2008, 2008
2007
Unobtrusive User Modeling for Adaptive Hypermedia.
Int. J. Pattern Recognit. Artif. Intell., 2007
The University of Amsterdam at the TREC 2007 QA Track.
Proceedings of The Sixteenth Text REtrieval Conference, 2007
Query and Document Models for Enterprise Search.
Proceedings of The Sixteenth Text REtrieval Conference, 2007
Automatic extension of non-english wordnets.
Proceedings of the SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007
The University of Amsterdam at CLEF@QA 2007.
Proceedings of the Working Notes for CLEF 2007 Workshop co-located with the 11th European Conference on Digital Libraries (ECDL 2007), 2007
The University of Amsterdam's Question Answering System at QA@CLEF 2007.
Proceedings of the Advances in Multilingual and Multimodal Information Retrieval, 2007
Modeling Engagement in Educational Adaptive Hypermedia.
Proceedings of the Artificial Intelligence in Education, 2007
2005
Subsymbolic User Modeling in Adaptive Hypermedia.
Proceedings of the Artificial Intelligence in Education, 2005