Peter Stone

Zifan Xu

IEEE Robotics Autom. Mag., 2022

Bottom-Up Skill Discovery From Unsegmented Demonstrations for Long-Horizon Robot Manipulation.

[BibT_eX]

[DOI]

Yifeng Zhu

Yuke Zhu

IEEE Robotics Autom. Lett., 2022

Socially CompliAnt Navigation Dataset (SCAND): A Large-Scale Dataset of Demonstrations for Social Navigation.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., 2022

Lucid dreaming for experience replay: refreshing past states with the current policy.

[BibT_eX]

[DOI]

Yunshu Du

Assefaw H. Gebremedhin

Neural Comput. Appl., 2022

Outracing champion Gran Turismo drivers with deep reinforcement learning.

[BibT_eX]

[DOI]

Nat., 2022

Mechanism Design for Correlated Valuations: Efficient Methods for Revenue Maximization.

[BibT_eX]

[DOI]

Oper. Res., 2022

Challenges and Opportunities of Applying Reinforcement Learning to Autonomous Racing.

[BibT_eX]

[DOI]

Peter R. Wurman

Michael Spranger

IEEE Intell. Syst., 2022

Safe Evaluation For Offline Learning: Are We Ready To Deploy?

[BibT_eX]

[DOI]

CoRR, 2022

Artificial Intelligence and Life in 2030: The One Hundred Year Study on Artificial Intelligence.

[BibT_eX]

[DOI]

CoRR, 2022

ABC: Adversarial Behavioral Cloning for Offline Mode-Seeking Imitation Learning.

[BibT_eX]

[DOI]

CoRR, 2022

VIOLA: Imitation Learning for Vision-Based Manipulation with Object Proposal Priors.

[BibT_eX]

[DOI]

CoRR, 2022

Learning Real-world Autonomous Navigation by Self-Supervised Environment Synthesis.

[BibT_eX]

[DOI]

CoRR, 2022

Autonomous Ground Navigation in Highly Constrained Spaces: Lessons learned from The BARN Challenge at ICRA 2022.

[BibT_eX]

[DOI]

CoRR, 2022

Metric Residual Networks for Sample Efficient Goal-conditioned Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2022

High-Speed Accurate Robot Control using Learned Forward Kinodynamics and Non-linear Least Squares Optimization.

[BibT_eX]

[DOI]

CoRR, 2022

DM<sup>2</sup>: Distributed Multi-Agent Reinforcement Learning for Distribution Matching.

[BibT_eX]

[DOI]

CoRR, 2022

A Survey of Ad Hoc Teamwork: Definitions, Methods, and Open Problems.

[BibT_eX]

[DOI]

CoRR, 2022

Learning a Shield from Catastrophic Action Effects: Never Repeat the Same Mistake.

[BibT_eX]

[DOI]

Shahaf S. Shperberg

Decebal Constantin Mocanu

CoRR, 2022

Motion planning and control for mobile robot navigation using machine learning: a survey.

[BibT_eX]

[DOI]

Auton. Robots, 2022

DynaBARN: Benchmarking Metric Ground Navigation in Dynamic Environments.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Safety, Security, and Rescue Robotics, 2022

Towards a Real-Time, Low-Resource, End-to-End Object Detection Pipeline for Robot Soccer.

[BibT_eX]

[DOI]

Sai Kiran Narayanaswami

Proceedings of the RoboCup 2022:, 2022

Value Function Decomposition for Iterative Design of Reinforcement Learning Agents.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

BOME! Bilevel Optimization Made Easy: A Simple First-Order Approach.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

VI-IKD: High-Speed Accurate Off-Road Navigation using Learned Visual-Inertial Inverse Kinodynamics.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Quantifying Changes in Kinematic Behavior of a Human-Exoskeleton Interactive System.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Dynamic Sparse Training for Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Ghada Sokar

Elena Mocanu

Mykola Pechenizkiy

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Visually Grounded Task and Motion Planning for Mobile Manipulation.

[BibT_eX]

[DOI]

Proceedings of the 2022 International Conference on Robotics and Automation, 2022

VOILA: Visual-Observation-Only Imitation Learning for Autonomous Navigation.

[BibT_eX]

[DOI]

Proceedings of the 2022 International Conference on Robotics and Automation, 2022

Adversarial Imitation Learning from Video Using a State Observer.

[BibT_eX]

[DOI]

Proceedings of the 2022 International Conference on Robotics and Automation, 2022

Skeletal Feature Compensation for Imitation Learning with Embodiment Mismatch.

[BibT_eX]

[DOI]

Proceedings of the 2022 International Conference on Robotics and Automation, 2022

Causal Dynamics Learning for Task-Independent State Abstraction.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Effective mutation rate adaptation through group elite selection.

[BibT_eX]

[DOI]

Proceedings of the GECCO '22: Genetic and Evolutionary Computation Conference, Boston, Massachusetts, USA, July 9, 2022

A Survey of Ad Hoc Teamwork Research.

[BibT_eX]

[DOI]

Proceedings of the Multi-Agent Systems - 19th European Conference, 2022

Offline training of multi-agent reinforcement agents for grid-interactive buildings control.

[BibT_eX]

[DOI]

Proceedings of the e-Energy '22: The Thirteenth ACM International Conference on Future Energy Systems, Virtual Event, 28 June 2022, 2022

Coopernaut: End-to-End Driving with Cooperative Perception for Networked Vehicles.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

VIOLA: Object-Centric Imitation Learning for Vision-Based Robot Manipulation.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 2022

Learning to Correct Mistakes: Backjumping in Long-Horizon Task and Motion Planning.

[BibT_eX]

[DOI]

Yoonchang Sung

Zizhao Wang

Proceedings of the Conference on Robot Learning, 2022

A Rule-based Shield: Accumulating Safety Rules from Catastrophic Action Effects.

[BibT_eX]

[DOI]

Proceedings of the Conference on Lifelong Learning Agents, 2022

Continual Learning and Private Unlearning.

[BibT_eX]

[DOI]

Qiang Liu

Proceedings of the Conference on Lifelong Learning Agents, 2022

2021

APPLE: Adaptive Planner Parameter Learning From Evaluative Feedback.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., October, 2021

Policy Evaluation in Continuous MDPs With Efficient Kernelized Gradient Temporal Difference.

[BibT_eX]

[DOI]

IEEE Trans. Autom. Control., 2021

RoboCup 2021 Worldwide: A Successful Robotics Competition During a Pandemic [Competitions].

[BibT_eX]

[DOI]

IEEE Robotics Autom. Mag., 2021

Toward Agile Maneuvers in Highly Constrained Spaces: Learning From Hallucination.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., 2021

Learning Inverse Kinodynamics for Accurate High-Speed Off-Road Navigation on Unstructured Terrain.

[BibT_eX]

[DOI]

Joydeep Biswas

IEEE Robotics Autom. Lett., 2021

A Lifelong Learning Approach to Mobile Robot Navigation.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., 2021

Importance sampling in reinforcement learning with an estimated behavior policy.

[BibT_eX]

[DOI]

Mach. Learn., 2021

Grounded action transformation for sim-to-real reinforcement learning.

[BibT_eX]

[DOI]

Mach. Learn., 2021

Agent-Based Markov Modeling for Improved COVID-19 Mitigation Policies.

[BibT_eX]

[DOI]

J. Artif. Intell. Res., 2021

Learning a Robust Multiagent Driving Policy for Traffic Congestion Reduction.

[BibT_eX]

[DOI]

CoRR, 2021

Incorporating Gaze into Social Navigation.

[BibT_eX]

[DOI]

CoRR, 2021

Prevention and Resolution of Conflicts in Social Navigation - a Survey.

[BibT_eX]

[DOI]

CoRR, 2021

RAIL: A modular framework for Reinforcement-learning-based Adversarial Imitation Learning.

[BibT_eX]

[DOI]

Eddy Hudson

CoRR, 2021

Sequential Online Chore Division for Autonomous Vehicle Convoy Formation.

[BibT_eX]

[DOI]

Harel Yedidsion

CoRR, 2021

Recent advances in leveraging human guidance for sequential decision-making tasks.

[BibT_eX]

[DOI]

Auton. Agents Multi Agent Syst., 2021

Machine Learning Methods for Local Motion Planning: A Study of End-to-End vs. Parameter Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Safety, Security, and Rescue Robotics, 2021

UT Austin Villa: RoboCup 2021 3D Simulation League Competition Champions.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2021: Robot World Cup XXIV, 2021

Conflict-Averse Gradient Descent for Multi-task learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Machine versus Human Attention in Deep Reinforcement Learning Tasks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Adversarial Intrinsic Motivation for Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

From Agile Ground to Aerial Navigation: Learning from Learned Hallucination.

[BibT_eX]

[DOI]

Zizhao Wang

Alexander J. Nettekoven

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

DEALIO: Data-Efficient Adversarial Learning for Imitation from Observation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Team Orienteering Coverage Planning with Uncertain Reward.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Capturing Skill State in Curriculum Learning for Human Skill Acquisition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

A Scavenger Hunt for Service Robots.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2021

APPLR: Adaptive Planner Parameter Learning from Reinforcement.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Agile Robot Navigation through Hallucinated Learning and Sober Deployment.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2021

APPLI: Adaptive Planner Parameter Learning From Interventions.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Towards Safe Motion Planning in Human Workspaces: A Robust Multi-agent Approach.

[BibT_eX]

[DOI]

Benito Fernandez

Andrea Lockerd Thomaz

Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Efficient Real-Time Inference in Temporal Convolution Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Watch Where You're Going! Gaze and Head Orientation as Predictors for Social Robot Navigation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Coach-Player Multi-agent Reinforcement Learning for Dynamic Team Composition.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Efficient Robot Skill Learning: Grounded Simulation Learning and Imitation Learning from Observation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Autonomous Robot Systems and Competitions, 2021

Multiagent Epidemiologic Inference through Realtime Contact Tracing.

[BibT_eX]

[DOI]

Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

The Seeing-Eye Robot Grand Challenge: Rethinking Automated Care.

[BibT_eX]

[DOI]

Reuth Mirsky

Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

Scalable Multiagent Driving Policies for Reducing Traffic Congestion.

[BibT_eX]

[DOI]

Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

Expected Value of Communication for Planning in Ad Hoc Teamwork.

[BibT_eX]

[DOI]

William Macke

Reuth Mirsky

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Goal Blending for Responsive Shared Autonomy in a Navigating Vehicle.

[BibT_eX]

[DOI]

Yu-Sian Jiang

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Temporal-Logic-Based Reward Shaping for Continuing Reinforcement Learning Tasks.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Demonstration of the EMPATHIC Framework for Task Learning from Implicit Human Feedback.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

APPLD: Adaptive Planner Parameter Learning From Demonstration.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., 2020

RIDM: Reinforced Inverse Dynamics Modeling for Learning from a Single Observed Demonstration.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., 2020

Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2020

Jointly Improving Parsing and Perception for Natural Language Commands through Human-Robot Dialog.

[BibT_eX]

[DOI]

J. Artif. Intell. Res., 2020

The PETLON Algorithm to Plan Efficiently for Task-Level-Optimal Navigation.

[BibT_eX]

[DOI]

J. Artif. Intell. Res., 2020

Special Issue "On Defining Artificial Intelligence" - Commentaries and Author's Response.

[BibT_eX]

[DOI]

J. Artif. Gen. Intell., 2020

Motion Control for Mobile Robot Navigation Using Machine Learning: a Survey.

[BibT_eX]

[DOI]

CoRR, 2020

Human versus Machine Attention in Deep Reinforcement Learning Tasks.

[BibT_eX]

[DOI]

CoRR, 2020

Extended Abstract: Motion Planners Learned from Geometric Hallucination.

[BibT_eX]

[DOI]

CoRR, 2020

An Imitation from Observation Approach to Sim-to-Real Transfer.

[BibT_eX]

[DOI]

CoRR, 2020

Lifelong Navigation.

[BibT_eX]

[DOI]

CoRR, 2020

Temporal-Logic-Based Reward Shaping for Continuing Learning Tasks.

[BibT_eX]

[DOI]

Yuqian Jiang

Sudarshanan Bharadwaj

CoRR, 2020

Artificial Musical Intelligence: A Survey.

[BibT_eX]

[DOI]

CoRR, 2020

iCORPP: Interleaved Commonsense Reasoning and Probabilistic Planning on Robots.

[BibT_eX]

[DOI]

CoRR, 2020

Special issue on autonomous agents modelling other agents: Guest editorial.

[BibT_eX]

[DOI]

Michael P. Wellman

Artif. Intell., 2020

Agents teaching agents: a survey on inter-agent transfer learning.

[BibT_eX]

[DOI]

Felipe Leno da Silva

Anna Helena Reali Costa

Auton. Agents Multi Agent Syst., 2020

Benchmarking Metric Ground Navigation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Safety, Security, and Rescue Robotics, 2020

Using Human-Inspired Signals to Disambiguate Navigational Intentions.

[BibT_eX]

[DOI]

Proceedings of the Social Robotics - 12th International Conference, 2020

Learning and Reasoning for Robot Dialog and Navigation Tasks.

[BibT_eX]

[DOI]

Proceedings of the 21th Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2020

Firefly Neural Architecture Descent: a General Approach for Growing Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

An Imitation from Observation Approach to Transfer Learning with Dynamics Mismatch.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Deep R-Learning for Continual Area Sweeping.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

Reinforced Grounded Action Transformation for Sim-to-Real Transfer.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

Stochastic Grounded Action Transformation for Robot Learning in Simulation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

A Penny for Your Thoughts: The Value of Communication in Ad Hoc Teamwork.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Balancing Individual Preferences and Shared Objectives in Multiagent Reinforcement Learning.

[BibT_eX]

[DOI]

Ishan Durugkar

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Reducing Sampling Error in Batch Temporal Difference Learning.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

Learning to Improve Multi-Robot Hallway Navigation.

[BibT_eX]

[DOI]

Proceedings of the 4th Conference on Robot Learning, 2020

The EMPATHIC Framework for Task Learning from Implicit Human Feedback.

[BibT_eX]

[DOI]

Proceedings of the 4th Conference on Robot Learning, 2020

The Sequential Online Chore Division Problem - Definition and Application.

[BibT_eX]

[DOI]

Harel Yedidsion

Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

Reinforcement Learning for Optimization of COVID-19 Mitigation Policies.

[BibT_eX]

[DOI]

Proceedings of the AAAI Fall Symposium on AI for Social Good, 2020

2019

RoboCup: A Treasure Trove of Rich Diversity for Research Issues and Interdisciplinary Connections [TC Spotlight].

[BibT_eX]

[DOI]

IEEE Robotics Autom. Mag., 2019

The Right Music at the Right Time: Adaptive Personalized Playlists Based on Sequence Modeling.

[BibT_eX]

[DOI]

Gilberto Briscoe-Martinez

MIS Q., 2019

Task planning in robotics: an empirical comparison of PDDL- and ASP-based systems.

[BibT_eX]

[DOI]

Frontiers Inf. Technol. Electron. Eng., 2019

Unclogging Our Arteries: Using Human-Inspired Signals to Disambiguate Navigational Intentions.

[BibT_eX]

[DOI]

CoRR, 2019

Solving Service Robot Tasks: UT Austin Villa@Home 2019 Team Report.

[BibT_eX]

[DOI]

Rishi Shah

Yuqian Jiang

Haresh Karnan

CoRR, 2019

Desiderata for Planning Systems in General-Purpose Service Robots.

[BibT_eX]

[DOI]

CoRR, 2019

Sample-efficient Adversarial Imitation Learning from Observation.

[BibT_eX]

[DOI]

CoRR, 2019

Multi-robot planning with conflicts and synergies.

[BibT_eX]

[DOI]

Auton. Robots, 2019

Optimal Use of Verbal Instructions for Multi-robot Human Navigation Guidance.

[BibT_eX]

[DOI]

Proceedings of the Social Robotics - 11th International Conference, 2019

UT Austin Villa: RoboCup 2019 3D Simulation League Competition and Technical Challenge Champions.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2019: Robot World Cup XXIII [Sydney, 2019

Task-Motion Planning with Reinforcement Learning for Adaptable Mobile Service Robots.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

Leveraging Human Guidance for Deep Reinforcement Learning Tasks.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Recent Advances in Imitation Learning from Observation.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Imitation Learning from Video by Leveraging Proprioception.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Ad Hoc Teamwork With Behavior Switching Agents.

[BibT_eX]

[DOI]

Manish Ravula

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Improving Grounded Natural Language Understanding through Human-Robot Dialog.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Robotics and Automation, 2019

Importance Sampling Policy Evaluation with an Estimated Behavior Policy.

[BibT_eX]

[DOI]

Josiah Hanna

Proceedings of the 36th International Conference on Machine Learning, 2019

Building Self-Play Curricula Online by Playing with Expert Agents in Adversarial Games.

[BibT_eX]

[DOI]

Felipe Leno da Silva

Anna Helena Reali Costa

Proceedings of the 8th Brazilian Conference on Intelligent Systems, 2019

Adversarial Imitation Learning from State-only Demonstrations.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Marginal Cost Pricing with a Fixed Error Factor in Traffic Networks.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Learning Curriculum Policies for Reinforcement Learning.

[BibT_eX]

[DOI]

Sanmit Narvekar

Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Escape Room: A Configurable Testbed for Hierarchical Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Reducing Sampling Error in Policy Gradient Learning.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Teaching Social Behavior through Human Reinforcement for Ad hoc Teamwork - The STAR Framework: Extended Abstract.

[BibT_eX]

[DOI]

Avilash Rath

Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Open-World Reasoning for Service Robots.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth International Conference on Automated Planning and Scheduling, 2019

Robust Motion Planning and Safety Benchmarking in Human Workspaces.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Artificial Intelligence Safety 2019 co-located with the Thirty-Third AAAI Conference on Artificial Intelligence 2019 (AAAI-19), 2019

Selecting Compliant Agents for Opt-in Micro-Tolling.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Variety Wins: Soccer-Playing Robots and Infant Walking.

[BibT_eX]

[DOI]

Frontiers Neurorobotics, 2018

Integrating Task-Motion Planning with Reinforcement Learning for Robust Decision Making in Mobile Robots.

[BibT_eX]

[DOI]

CoRR, 2018

LAAIR: A Layered Architecture for Autonomous Interactive Robots.

[BibT_eX]

[DOI]

CoRR, 2018

Interaction and Autonomy in RoboCup@Home and Building-Wide Intelligence.

[BibT_eX]

[DOI]

CoRR, 2018

Robot Representing and Reasoning with Knowledge from Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2018

An Architecture for Person-Following using Active Target Search.

[BibT_eX]

[DOI]

CoRR, 2018

Ad hoc Teamwork and Moral Feedback as a Framework for Safe Agent Behavior.

[BibT_eX]

[DOI]

Avilash Rath

CoRR, 2018

Deterministic Implementations for Reproducibility in Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Prabhat Nagarajan

CoRR, 2018

Generative Adversarial Imitation from Observation.

[BibT_eX]

[DOI]

CoRR, 2018

An Empirical Comparison of PDDL-based and ASP-based Task Planners.

[BibT_eX]

[DOI]

CoRR, 2018

A century-long commitment to assessing artificial intelligence and its impact on society.

[BibT_eX]

[DOI]

Barbara J. Grosz

Commun. ACM, 2018

Overlapping layered learning.

[BibT_eX]

[DOI]

Artif. Intell., 2018

Autonomous agents modelling other agents: A comprehensive survey and open problems.

[BibT_eX]

[DOI]

Artif. Intell., 2018

UT Austin Villa: RoboCup 2018 3D Simulation League Champions.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2018: Robot World Cup XXII [Montreal, 2018

A Study of Human-Robot Copilot Systems for En-route Destination Changing.

[BibT_eX]

[DOI]

Proceedings of the 27th IEEE International Symposium on Robot and Human Interactive Communication, 2018

Passive Demonstrations of Light-Based Robot Signals for Improved Human Interpretability.

[BibT_eX]

[DOI]

Proceedings of the 27th IEEE International Symposium on Robot and Human Interactive Communication, 2018

Enhanced Delta-tolling: Traffic Optimization via Policy Gradient Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 21st International Conference on Intelligent Transportation Systems, 2018

On the Impact of Music on Decision Making in Cooperative Tasks.

[BibT_eX]

[DOI]

Corey N. White

Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018

Enhanced Delta-tolling: Traffic Optimization via Policy Gradient Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the International Symposium on Artificial Intelligence and Mathematics, 2018

PRISM: Pose Registration for Integrated Semantic Mapping.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

Behavioral Cloning from Observation.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Multi-modal Predicate Identification using Dynamically Learned Robot Controllers.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Inferring User Intention using Gaze in Vehicles.

[BibT_eX]

[DOI]

Yu-Sian Jiang

Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018

Learning a Policy for Opportunistic Active Learning.

[BibT_eX]

[DOI]

Aishwarya Padmakumar

Raymond J. Mooney

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Link-based Parameterized Micro-tolling Scheme for Optimal Traffic Management.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

PETLON: Planning Efficiently for Task-Level-Optimal Navigation.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

A Stitch in Time - Autonomous Model Management via Reinforcement Learning.

[BibT_eX]

[DOI]

Eric Zavesky

Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

State Abstraction Synthesis for Discrete Models of Continuous Domains.

[BibT_eX]

[DOI]

Proceedings of the 2018 AAAI Spring Symposia, 2018

Towards a Data Efficient Off-Policy Policy Gradient.

[BibT_eX]

[DOI]

Proceedings of the 2018 AAAI Spring Symposia, 2018

Robot Behavioral Exploration and Multi-modal Perception using Dynamically Constructed Controllers.

[BibT_eX]

[DOI]

Proceedings of the 2018 AAAI Spring Symposia, 2018

Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces.

[BibT_eX]

[DOI]

Nicholas R. Waytowich

Vernon Lawhern

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Guiding Exploratory Behaviors for Multi-Modal Grounding of Linguistic Descriptions.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Traffic Optimization for a Mixture of Self-Interested and Compliant Agents.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Autonomous Model Management via Reinforcement Learning.

[BibT_eX]

[DOI]

Eric Zavesky

Proceedings of the Workshops of the The Thirty-Second AAAI Conference on Artificial Intelligence, 2018

DIPD: Gaze-Based Intention Inference in Dynamic Environments.

[BibT_eX]

[DOI]

Yu-Sian Jiang

Proceedings of the Workshops of the The Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Adversarial Goal Generation for Intrinsic Motivation.

[BibT_eX]

[DOI]

Ishan Durugkar

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

DyETC: Dynamic Electronic Toll Collection for Traffic Congestion Alleviation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Encyclopedia of Machine Learning and Data Mining, 2017

Q-Learning.

[BibT_eX]

[DOI]

Proceedings of the Encyclopedia of Machine Learning and Data Mining, 2017

Machine Learning Capabilities of a Simulated Cerebellum.

[BibT_eX]

[DOI]

Wen-Ke Li

Michael D. Mauk

Decebal Constantin Mocanu

IEEE Trans. Neural Networks Learn. Syst., 2017

BWIBots: A platform for bridging the gap between AI and human-robot interaction research.

[BibT_eX]

[DOI]

Int. J. Robotics Res., 2017

Multirobot Systems.

[BibT_eX]

[DOI]

IEEE Intell. Syst., 2017

Evolutionary Training of Sparse Artificial Neural Networks: A Network Science Perspective.

[BibT_eX]

[DOI]

CoRR, 2017

Intrinsically motivated model learning for developing curious robots.

[BibT_eX]

[DOI]

Artif. Intell., 2017

Making friends on the fly: Cooperating with new teammates.

[BibT_eX]

[DOI]

Artif. Intell., 2017

Three years of the RoboCup standard platform league drop-in player competition - Creating and maintaining a large scale ad hoc teamwork robotics competition.

[BibT_eX]

[DOI]

Tim Laue

Auton. Agents Multi Agent Syst., 2017

Special issue on multiagent interaction without prior coordination: guest editorial.

[BibT_eX]

[DOI]

Somchaya Liemhetcharat

Auton. Agents Multi Agent Syst., 2017

Fast and Precise Black and White Ball Detection for RoboCup Soccer.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2017: Robot World Cup XXI [Nagoya, Japan, July 27-31, 2017]., 2017

UT Austin Villa: RoboCup 2017 3D Simulation League Competition and Technical Challenges Champions.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2017: Robot World Cup XXI [Nagoya, Japan, July 27-31, 2017]., 2017

Leveraging commonsense reasoning and multimodal perception for robot spoken dialog systems.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017

Autonomous Task Sequencing for Customized Curriculum Design in Reinforcement Learning.

[BibT_eX]

[DOI]

Sanmit Narvekar

Jivko Sinapov

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Data-Efficient Policy Evaluation Through Behavior Policy Search.

[BibT_eX]

[DOI]

Proceedings of the 34th International Conference on Machine Learning, 2017

CC-Log: Drastically Reducing Storage Requirements for Robots Using Classification and Compression.

[BibT_eX]

[DOI]

Proceedings of the 9th USENIX Workshop on Hot Topics in Storage and File Systems, 2017

Multiagent Learning Paradigms.

[BibT_eX]

[DOI]

Karl Tuyls

Proceedings of the Multi-Agent Systems and Agreement Technologies, 2017

Opportunistic Active Learning for Grounding Natural Language Descriptions.

[BibT_eX]

[DOI]

Proceedings of the 1st Annual Conference on Robot Learning, CoRL 2017, Mountain View, 2017

Multirobot Symbolic Planning under Temporal Uncertainty.

[BibT_eX]

[DOI]

Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017

A Protocol for Mixed Autonomous and Human-Operated Vehicles at Intersections.

[BibT_eX]

[DOI]

Guni Sharon

Proceedings of the Autonomous Agents and Multiagent Systems, 2017

Evaluating Ad Hoc Teamwork Performance in Drop-In Player Challenges.

[BibT_eX]

[DOI]

Proceedings of the Autonomous Agents and Multiagent Systems, 2017

Autonomous Model Management via Reinforcement Learning: Extended Abstract.

[BibT_eX]

[DOI]

Eric Zavesky

Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017

Multi-Robot Human Guidance: Human Experiments and Multiple Concurrent Requests.

[BibT_eX]

[DOI]

Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017

Agent Behaviors for Joining and Leaving a Flock.

[BibT_eX]

[DOI]

Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017

Reasoning about Hypothetical Agent Behaviours and their Parameters.

[BibT_eX]

[DOI]

Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017

Mechanism Design with Unknown Correlated Distributions: Can We Learn Optimal Mechanisms?

[BibT_eX]

[DOI]

Michael Albert

Vincent Conitzer

Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017

Dynamically Constructed (PO)MDPs for Adaptive Robot Planning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Automatic Curriculum Graph Generation for Reinforcement Learning Agents.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Designing Better Playlists with Monte Carlo Tree Search.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Bootstrapping with Models: Confidence Intervals for Off-Policy Evaluation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Grounded Action Transformation for Robot Learning in Simulation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Automated Design of Robust Mechanisms.

[BibT_eX]

[DOI]

Michael Albert

Vincent Conitzer

Decebal Constantin Mocanu

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016

UT Austin Villa: Project-Driven Research in AI and Robotics.

[BibT_eX]

[DOI]

IEEE Intell. Syst., 2016

Online Contrastive Divergence with Generative Replay: Experience Replay without Storing Data.

[BibT_eX]

[DOI]

CoRR, 2016

Deep Reinforcement Learning in Parameterized Action Space.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Learning Representations, 2016

High Confidence Off-Policy Evaluation with Models.

[BibT_eX]

[DOI]

CoRR, 2016

A synthesis of automated planning and reinforcement learning for efficient, robust decision-making.

[BibT_eX]

[DOI]

Matteo Leonetti

Luca Iocchi

Artif. Intell., 2016

UT Austin Villa: RoboCup 2016 3D Simulation League Competition and Technical Challenges Champions.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2016: Robot World Cup XX [Leipzig, Germany, June 30, 2016

Prioritized Role Assignment for Marking.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2016: Robot World Cup XX [Leipzig, Germany, June 30, 2016

UT Austin Villa RoboCup 3D Simulation Base Code Release.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2016: Robot World Cup XX [Leipzig, Germany, June 30, 2016

Impact of Music on Decision Making in Quantitative Tasks.

[BibT_eX]

[DOI]

Corey N. White

Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016

Robot Scavenger Hunt: A Standardized Framework for Evaluating Intelligent Mobile Robots.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Learning Multi-Modal Grounded Linguistic Semantics by Playing "I Spy".

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Learning to Order Objects Using Haptic and Proprioceptive Exploratory Behaviors.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Delta-Tolling: Adaptive Tolling for Optimizing Traffic Throughput.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Workshop on Agents in Traffic and Transportation (ATT 2016) co-located with the 25th International Joint Conference On Artificial Intelligence (IJCAI 2016), 2016

On the Analysis of Complex Backup Strategies in Monte Carlo Tree Search.

[BibT_eX]

[DOI]

Proceedings of the 33nd International Conference on Machine Learning, 2016

Dynamic behaviors on the NAO robot with closed-loop whole body operational space control.

[BibT_eX]

[DOI]

Donghyun Kim

Steven Jens Jorgensen

Luis Sentis

Proceedings of the 16th IEEE-RAS International Conference on Humanoid Robots, 2016

Adaptation of Surrogate Tasks for Bipedal Walk Optimization.

[BibT_eX]

[DOI]

Proceedings of the Genetic and Evolutionary Computation Conference, 2016

Autonomous Learning Agents: Layered Learning and Ad Hoc Teamwork.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

Source Task Creation for Curriculum Learning.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

Adding Influencing Agents to a Flock.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

An MDP-Based Winning Approach to Autonomous Power Trading: Formalization and Empirical Analysis.

[BibT_eX]

[DOI]

Proceedings of the AI for Smart Grids and Smart Buildings, 2016

Autonomous Electricity Trading Using Time-of-Use Tariffs in a Competitive Market.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

What's Hot at RoboCup.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015

Representative Selection in Non Metric Datasets.

[BibT_eX]

[DOI]

Benny Chor

CoRR, 2015

Who speaks for AI?

[BibT_eX]

[DOI]

Charles L. Isbell Jr.

Michael J. Wooldridge

AI Matters, 2015

Framing reinforcement learning from human reward: Reward positivity, temporal discounting, episodicity, and performance.

[BibT_eX]

[DOI]

Artif. Intell., 2015

Representative Selection in Nonmetric Datasets.

[BibT_eX]

[DOI]

Benny Chor

Appl. Artif. Intell., 2015

Robot-Centric Activity Recognition 'in the Wild'.

[BibT_eX]

[DOI]

Proceedings of the Social Robotics - 7th International Conference, 2015

UT Austin Villa: RoboCup 2015 3D Simulation League Competition and Technical Challenges Champions.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2015: Robot World Cup XIX [papers from the 19th Annual RoboCup International Symposium, 2015

A Study of Layered Learning Strategies Applied to Individual Behaviors in Robot Soccer.

[BibT_eX]

[DOI]

David Leonardo Leottau

Javier Ruiz-del-Solar

Proceedings of the RoboCup 2015: Robot World Cup XIX [papers from the 19th Annual RoboCup International Symposium, 2015

Mobile Robot Planning Using Action Language <i>BC</i> with an Abstraction Hierarchy.

[BibT_eX]

[DOI]

Proceedings of the Logic Programming and Nonmonotonic Reasoning, 2015

How Music Alters Decision Making - Impact of Music Stimuli on Emotional Classification.

[BibT_eX]

[DOI]

Corey N. White

Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015

Benchmarking robot cooperation without pre-coordination in the RoboCup Standard Platform League drop-in player competition.

[BibT_eX]

[DOI]

Tim Laue

Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Learning to Interpret Natural Language Commands through Human-Robot Dialog.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

When Security Games Go Green: Designing Defender Strategies to Prevent Poaching and Illegal Fishing.

[BibT_eX]

[DOI]

Fei Fang

Milind Tambe

Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Learning Inter-Task Transferability in the Absence of Target Task Samples.

[BibT_eX]

[DOI]

Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

Monte Carlo Hierarchical Model Learning: (Doctoral Consortium).

[BibT_eX]

[DOI]

Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

Monte Carlo Hierarchical Model Learning.

[BibT_eX]

[DOI]

Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

DJ-MC: A Reinforcement-Learning Agent for Music Playlist Recommendation.

[BibT_eX]

[DOI]

Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

Leading the Way: An Efficient Multi-robot Guidance System.

[BibT_eX]

[DOI]

Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

Determining Placements of Influencing Agents in a Flock.

[BibT_eX]

[DOI]

Shun Zhang

Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

The RoboCup 2014 SPL Drop-in Player Competition: Encouraging Teamwork without Pre-coordination.

[BibT_eX]

[DOI]

Tim Laue

Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

Defender Strategies In Domains Involving Frequent Adversary Interaction.

[BibT_eX]

[DOI]

Fei Fang

Milind Tambe

Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

Deep Recurrent Q-Learning for Partially Observable MDPs.

[BibT_eX]

[DOI]

Proceedings of the 2015 AAAI Fall Symposia, Arlington, Virginia, USA, November 12-14, 2015, 2015

CORPP: Commonsense Reasoning and Probabilistic Planning, as Applied to Dialog with a Mobile Robot.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

SCRAM: Scalable Collision-avoiding Role Assignment with Minimal-Makespan for Formational Positioning.

[BibT_eX]

[DOI]

Eric Price

Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

UT Austin Villa 2014: RoboCup 3D Simulation League Champion via Overlapping Layered Learning.

[BibT_eX]

[DOI]

Mike Depinet

Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

The Impact of Determinism on Learning Atari 2600 Games.

[BibT_eX]

[DOI]

Proceedings of the Learning for General Competency in Video Games, 2015

Placing Influencing Agents in a Flock.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Cooperating with Unknown Teammates in Complex Domains: A Robot Soccer Case Study of Ad Hoc Teamwork.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014

A Neuroevolution Approach to General Atari Game Playing.

[BibT_eX]

[DOI]

Joel Lehman

Risto Miikkulainen

IEEE Trans. Comput. Intell. AI Games, 2014

DJ-MC: A Reinforcement-Learning Agent for Music Playlist Recommendation.

[BibT_eX]

[DOI]

CoRR, 2014

Drop-in games at RoboCup.

[BibT_eX]

[DOI]

AI Matters, 2014

RoboCup Soccer Leagues.

[BibT_eX]

[DOI]

AI Mag., 2014

Multiagent learning in the presence of memory-bounded agents.

[BibT_eX]

[DOI]

Auton. Agents Multi Agent Syst., 2014

UT Austin Villa: RoboCup 2014 3D Simulation League Competition and Technical Challenge Champions.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2014: Robot World Cup XVIII [papers from the 18th Annual RoboCup International Symposium, 2014

Keyframe Sampling, Optimization, and Behavior Integration: Towards Long-Distance Kicking in the RoboCup 3D Simulation League.

[BibT_eX]

[DOI]

Mike Depinet

Proceedings of the RoboCup 2014: Robot World Cup XVIII [papers from the 18th Annual RoboCup International Symposium, 2014

The RoboCup 2013 drop-in player challenges: Experiments in ad hoc teamwork.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2014

Communicating with Unknown Teammates.

[BibT_eX]

[DOI]

Proceedings of the ECAI 2014 - 21st European Conference on Artificial Intelligence, 18-22 August 2014, Prague, Czech Republic, 2014

The RoboCup 2013 drop-in player challenges: a testbed for ad hoc teamwork.

[BibT_eX]

[DOI]

Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

Orienting a flock via ad hoc teamwork.

[BibT_eX]

[DOI]

Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

Semi-autonomous intersection management.

[BibT_eX]

[DOI]

Shun Zhang

Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

Modeling uncertainty in leading ad hoc teams.

[BibT_eX]

[DOI]

Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

Influencing a Flock via Ad Hoc Teamwork.

[BibT_eX]

[DOI]

Proceedings of the Swarm Intelligence - 9th International Conference, 2014

Planning in Action Language BC while Learning Action Costs for Mobile Robots.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fourth International Conference on Automated Planning and Scheduling, 2014

Planning in Answer Set Programming while Learning Action Costs for Mobile Robots.

[BibT_eX]

[DOI]

Proceedings of the 2014 AAAI Spring Symposia, 2014

Multi-Robot Human Guidance Using Topological Graphs.

[BibT_eX]

[DOI]

Proceedings of the 2014 AAAI Spring Symposia, 2014

Leading the Way: An Efficient Multi-Robot Guidance System.

[BibT_eX]

[DOI]

Proceedings of the 2014 AAAI Fall Symposia, Arlington, Virginia, USA, November 13-15, 2014, 2014

TacTex'13: A Champion Adaptive Power Trading Agent.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013

Using a million cell simulation of the cerebellum: Network scaling and task generality.

[BibT_eX]

[DOI]

Wen-Ke Li

Michael D. Mauk

Neural Networks, 2013

TEXPLORE: real-time sample-efficient reinforcement learning for robots.

[BibT_eX]

[DOI]

Mach. Learn., 2013

Teaching and leading an ad hoc teammate: Collaboration without pre-coordination.

[BibT_eX]

[DOI]

Gal A. Kaminka

Jeffrey S. Rosenschein

Artif. Intell., 2013

Training a Robot via Human Feedback: A Case Study.

[BibT_eX]

[DOI]

Cynthia Breazeal

Proceedings of the Social Robotics - 5th International Conference, 2013

The Open-Source TEXPLORE Code Release for Reinforcement Learning on Robots.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2013: Robot World Cup XVII [papers from the 17th Annual RoboCup International Symposium, 2013

The 2012 UT Austin Villa Code Release.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2013: Robot World Cup XVII [papers from the 17th Annual RoboCup International Symposium, 2013

Model-Selection for Non-parametric Function Approximation in Continuous Control Problems: A Case Study in a Smart Energy System.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2013

Teaching agents with human feedback: a demonstration of the TAMER framework.

[BibT_eX]

[DOI]

Cynthia Breazeal

Proceedings of the 18th International Conference on Intelligent User Interfaces, 2013

Learning non-myopically from human-generated reward.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Intelligent User Interfaces, 2013

Auction-based autonomous intersection management.

[BibT_eX]

[DOI]

Dustin Carlino

Stephen D. Boyles

Proceedings of the 16th International IEEE Conference on Intelligent Transportation Systems, 2013

A learning agent for heat-pump thermostat control.

[BibT_eX]

[DOI]

Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

Learning exploration strategies in model-based reinforcement learning.

[BibT_eX]

[DOI]

Manuel Lopes

Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

Ad hoc teamwork for leading a flock.

[BibT_eX]

[DOI]

Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

Humanoid robots learning to walk faster: from the real world to simulation and back.

[BibT_eX]

[DOI]

Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

Cooperating with a markovian ad hoc teammate.

[BibT_eX]

[DOI]

Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

Teamwork with Limited Knowledge of Teammates.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

2012

How Humans Teach Agents - A New Experimental Perspective.

[BibT_eX]

[DOI]

Int. J. Soc. Robotics, 2012

Ten Years of AAMAS: Introduction to the Special Issue.

[BibT_eX]

[DOI]

AI Mag., 2012

UT Austin Villa: RoboCup 2012 3D Simulation League Champion.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2012: Robot Soccer World Cup XVI [papers from the 16th Annual RoboCup International Symposium, 2012

Positioning to Win: A Dynamic Role Assignment and Formation Positioning System.

[BibT_eX]

[DOI]

Francisco Barrera

Proceedings of the RoboCup 2012: Robot Soccer World Cup XVI [papers from the 16th Annual RoboCup International Symposium, 2012

UT Austin Villa 2012: Standard Platform League World Champions.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2012: Robot Soccer World Cup XVI [papers from the 16th Annual RoboCup International Symposium, 2012

Reinforcement learning from human reward: Discounting in episodic tasks.

[BibT_eX]

[DOI]

Proceedings of the 21st IEEE International Symposium on Robot and Human Interactive Communication, 2012

Approximately Orchestrated Routing and Transportation Analyzer: Large-scale traffic simulation for autonomous vehicles.

[BibT_eX]

[DOI]

Proceedings of the 15th International IEEE Conference on Intelligent Transportation Systems, 2012

Video: RoboCup robot soccer history 1997 - 2011.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012

Evasion planning for autonomous vehicles at intersections.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012

RTMBA: A Real-Time Model-Based Reinforcement Learning Architecture for robot control.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2012

Setpoint scheduling for autonomous vehicle controllers.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2012

On coordination in practical multi-robot patrol.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2012

PAC Subset Selection in Stochastic Multi-armed Bandits.

[BibT_eX]

[DOI]

Ambuj Tewari

Peter Auer

Proceedings of the 29th International Conference on Machine Learning, 2012

Intrinsically motivated model learning for a developing curious agent.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Development and Learning and Epigenetic Robotics, 2012

A Platform for Evaluating Autonomous Intersection Management Policies.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE/ACM Third International Conference on Cyber-Physical Systems, 2012

HyperNEAT-GGP: a hyperNEAT-based atari general game player.

[BibT_eX]

[DOI]

Risto Miikkulainen

Proceedings of the Genetic and Evolutionary Computation Conference, 2012

UT Austin Villa 2011: a champion agent in the RoboCup 3D soccer simulation competition.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

Reinforcement learning from simultaneous human and MDP reward.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

Role selection in ad hoc teamwork.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

An analysis framework for ad hoc teamwork tasks.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

Leading ad hoc agents in joint action settings with multiple teammates.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

Design and Optimization of an Omnidirectional Humanoid Walk: A Winning Approach at the RoboCup 2011 3D Simulation Competition.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

Learning and Using Models.

[BibT_eX]

[DOI]

Proceedings of the Reinforcement Learning, 2012

2011

Designing adaptive trading agents.

[BibT_eX]

[DOI]

SIGecom Exch., 2011

Characterizing reinforcement learning methods through parameterized learning problems.

[BibT_eX]

[DOI]

Mach. Learn., 2011

A Real-Time Model-Based Reinforcement Learning Architecture for Robot Control

[BibT_eX]

[DOI]

CoRR, 2011

An Introduction to Intertask Transfer for Reinforcement Learning.

[BibT_eX]

[DOI]

AI Mag., 2011

Empowerment for continuous agent - environment systems.

[BibT_eX]

[DOI]

Tobias Jung

Daniel Polani

Adapt. Behav., 2011

A Low Cost Ground Truth Detection System for RoboCup Using the Kinect.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2011: Robot Soccer World Cup XV [papers from the 15th Annual RoboCup International Symposium, 2011

WrightEagle and UT Austin Villa: RoboCup 2011 Simulation League Champions.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2011: Robot Soccer World Cup XV [papers from the 15th Annual RoboCup International Symposium, 2011

Dynamic lane reversal in traffic management.

[BibT_eX]

[DOI]

Proceedings of the 14th International IEEE Conference on Intelligent Transportation Systems, 2011

Autonomous Intersection Management: Multi-intersection optimization.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2011

Structure Learning in Ergodic Factored MDPs without Knowledge of the Transition Function's In-Degree.

[BibT_eX]

[DOI]

Proceedings of the 28th International Conference on Machine Learning, 2011

Invited Talk: PRISM - Practical RL: Representation, Interaction, Synthesis, and Mortality.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Reinforcement Learning - 9th European Workshop, 2011

On optimizing interdependent skills: a case study in simulated 3D humanoid robot soccer.

[BibT_eX]

[DOI]

Yinon Bentor

Proceedings of the 10th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2011), 2011

Batch reservations in autonomous intersection management.

[BibT_eX]

[DOI]

Neda Shahidi

Proceedings of the 10th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2011), 2011

Flood Disaster Mitigation: A Real-World Challenge Problem for Multi-agent Unmanned Surface Vehicles.

[BibT_eX]

[DOI]

Proceedings of the Advanced Agent Technology, 2011

A particle filter for bid estimation in ad auctions with periodic ranking observations.

[BibT_eX]

[DOI]

Proceedings of the 10th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2011), 2011

Empirical evaluation of ad hoc teamwork in the pursuit domain.

[BibT_eX]

[DOI]

Proceedings of the 10th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2011), 2011

Ship patrol: multiagent patrol under complex environmental conditions.

[BibT_eX]

[DOI]

Proceedings of the 10th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2011), 2011

Protecting against evaluation overfitting in empirical reinforcement learning.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE Symposium on Adaptive Dynamic Programming And Reinforcement Learning, 2011

On learning with imperfect representations.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE Symposium on Adaptive Dynamic Programming And Reinforcement Learning, 2011

Intersections of the Future: Using Fully Autonomous Vehicles.

[BibT_eX]

[DOI]

Proceedings of the Agents and Data Mining Interaction, 2011

Reinforcement Learning with Human Feedback in Mountain Car.

[BibT_eX]

[DOI]

Adam Bradley Setapen

Proceedings of the Help Me Help You: Bridging the Gaps in Human-Agent Collaboration, 2011

Comparing Agents' Success against People in Security Domains.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

Role-Based Ad Hoc Teamwork.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

Ad Hoc Teamwork in Variations of the Pursuit Domain.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

Enforcing Liveness in Autonomous Traffic Management.

[BibT_eX]

[DOI]

Neda Shahidi

Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

Multiagent Patrol Generalized to Complex Environmental Conditions.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

Leading Multiple Ad Hoc Teammates in Joint Action Settings.

[BibT_eX]

[DOI]

Proceedings of the Interactive Decision Theory and Game Theory, 2011

2010

Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Encyclopedia of Machine Learning, 2010

Q-Learning.

[BibT_eX]

[DOI]

Proceedings of the Encyclopedia of Machine Learning, 2010

Adaptive Auction Mechanism Design and the Incorporation of Prior Knowledge.

[BibT_eX]

[DOI]

Tayfun Keskin

Kerem Tomak

INFORMS J. Comput., 2010

Autonomous return on investment analysis of additional processing resources.

[BibT_eX]

[DOI]

Jonathan Wildstrom

Emmett Witchel

Int. J. Auton. Comput., 2010

Critical factors in the empirical performance of temporal difference and evolutionary methods for reinforcement learning.

[BibT_eX]

[DOI]

Auton. Agents Multi Agent Syst., 2010

Learning Powerful Kicks on the Aibo ERS-7: The Quest for a Striker.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2010: Robot Soccer World Cup XIV [papers from the 14th annual RoboCup International Symposium, 2010

Gaussian Processes for Sample Efficient Reinforcement Learning with RMAX-Like Exploration.

[BibT_eX]

[DOI]

Tobias Jung

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2010

Bringing simulation to life: A mixed reality autonomous intersection.

[BibT_eX]

[DOI]

Proceedings of the 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010

Generalized model learning for Reinforcement Learning on a humanoid robot.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2010

Boosting for Regression Transfer.

[BibT_eX]

[DOI]

Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

Efficient Selection of Multiple Bandit Arms: Theory and Practice.

[BibT_eX]

[DOI]

Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

Convergence, Targeted Optimality, and Safety in Multiagent Learning.

[BibT_eX]

[DOI]

Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

Real time targeted exploration in large domains.

[BibT_eX]

[DOI]

Proceedings of the 2010 IEEE 9th International Conference on Development and Learning, 2010

To teach or not to teach?: decision making under uncertainty in ad hoc teams.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

MARIOnET: motion acquisition for robots through iterative online evaluative training.

[BibT_eX]

[DOI]

Adam Setapen

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

TacTex09: a champion bidding agent for ad auctions.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

Training a Tetris agent via interactive shaping: a demonstration of the TAMER framework.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

Combining manual feedback with subsequent MDP reward signals for reinforcement learning.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

Online model learning in adversarial Markov decision processes.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

Ad Hoc Autonomous Agent Teams: Collaboration without Pre-Coordination.

[BibT_eX]

[DOI]

Gal A. Kaminka

Jeffrey S. Rosenschein

Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010

Motion Planning Algorithms for Autonomous Intersection Management.

[BibT_eX]

[DOI]

Proceedings of the Bridging the Gap Between Task and Motion Planning, 2010

Multi-Agent Social Simulation.

[BibT_eX]

[DOI]

Proceedings of the Handbook of Ambient Intelligence and Smart Environments, 2010

2009

Color learning and illumination invariance on mobile robots: A survey.

[BibT_eX]

[DOI]

Robotics Auton. Syst., 2009

Transfer Learning for Reinforcement Learning Domains: A Survey.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2009

Learning Complementary Multiagent Behaviors: A Case Study.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2009: Robot Soccer World Cup XIII [papers from the 13th annual RoboCup International Symposium, Graz, Austria, June 29, 2009

Three Humanoid Soccer Platforms: Comparison and Synthesis.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2009: Robot Soccer World Cup XIII [papers from the 13th annual RoboCup International Symposium, Graz, Austria, June 29, 2009

Feature Selection for Value Function Approximation Using Bayesian Model Selection.

[BibT_eX]

[DOI]

Tobias Jung

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2009

Compositional Models for Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2009

Interactively shaping agents via human reinforcement: the TAMER framework.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Knowledge Capture (K-CAP 2009), 2009

Improving particle filter performance using SSE instructions.

[BibT_eX]

[DOI]

Peter Djeu

Proceedings of the 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009

An empirical analysis of value function-based and policy search reinforcement learning.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2009), 2009

Generalized model learning for reinforcement learning in factored domains.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2009), 2009

Leading a Best-Response Teammate in an Ad Hoc Team.

[BibT_eX]

[DOI]

Gal A. Kaminka

Jeffrey S. Rosenschein

Proceedings of the Agent-Mediated Electronic Commerce. Designing Trading Strategies and Mechanisms for Electronic Markets, 2009

Design Principles for Creating Human-Shapable Agents.

[BibT_eX]

[DOI]

Ian R. Fasel

Proceedings of the Agents that Learn from Human Teachers, 2009

A Task Specification Language for Bootstrap Learning.

[BibT_eX]

[DOI]

Ian R. Fasel

Proceedings of the Agents that Learn from Human Teachers, 2009

An Unmanaged Intersection Protocol and Improved Intersection Safety for Autonomous Vehicles.

[BibT_eX]

[DOI]

Mark Van Middlesworth

Proceedings of the Multi-Agent Systems for Traffic and Transportation Engineering., 2009

2008

Book announcement: autonomous bidding agents.

[BibT_eX]

[DOI]

Michael P. Wellman

SIGecom Exch., 2008

A Multiagent Approach to Autonomous Intersection Management.

[BibT_eX]

[DOI]

J. Artif. Intell. Res., 2008

Polynomial Regression with Automated Degree: a Function Approximator for Autonomous Agents.

[BibT_eX]

[DOI]

Int. J. Artif. Intell. Tools, 2008

Comparing Two Action Planning Approaches for Color Learning on a Mobile Robot.

[BibT_eX]

Proceedings of the VISAPP International Workshop on Robotic Perception, 2008

Long-Term vs. Greedy Action Planning for Color Learning on a Mobile Robot.

[BibT_eX]

Proceedings of the VISAPP 2008: Proceedings of the Third International Conference on Computer Vision Theory and Applications, Funchal, Madeira, Portugal, January 22-25, 2008, 2008

Domestic Interaction on a Segway Base.

[BibT_eX]

[DOI]

Juhyun Lee

Proceedings of the RoboCup 2008: Robot Soccer World Cup XII [papers from the 12th annual RoboCup International Symposium, 2008

Transferring Instances for Model-Based Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2008

Online Multiagent Learning against Memory Bounded Adversaries.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2008

Maximum likelihood estimation of sensor and action model functions on a mobile robot.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE International Conference on Robotics and Automation, 2008

Person tracking on a mobile robot with heterogeneous inter-characteristic feedback.

[BibT_eX]

[DOI]

Juhyun Lee

Proceedings of the 2008 IEEE International Conference on Robotics and Automation, 2008

Person recognition on a Segway Robot: A video of UT Austin Villa Robocup@Home 2007 finals demonstration.

[BibT_eX]

[DOI]

Juhyun Lee

Proceedings of the 2008 IEEE International Conference on Robotics and Automation, 2008

Negative information and line observations for Monte Carlo localization.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE International Conference on Robotics and Automation, 2008

Online kernel selection for Bayesian reinforcement learning.

[BibT_eX]

[DOI]

Joseph Reisinger

Risto Miikkulainen

Proceedings of the Machine Learning, 2008

CARVE: A Cognitive Agent for Resource Value Estimation.

[BibT_eX]

[DOI]

Jonathan Wildstrom

Emmett Witchel

Proceedings of the 2008 International Conference on Autonomic Computing, 2008

Autonomous transfer for reinforcement learning.

[BibT_eX]

[DOI]

Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2008), 2008

Replacing the stop sign: unmanaged intersection control for autonomous vehicles.

[BibT_eX]

[DOI]

Mark Van Middlesworth

Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2008), 2008

The utility of temporal abstraction in reinforcement learning.

[BibT_eX]

[DOI]

Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2008), 2008

Mitigating catastrophic failure at intersections of autonomous vehicles.

[BibT_eX]

[DOI]

Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2008), 2008

The 2007 TAC SCM Prediction Challenge.

[BibT_eX]

[DOI]

Proceedings of the Agent-Mediated Electronic Commerce and Trading Agent Design and Analysis, 2008

Transfer Learning and Intelligence: an Argument and Approach.

[BibT_eX]

[DOI]

Proceedings of the Artificial General Intelligence 2008, 2008

2007

Intelligent Autonomous Robotics: A Robot Soccer Case Study

[BibT_eX]

[DOI]

Synthesis Lectures on Artificial Intelligence and Machine Learning, Morgan & Claypool Publishers, ISBN: 978-3-031-01544-1, 2007

Transfer Learning via Inter-Task Mappings for Temporal Difference Learning.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2007

Structure-based color learning on a mobile robot under changing illumination.

[BibT_eX]

[DOI]

Auton. Robots, 2007

Multiagent learning is not the answer. It is the question.

[BibT_eX]

[DOI]

Artif. Intell., 2007

Empirical Studies in Action Selection with Reinforcement Learning.

[BibT_eX]

[DOI]

Adapt. Behav., 2007

Model-Based Exploration in Continuous State Spaces.

[BibT_eX]

[DOI]

Proceedings of the Abstraction, 2007

Model-Based Reinforcement Learning in a Complex Domain.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2007: Robot Soccer World Cup XI, 2007

A Neural Network-Based Approach to Robot Motion Control.

[BibT_eX]

[DOI]

Uli Grasemann

Proceedings of the RoboCup 2007: Robot Soccer World Cup XI, 2007

Instance-Based Action Models for Fast Action Planning.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2007: Robot Soccer World Cup XI, 2007

Global action selection for illumination invariant color modeling.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, October 29, 2007

Machine Learning for On-Line Hardware Reconfiguration.

[BibT_eX]

[DOI]

Proceedings of the IJCAI 2007, 2007

Learning and Multiagent Reasoning for Autonomous Agents.

[BibT_eX]

[DOI]

Proceedings of the IJCAI 2007, 2007

Color Learning on a Mobile Robot: Towards Full Autonomy under Changing Illumination.

[BibT_eX]

[DOI]

Proceedings of the IJCAI 2007, 2007

Sharing the Road: Autonomous Vehicles Meet Human Drivers.

[BibT_eX]

[DOI]

Proceedings of the IJCAI 2007, 2007

General Game Learning Using Knowledge Transfer.

[BibT_eX]

[DOI]

Bikramjit Banerjee

Proceedings of the IJCAI 2007, 2007

A Comparison of Two Approaches for Vision and Self-Localization on a Mobile Robot.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE International Conference on Robotics and Automation, 2007

Cross-domain transfer for reinforcement learning.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning, 2007

Graph-Based Domain Mapping for Transfer Learning in General Games.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning: ECML 2007, 2007

Transfer via inter-task mappings in policy search reinforcement learning.

[BibT_eX]

[DOI]

Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2007), 2007

Towards reinforcement learning representation transfer.

[BibT_eX]

[DOI]

Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2007), 2007

Adapting Price Predictions in TAC SCM.

[BibT_eX]

[DOI]

Proceedings of the Agent-Mediated Electronic Commerce and Trading Agent Design and Analysis, 2007

Adapting in agent-based markets: a study from TAC SCM.

[BibT_eX]

[DOI]

Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2007), 2007

Batch reinforcement learning in a complex domain.

[BibT_eX]

[DOI]

Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2007), 2007

Model-based function approximation in reinforcement learning.

[BibT_eX]

[DOI]

Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2007), 2007

IFSA: incremental feature-set augmentation for reinforcement learning tasks.

[BibT_eX]

[DOI]

Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2007), 2007

Representation Transfer for Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Computational Approaches to Representation Change during Learning and Development, 2007

Temporal Difference and Policy Search Methods for Reinforcement Learning: An Empirical Comparison.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007

Representation Transfer via Elaboration.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007

Autonomous bidding agents - strategies and lessons from the trading agent competition.

[BibT_eX]

Michael P. Wellman

MIT Press, ISBN: 978-0-262-23260-9, 2007

2006

From pixels to multi-robot decision-making: A study in uncertainty.

[BibT_eX]

[DOI]

Robotics Auton. Syst., 2006

Evolutionary Function Approximation for Reinforcement Learning.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2006

Towards autonomous sensor and actuator model induction on a mobile robot.

[BibT_eX]

[DOI]

Connect. Sci., 2006

Cobot in LambdaMOO: An Adaptive Social Statistics Agent.

[BibT_eX]

[DOI]

Auton. Agents Multi Agent Syst., 2006

Selective Visual Attention for Object Detection on a Legged Robot.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2006: Robot Soccer World Cup X, 2006

Autonomous Planned Color Learning on a Legged Robot.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2006: Robot Soccer World Cup X, 2006

Autonomous Learning of Stable Quadruped Locomotion.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2006: Robot Soccer World Cup X, 2006

Half Field Offense in RoboCup Soccer: A Multiagent Reinforcement Learning Case Study.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2006: Robot Soccer World Cup X, 2006

The Chin Pinch: A Case Study in Skill Learning on a Legged Robot.

[BibT_eX]

[DOI]

Peggy Fidelman

Proceedings of the RoboCup 2006: Robot Soccer World Cup X, 2006

A Multi-robot System for Continuous Area Sweeping Tasks.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Robotics and Automation, 2006

Autonomous Planned Color Learning on a Mobile Robot Without Labeled Data.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Control, 2006

On-line evolutionary computation for reinforcement learning in stochastic domains.

[BibT_eX]

[DOI]

Proceedings of the Genetic and Evolutionary Computation Conference, 2006

Comparing evolutionary and temporal difference methods in a reinforcement learning domain.

[BibT_eX]

[DOI]

Proceedings of the Genetic and Evolutionary Computation Conference, 2006

Designing safe, profitable automated stock trading agents using evolutionary algorithms.

[BibT_eX]

[DOI]

Harish Subramanian

Subramanian Ramamoorthy

Benjamin Kuipers

Proceedings of the Genetic and Evolutionary Computation Conference, 2006

A Distributed Biconnectivity Check.

[BibT_eX]

[DOI]

Proceedings of the Distributed Autonomous Robotic Systems 7, 2006

TacTex-05: An Adaptive Agent for TAC SCM.

[BibT_eX]

[DOI]

Mark Van Middlesworth

Proceedings of the Agent-Mediated Electronic Commerce. Automated Negotiation and Strategy Design for Electronic Markets, 2006

Predictive Planning for Supply Chain Management.

[BibT_eX]

[DOI]

Proceedings of the Sixteenth International Conference on Automated Planning and Scheduling, 2006

Sample-Efficient Evolutionary Function Approximation for Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Proceedings, 2006

Inter-Task Action Correlation for Reinforcement Learning Tasks.

[BibT_eX]

[DOI]

Proceedings of the Proceedings, 2006

Expectation-Based Vision for Self-Localization on a Legged Robot.

[BibT_eX]

[DOI]

Proceedings of the Proceedings, 2006

TacTex-05: A Champion Supply Chain Management Agent.

[BibT_eX]

[DOI]

Proceedings of the Proceedings, 2006

Value-Function-Based Transfer for Reinforcement Learning Using Structure Mapping.

[BibT_eX]

[DOI]

Proceedings of the Proceedings, 2006

Automatic Heuristic Construction for General Game Playing.

[BibT_eX]

[DOI]

Proceedings of the Proceedings, 2006

Automatic Heuristic Construction in a Complete General Game Player.

[BibT_eX]

[DOI]

Proceedings of the Proceedings, 2006

Know Thine Enemy: A Champion RoboCup Coach Agent.

[BibT_eX]

[DOI]

William B. Knox

Proceedings of the Proceedings, 2006

Making Autonomous Intersection Management Backwards-Compatible.

[BibT_eX]

[DOI]

Proceedings of the Proceedings, 2006

Traffic Intersections of the Future.

[BibT_eX]

[DOI]

Proceedings of the Proceedings, 2006

Biconnected Structure for Multi-Robot Systems.

[BibT_eX]

[DOI]

Proceedings of the Proceedings, 2006

Keeping in Touch: Maintaining Biconnected Structure by Homogeneous Robots.

[BibT_eX]

[DOI]

Proceedings of the Proceedings, 2006

Adaptive mechanism design: a metalearning approach.

[BibT_eX]

[DOI]

Kerem Tomak

Proceedings of the 8th International Conference on Electronic Commerce: The new e-commerce, 2006

2005

Developing adaptive auction mechanisms.

[BibT_eX]

[DOI]

SIGecom Exch., 2005

Evolving Soccer Keepaway Players Through Task Decomposition.

[BibT_eX]

[DOI]

Mach. Learn., 2005

The First International Trading Agent Competition: Autonomous Bidding Agents.

[BibT_eX]

[DOI]

Electron. Commer. Res., 2005

A polynomial-time Nash equilibrium algorithm for repeated games.

[BibT_eX]

[DOI]

Michael L. Littman

Decis. Support Syst., 2005

Reinforcement Learning for RoboCup Soccer Keepaway.

[BibT_eX]

[DOI]

Adapt. Behav., 2005

Function Approximation via Tile Coding: Automating Parameter Choice.

[BibT_eX]

[DOI]

Alexander A. Sherstov

Proceedings of the Abstraction, 2005

Keepaway Soccer: From Machine Learning Testbed to Benchmark.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2005: Robot Soccer World Cup IX, 2005

Towards Eliminating Manual Color Calibration at RoboCup.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2005: Robot Soccer World Cup IX, 2005

Multiagent Traffic Management: Opportunities for Multiagent Learning.

[BibT_eX]

[DOI]

Proceedings of the Learning and Adaption in Multi-Agent Systems, 2005

Multi-robot Learning for Continuous Area Sweeping.

[BibT_eX]

[DOI]

Proceedings of the Learning and Adaption in Multi-Agent Systems, 2005

Real-time vision on a mobile robot platform.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2005

State Abstraction Discovery from Irrelevant State Variables.

[BibT_eX]

[DOI]

Proceedings of the IJCAI-05, Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, UK, July 30, 2005

Simultaneous Calibration of Action and Sensor Models on a Mobile Robot.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Robotics and Automation, 2005

Practical Vision-Based Monte Carlo Localization on a Legged Robot.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Robotics and Automation, 2005

Towards Self-Configuring Hardware for Distributed Computer Systems.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Autonomic Computing (ICAC 2005), 2005

Automatic feature selection in neuroevolution.

[BibT_eX]

[DOI]

Proceedings of the Genetic and Evolutionary Computation Conference, 2005

Behavior transfer for value-function-based reinforcement learning.

[BibT_eX]

[DOI]

Proceedings of the 4th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2005), 2005

Multiagent traffic management: an improved intersection control mechanism.

[BibT_eX]

[DOI]

Proceedings of the 4th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2005), 2005

Value Functions for RL-Based Behavior Transfer: A Comparative Study.

[BibT_eX]

[DOI]

Proceedings of the Proceedings, 2005

Autonomous Color Learning on a Mobile Robot.

[BibT_eX]

[DOI]

Proceedings of the Proceedings, 2005

Improving Action Selection in MDP's via Knowledge Transfer.

[BibT_eX]

[DOI]

Alexander A. Sherstov

Proceedings of the Proceedings, 2005

2004

TacTex-03: a supply chain management agent.

[BibT_eX]

[DOI]

SIGecom Exch., 2004

Using RoboCup in university-level computer science education.

[BibT_eX]

[DOI]

Elizabeth Sklar

Simon Parsons

ACM J. Educ. Resour. Comput., 2004

Adaptive job routing and scheduling.

[BibT_eX]

[DOI]

Eng. Appl. Artif. Intell., 2004

A Model-Based Approach to Robot Joint Control.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2004: Robot Soccer World Cup VIII, 2004

Towards Illumination Invariance in the Legged League.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2004: Robot Soccer World Cup VIII, 2004

The UT Austin Villa 2003 Champion Simulator Coach: A Machine Learning Approach.

[BibT_eX]

[DOI]

Justin Lallinger

Proceedings of the RoboCup 2004: Robot Soccer World Cup VIII, 2004

Policy Gradient Reinforcement Learning for Fast Quadrupedal Locomotion.

[BibT_eX]

[DOI]

Nate Kohl

Proceedings of the 2004 IEEE International Conference on Robotics and Automation, 2004

Towards Autonomic Computing: Adaptive Network Routing and Scheduling.

[BibT_eX]

[DOI]

Proceedings of the 1st International Conference on Autonomic Computing (ICAC 2004), 2004

Towards On-Board Color Constancy on Mobile Robots.

[BibT_eX]

[DOI]

Proceedings of the 1st Canadian Conference on Computer and Robot Vision (CRV 2004) 17-19 May 2004, 2004

Agent-Based Supply Chain Management: Bidding for Customer Orders.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2004), 2004

Multiagent Traffic Management: A Reservation-Based Intersection Control Mechanism.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2004), 2004

Three Automated Stock-Trading Agents: A Comparative Study.

[BibT_eX]

[DOI]

Alexander A. Sherstov

Proceedings of the Agent-Mediated Electronic Commerce VI, 2004

Bidding for Customer Orders in TAC SCM.

[BibT_eX]

[DOI]

Proceedings of the Agent-Mediated Electronic Commerce VI, 2004

Towards Autonomic Computing: Adaptive Job Routing and Scheduling.

[BibT_eX]

Proceedings of the Nineteenth National Conference on Artificial Intelligence, 2004

Machine Learning for Fast Quadrupedal Locomotion.

[BibT_eX]

[DOI]

Nate Kohl

Proceedings of the Nineteenth National Conference on Artificial Intelligence, 2004

2003

Decision-Theoretic Bidding Based on Learned Density Models in Simultaneous, Interacting Auctions.

[BibT_eX]

[DOI]

J. Artif. Intell. Res., 2003

Guest Editors' Introduction: Agents and Markets.

[BibT_eX]

[DOI]

Nicholas R. Jennings

IEEE Intell. Syst., 2003

The 2001 Trading Agent Competition.

[BibT_eX]

[DOI]

Electron. Mark., 2003

The RoboCup Soccer Server and CMUnited Clients: Implemented Infrastructure for MAS Research.

[BibT_eX]

[DOI]

Itsuki Noda

Auton. Agents Multi Agent Syst., 2003

RoboCup as an Introduction to CS Research.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2003: Robot Soccer World Cup VII, 2003

RoboCup in Higher Education: A Preliminary Report.

[BibT_eX]

[DOI]

Elizabeth Sklar

Simon Parsons

Proceedings of the RoboCup 2003: Robot Soccer World Cup VII, 2003

Progress in Learning 3 vs. 2 Keepaway

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2003: Robot Soccer World Cup VII, 2003

Learning Predictive State Representations.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning, 2003

Evolving Keepaway Soccer Players through Task Decomposition.

[BibT_eX]

[DOI]

Proceedings of the Genetic and Evolutionary Computation, 2003

Concurrent layered learning.

[BibT_eX]

[DOI]

Proceedings of the Second International Joint Conference on Autonomous Agents & Multiagent Systems, 2003

Two Stock-Trading Agents: Market Making and Technical Analysis.

[BibT_eX]

[DOI]

Y. Feng

Rong Yu

Proceedings of the Agent-Mediated Electronic Commerce V, 2003

Performance analysis of a counter-intuitive automated stock-trading agent.

[BibT_eX]

[DOI]

Ronggang Yu

Mohammad Taghi Hajiaghayi

Proceedings of the 5th International Conference on Electronic Commerce, 2003

2002

RoboCup-2001: The Fifth Robotic Soccer World Championships.

[BibT_eX]

[DOI]

Ehsan Chiniforooshan

AI Mag., 2002

The 2002 AAAI Spring Symposium Series.

[BibT_eX]

[DOI]

AI Mag., 2002

Multiagent Competitions and Research: Lessons from RoboCup and TAC.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2002: Robot Soccer World Cup VI, 2002

Modeling Auction Price Uncertainty Using Boosting-based Conditional Density Estimation.

[BibT_eX]

Proceedings of the Machine Learning, 2002

Randomized strategic demand reduction: getting more by asking for less.

[BibT_eX]

[DOI]

Proceedings of the First International Joint Conference on Autonomous Agents & Multiagent Systems, 2002

ATTac-2001: A Learning, Autonomous Bidding Agent.

[BibT_eX]

[DOI]

Proceedings of the Agent-Mediated Electronic Commerce IV, 2002

Self-Enforcing Strategic Demand Reduction.

[BibT_eX]

[DOI]

Proceedings of the Agent-Mediated Electronic Commerce IV, 2002

2001

ATTac-2000: An Adaptive Autonomous Bidding Agent.

[BibT_eX]

[DOI]

J. Artif. Intell. Res., 2001

Autonomous Bidding Agents in the Trading Agent Competition.

[BibT_eX]

[DOI]

IEEE Internet Comput., 2001

RoboCup-2000: The Fourth Robotic Soccer World Championships.

[BibT_eX]

[DOI]

AI Mag., 2001

FAucS : An FCC Spectrum Auction Simulator for Autonomous Bidding Agents.

[BibT_eX]

[DOI]

Proceedings of the Electronic Commerce, Second International Workshop, 2001

Keepaway Soccer: A Machine Learning Testbed.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2001: Robot Soccer World Cup V, 2001

ATTUnited-2001: Using Heterogeneous Players.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2001: Robot Soccer World Cup V, 2001

Cobot: A Social Reinforcement Learning Agent.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, 2001

Scaling Reinforcement Learning toward RoboCup Soccer.

[BibT_eX]

Proceedings of the Eighteenth International Conference on Machine Learning (ICML 2001), Williams College, Williamstown, MA, USA, June 28, 2001

Implicit Negotiation in Repeated Games.

[BibT_eX]

[DOI]

Michael L. Littman

Proceedings of the Intelligent Agents VIII, 8th International Workshop, 2001

An architecture for action selection in robotic soccer.

[BibT_eX]

[DOI]

David A. McAllester

Proceedings of the Fifth International Conference on Autonomous Agents, 2001

A social reinforcement learning agent.

[BibT_eX]

[DOI]

Proceedings of the Fifth International Conference on Autonomous Agents, 2001

2000

Multiagent Systems: A Survey from a Machine Learning Perspective.

[BibT_eX]

[DOI]

Auton. Robots, 2000

CMUNITED-98: RoboCup-98 Small-Robot World Champion Team.

[BibT_eX]

[DOI]

AI Mag., 2000

CMUNITED-98 Simulator Team.

[BibT_eX]

[DOI]

AI Mag., 2000

The CMUnited-99 Champion Simulator Team.

[BibT_eX]

[DOI]

AI Mag., 2000

Overview of RoboCup-99.

[BibT_eX]

[DOI]

Gerhard K. Kraetzschmar

Minoru Asada

AI Mag., 2000

Reinforcement Learning for 3 vs. 2 Keepaway

[BibT_eX]

[DOI]

Satinder Singh

Proceedings of the RoboCup 2000: Robot Soccer World Cup IV, 2000

Overview of RoboCup-2000.

[BibT_eX]

[DOI]

Gerhard K. Kraetzschmar

Proceedings of the RoboCup 2000: Robot Soccer World Cup IV, 2000

ATT-CMUnited-2000: Third Place Finisher in the RoboCup-2000 Simulator League.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2000: Robot Soccer World Cup IV, 2000

Keeping the Ball from CMUnited-99.

[BibT_eX]

[DOI]

David A. McAllester

Proceedings of the RoboCup 2000: Robot Soccer World Cup IV, 2000

Progress in RoboCup Soccer Research in 2000.

[BibT_eX]

[DOI]

Proceedings of the Experimental Robotics VII [ISER 2000, 2000

TPOT-RL Applied to Network Routing.

[BibT_eX]

Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, June 29, 2000

Defining and Using Ideal Teammate and Opponent Agent Models: A Case Study in Robotic Soccer.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Multi-Agent Systems, 2000

Layered Learning.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning: ECML 2000, 11th European Conference on Machine Learning, Barcelona, Catalonia, Spain, May 31, 2000

Layered Disclosure: Revealing Agents' Internals.

[BibT_eX]

[DOI]

Proceedings of the Intelligent Agents VII. Agent Theories Architectures and Languages, 2000

Layered disclosure: why is the agent doing what it's doing?

[BibT_eX]

[DOI]

Proceedings of the Fourth International Conference on Autonomous Agents, 2000

The RoboCup Soccer Server and CMUnited: Implemented Infrastructure for MAS Research.

[BibT_eX]

[DOI]

Itsuki Noda

Proceedings of the Infrastructure for Agents, 2000

Defining and Using Ideal Teammate and Opponent Agent Models.

[BibT_eX]

[DOI]

Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on on Innovative Applications of Artificial Intelligence, July 30, 2000

Cobot in LambdaMOO: A Social Statistics Agent.

[BibT_eX]

[DOI]

Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on on Innovative Applications of Artificial Intelligence, July 30, 2000

Layered learning in multiagent systems - a winning approach to robotic soccer.

[BibT_eX]

Intelligent robotics and autonomous agents, MIT Press, ISBN: 978-0-262-19438-9, 2000

1999

The CMUnited-97 robotic soccer team: Perception and multi-agent control.

[BibT_eX]

[DOI]

Kwun Han

Robotics Auton. Syst., 1999

Task Decomposition, Dynamic Role Assignment, and Low-Bandwidth Communication for Real-Time Strategic Teamwork.

[BibT_eX]

[DOI]

Artif. Intell., 1999

Overview of RoboCup-99.

[BibT_eX]

[DOI]

Hiroaki Kitano

Enrico Pagello

Gerhard K. Kraetzschmar

Proceedings of the RoboCup-99: Robot Soccer World Cup III, 1999

Layered Learning and Flexible Teamwork in RoboCup Simulation Agents.

[BibT_eX]

[DOI]

Proceedings of the RoboCup-99: Robot Soccer World Cup III, 1999

Team-Partitioned, Opaque-Transition Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Third Annual Conference on Autonomous Agents, 1999

CMUnited-98: A Team of Robotic Soccer Agents.

[BibT_eX]

[DOI]

Proceedings of the Sixteenth National Conference on Artificial Intelligence and Eleventh Conference on Innovative Applications of Artificial Intelligence, 1999

1998

Towards collaborative and adversarial learning: a case study in robotic soccer.

[BibT_eX]

[DOI]

Int. J. Hum. Comput. Stud., 1998

CMUnited: a team of robotics soccer agents collaborating in an adversarial environment.

[BibT_eX]

[DOI]

XRDS, 1998

The CMUnited-98 champion small-robot team.

[BibT_eX]

[DOI]

Michael H. Bowling

Adv. Robotics, 1998

CMUNITED-97: RoboCup-97 Small-Robot World Champion Team.

[BibT_eX]

[DOI]

Kwun Han

AI Mag., 1998

Layered Approach to Learning Client Behaviors in the Robocup Soccer Server.

[BibT_eX]

[DOI]

Appl. Artif. Intell., 1998

The Robocup Physical Agent Challenge: Phase I.

[BibT_eX]

[DOI]

Appl. Artif. Intell., 1998

The CMUnited-98 Small-Robot Team.

[BibT_eX]

[DOI]

Proceedings of the RoboCup-98: Robot Soccer World Cup II, 1998

The CMUnited-98 Champion Simulator Team.

[BibT_eX]

[DOI]

Proceedings of the RoboCup-98: Robot Soccer World Cup II, 1998

Team-Partitioned, Opaque-Transition Reinforced Learning.

[BibT_eX]

[DOI]

Proceedings of the RoboCup-98: Robot Soccer World Cup II, 1998

Individual and Collaborative Behaviors in a Team of Robotic Soccer Agents.

[BibT_eX]

[DOI]

Proceedings of the Third International Conference on Multiagent Systems, 1998

Communication in Domains with Unreliable, Single-Channel, Low-Bandwidth Communication.

[BibT_eX]

[DOI]

Proceedings of the Collective Robotics, First International Workshop, 1998

Task Decomposition and Dynamic Role Assignment for Real-Time Strategic Teamwork.

[BibT_eX]

[DOI]

Proceedings of the Intelligent Agents V, 1998

The CMUnited-97 Robotic Socccer Team: Perception and Multiagent Control.

[BibT_eX]

[DOI]

Kwun Han

Proceedings of the Second International Conference on Autonomous Agents, 1998

Using Decision Tree Confidence Factors for Multi-Agent Control.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Autonomous Agents, 1998

1997

The CMUnited-97 Small Robot Team.

[BibT_eX]

[DOI]

Proceedings of the RoboCup-97: Robot Soccer World Cup I, 1997

The CMUnited-97 Simulator Team.

[BibT_eX]

[DOI]

Proceedings of the RoboCup-97: Robot Soccer World Cup I, 1997

Using Decision Tree Confidence Factors for Multiagent Control.

[BibT_eX]

[DOI]

Proceedings of the RoboCup-97: Robot Soccer World Cup I, 1997

The RoboCup Synthetic Agent Challenge 97.

[BibT_eX]

[DOI]

Proceedings of the RoboCup-97: Robot Soccer World Cup I, 1997

The RoboCup Physical Agent Challenge: Goals and Protocols for Phase 1.

[BibT_eX]

[DOI]

Proceedings of the RoboCup-97: Robot Soccer World Cup I, 1997

A Layered Approach for an Autonomous Robotic Soccer System.

[BibT_eX]

[DOI]

Sorin Achim

Proceedings of the First International Conference on Autonomous Agents, 1997

Layered Learning in Multiagent Systems.

[BibT_eX]

[DOI]

Proceedings of the Fourteenth National Conference on Artificial Intelligence and Ninth Innovative Applications of Artificial Intelligence Conference, 1997

1995

FLECS: Planning with a Flexible Commitment Strategy.

[BibT_eX]

[DOI]

J. Artif. Intell. Res., 1995

Beating a Defender in Robotic Soccer: Memory-Based Learning of a Continuous Function.

[BibT_eX]

[DOI]