Modeling and reinforcement learning in partially observable many-agent systems.
Auton. Agents Multi Agent Syst., June, 2024

Reinforcement actor-critic learning as a rehearsal in MicroRTS.
Knowl. Eng. Rev., 2024

Quasimetric Value Functions with Dense Rewards.
CoRR, 2024

Robust Individualistic Learning in Many-Agent Systems.
Proceedings of the PRIMA 2024: Principles and Practice of Multi-Agent Systems, 2024

Latent Interactive A2C for Improved RL in Open Many-Agent Systems.
CoRR, 2023

Reinforcement learning as a rehearsal for swarm foraging.
Swarm Intell., 2022

Reinforcement learning in many-agent settings under partial observability.
Proceedings of the Uncertainty in Artificial Intelligence, 2022

Online Inverse Reinforcement Learning with Learned Observation Model.
Proceedings of the Conference on Robot Learning, 2022

Human-agent transfer from observations.
Knowl. Eng. Rev., 2021

PALO bounds for reinforcement learning in partially observable stochastic games.
Neurocomputing, 2021

Many Agent Reinforcement Learning Under Partial Observability.
CoRR, 2021

I2RL: online inverse reinforcement learning under occlusion.
Auton. Agents Multi Agent Syst., 2021

Min-Max Entropy Inverse RL of Multiple Tasks.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Cooperative-Competitive Reinforcement Learning with History-Dependent Rewards.
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

Maximum Entropy Multi-Task Inverse RL.
CoRR, 2020

Team learning from human demonstration with coordination confidence.
Knowl. Eng. Rev., 2019

Online Inverse Reinforcement Learning Under Occlusion.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Model-Free IRL Using Maximum Likelihood Estimation.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

A Framework and Method for Online Inverse Reinforcement Learning.
CoRR, 2018

Autonomous Acquisition of Behavior Trees for Robot Control.
Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

Multiagent Path Finding With Persistence Conflicts.
IEEE Trans. Comput. Intell. AI Games, 2017

Multirobot Systems.
IEEE Intell. Syst., 2017

Exact and Heuristic Algorithms for Risk-Aware Stochastic Physical Search.
Comput. Intell., 2017

Multi-agent reinforcement learning as a rehearsal for decentralized planning.
Neurocomputing, 2016

Reinforcement Learning in Partially Observable Multiagent Settings: Monte Carlo Exploring Policies with PAC Bounds.
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

Detection of Plan Deviation in Multi-Agent Systems.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Stackelberg Surveillance.
Informatica (Slovenia), 2015

The complexity of multi-agent plan recognition.
Auton. Agents Multi Agent Syst., 2015

Reinforcement Learning of Informed Initial Policies for Decentralized Planning.
ACM Trans. Auton. Adapt. Syst., 2014

Model AI Assignments 2014.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

Concurrent reinforcement learning as a rehearsal for decentralized planning under uncertainty.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

Pruning for Monte Carlo Distributed Reinforcement Learning in Decentralized POMDPs.
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

Strategic best-response learning in multiagent systems.
J. Exp. Theor. Artif. Intell., 2012

Efficient context free parsing of multi-agent activities for team and plan recognition.
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

Informed Initial Policies for Learning in Dec-POMDPs.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

Sample Bounded Distributed Reinforcement Learning for Decentralized POMDPs.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

Action Discovery for Single and Multi-Agent Reinforcement Learning.
Adv. Complex Syst., 2011

Adaptive Multi-robot Team Reconfiguration Using a Policy-Reuse Reinforcement Learning Approach.
Proceedings of the Advanced Agent Technology, 2011

Branch and Price for Multi-Agent Plan Recognition.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

Fast a* with Iterative Resolution for Navigation.
Int. J. Artif. Intell. Tools, 2010

Action discovery for reinforcement learning.
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

Validation of agent based crowd egress simulation.
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

Coalition structure generation in multi-agent systems with mixed externalities.
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

Evaluation and Comparison of Multi-agent Based Crowd Simulation Systems.
Proceedings of the Agents for Games and Simulations II, 2010

Multi-Agent Plan Recognition: Formalization and Algorithms.
Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010

Search Performance of Multi-Agent Plan Recognition in a General Model.
Proceedings of the Plan, Activity, and Intent Recognition, 2010

Layered Intelligence for Agent-based Crowd Simulation.
Simul., 2009

Advancing the Layered Approach to Agent-Based Crowd Simulation.
Proceedings of the 22st International Workshop on Principles of Advanced and Distributed Simulation, 2008

Congestion Avoidance in Multi-Agent-based Egress Simulation.
Proceedings of the 2008 International Conference on Artificial Intelligence, 2008

Generalized multiagent learning with performance bound.
Auton. Agents Multi Agent Syst., 2007

General Game Learning Using Knowledge Transfer.
Proceedings of the IJCAI 2007, 2007

Reactivity and Safe Learning in Multi-Agent Systems.
Adapt. Behav., 2006

RV<sub>sigma(t)</sub>: a unifying approach to performance and convergence in online multiagent learning.
Proceedings of the 5th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2006), 2006

Unifying Convergence and No-Regret in Multiagent Learning.
Proceedings of the Learning and Adaption in Multi-Agent Systems, 2005

Efficient learning of multi-step best response.
Proceedings of the 4th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2005), 2005

On the performance of on-line concurrent reinforcement learners.
Proceedings of the 4th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2005), 2005

Efficient No-Regret Multiagent Learning.
Proceedings of the Proceedings, 2005

On-policy concurrent reinforcement learning.
J. Exp. Theor. Artif. Intell., 2004

The Role of Reactivity in Multiagent Learning.
Proceedings of the 3rd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2004), 2004

Performance Bounded Reinforcement Learning in Strategic Interactions.
Proceedings of the Nineteenth National Conference on Artificial Intelligence, 2004

Adaptive policy gradient in multiagent learning.
Proceedings of the Second International Joint Conference on Autonomous Agents & Multiagent Systems, 2003

Kernel Index for Relevance feedback Retrieval.
Proceedings of the FSDK'02, 2002

Convergent Gradient Ascent in General-Sum Games.
Proceedings of the Machine Learning: ECML 2002, 2002

Mining user session data to facilitate user interaction with a customer service knowledge base in RightNow Web.
Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining, 2001

Fast Concurrent Reinforcement Learners.
Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence, 2001

Using Bayesian Networks to Model Agent Relationships.
Appl. Artif. Intell., 2000

Combining Multiple Perspectives.
Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, June 29, 2000

Learning Mutual Trust.
Proceedings of the Fourth International Conference on Autonomous Agents, 2000

Selecting partners.
Proceedings of the Fourth International Conference on Autonomous Agents, 2000