Bikramjit Banerjee

Auton. Agents Multi Agent Syst., June, 2024

Reinforcement actor-critic learning as a rehearsal in MicroRTS.

[BibT_eX]

[DOI]

Shiron Manandhar

Knowl. Eng. Rev., 2024

Quasimetric Value Functions with Dense Rewards.

[BibT_eX]

[DOI]

Khadichabonu Valieva

CoRR, 2024

Robust Individualistic Learning in Many-Agent Systems.

[BibT_eX]

[DOI]

Proceedings of the PRIMA 2024: Principles and Practice of Multi-Agent Systems, 2024

2023

Latent Interactive A2C for Improved RL in Open Many-Agent Systems.

[BibT_eX]

[DOI]

CoRR, 2023

2022

Reinforcement learning as a rehearsal for swarm foraging.

[BibT_eX]

[DOI]

Trung Nguyen

Swarm Intell., 2022

Reinforcement learning in many-agent settings under partial observability.

[BibT_eX]

[DOI]

Proceedings of the Uncertainty in Artificial Intelligence, 2022

Online Inverse Reinforcement Learning with Learned Observation Model.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 2022

2021

Human-agent transfer from observations.

[BibT_eX]

[DOI]

Sneha Racharla

Knowl. Eng. Rev., 2021

PALO bounds for reinforcement learning in partially observable stochastic games.

[BibT_eX]

[DOI]

Neurocomputing, 2021

Many Agent Reinforcement Learning Under Partial Observability.

[BibT_eX]

[DOI]

CoRR, 2021

I2RL: online inverse reinforcement learning under occlusion.

[BibT_eX]

[DOI]

Auton. Agents Multi Agent Syst., 2021

Min-Max Entropy Inverse RL of Multiple Tasks.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Cooperative-Competitive Reinforcement Learning with History-Dependent Rewards.

[BibT_eX]

[DOI]

Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

2020

Maximum Entropy Multi-Task Inverse RL.

[BibT_eX]

[DOI]

CoRR, 2020

2019

Team learning from human demonstration with coordination confidence.

[BibT_eX]

[DOI]

Syamala Vittanala

Matthew Edmund Taylor

Knowl. Eng. Rev., 2019

Online Inverse Reinforcement Learning Under Occlusion.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Model-Free IRL Using Maximum Likelihood Estimation.

[BibT_eX]

[DOI]

Vinamra Jain

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

A Framework and Method for Online Inverse Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2018

Autonomous Acquisition of Behavior Trees for Robot Control.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

2017

Multiagent Path Finding With Persistence Conflicts.

[BibT_eX]

[DOI]

Caleb E. Davis

IEEE Trans. Comput. Intell. AI Games, 2017

Multirobot Systems.

[BibT_eX]

[DOI]

IEEE Intell. Syst., 2017

Exact and Heuristic Algorithms for Risk-Aware Stochastic Physical Search.

[BibT_eX]

[DOI]

Comput. Intell., 2017

2016

Multi-agent reinforcement learning as a rehearsal for decentralized planning.

[BibT_eX]

[DOI]

Neurocomputing, 2016

Reinforcement Learning in Partially Observable Multiagent Settings: Monte Carlo Exploring Policies with PAC Bounds.

[BibT_eX]

[DOI]

Roi Ceren

Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

Detection of Plan Deviation in Multi-Agent Systems.

[BibT_eX]

[DOI]

Steven Loscalzo

Daniel Lucas Thompson

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015

Stackelberg Surveillance.

[BibT_eX]

[DOI]

Informatica (Slovenia), 2015

The complexity of multi-agent plan recognition.

[BibT_eX]

[DOI]

Jeremy Lyle

Auton. Agents Multi Agent Syst., 2015

2014

Reinforcement Learning of Informed Initial Policies for Decentralized Planning.

[BibT_eX]

[DOI]

ACM Trans. Auton. Adapt. Syst., 2014

Model AI Assignments 2014.

[BibT_eX]

[DOI]

Daniel Lucas Thompson

Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013

Concurrent reinforcement learning as a rehearsal for decentralized planning under uncertainty.

[BibT_eX]

[DOI]

Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

Pruning for Monte Carlo Distributed Reinforcement Learning in Decentralized POMDPs.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

2012

Strategic best-response learning in multiagent systems.

[BibT_eX]

[DOI]

J. Exp. Theor. Artif. Intell., 2012

Efficient context free parsing of multi-agent activities for team and plan recognition.

[BibT_eX]

[DOI]

Jeremy Lyle

Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

Informed Initial Policies for Learning in Dec-POMDPs.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

Sample Bounded Distributed Reinforcement Learning for Decentralized POMDPs.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

2011

Action Discovery for Single and Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Adv. Complex Syst., 2011

Adaptive Multi-robot Team Reconfiguration Using a Policy-Reuse Reinforcement Learning Approach.

[BibT_eX]

[DOI]

Prithviraj Dasgupta

Ke Cheng

Proceedings of the Advanced Agent Technology, 2011

Branch and Price for Multi-Agent Plan Recognition.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

2010

Fast a* with Iterative Resolution for Navigation.

[BibT_eX]

[DOI]

Kyle Walsh

Int. J. Artif. Intell. Tools, 2010

Action discovery for reinforcement learning.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

Validation of agent based crowd egress simulation.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

Coalition structure generation in multi-agent systems with mixed externalities.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

Evaluation and Comparison of Multi-agent Based Crowd Simulation Systems.

[BibT_eX]

[DOI]

Proceedings of the Agents for Games and Simulations II, 2010

Multi-Agent Plan Recognition: Formalization and Algorithms.

[BibT_eX]

[DOI]

Jeremy Lyle

Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010

Search Performance of Multi-Agent Plan Recognition in a General Model.

[BibT_eX]

[DOI]

Proceedings of the Plan, Activity, and Intent Recognition, 2010

2009

Layered Intelligence for Agent-based Crowd Simulation.

[BibT_eX]

[DOI]

Ahmed Abukmail

Simul., 2009

2008

Advancing the Layered Approach to Agent-Based Crowd Simulation.

[BibT_eX]

[DOI]

Ahmed Abukmail

Proceedings of the 22st International Workshop on Principles of Advanced and Distributed Simulation, 2008

Congestion Avoidance in Multi-Agent-based Egress Simulation.

[BibT_eX]

Proceedings of the 2008 International Conference on Artificial Intelligence, 2008

2007

Generalized multiagent learning with performance bound.

[BibT_eX]

[DOI]

Auton. Agents Multi Agent Syst., 2007

General Game Learning Using Knowledge Transfer.

[BibT_eX]

[DOI]

Peter Stone

Proceedings of the IJCAI 2007, 2007

2006

Reactivity and Safe Learning in Multi-Agent Systems.

[BibT_eX]

[DOI]

Adapt. Behav., 2006

RV<sub>sigma(t)</sub>: a unifying approach to performance and convergence in online multiagent learning.

[BibT_eX]

[DOI]

Proceedings of the 5th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2006), 2006

2005

Unifying Convergence and No-Regret in Multiagent Learning.

[BibT_eX]

[DOI]

Proceedings of the Learning and Adaption in Multi-Agent Systems, 2005

Efficient learning of multi-step best response.

[BibT_eX]

[DOI]

Proceedings of the 4th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2005), 2005

On the performance of on-line concurrent reinforcement learners.

[BibT_eX]

[DOI]

Proceedings of the 4th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2005), 2005

Efficient No-Regret Multiagent Learning.

[BibT_eX]

[DOI]

Proceedings of the Proceedings, 2005

2004

On-policy concurrent reinforcement learning.

[BibT_eX]

[DOI]

J. Exp. Theor. Artif. Intell., 2004

The Role of Reactivity in Multiagent Learning.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2004), 2004

Performance Bounded Reinforcement Learning in Strategic Interactions.

[BibT_eX]

[DOI]

Proceedings of the Nineteenth National Conference on Artificial Intelligence, 2004

2003

Adaptive policy gradient in multiagent learning.

[BibT_eX]

[DOI]

Proceedings of the Second International Joint Conference on Autonomous Agents & Multiagent Systems, 2003

2002

Kernel Index for Relevance feedback Retrieval.

[BibT_eX]

Douglas R. Heisterkamp

Proceedings of the FSDK'02, 2002

Convergent Gradient Ascent in General-Sum Games.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning: ECML 2002, 2002

2001

Mining user session data to facilitate user interaction with a customer service knowledge base in RightNow Web.

[BibT_eX]

[DOI]

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining, 2001

Fast Concurrent Reinforcement Learners.

[BibT_eX]

Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence, 2001

2000

Using Bayesian Networks to Model Agent Relationships.

[BibT_eX]

[DOI]

Appl. Artif. Intell., 2000

Combining Multiple Perspectives.

[BibT_eX]

Sandip Debnath

Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, June 29, 2000

Learning Mutual Trust.

[BibT_eX]

[DOI]

Rajatish Mukherjee

Proceedings of the Fourth International Conference on Autonomous Agents, 2000

Selecting partners.

[BibT_eX]

[DOI]