Prashant Doshi

Orcid: 0000-0001-9042-9131

  • University of Georgia, Athens, USA

According to our database1, Prashant Doshi authored at least 166 papers between 2003 and 2025.

Collaborative distances:




In proceedings 
PhD thesis 


Online presence:



FRESHR-GSI: A Generalized Safety Model and Evaluation Framework for Mobile Robots in Multi-Human Environments.
CoRR, January, 2025

Inversely Learning Transferable Rewards via Abstracted States.
CoRR, January, 2025

Modeling and reinforcement learning in partially observable many-agent systems.
Auton. Agents Multi Agent Syst., June, 2024

Visual IRL for Human-Like Robotic Manipulation.
CoRR, 2024

Active Legibility in Multiagent Reinforcement Learning.
CoRR, 2024

Analyzing Human Perceptions of a MEDEVAC Robot in a Simulated Evacuation Scenario.
CoRR, 2024

Robust Individualistic Learning in Many-Agent Systems.
Proceedings of the PRIMA 2024: Principles and Practice of Multi-Agent Systems, 2024

IRL for Restless Multi-armed Bandits with Applications in Maternal and Child Health.
Proceedings of the PRICAI 2024: Trends in Artificial Intelligence, 2024

An Autoencoder-Like Nonnegative Matrix Co-Factorization for Improved Student Cognitive Modeling.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Open Human-Robot Collaboration using Decentralized Inverse Reinforcement Learning.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024

Open Human-Robot Collaboration Systems (OHRCS): A Research Perspective.
Proceedings of the 6th IEEE International Conference on Cognitive Machine Intelligence, 2024

Trust and Collaboration Testing in Controlled Human-Robot Environments.
Proceedings of the 6th IEEE International Conference on Cognitive Machine Intelligence, 2024

Modeling Cognitive Biases in Decision-theoretic Planning for Active Cyber Deception.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

Decision making in open agent systems.
AI Mag., December, 2023

MVSA-Net: Multi-View State-Action Recognition for Robust and Deployable Trajectory Generation.
CoRR, 2023

A Novel Variational Lower Bound for Inverse Reinforcement Learning.
CoRR, 2023

Latent Interactive A2C for Improved RL in Open Many-Agent Systems.
CoRR, 2023

Dec-AIRL: Decentralized Adversarial IRL for Human-Robot Teaming.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

IRL with Partial Observations using the Principle of Uncertain Maximum Entropy.
CoRR, 2022

SIPOMDPLite-Net: Lightweight, Self-Interested Learning and Planning in POSGs with Sparse Interactions.
CoRR, 2022

Marginal MAP estimation for inverse RL under occlusion with observer noise.
Proceedings of the Uncertainty in Artificial Intelligence, 2022

Decision-theoretic planning with communication in open multiagent systems.
Proceedings of the Uncertainty in Artificial Intelligence, 2022

Reinforcement learning in many-agent settings under partial observability.
Proceedings of the Uncertainty in Artificial Intelligence, 2022

Anytime Learning of Sum-Product and Sum-Product-Max Networks.
Proceedings of the International Conference on Probabilistic Graphical Models, 2022

Online Inverse Reinforcement Learning with Learned Observation Model.
Proceedings of the Conference on Robot Learning, 2022

A Hierarchical Bayesian Process for Inverse RL in Partially-Controlled Environments.
Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022

GrAALF: Supporting graphical analysis of audit logs for forensics.
Softw. Impacts, 2021

PALO bounds for reinforcement learning in partially observable stochastic games.
Neurocomputing, 2021

A Hierarchical Bayesian model for Inverse RL in Partially-Controlled Environments.
CoRR, 2021

Many Agent Reinforcement Learning Under Partial Observability.
CoRR, 2021

A survey of inverse reinforcement learning: Challenges, methods and progress.
Artif. Intell., 2021

I2RL: online inverse reinforcement learning under occlusion.
Auton. Agents Multi Agent Syst., 2021

State-Based Recurrent SPMNs for Decision-Theoretic Planning under Partial Observability.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Min-Max Entropy Inverse RL of Multiple Tasks.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Cyber Attack Intent Recognition and Active Deception using Factored Interactive POMDPs.
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

Cooperative-Competitive Reinforcement Learning with History-Dependent Rewards.
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

Data-Driven Decision-Theoretic Planning using Recurrent Sum-Product-Max Networks.
Proceedings of the Thirty-First International Conference on Automated Planning and Scheduling, 2021

A Novel AI-based Methodology for Identifying Cyber Attacks in Honey Pots.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Active Deception using Factored Interactive POMDPs to Recognize Cyber Attacker's Intent.
CoRR, 2020

Recurrent Sum-Product-Max Networks for Decision Making in Perfectly-Observed Environments.
CoRR, 2020

Maximum Entropy Multi-Task Inverse RL.
CoRR, 2020

Recursively modeling other agents for decision making: A research perspective.
Artif. Intell., 2020

SA-Net: Robust State-Action Recognition for Learning from Observations.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Scalable Decision-Theoretic Planning in Open and Typed Multiagent Systems.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Merging in Congested Freeway Traffic Using Multipolicy Decision Making and Passive Actor-Critic Learning.
IEEE Trans. Intell. Veh., 2019

SA-Net: Deep Neural Network for Robot Trajectory Recognition from RGB-D Streams.
CoRR, 2019

Evacuate or Not? A POMDP Model of the Decision Making of Individuals in Hurricane Evacuation Zones.
Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence, 2019

Online Inverse Reinforcement Learning Under Occlusion.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Model-Free IRL Using Maximum Likelihood Estimation.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Reinforcement Learning for Heterogeneous Teams with PALO Bounds.
CoRR, 2018

A Framework and Method for Online Inverse Reinforcement Learning.
CoRR, 2018

Multi-robot inverse reinforcement learning under occlusion with estimation of state transitions.
Artif. Intell., 2018

Online Structure Learning for Feed-Forward and Recurrent Sum-Product Networks.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Inverse Learning of Robot Behavior for Collaborative Planning.
Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

Decision-Theoretic Planning Under Anonymity in Agent Populations.
J. Artif. Intell. Res., 2017

Freeway Merging in Congested Traffic based on Multipolicy Decision Making with Passive Actor Critic.
CoRR, 2017

Actor-Critic for Linearly-Solvable Continuous MDP with Partially Known Dynamics.
CoRR, 2017

On the role of fairness and limited backward induction in sequential bargaining games - New behavioral models and analyses.
Ann. Math. Artif. Intell., 2017

Can bounded and self-interested agents be teammates? Application to planning in ad hoc teams.
Auton. Agents Multi Agent Syst., 2017

Robust Model Equivalence using Stochastic Bisimulation for N-Agent Interactive DIDs.
Proceedings of the Thirty-Third Conference on Uncertainty in Artificial Intelligence, 2017

A layered HMM for predicting motion of a leader in multi-robot settings.
Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Inverse Reinforcement Learning Under Noisy Observations.
Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017

Scaling Expectation-Maximization for Inverse Reinforcement Learning to Multiple Robots under Occlusion.
Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017

On Markov Games Played by Bayesian and Boundedly-Rational Players.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Approximating behavioral equivalence for scaling solutions of I-DIDs.
Knowl. Inf. Syst., 2016

Individual Planning in Open and Typed Agent Systems.
Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence, 2016

Dynamic Sum Product Networks for Tractable Inference on Sequence Data.
Proceedings of the Probabilistic Graphical Models - Eighth International Conference, 2016

Sum-Product-Max Networks for Tractable Decision Making.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Sum-Product-Max Networks for Tractable Decision Making: (Extended Abstract).
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

Reinforcement Learning in Partially Observable Multiagent Settings: Monte Carlo Exploring Policies with PAC Bounds.
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

Expectation-Maximization for Inverse Reinforcement Learning with Hidden Data.
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

Decision Sum-Product-Max Networks.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Bayesian Markov Games with Explicit Finite-Level Types.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Dynamic Sum Product Networks for Tractable Inference on Sequence Data.
CoRR, 2015

Robotics to promote elementary education pre-service teachers' STEM engagement, learning, and teaching.
Comput. Educ., 2015

A framework for ontology-based question answering with application to parasite immunology.
J. Biomed. Semant., 2015

Scalable solutions of interactive POMDPs using generalized and bounded policy iteration.
Auton. Agents Multi Agent Syst., 2015

Individual Planning in Infinite-Horizon Multiagent Settings: Inference, Structure and Scalability.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Localization and tracking under extreme and persistent sensory occlusion.
Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Toward Estimating Others' Transition Models Under Occlusion for Multi-Robot IRL.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Speeding up Planning in Multiagent Settings Using CPU-GPU Architectures.
Proceedings of the Agents and Artificial Intelligence - 7th International Conference, 2015

Fast Solving of Influence Diagrams for Multiagent Planning on GPU-enabled Architectures.
Proceedings of the ICAART 2015, 2015

Improved Planning for Infinite-Horizon Interactive POMDPs using Probabilistic Inference (Extended Abstract).
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

Iterative Online Planning in Multiagent Settings with Limited Model Spaces and PAC Guarantees.
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

Aerial Robotic Simulations for Evaluation of Multi-Agent Planning in GaTAC.
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

Multi-Robot Inverse Reinforcement Learning Under Occlusion with State Transition Estimation.
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

Individual Planning in Agent Populations: Exploiting Anonymity and Frame-Action Hypergraphs.
Proceedings of the Twenty-Fifth International Conference on Automated Planning and Scheduling, 2015

Speeding Up Iterative Ontology Alignment using Block-Coordinate Descent.
J. Artif. Intell. Res., 2014

Behavioral Modeling of Sequential Bargaining Games: Fairness and Limited Backward Induction.
Proceedings of the International Symposium on Artificial Intelligence and Mathematics, 2014

A Portal Designed to Learn about Educational Robotics.
Proceedings of the 36th Annual Meeting of the Cognitive Science Society, 2014

Team behavior in interactive dynamic influence diagrams with applications to ad hoc teams.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

Multi-robot inverse reinforcement learning under occlusion with interactions.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

Canonical Forms and Similarity of Complex Concepts for Improved Ontology Alignment.
Proceedings of the 2013 IEEE/WIC/ACM International Conferences on Web Intelligence, 2013

On Modeling Human Learning in Sequential Games with Delayed Reinforcements.
Proceedings of the IEEE International Conference on Systems, 2013

Speeding Up Batch Alignment of Large Ontologies Using MapReduce.
Proceedings of the 2013 IEEE Seventh International Conference on Semantic Computing, 2013

Bimodal Switching for Online Planning in Multiagent Settings.
Proceedings of the IJCAI 2013, 2013

From Questions to Effective Answers: On the Utility of Knowledge-Driven Querying Systems for Life Sciences Data.
Proceedings of the Data Integration in the Life Sciences - 9th International Conference, 2013

Modeling Human Recursive Reasoning Using Empirically Informed Interactive Partially Observable Markov Decision Processes.
IEEE Trans. Syst. Man Cybern. Part A, 2012

Exploiting Model Equivalences for Solving Interactive Dynamic Influence Diagrams.
J. Artif. Intell. Res., 2012

Decision Making in Complex Multiagent Contexts: A Tale of Two Frameworks.
AI Mag., 2012

Reports of the AAAI 2011 Conference Workshops.
AI Mag., 2012

Optima+ results for OAEI 2012.
Proceedings of the 7th International Workshop on Ontology Matching, 2012

Generalized and Bounded Policy Iteration for Interactive POMDPs.
Proceedings of the International Symposium on Artificial Intelligence and Mathematics, 2012

Learning Communication in Interactive Dynamic Influence Diagrams.
Proceedings of the 2012 IEEE/WIC/ACM International Conferences on Intelligent Agent Technology, 2012

GaTAC: a scalable and realistic testbed for multiagent decision making (demonstration).
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

Generalized and bounded policy iteration for finitely-nested interactive POMDPs: scaling up.
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

Modeling deep strategic reasoning by humans in competitive games.
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

Improved Convergence of Iterative Ontology Alignment using Block-Coordinate Descent.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

Optima results for OAEI 2011.
Proceedings of the 6th International Workshop on Ontology Matching, 2011

On the Utility of WordNet for Ontology Alignment: Is it Really Worth it?
Proceedings of the 5th IEEE International Conference on Semantic Computing (ICSC 2011), 2011

Approximating Model Equivalence in Interactive Dynamic Influence Diagrams Using Top K Policy Paths.
Proceedings of the 2011 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, 2011

Approximating behavioral equivalence of models using top-k policy paths.
Proceedings of the 10th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2011), 2011

Identifying and exploiting weak-information inducing actions in solving POMDPs.
Proceedings of the 10th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2011), 2011

Individual Localization and Tracking in Multi-robot Settings with Dynamic Landmarks - (Extended Abstract).
Proceedings of the Advanced Agent Technology, 2011

Utilizing Partial Policies for Identifying Equivalence of Behavioral Models.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

Model identification in interactive influence diagrams using mutual information.
Web Intell. Agent Syst., 2010

Reports of the AAAI 2010 Conference Workshops.
AI Mag., 2010

Approximate solutions of interactive dynamic influence diagrams using ε-behavioral equivalence.
Proceedings of the International Symposium on Artificial Intelligence and Mathematics, 2010

Risk Sensitive Value of Changed Information for Selective Querying of Web Services.
Proceedings of the Service-Oriented Computing - 8th International Conference, 2010

Epsilon-Subjective Equivalence of Models for Interactive Dynamic Influence Diagrams.
Proceedings of the 2010 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, 2010

Modeling recursive reasoning by humans using empirically informed interactive POMDPs.
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

Inexact matching of ontology graphs using expectation-maximization.
J. Web Semant., 2009

A hierarchical framework for logical composition of web services.
Serv. Oriented Comput. Appl., 2009

Monte Carlo Sampling Methods for Approximating Interactive POMDPs.
J. Artif. Intell. Res., 2009

Graphical models for interactive POMDPs: representations and solutions.
Auton. Agents Multi Agent Syst., 2009

Speeding Up Exact Solutions of Interactive Dynamic Influence Diagrams Using Action Equivalence.
Proceedings of the IJCAI 2009, 2009

Towards Automated RESTful Web Service Composition.
Proceedings of the IEEE International Conference on Web Services, 2009

Integrating Behavioral Trust in Web Service Compositions.
Proceedings of the IEEE International Conference on Web Services, 2009

Selective Querying for Adapting Hierarchical Web Service Compositions Using Aggregate Volatility.
Proceedings of the IEEE International Conference on Web Services, 2009

Improved approximation of interactive dynamic influence diagrams using discriminative model updates.
Proceedings of the 8th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2009), 2009

Compact approximations of mixture distributions for state estimation in multiagent settings.
Proceedings of the 8th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2009), 2009

Toward Integrating Social Trust into Web Service Compositions.
Proceedings of the Social Semantic Web: Where Web 2.0 Meets Web 3.0, 2009

Selective Querying for Adapting Web Service Compositions Using the Value of Changed Information.
IEEE Trans. Serv. Comput., 2008

Making BPEL flexible: adapting in the context of coordination constraints using WS-BPEL.
Proceedings of the 17th International Conference on World Wide Web, 2008

Speeding up web service composition with volatile information.
Proceedings of the 17th International Conference on World Wide Web, 2008

Speeding up web service composition with volatile external information.
Proceedings of the 2008 International Workshop on Context Enabled Source and Service Selection, 2008

OPTIMA: Tool for Ontology Alignment with Application to Semantic Reconciliation of Sensor Metadata for Publication in SensorMap.
Proceedings of the 2th IEEE International Conference on Semantic Computing (ICSC 2008), 2008

Approximate Solutions of Interactive POMDPs Using Point Based Value Iteration.
Proceedings of the International Symposium on Artificial Intelligence and Mathematics, 2008

Enhancing the Quality of Noisy Training Data Using a Genetic Algorithm and Prototype Selection.
Proceedings of the 2008 International Conference on Artificial Intelligence, 2008

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams.
Proceedings of the 2008 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, 2008

Distributed Private Constraint Optimization.
Proceedings of the 2008 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, 2008

Generalized Point Based Value Iteration for Interactive POMDPs.
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008

Speeding up adaptation of web service compositions using expiration times.
Proceedings of the 16th International Conference on World Wide Web, 2007

Haley: A Hierarchical Framework for Logical Composition ofWeb Services.
Proceedings of the 2007 IEEE International Conference on Web Services (ICWS 2007), 2007

Improved Adaptation of Web Service Compositions Using Value of Changed Information.
Proceedings of the 2007 IEEE International Conference on Web Services (ICWS 2007), 2007

Approximate state estimation in multiagent settings with continuous or large discrete state spaces.
Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2007), 2007

On the Role of Interactive Epistemology in Multiagent Planning.
Proceedings of the International Conference on Artificial Intelligence and Pattern Recognition, 2007

Graphical Models for Online Solutions to Interactive POMDPs.
Proceedings of the Game Theoretic and Decision Theoretic Agents, 2007

Approximate Solutions of Interactive Dynamic Influence Diagrams Using Model Clustering.
Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007

Improved State Estimation in Multiagent Settings with Continuous or Large Discrete State Spaces.
Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007

Regret-Based Decentralized Adaptation ofWeb Processes with Coordination Constraints.
Proceedings of the 2007 IEEE International Conference on Services Computing (SCC 2007), 2007

Reports on the Twenty-First National Conference on Artificial Intelligence (AAAI-06) Workshop Program.
AI Mag., 2006

Optimal Adaptation in Web Processes with Coordination Constraints.
Proceedings of the 2006 IEEE International Conference on Web Services (ICWS 2006), 2006

A Hierarchical Framework for Composing Nested Web Processes.
Proceedings of the Service-Oriented Computing, 2006

Adaptive Web Processes Using Value of Changed Information.
Proceedings of the Service-Oriented Computing, 2006

Exact solutions of interactive POMDPs using behavioral equivalence.
Proceedings of the 5th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2006), 2006

Inexact Matching of Ontology Graphs Using Expectation-Maximization.
Proceedings of the Proceedings, 2006

On the Difficulty of Achieving Equilibrium in Interactive POMDPs.
Proceedings of the Proceedings, 2006

Dynamic Workflow Composition: Using Markov Decision Processes.
Int. J. Web Serv. Res., 2005

A Framework for Sequential Planning in Multi-Agent Settings.
J. Artif. Intell. Res., 2005

Approximating state estimation in multiagent settings using particle filters.
Proceedings of the 4th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2005), 2005

A Particle Filtering Based Approach to Approximating Interactive POMDPs.
Proceedings of the Proceedings, 2005

Interactive POMDPs: Properties and Preliminary Results.
Proceedings of the 3rd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2004), 2004

A Framework for Optimal Sequential Planning in Multiagent Settings.
Proceedings of the Nineteenth National Conference on Artificial Intelligence, 2004

A Method for Semantically Enhancing the Service Discovery Capabilities of UDDI.
Proceedings of IJCAI-03 Workshop on Information Integration on the Web (IIWeb-03), 2003

Using Bayesian Networks for Cleansing Trauma Data.
Proceedings of the Sixteenth International Florida Artificial Intelligence Research Society Conference, 2003
