Craig Boutilier

Orcid: 0000-0001-9330-4545

  • University of Toronto, Canada

According to our database1, Craig Boutilier authored at least 250 papers between 1989 and 2024.

Collaborative distances:


ACM Fellow

ACM Fellow 2012, "For contributions to knowledge representation and computational decision making.".



In proceedings 
PhD thesis 


Online presence:



Discovering Personalized Semantics for Soft Attributes in Recommender Systems Using Concept Activation Vectors.
Trans. Recomm. Syst., December, 2024

Building Human Values into Recommender Systems: An Interdisciplinary Synthesis.
Trans. Recomm. Syst., September, 2024

Embedding-Aligned Language Models.
CoRR, 2024

DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning.
CoRR, 2024

Minimizing Live Experiments in Recommender Systems: User Simulation to Evaluate Preference Elicitation Policies.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Model-Free Preference Elicitation.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Demystifying Embedding Spaces using Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Recommender Ecosystems: A Mechanism Design Perspective on Holistic Modeling and Optimization.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Preference Elicitation with Soft Attributes in Interactive Recommendation.
CoRR, 2023

Density-based User Representation through Gaussian Process Regression for Multi-interest Personalized Retrieval.
CoRR, 2023

Factual and Personalized Recommendations using Language Models and Reinforcement Learning.
CoRR, 2023

Modeling Recommender Ecosystems: Research Challenges at the Intersection of Mechanism Design, Reinforcement Learning and Generative Models.
CoRR, 2023

Content Prompting: Modeling Content Provider Dynamics to Improve User Welfare in Recommender Ecosystems.
CoRR, 2023

Ranking with Popularity Bias: User Welfare under Self-Amplification Dynamics.
CoRR, 2023

DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models.
CoRR, 2023

Aligning Text-to-Image Models using Human Feedback.
CoRR, 2023

Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management.
CoRR, 2023

Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Reinforcement Learning with History Dependent Dynamic Contexts.
Proceedings of the International Conference on Machine Learning, 2023

A Mixture-of-Expert Approach to RL-based Dialogue Management.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Gathering Strength, Gathering Storms: The One Hundred Year Study on Artificial Intelligence (AI100) 2021 Study Panel Report.
CoRR, 2022

Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning.
CoRR, 2022

Building Human Values into Recommender Systems: An Interdisciplinary Synthesis.
CoRR, 2022

A Mixture-of-Expert Approach to RL-based Dialogue Management.
CoRR, 2022

IMO<sup>3</sup>: Interactive Multi-Objective Off-Policy Optimization.
CoRR, 2022

Discovering Personalized Semantics for Soft Attributes in Recommender Systems using Concept Activation Vectors.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

IMO^3: Interactive Multi-Objective Off-Policy Optimization.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

An adversarial variational inference approach for travel demand calibration of urban traffic simulators.
Proceedings of the 30th International Conference on Advances in Geographic Information Systems, 2022

Thompson Sampling with a Mixture Prior.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

Subjective Attributes in Conversational Recommendation Systems: Challenges and Opportunities.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

RecSim NG: Toward Principled Uncertainty Modeling for Recommender Ecosystems.
CoRR, 2021

Towards Content Provider Aware Recommender Systems: A Simulation Study on the Interplay between User and Provider Utilities.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Meta-Thompson Sampling.
Proceedings of the 38th International Conference on Machine Learning, 2021

Non-Stationary Latent Bandits.
CoRR, 2020

Differentiable Meta-Learning in Contextual Bandits.
CoRR, 2020

ConQUR: Mitigating Delusional Bias in Deep Q-learning.
CoRR, 2020

Differentiable Bandit Exploration.
CoRR, 2020

Data Efficient Training for Reinforcement Learning with Adaptive Behavior Policy Sharing.
CoRR, 2020

On the equivalence of optimal recommendation sets and myopically optimal query sets.
Artif. Intell., 2020

Preference elicitation and robust winner determination for single- and multi-winner social choice.
Artif. Intell., 2020

Demonstrating Principled Uncertainty Modeling for Recommender Ecosystems with RecSim NG.
Proceedings of the RecSys 2020: Fourteenth ACM Conference on Recommender Systems, 2020

Latent Bandits Revisited.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Differentiable Meta-Learning of Bandit Policies.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

BRPO: Batch Residual Policy Optimization.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

ConQUR: Mitigating Delusional Bias in Deep Q-Learning.
Proceedings of the 37th International Conference on Machine Learning, 2020

Optimizing Long-term Social Welfare in Recommender Systems: A Constrained Matching Approach.
Proceedings of the 37th International Conference on Machine Learning, 2020

CAQL: Continuous Action Q-Learning.
Proceedings of the 8th International Conference on Learning Representations, 2020

Randomized Exploration in Generalized Linear Bandits.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

Gradient-Based Optimization for Bayesian Preference Elicitation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

RecSim: A Configurable Simulation Platform for Recommender Systems.
CoRR, 2019

Reinforcement Learning for Slate-based Recommender Systems: A Tractable Decomposition and Practical Methodology.
CoRR, 2019

Empathetic decision making in social networks.
Artif. Intell., 2019

Perturbed-History Exploration in Stochastic Linear Bandits.
Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence, 2019

Advantage Amplification in Slowly Evolving Latent-State Environments.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Perturbed-History Exploration in Stochastic Multi-Armed Bandits.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

SlateQ: A Tractable Decomposition for Reinforcement Learning with Recommendation Sets.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Experiential Preference Elicitation for Autonomous Heating and Cooling Systems.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Seq2Slate: Re-ranking and Slate Optimization with RNNs.
CoRR, 2018

Planning and Learning with Stochastic Action Sets.
CoRR, 2018

Non-delusional Q-learning and value-iteration.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Data center cooling using model-predictive control.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Planning and Learning with Stochastic Action Sets.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Toward User-Centric Recommender Systems.
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

Safe Exploration for Identifying Linear Systems via Robust Optimization.
CoRR, 2017

Multiple-Profile Prediction-of-Use Games.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Logistic Markov Decision Processes.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Incomplete Information and Communication in Voting.
Proceedings of the Handbook of Computational Social Choice, 2016

Budget Allocation using Weakly Coupled, Constrained Markov Decision Processes.
Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence, 2016

Computational Social Choice: Theory and Applications (Dagstuhl Seminar 15241).
Dagstuhl Reports, 2015

Optimal social choice functions: A utilitarian view.
Artif. Intell., 2015

Preference-oriented Social Networks: Group Recommendation and Inference.
Proceedings of the 9th ACM Conference on Recommender Systems, 2015

Approximately Stable Pricing for Coordinated Purchasing of Electricity.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Approximately Strategy-proof Mechanisms for (Constrained) Facility Location.
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

Optimal Group Manipulation in Facility Location Problems.
Proceedings of the Algorithmic Decision Theory - 4th International Conference, 2015

Value-Directed Compression of Large-Scale Assignment Problems.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

The Pricing War Continues: On Competitive Multi-Item Pricing.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Effective sampling and learning for mallows models with pairwise-preference data.
J. Mach. Learn. Res., 2014

On the value of using group discounts under price competition.
Artif. Intell., 2014

Empathetic social choice on social networks.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

Efficient coordinated power distribution on private infrastructure.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

A Game-Theoretic Analysis of Catalog Optimization.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

Regret-Based Optimization and Preference Elicitation for Stackelberg Security Games with Uncertainty.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

Preference Elicitation and Interview Minimization in Stable Matchings.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

Robust Winners and Winner Determination Policies under Candidate Uncertainty.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence (2000)
CoRR, 2013

Reasoning With Conditional Ceteris Paribus Preference Statem
CoRR, 2013

Multi-Dimensional Single-Peaked Consistency and Its Approximations.
Proceedings of the IJCAI 2013, 2013

Analysis and Optimization of Multi-Dimensional Percentile Mechanisms.
Proceedings of the IJCAI 2013, 2013

Efficient Vote Elicitation under Candidate Uncertainty.
Proceedings of the IJCAI 2013, 2013

Multi-Winner Social Choice with Incomplete Preferences.
Proceedings of the IJCAI 2013, 2013

Elicitation and Approximately Stable Matching with Partial Preferences.
Proceedings of the IJCAI 2013, 2013

People, sensors, decisions: Customizable and adaptive technologies for assistance in healthcare.
ACM Trans. Interact. Intell. Syst., 2012

Sequentially optimal repeated coalition formation under uncertainty.
Auton. Agents Multi Agent Syst., 2012

Bayesian Vote Manipulation: Optimal Strategies and Impact on Welfare.
Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, 2012

Matching models for preference-sensitive group purchasing.
Proceedings of the 13th ACM Conference on Electronic Commerce, 2012

Active Learning for Matching Problems.
Proceedings of the 29th International Conference on Machine Learning, 2012

Eliciting forecasts from self-interested experts: scoring rules for decision makers.
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

A Dynamic Rationalization of Distance Rationalizability.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

CP-nets: A Tool for Representing and Reasoning withConditional Ceteris Paribus Preference Statements
CoRR, 2011

A Framework for Optimizing Paper Matching.
Proceedings of the UAI 2011, 2011

Robust Online Optimization of Reward-Uncertain MDPs.
Proceedings of the IJCAI 2011, 2011

Eliciting Additive Reward Functions for Markov Decision Processes.
Proceedings of the IJCAI 2011, 2011

Robust Approximation and Incremental Elicitation in Voting Protocols.
Proceedings of the IJCAI 2011, 2011

Budgeted Social Choice: From Consensus to Personalized Decision Making.
Proceedings of the IJCAI 2011, 2011

Learning Mallows Models with Pairwise Preferences.
Proceedings of the 28th International Conference on Machine Learning, 2011

Preference Elicitation and Preference Learning in Social Choice.
Proceedings of the Integration of AI and OR Techniques in Constraint Programming for Combinatorial Optimization Problems, 2011

Learning Complex Concepts Using Crowdsourcing: A Bayesian Approach.
Proceedings of the Algorithmic Decision Theory - Second International Conference, 2011

Vote Elicitation with Probabilistic Preference Models: Empirical Estimation and Cost Tradeoffs.
Proceedings of the Algorithmic Decision Theory - Second International Conference, 2011

A Bayesian Concept Learning Approach to Crowdsourcing.
Proceedings of the Interactive Decision Theory and Game Theory, 2011

Recommendation Sets and Choice Queries: There Is No Exploration/Exploitation Tradeoff!
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

Efficiency and Privacy Tradeoffs in Mechanism Design.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

Automated handwashing assistance for persons with dementia using video and a partially observable Markov decision process.
Comput. Vis. Image Underst., 2010

The unavailable candidate model: a decision-theoretic view of social choice.
Proceedings of the Proceedings 11th ACM Conference on Electronic Commerce (EC-2010), 2010

Assessing regret-based preference elicitation with the UTPREF recommendation system.
Proceedings of the Proceedings 11th ACM Conference on Electronic Commerce (EC-2010), 2010

Optimal Bayesian Recommendation Sets and Myopically Optimal Choice Query Sets.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Automated Channel Abstraction for Advertising Auctions.
Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010

Robust Policy Computation in Reward-Uncertain MDPs Using Nondominated Policies.
Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010

Simultaneous Elicitation of Preference Features and Utility.
Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010

Practical solution techniques for first-order MDPs.
Artif. Intell., 2009

Regret-based Reward Elicitation for Markov Decision Processes.
Proceedings of the UAI 2009, 2009

Regret-based optimal recommendation sets in conversational recommender systems.
Proceedings of the 2009 ACM Conference on Recommender Systems, 2009

Preference elicitation with subjective features.
Proceedings of the 2009 ACM Conference on Recommender Systems, 2009

A probabilistic mental model for estimating disruption.
Proceedings of the 14th International Conference on Intelligent User Interfaces, 2009

Optimal Set Recommendations Based on Regret.
Proceedings of the 7th Workshop on Intelligent Techniques for Web Personalization & Recommender Systems (ITWP'09), Pasadena, California, USA, July 11-17, 2009 in conjunction with the 21st International Joint Conference on Artificial Intelligence, 2009

Online feature elicitation in interactive optimization.
Proceedings of the 26th Annual International Conference on Machine Learning, 2009

Intelligent Decision Support in Medicine: Back to Bayes?
J. Univers. Comput. Sci., 2008

Elicitation of Factored Utilities.
AI Mag., 2008

Toward Experiential Utility Elicitation for Interface Customization.
Proceedings of the UAI 2008, 2008

The need for an interaction cost model in adaptive interfaces.
Proceedings of the working conference on Advanced Visual Interfaces, 2008

Sequential decision making in repeated coalition formation under uncertainty.
Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2008), 2008

Computing Reserve Prices and Identifying the Value Distribution in Real-world Auctions with Market Disruptions.
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008

Expressive Banner Ad Auctions and Model-Based Online Optimization for Clearing.
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008

Minimax regret based elicitation of generalized additive utilities.
Proceedings of the UAI 2007, 2007

Automated Design of Multistage Mechanisms.
Proceedings of the IJCAI 2007, 2007

Mechanism Design with Partial Revelation.
Proceedings of the IJCAI 2007, 2007

Coalitional Bargaining with Agent Type Uncertainty.
Proceedings of the IJCAI 2007, 2007

Coalition formation under uncertainty: bargaining equilibria and the Bayesian core stability concept.
Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2007), 2007

Approximate Solution Techniques for Factored First-Order MDPs.
Proceedings of the Seventeenth International Conference on Automated Planning and Scheduling, 2007

Partial Revelation Automated Mechanism Design.
Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007

Computing Optimal Subsets.
Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007

A Planning System Based on Markov Decision Processes to Guide People With Dementia Through Activities of Daily Living.
IEEE Trans. Inf. Technol. Biomed., 2006

Constraint-based optimization and utility elicitation using the minimax decision criterion.
Artif. Intell., 2006

Practical Linear Value-approximation Techniques for First-order MDPs.
Proceedings of the UAI '06, 2006

Who's asking for help?: a Bayesian approach to intelligent assistance.
Proceedings of the 11th International Conference on Intelligent User Interfaces, 2006

Regret-based Incremental Partial Revelation Mechanisms.
Proceedings of the Proceedings, 2006

Preference Elicitation and Generalized Additive Utility.
Proceedings of the Proceedings, 2006

The Influence of Influence Diagrams on Artificial Intelligence.
Decis. Anal., 2005

Approximate Linear Programming for First-order MDPs.
Proceedings of the UAI '05, 2005

Local Utility Elicitation in GAI Models.
Proceedings of the UAI '05, 2005

Regret-based Utility Elicitation in Constraint-based Decision Problems.
Proceedings of the IJCAI-05, Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, UK, July 30, 2005

A Decision-Theoretic Approach to Task Assistance for Persons with Dementia.
Proceedings of the IJCAI-05, Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, UK, July 30, 2005

POMDP Models for Assistive Technology.
Proceedings of the Caring Machines: AI in Eldercare, 2005

New Approaches to Optimization and Utility Elicitation in Autonomic Computing.
Proceedings of the Proceedings, 2005

CP-nets: A Tool for Representing and Reasoning with Conditional Ceteris Paribus Preference Statements.
J. Artif. Intell. Res., 2004

Preference-Based Constrained Optimization with CP-Nets.
Comput. Intell., 2004

Regret Minimizing Equilibria and Mechanisms for Games with Strict Type Uncertainty.
Proceedings of the UAI '04, 2004

VDCBPI: an Approximate Scalable Algorithm for Large POMDPs.
Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004

A Study of Limited-Precision, Incremental Elicitation in Auctions.
Proceedings of the 3rd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2004), 2004

Bayesian Reinforcement Learning for Coalition Formation under Uncertainty.
Proceedings of the 3rd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2004), 2004

Stochastic Local Search for POMDP Controllers.
Proceedings of the Nineteenth National Conference on Artificial Intelligence, 2004

Eliciting Bid Taker Non-price Preferences in (Combinatorial) Auctions.
Proceedings of the Nineteenth National Conference on Artificial Intelligence, 2004

Accelerating Reinforcement Learning through Implicit Imitation.
J. Artif. Intell. Res., 2003

Active Collaborative Filtering.
Proceedings of the UAI '03, 2003

Cooperative Negotiation in Autonomic Systems using Incremental Utility Elicitation.
Proceedings of the UAI '03, 2003

Bounded Finite State Controllers.
Proceedings of the Advances in Neural Information Processing Systems 16 [Neural Information Processing Systems, 2003

Incremental Utility Elicitation with the Minimax Regret Decision Criterion.
Proceedings of the IJCAI-03, 2003

A Bayesian Approach to Imitation in Reinforcement Learning.
Proceedings of the IJCAI-03, 2003

Towards Cooperative Negotiation for Decentralized Resource Allocation in Autonomic Computing Systems.
Proceedings of the IJCAI-03, 2003

On the Foundations of Expected Expected Utility.
Proceedings of the IJCAI-03, 2003

Constraint-Based Optimization with the Minimax Decision Criterion.
Proceedings of the Principles and Practice of Constraint Programming, 2003

Coordination in multiagent reinforcement learning: a Bayesian approach.
Proceedings of the Second International Joint Conference on Autonomous Agents & Multiagent Systems, 2003

An Active Approach to Collaborative Filtering.
Proceedings of the Ninth International Workshop on Artificial Intelligence and Statistics, 2003

Value-Directed Compression of POMDPs.
Proceedings of the Advances in Neural Information Processing Systems 15 [Neural Information Processing Systems, 2002

Piecewise Linear Value Function Approximation for Factored MDPs.
Proceedings of the Eighteenth National Conference on Artificial Intelligence and Fourteenth Conference on Innovative Applications of Artificial Intelligence, July 28, 2002

Greedy Linear Value-Approximation for Factored Markov Decision Processes.
Proceedings of the Eighteenth National Conference on Artificial Intelligence and Fourteenth Conference on Innovative Applications of Artificial Intelligence, July 28, 2002

Solving Concisely Expressed Combinatorial Auction Problems.
Proceedings of the Eighteenth National Conference on Artificial Intelligence and Fourteenth Conference on Innovative Applications of Artificial Intelligence, July 28, 2002

A POMDP Formulation of Preference Elicitation Problems.
Proceedings of the Eighteenth National Conference on Artificial Intelligence and Fourteenth Conference on Innovative Applications of Artificial Intelligence, July 28, 2002

Partial-Order Planning with Concurrent Interacting Actions.
J. Artif. Intell. Res., 2001

Value-Directed Sampling Methods for POMDPs.
Proceedings of the UAI '01: Proceedings of the 17th Conference in Uncertainty in Artificial Intelligence, 2001

Vector-space Analysis of Belief-state Approximation for POMDPs.
Proceedings of the UAI '01: Proceedings of the 17th Conference in Uncertainty in Artificial Intelligence, 2001

UCP-Networks: A Directed Graphical Representation of Conditional Utilities.
Proceedings of the UAI '01: Proceedings of the 17th Conference in Uncertainty in Artificial Intelligence, 2001

Symbolic Dynamic Programming for First-Order MDPs.
Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence, 2001

Bidding Languages for Combinatorial Auctions.
Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence, 2001

Imitation and Reinforcement Learning in Agents with Heterogeneous Actions.
Proceedings of the Advances in Artificial Intelligence, 2001

Stochastic dynamic programming with factored representations.
Artif. Intell., 2000

Value-Directed Belief State Approximation for POMDPs.
Proceedings of the UAI '00: Proceedings of the 16th Conference in Uncertainty in Artificial Intelligence, Stanford University, Stanford, California, USA, June 30, 2000

Approximately Optimal Monitoring of Plan Preconditions.
Proceedings of the UAI '00: Proceedings of the 16th Conference in Uncertainty in Artificial Intelligence, Stanford University, Stanford, California, USA, June 30, 2000

APRICODD: Approximate Policy Construction Using Decision Diagrams.
Proceedings of the Advances in Neural Information Processing Systems 13, 2000

Solving Combinatorial Auctions Using Stochastic Local Search.
Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on on Innovative Applications of Artificial Intelligence, July 30, 2000

Decision-Theoretic, High-Level Agent Programming in the Situation Calculus.
Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on on Innovative Applications of Artificial Intelligence, July 30, 2000

Decision Making under Uncertainty: Operations Research Meets AI (Again).
Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on on Innovative Applications of Artificial Intelligence, July 30, 2000

Decision-Theoretic Planning: Structural Assumptions and Computational Leverage.
J. Artif. Intell. Res., 1999

Multiagent Systems: Challenges and Opportunities for Decision-Theoretic Planning.
AI Mag., 1999

SPUDD: Stochastic Planning using Decision Diagrams.
Proceedings of the UAI '99: Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence, Stockholm, Sweden, July 30, 1999

Continuous Value Function Approximation for Sequential Bidding Policies.
Proceedings of the UAI '99: Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence, Stockholm, Sweden, July 30, 1999

Reasoning With Conditional Ceteris Paribus Preference Statements.
Proceedings of the UAI '99: Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence, Stockholm, Sweden, July 30, 1999

Sequential Auctions for the Allocation of Resources with Complementarities.
Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence, 1999

Resource Allocation Using Sequential Auctions.
Proceedings of the Agent Mediated Electronic Commerce II, 1999

Sequential Optimality and Coordination in Multiagent Systems.
Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence, 1999

Implicit Imitation in Multiagent Reinforcement Learning.
Proceedings of the Sixteenth International Conference on Machine Learning (ICML 1999), Bled, Slovenia, June 27, 1999

Knowledge Representation for Stochastic Decision Process.
Proceedings of the Artificial Intelligence Today: Recent Trends and Developments, 1999

A Unified Model of Qualitative Belief Change: A Dynamical Systems Perspective.
Artif. Intell., 1998

Hierarchical Solution of Markov Decision Processes using Macro-actions.
Proceedings of the UAI '98: Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence, 1998

Structured Reachability Analysis for Markov Decision Processes.
Proceedings of the UAI '98: Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence, 1998

Solving Very Large Weakly Coupled Markov Decision Processes.
Proceedings of the Fifteenth National Conference on Artificial Intelligence and Tenth Innovative Applications of Artificial Intelligence Conference, 1998

The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems.
Proceedings of the Fifteenth National Conference on Artificial Intelligence and Tenth Innovative Applications of Artificial Intelligence Conference, 1998

Belief Revision with Unreliable Observations.
Proceedings of the Fifteenth National Conference on Artificial Intelligence and Tenth Innovative Applications of Artificial Intelligence Conference, 1998

Abstraction and Approximate Decision-Theoretic Planning.
Artif. Intell., 1997

Economic Principles of Multi-Agent Systems.
Artif. Intell., 1997

Structured Arc Reversal and Simulation of Dynamic Probabilistic Networks.
Proceedings of the UAI '97: Proceedings of the Thirteenth Conference on Uncertainty in Artificial Intelligence, 1997

Correlated Action Effects in Decision Theoretic Regression.
Proceedings of the UAI '97: Proceedings of the Thirteenth Conference on Uncertainty in Artificial Intelligence, 1997

Prioritized Goal Decomposition of Markov Decision Processes: Toward a Synthesis of Classical and Decision Theoretic Planning.
Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence, 1997

Planning with Concurrent Interacting Actions.
Proceedings of the Fourteenth National Conference on Artificial Intelligence and Ninth Innovative Applications of Artificial Intelligence Conference, 1997

Structured Solution Methods for Non-Markovian Decision Processes.
Proceedings of the Fourteenth National Conference on Artificial Intelligence and Ninth Innovative Applications of Artificial Intelligence Conference, 1997

Iterated revision and minimal change of conditional beliefs.
J. Philos. Log., 1996

Abduction to Plausible Causes: An Event-Based model of Belief Update.
Artif. Intell., 1996

Context-Specific Independence in Bayesian Networks.
Proceedings of the UAI '96: Proceedings of the Twelfth Annual Conference on Uncertainty in Artificial Intelligence, 1996

Learning Conventions in Multiagent Stochastic Domains using Likelihood Estimates.
Proceedings of the UAI '96: Proceedings of the Twelfth Annual Conference on Uncertainty in Artificial Intelligence, 1996

Planning, Learning and Coordination in Multiagent Decision Processes.
Proceedings of the Sixth Conference on Theoretical Aspects of Rationality and Knowledge, 1996

Approximate Value Trees in Structured Dynamic Programming.
Proceedings of the Machine Learning, 1996

The Frame Problem and Bayesian Network Action Representation.
Proceedings of the Advances in Artificial Intelligence, 1996

Computing Optimal Policies for Partially Observable Decision Processes Using Compact Representations.
Proceedings of the Thirteenth National Conference on Artificial Intelligence and Eighth Innovative Applications of Artificial Intelligence Conference, 1996

Rewarding Behaviors.
Proceedings of the Thirteenth National Conference on Artificial Intelligence and Eighth Innovative Applications of Artificial Intelligence Conference, 1996

On the Revision of Probabilistic Belief States.
Notre Dame J. Formal Log., 1995

Abduction as Belief Revision.
Artif. Intell., 1995

Process-Oriented Planning and Average-Reward Optimality.
Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, 1995

Exploiting Structure in Policy Construction.
Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, 1995

Generalized Update: Belief Change in Dynamic Settings.
Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, 1995

Modal logics for qualitative possibility theory.
Int. J. Approx. Reason., 1994

Believing on the Basis of Qualitative Rules: Commentary on Kyburg.
Comput. Intell., 1994

Conditional Logics of Normality: A Modal Approach.
Artif. Intell., 1994

Unifying Default Reasoning and Belief Revision in a Modal Framework.
Artif. Intell., 1994

Integrating Planning and Execution in Stochastic Domains.
Proceedings of the UAI '94: Proceedings of the Tenth Annual Conference on Uncertainty in Artificial Intelligence, 1994

Toward a Logic for Qualitative Decision Theory.
Proceedings of the 4th International Conference on Principles of Knowledge Representation and Reasoning (KR'94). Bonn, 1994

Using Abstractions for Decision-Theoretic Planning with Time Constraints.
Proceedings of the 12th National Conference on Artificial Intelligence, Seattle, WA, USA, July 31, 1994

On the Semantics of Stable Inheritance Reasoning.
Comput. Intell., 1993

The Probability of a Possibility: Adding Uncertainty to Default Rules.
Proceedings of the UAI '93: Proceedings of the Ninth Annual Conference on Uncertainty in Artificial Intelligence, 1993

Revision Sequences and Nested Conditionals.
Proceedings of the 13th International Joint Conference on Artificial Intelligence. Chambéry, France, August 28, 1993

Revision by Conditional Beliefs.
Proceedings of the 11th National Conference on Artificial Intelligence. Washington, 1993

Abduction As Belief Revision: A Model of Preferred Explanations.
Proceedings of the 11th National Conference on Artificial Intelligence. Washington, 1993

Conditional logics for default reasoning and belief revision.
PhD thesis, 1992

Epistemic Entrenchment in autoepistemic logic.
Fundam. Informaticae, 1992

Modal Logics for Qualitative Possibility and Beliefs.
Proceedings of the UAI '92: Proceedings of the Eighth Annual Conference on Uncertainty in Artificial Intelligence, 1992

Normative, Subjunctive, and Autoepistemic Defaults: Adopting the Ramsey Test.
Proceedings of the 3rd International Conference on Principles of Knowledge Representation and Reasoning (KR'92). Cambridge, 1992

Normative, Subjunctive and Autoepistemic Defaults.
Proceedings of the Foundation of Knowledge Representation and Reasoning [the book grew out of an ECAI-92 workshop], 1992

A Logic for Revision and Subjunctive Queries.
Proceedings of the 10th National Conference on Artificial Intelligence, 1992

Inaccessible Worlds and Irrelevance: Preliminary Report.
Proceedings of the 12th International Joint Conference on Artificial Intelligence. Sydney, 1991

Conditional Logics of Normality as Modal Systems.
Proceedings of the 8th National Conference on Artificial Intelligence. Boston, Massachusetts, USA, July 29, 1990

A Semantical Approach to Stable Inheritance Reasoning.
Proceedings of the 11th International Joint Conference on Artificial Intelligence. Detroit, 1989
