Olivier Buffet

Orcid: 0000-0002-5072-5857

Affiliations:
  • University of Lorraine, Nancy, France
  • Henri Poincaré University, Nancy, France (PhD 2003)


According to our database1, Olivier Buffet authored at least 73 papers between 2001 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
HSVI Can Solve Zero-Sum Partially Observable Stochastic Games.
Dyn. Games Appl., September, 2024

How to Exhibit More Predictable Behaviors.
CoRR, 2024

Un cadre pour la planification consciente d'un observateur sous observabilité partielle.
Proceedings of the 18èmes Journées d'Intelligence Artificielle Fondamentale et 19èmes Journées Francophones sur la Planification, 2024

Solving Hierarchical Information-Sharing Dec-POMDPs: An Extensive-Form Game Approach.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Monte-Carlo Search for an Equilibrium in Dec-POMDPs.
Proceedings of the Uncertainty in Artificial Intelligence, 2023

Comment rendre des comportements plus prédictibles.
Proceedings of the 17èmes Journées d'Intelligence Artificielle Fondamentale, 2023

Global min-max Computation for α-Hölder Games.
Proceedings of the 35th IEEE International Conference on Tools with Artificial Intelligence, 2023

Robust Robot Planning for Human-Robot Collaboration.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

2021
Heuristic Search Value Iteration for Zero-Sum Stochastic Games.
IEEE Trans. Games, 2021

HSVI fo zs-POSGs using Concavity, Convexity and Lipschitz Properties.
CoRR, 2021

Solving infinite-horizon Dec-POMDPs using Finite State Controllers within JESP.
Proceedings of the 33rd IEEE International Conference on Tools with Artificial Intelligence, 2021

K-N-MOMDPs: Towards Interpretable Solutions for Adaptive Management.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
On Bellman's Optimality Principle for zs-POSGs.
CoRR, 2020

Optimally Solving Two-Agent Decentralized POMDPs Under One-Sided Information Sharing.
Proceedings of the 37th International Conference on Machine Learning, 2020

Monte Carlo Information-Oriented Planning.
Proceedings of the ECAI 2020 - 24th European Conference on Artificial Intelligence, 29 August-8 September 2020, Santiago de Compostela, Spain, August 29 - September 8, 2020, 2020

Solving K-MDPs.
Proceedings of the Thirtieth International Conference on Automated Planning and Scheduling, 2020

Reinforcement Learning.
Proceedings of the A Guided Tour of Artificial Intelligence Research: Volume I: Knowledge Representation, 2020

2018
rho-POMDPs have Lipschitz-Continuous epsilon-Optimal Value Functions.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Learning to Act in Continuous Dec-POMDPs.
Proceedings of the Journées Francophone Planification, 2018

Recherche heuristique pour jeux stochastiques (à somme nulle).
Proceedings of the Journées Francophone Planification, 2018

Learning to Act in Decentralized Partially Observable MDPs.
Proceedings of the 35th International Conference on Machine Learning, 2018

2017
Prise de décision séquentielle dans l'incertain : Exploiter la structure et rester dans le cadre.
, 2017

2016
Intersections intelligentes pour le contrôle de véhicules sans pilote. Coordination locale et optimisation globale.
Rev. d'Intelligence Artif., 2016

Goal Probability Analysis in Probabilistic Planning: Exploring and Enhancing the State of the Art.
J. Artif. Intell. Res., 2016

Optimally Solving Dec-POMDPs as Continuous-State MDPs.
J. Artif. Intell. Res., 2016

Revisiting Goal Probability Analysis in Probabilistic Planning.
Proceedings of the Twenty-Sixth International Conference on Automated Planning and Scheduling, 2016

2015
Structural Results for Cooperative Decentralized Control Models.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Exploiting Separability in Multiagent Planning with Continuous-State MDPs (Extended Abstract).
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

2014
Towards the Usage of Advanced Behavioral Simulations for Simultaneous Tracking and Activity Recognition.
Proceedings of the STAIRS 2014, 2014

Error-Bounded Approximations for Infinite-Horizon Discounted Decentralized POMDPs.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2014

Decentralized traffic management: A synchronization-based intersection control.
Proceedings of the International Conference on Advanced Logistics and Transport, 2014

Tracking multiple interacting targets using a joint probabilistic Data Association filter.
Proceedings of the 17th International Conference on Information Fusion, 2014

Stop-Free Strategies for Traffic Networks: Decentralized On-line Optimization.
Proceedings of the ECAI 2014 - 21st European Conference on Artificial Intelligence, 18-22 August 2014, Prague, Czech Republic, 2014

Simultaneous Tracking and Activity Recognition (STAR) using Advanced Agent-Based Behavioral Simulations.
Proceedings of the ECAI 2014 - 21st European Conference on Artificial Intelligence, 18-22 August 2014, Prague, Czech Republic, 2014

Learning Pruning Rules for Heuristic Search Planning.
Proceedings of the ECAI 2014 - 21st European Conference on Artificial Intelligence, 18-22 August 2014, Prague, Czech Republic, 2014

Simulation-based behavior tracking of pedestrians in partially observed indoor environments.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

Exploiting separability in multiagent planning with continuous-state MDPs.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

2013
Introduction.
Rev. d'Intelligence Artif., 2013

Les POMDP font de meilleurs hackers: Tenir compte de l'incertitude dans les tests de penetration.
CoRR, 2013

Penetration Testing == POMDP Solving?
CoRR, 2013

Reactive Coordination Rules for Traffic Optimization in Road Sharing Problems.
Proceedings of the Highlights on Practical Applications of Agents and Multi-Agent Systems, 2013

Adaptive Management of Migratory Birds Under Sea Level Rise.
Proceedings of the IJCAI 2013, 2013

2012
Cooperative Behaviors for the Self-Regulation of Autonomous Vehicles in Space Sharing Conflicts.
Proceedings of the IEEE 24th International Conference on Tools with Artificial Intelligence, 2012

Near-Optimal BRL using Optimistic Local Transitions.
Proceedings of the 29th International Conference on Machine Learning, 2012

POMDPs Make Better Hackers: Accounting for Uncertainty in Penetration Testing.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

MOMDPs: A Solution for Modelling Adaptive Management Problems.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

2011
Optimal Priority Assignment Algorithms for Probabilistic Real-Time Systems.
Proceedings of the 19th International Conference on Real-Time and Network Systems, 2011

Active Learning of MDP Models.
Proceedings of the Recent Advances in Reinforcement Learning - 9th European Workshop, 2011

2010
A POMDP Extension with Belief-dependent Rewards.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

From "I Like" to "I Prefer" in Collaborative Filtering.
Proceedings of the 22nd IEEE International Conference on Tools with Artificial Intelligence, 2010

A Closer Look at MOMDPs.
Proceedings of the 22nd IEEE International Conference on Tools with Artificial Intelligence, 2010

Influence of different execution models on patrolling ant behaviors: from agents to robots.
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

2009
The factored policy-gradient planner.
Artif. Intell., 2009

Self-Organization of Patrolling-Ant Algorithms.
Proceedings of the Third IEEE International Conference on Self-Adaptive and Self-Organizing Systems, 2009

Global Multiprocessor Real-Time Scheduling as a Constraint Satisfaction Problem.
Proceedings of the ICPPW 2009, 2009

2008
Theoretical Study of Ant-based Algorithms for Multi-Agent Patrolling.
Proceedings of the ECAI 2008, 2008

2007
Policy-Gradients for PSRs and POMDPs.
Proceedings of the Eleventh International Conference on Artificial Intelligence and Statistics, 2007

Reachability Analysis for Uncertain SSPS.
Int. J. Artif. Intell. Tools, 2007

Shaping multi-agent systems with gradient reinforcement learning.
Auton. Agents Multi Agent Syst., 2007

Factored Planning Using Decomposition Trees.
Proceedings of the IJCAI 2007, 2007

FF + FPG: Guiding a Policy-Gradient Planner.
Proceedings of the Seventeenth International Conference on Automated Planning and Scheduling, 2007

Concurrent Probabilistic Temporal Planning with Policy-Gradients.
Proceedings of the Seventeenth International Conference on Automated Planning and Scheduling, 2007

2006
Étude de différentes combinaisons de comportements adaptatives.
Rev. d'Intelligence Artif., 2006

2005
Développement autonome des comportements de base d'un agent.
Rev. d'Intelligence Artif., 2005

Robust Planning with (L)RTDP.
Proceedings of the IJCAI-05, Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, UK, July 30, 2005

Planification robuste avec (L)RTDP.
Proceedings of the Actes de CAP 05, Conférence francophone sur l'apprentissage automatique, 2005

2003
Une double approche modulaire de l'apprentissage par renforcement pour des agents intelligents adaptatifs. (A Twofold Modular Approach of Reinforcement Learning for Adaptive Intelligent Agents).
PhD thesis, 2003

Apprentissage par renforcement pour la conception de systèmes multi-agents réactifs.
Tech. Sci. Informatiques, 2003

Automatic generation of an agent's basic behaviors.
Proceedings of the Second International Joint Conference on Autonomous Agents & Multiagent Systems, 2003

2002
Adaptive Combination of Behaviors in an Agent.
Proceedings of the 15th European Conference on Artificial Intelligence, 2002

Learning to weigh basic behaviors in scalable agents.
Proceedings of the First International Joint Conference on Autonomous Agents & Multiagent Systems, 2002

2001
Multi-Agent Systems by Incremental Gradient Reinforcement Learning.
Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence, 2001

Incremental reinforcement learning for designing multi-agent systems.
Proceedings of the Fifth International Conference on Autonomous Agents, 2001


  Loading...