Pablo Hernandez-Leal

Orcid: 0000-0002-8530-6775

According to our database1, Pablo Hernandez-Leal authored at least 44 papers between 2011 and 2021.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2021
Unobtrusive Stress Assessment Using Smartphones.
IEEE Trans. Mob. Comput., 2021

Robust Risk-Sensitive Reinforcement Learning Agents for Trading Markets.
CoRR, 2021

2020
CDT: Cascading Decision Trees for Explainable Reinforcement Learning.
CoRR, 2020

Work in Progress: Temporally Extended Auxiliary Tasks.
CoRR, 2020

Safe reinforcement learning using risk mapping by similarity.
Adapt. Behav., 2020

A Very Condensed Survey and Critique of Multiagent Deep Reinforcement Learning.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

Uncertainty-Aware Action Advising for Deep Reinforcement Learning Agents.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Providing Uncertainty-Based Advice for Deep Reinforcement Learning Agents (Student Abstract).
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition.
CoRR, 2019

Safer Deep RL with Shallow MCTS: A Case Study in Pommerman.
CoRR, 2019

A survey and critique of multiagent deep reinforcement learning.
Auton. Agents Multi Agent Syst., 2019

Towers of Saliency: A Reinforcement Learning Visualization Using Immersive Environments.
Proceedings of the 2019 ACM International Conference on Interactive Surfaces and Spaces, 2019

An Exchange Mechanism to Coordinate Flexibility in Residential Energy Cooperatives.
Proceedings of the IEEE International Conference on Industrial Technology, 2019

Action Guidance with MCTS for Deep Reinforcement Learning.
Proceedings of the Fifteenth AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment, 2019

Terminal Prediction as an Auxiliary Task for Deep Reinforcement Learning.
Proceedings of the Fifteenth AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment, 2019

Agent Modeling as Auxiliary Task for Deep Reinforcement Learning.
Proceedings of the Fifteenth AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment, 2019

On Hard Exploration for Reinforcement Learning: A Case Study in Pommerman.
Proceedings of the Fifteenth AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment, 2019

2018
Using Monte Carlo Tree Search as a Demonstrator within Asynchronous Deep RL.
CoRR, 2018

Is multiagent deep reinforcement learning the answer or the question? A brief survey.
CoRR, 2018

Load Classification and Forecasting for Temporary Power Installations.
Proceedings of the 2018 IEEE PES Innovative Smart Grid Technologies Conference Europe, 2018

Coordinating Distributed and Flexible Resources: A Case-study of Residential Cooperatives.
Proceedings of the 2018 IEEE PES Innovative Smart Grid Technologies Conference Europe, 2018

2017
A Survey of Learning in Multiagent Environments: Dealing with Non-Stationarity.
CoRR, 2017

An exploration strategy for non-stationary opponents.
Auton. Agents Multi Agent Syst., 2017

Efficiently detecting switches against non-stationary opponents.
Auton. Agents Multi Agent Syst., 2017

An Exploration Strategy Facing Non-Stationary Agents.
Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017

Detecting Switches Against Non-Stationary Opponents.
Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017

Towards a Fast Detection of Opponents in Repeated Stochastic Games.
Proceedings of the Autonomous Agents and Multiagent Systems, 2017

2016
Stress modelling and prediction in presence of scarce data.
J. Biomed. Informatics, 2016

A Bayesian Approach for Learning and Tracking Switching, Non-Stationary Opponents: (Extended Abstract).
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

Identifying and Tracking Switching, Non-Stationary Opponents: A Bayesian Approach.
Proceedings of the Multiagent Interaction without Prior Coordination, 2016

2015
Bidding in Non-Stationary Energy Markets.
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

Opponent Modeling against Non-stationary Strategies: (Doctoral Consortium).
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

Stress Modelling Using Transfer Learning in Presence of Scarce Data.
Proceedings of the Ambient Intelligence for Health - First International Conference, 2015

2014
Multi-label classification with Bayesian network-based chain classifiers.
Pattern Recognit. Lett., 2014

A framework for learning and planning against switching strategies in repeated games.
Connect. Sci., 2014

Using a Priori Information for Fast Learning Against Non-stationary Opponents.
Proceedings of the Advances in Artificial Intelligence - IBERAMIA 2014, 2014

2013
InstanceRank based on borders for instance selection.
Pattern Recognit., 2013

Learning temporal nodes Bayesian networks.
Int. J. Approx. Reason., 2013

Discovering human immunodeficiency virus mutational pathways using temporal Bayesian networks.
Artif. Intell. Medicine, 2013

Strategic Interactions Among Agents with Bounded Rationality.
Proceedings of the IJCAI 2013, 2013

Modeling non-stationary opponents.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

2012
Contrasting Temporal Bayesian Network Models for Analyzing HIV Mutations.
Proceedings of the Ninth UAI Bayesian Modeling Applications Workshop, 2012

2011
Learning Temporal Bayesian Networks for Power Plant Diagnosis.
Proceedings of the Modern Approaches in Applied Intelligence, 2011

Learning Temporal Nodes Bayesian Networks.
Proceedings of the Twenty-Fourth International Florida Artificial Intelligence Research Society Conference, 2011


  Loading...