Philippe Preux

CoRR, 2023

Optimal Interpretability-Performance Trade-off of Classification Trees with Black-Box Reinforcement Learning.

[BibT_eX]

[DOI]

Hector Kohler

Riad Akrour

CoRR, 2023

Augmentation de jeux de données RI pour la recherche conversationnelle à initiative mixte.

[BibT_eX]

[DOI]

Proceedings of the Actes de CORIA-TALN 2023. Actes de la 18e Conférence en Recherche d'Information et Applications, 2023

Soft Action Priors: Towards Robust Policy Transfer.

[BibT_eX]

[DOI]

Matheus Centa

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Entropy Regularized Reinforcement Learning with Cascading Networks.

[BibT_eX]

[DOI]

Riccardo Della Vecchia

Alena Shilova

Riad Akrour

CoRR, 2022

gym-DSSAT: a crop model turned into a Reinforcement Learning environment.

[BibT_eX]

[DOI]

David Emukpere

CoRR, 2022

Reinforcement learning for crop management support: Review, prospects and challenges.

[BibT_eX]

[DOI]

Romain Gautron

Marc Corbeels

Régis Sabbadin

Comput. Electron. Agric., 2022

Automated Planning for Robotic Guidewire Navigation in the Coronary Arteries.

[BibT_eX]

[DOI]

Proceedings of the 5th IEEE International Conference on Soft Robotics, 2022

2021

More Efficient Exploration with Symbolic Priors on Action Sequence Equivalences.

[BibT_eX]

[DOI]

CoRR, 2021

Low-Rank Projections of GCNs Laplacian.

[BibT_eX]

[DOI]

Nathan Grinsztajn

Edouard Oyallon

CoRR, 2021

Interferometric Graph Transform for Community Labeling.

[BibT_eX]

[DOI]

CoRR, 2021

There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Don't Do What Doesn't Matter: Intrinsic Motivation with Action Usefulness.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Learning Value Functions in Deep Policy Gradients using Residual Variance.

[BibT_eX]

[DOI]

Reda Ouhamma

Proceedings of the 9th International Conference on Learning Representations, 2021

Adversarially Guided Actor-Critic.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

READYS: A Reinforcement Learning Based Strategy for Heterogeneous Dynamic Scheduling.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Cluster Computing, 2021

2020

Is Standard Deviation the New Standard? Revisiting the Critic in Deep Policy Gradients.

[BibT_eX]

[DOI]

Reda Ouhamma

CoRR, 2020

Geometric deep reinforcement learning for dynamic DAG scheduling.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE Symposium Series on Computational Intelligence, 2020

A Machine of Few Words: Interactive Speaker Recognition with Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

"I'm Sorry Dave, I'm Afraid I Can't Do That" Deep Q-Learning from Forbidden Actions.

[BibT_eX]

[DOI]

Mathieu Seurin

Olivier Pietquin

Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

Only Relevant Information Matters: Filtering Out Noisy Samples To Boost RL.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

2019

"I'm sorry Dave, I'm afraid I can't do that" Deep Q-learning from forbidden action.

[BibT_eX]

[DOI]

Mathieu Seurin

Olivier Pietquin

CoRR, 2019

High-Dimensional Control Using Generalized Auxiliary Tasks.

[BibT_eX]

[DOI]

CoRR, 2019

Samples are not all useful: Denoising policy gradient updates using variance.

[BibT_eX]

[DOI]

CoRR, 2019

Energy Management for Microgrids: a Reinforcement Learning Approach.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE PES Innovative Smart Grid Technologies Europe, 2019

2018

Correctness attraction: a study of stability of software behavior under runtime perturbation.

[BibT_eX]

[DOI]

Empir. Softw. Eng., 2018

Recurrent Neural Networks for Long and Short-Term Sequential Recommendation.

[BibT_eX]

[DOI]

CoRR, 2018

Visual Reasoning with Multi-hop Feature Modulation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

2017

A large-scale study of call graph-based impact prediction using mutation testing.

[BibT_eX]

[DOI]

Softw. Qual. J., 2017

A Multi-Armed Bandit Model Selection for Cold-Start User Recommendation.

[BibT_eX]

[DOI]

Crícia Z. Felício

Klérisson V. R. Paixão

Célia A. Zorzo Barcelos

Proceedings of the 25th Conference on User Modeling, Adaptation and Personalization, 2017

A Generative Model for Sparse, Evolving Digraphs.

[BibT_eX]

[DOI]

Georgios Papoudakis

Proceedings of the Complex Networks & Their Applications VI, 2017

2016

Consistent Algorithms for Clustering Time Series.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2016

Operator-valued Kernels for Learning from Functional Response Data.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2016

Exploiting Social Information in Pairwise Preference Recommender System.

[BibT_eX]

[DOI]

Crícia Z. Felício

Klérisson V. R. Paixão

Guilherme Alves

Sandra de Amo

J. Inf. Data Manag., 2016

Mutation-Based Graph Inference for Fault Localization.

[BibT_eX]

[DOI]

Proceedings of the 16th IEEE International Working Conference on Source Code Analysis and Manipulation, 2016

Scalable Explore-Exploit Collaborative filtering.

[BibT_eX]

[DOI]

Frédéric Guillou

Proceedings of the 20th Pacific Asia Conference on Information Systems, 2016

Large-Scale Bandit Recommender System.

[BibT_eX]

[DOI]

Frédéric Guillou

Proceedings of the Machine Learning, Optimization, and Big Data, 2016

Preference-Like Score to Cope with Cold-Start User in Recommender Systems.

[BibT_eX]

[DOI]

Crícia Z. Felício

Klérisson V. R. Paixão

Célia A. Z. Barcelos

Proceedings of the 28th IEEE International Conference on Tools with Artificial Intelligence, 2016

A learning algorithm for change impact prediction.

[BibT_eX]

[DOI]

Proceedings of the 5th International Workshop on Realizing Artificial Intelligence Synergies in Software Engineering, 2016

Sequential Collaborative Ranking Using (No-)Click Implicit Feedback.

[BibT_eX]

[DOI]

Frédéric Guillou

Proceedings of the Neural Information Processing - 23rd International Conference, 2016

2015

A Learning Algorithm for Change Impact Prediction: Experimentation on 7 Java Applications.

[BibT_eX]

[DOI]

CoRR, 2015

Bandits and Recommender Systems.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning, Optimization, and Big Data, 2015

An Experimental Protocol for Analyzing the Accuracy of Software Error Impact Analysis.

[BibT_eX]

[DOI]

Proceedings of the 10th IEEE/ACM International Workshop on Automation of Software Test, 2015

Simultaneous optimistic optimization on the noiseless BBOB testbed.

[BibT_eX]

[DOI]

Bilel Derbel

Proceedings of the IEEE Congress on Evolutionary Computation, 2015

2014

Cold-start Problems in Recommendation Systems via Contextual-bandit Algorithms.

[BibT_eX]

[DOI]

Hai Thanh Nguyen

CoRR, 2014

A Generative Model of Software Dependency Graphs to Better Understand Software Evolution.

[BibT_eX]

[DOI]

CoRR, 2014

Bandits Warm-up Cold Recommender Systems.

[BibT_eX]

[DOI]

CoRR, 2014

Understanding software evolution: the maisqual ant data set.

[BibT_eX]

[DOI]

Boris Baldassari

Proceedings of the 11th Working Conference on Mining Software Repositories, 2014

Improving offline evaluation of contextual bandit algorithms via bootstrapping techniques.

[BibT_eX]

[DOI]

Olivier Nicol

Proceedings of the 31th International Conference on Machine Learning, 2014

De l'ombre à la lumière : plus de visibilité sur l'Eclipse.

[BibT_eX]

[DOI]

Boris Baldassari

Flavien Huynh

Proceedings of the 14èmes Journées Francophones Extraction et Gestion des Connaissances, 2014

Bandits attack function optimization.

[BibT_eX]

[DOI]

Rémi Munos

Michal Valko

Proceedings of the IEEE Congress on Evolutionary Computation, 2014

2013

Multiple functional regression with both discrete and continuous covariates

[BibT_eX]

[DOI]

CoRR, 2013

Functional Regularized Least Squares Classi cation with Operator-valued Kernels

[BibT_eX]

[DOI]

CoRR, 2013

A Generalized Kernel Approach to Structured Output Learning.

[BibT_eX]

[DOI]

Hachem Kadri

Mohammad Ghavamzadeh

Proceedings of the 30th International Conference on Machine Learning, 2013

2012

Sequential approaches for learning datum-wise sparse representations.

[BibT_eX]

[DOI]

Mach. Learn., 2012

ICML Exploration & Exploitation Challenge: Keep it simple!

[BibT_eX]

[DOI]

Olivier Nicol

Proceedings of the Workshop on On-line Trading of Exploration and Exploitation 2, 2012

Online Clustering of Processes.

[BibT_eX]

[DOI]

Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, 2012

Managing advertising campaigns - an approximate planning approach.

[BibT_eX]

[DOI]

Frontiers Comput. Sci., 2012

Fast Reinforcement Learning with Large Action Sets Using Error-Correcting Output Codes for MDP Factorization.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2012

Multiple Operator-valued Kernel Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

2011

Datum-Wise Classification: A Sequential Approach to Sparsity.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2011

Functional Regularized Least Squares Classication with Operator-valued Kernels.

[BibT_eX]

[DOI]

Proceedings of the 28th International Conference on Machine Learning, 2011

Learning vocal tract variables with multi-task kernels.

[BibT_eX]

[DOI]

Hachem Kadri

Emmanuel Duflos

Proceedings of the IEEE International Conference on Acoustics, 2011

2010

Nonlinear functional regression: a functional RKHS approach.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, 2010

The Iso-regularization Descent Algorithm for the LASSO.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing. Theory and Algorithms, 2010

Advertising Campaigns Management: Should We Be Greedy?

[BibT_eX]

[DOI]

Proceedings of the ICDM 2010, 2010

Affichage de publicités sur des portails web.

[BibT_eX]

[DOI]

Victor Gabillon

Proceedings of the Extraction et gestion des connaissances (EGC'2010), 2010

2009

ECON: A Kernel Basis Pursuit Algorithm with Automatic Feature Parameter Tuning, and its Application to Photometric Solids Approximation.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning and Applications, 2009

Feature discovery in approximate dynamic programming.

[BibT_eX]

[DOI]

Proceedings of the IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, 2009

2008

Basis Function Construction in Reinforcement Learning Using Cascade-Correlation Learning Architecture.

[BibT_eX]

[DOI]

Proceedings of the Seventh International Conference on Machine Learning and Applications, 2008

Basis Expansion in Natural Actor Critic Methods.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Reinforcement Learning, 8th European Workshop, 2008

Feature Discovery in Reinforcement Learning Using Genetic Programming.

[BibT_eX]

[DOI]

Proceedings of the Genetic Programming, 11th European Conference, 2008

2007

A unified view of TD algorithms, introducing Full-gradient TD and Equi-gradient descent TD.

[BibT_eX]

[DOI]

Manuel Davy

Proceedings of the 15th European Symposium on Artificial Neural Networks, 2007

2006

A Unified View of TD Algorithms; Introducing Full-Gradient TD and Equi-Gradient Descent TD

[BibT_eX]

[DOI]

CoRR, 2006

2004

A generic architecture for adaptive agents based on reinforcement learning.

[BibT_eX]

[DOI]

Inf. Sci., 2004

2003

"Virtual laboratory environment" (VLE): a software environment oriented agent and object for modeling and simulation of complex systems.

[BibT_eX]

[DOI]

Éric Ramat

Simul. Model. Pract. Theory, 2003

2002

Propagation of Q-values in Tabular TD(lambda).

[BibT_eX]

[DOI]

Proceedings of the Machine Learning: ECML 2002, 2002

2001

Selection of Behavior in Social Situations.

[BibT_eX]

[DOI]

Proceedings of the Applications of Evolutionary Computing, 2001

Learning as a Consequence of Selection.

[BibT_eX]

[DOI]

Proceedings of the Artificial Evolution, 2001

2000

Virtual Laboratory Environment (VLE) : un environnement multi-agents pour la modélisation et la simulation d'écosystèmes (démonstration).

[BibT_eX]

Éric Ramat

Proceedings of the Systèmes multi-agents : Méthodologie, technologie et expériences - JFIADSMA 00, 2000

1999

Evolution of Cooperation within a Behavior-Based Perspective: Confronting Nature and Animats.

[BibT_eX]

[DOI]

Proceedings of the Artificial Evolution, 4th European Conference, 1999

1998

The fitness function and its impact on local search methods.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, 1998

A Bit-Wise Epistasis Measure for Binary Search Spaces.

[BibT_eX]

[DOI]

Cyril Fonlupt

Denis Robilliard

Proceedings of the Parallel Problem Solving from Nature, 1998

1996

Climbing Up NP-Hard Hills.

[BibT_eX]

[DOI]

David Duvivier

El-Ghazali Talbi

Proceedings of the Parallel Problem Solving from Nature, 1996

1992

Performance improvement for vector pipeline multiprocessor systems using a disordered execution model.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual International Symposium on Computer Architecture. Gold Coast, 1992

1990

EVA: an explicit vector language.

[BibT_eX]

[DOI]

Jean-Luc Dekeyser

Philippe Marquet

ACM SIGPLAN Notices, 1990

Vector addressing processor for direct and indirect accesses.

[BibT_eX]

[DOI]

Jean-Luc Dekeyser

Philippe Marquet