Olivier Pietquin
Orcid: 0000-0002-5386-465XAffiliations:
- Google DeepMind
- University Lille 1, France
According to our database1,
Olivier Pietquin
authored at least 211 papers
between 2002 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion.
CoRR, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Population-aware Online Mirror Descent for Mean-Field Games by Deep Reinforcement Learning.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024
Countering Reward Over-Optimization in LLM with Demonstration-Guided Reinforcement Learning.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Back to Basics: Revisiting REINFORCE-Style Optimization for Learning from Human Feedback in LLMs.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
IEEE ACM Trans. Audio Speech Lang. Process., 2023
Trans. Assoc. Comput. Linguistics, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice.
Proceedings of the International Conference on Machine Learning, 2023
Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
2022
CoRR, 2022
CoRR, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022
Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022
Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022
Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022
Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
CoRR, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021
2020
The Monte Carlo Transformer: a stochastic self-attention model for sequence prediction.
CoRR, 2020
CoRR, 2020
HIGhER: Improving instruction following with Hindsight Generation for Experience Replay.
Proceedings of the 2020 IEEE Symposium Series on Computational Intelligence, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020
Proceedings of the 37th International Conference on Machine Learning, 2020
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020
Proceedings of The 12th Asian Conference on Machine Learning, 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
Proceedings of the A Guided Tour of Artificial Intelligence Research: Volume I: Knowledge Representation, 2020
2019
CoRR, 2019
CoRR, 2019
Targeted Attacks on Deep Reinforcement Learning Agents through Adversarial Observations.
CoRR, 2019
Self-Educated Language Agent with Hindsight Experience Replay for Instruction Following.
Proceedings of the Visually Grounded Interaction and Language (ViGIL), 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019
2018
Proceedings of the 6th International Conference on Learning Representations, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the Computer Vision - ECCV 2018, 2018
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
Proceedings of the Handbook of Multimodal-Multisensor Interfaces: Foundations, User Modeling, and Common Modality Combinations, 2018
2017
IEEE Trans. Neural Networks Learn. Syst., 2017
Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards.
CoRR, 2017
Proceedings of the Second Conference on Machine Translation, 2017
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017
Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017
2016
Proceedings of the Toward Robotic Socially Believable Behaving Systems - Volume I, 2016
CoRR, 2016
CoRR, 2016
CoRR, 2016
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016
Compact and Interpretable Dialogue State Representation with Genetic Sparse Distributed Memory.
Proceedings of the Dialogues with Social Robots, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 33nd International Conference on Machine Learning, 2016
Proceedings of the 33nd International Conference on Machine Learning, 2016
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016
On the Use of Non-Stationary Strategies for Solving Two-Player Zero-Sum Markov Games.
Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, 2016
2015
Proceedings of the IEEE Symposium Series on Computational Intelligence, 2015
Proceedings of the SIGDIAL 2015 Conference, 2015
Learning of scanning strategies for electronic support using predictive state representations.
Proceedings of the 25th IEEE International Workshop on Machine Learning for Signal Processing, 2015
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015
Proceedings of the Neural Information Processing - 22nd International Conference, 2015
Proceedings of the Neural Information Processing - 22nd International Conference, 2015
Proceedings of the 4th Workshop on Machine Learning for Interactive Systems, 2015
Proceedings of the 32nd International Conference on Machine Learning, 2015
Proceedings of the 4th Workshop on Machine Learning for Interactive Systems (MLIS-2015).
Proceedings of the 4th Workshop on Machine Learning for Interactive Systems, 2015
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
2014
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2014
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014
Subspace identification for predictive state representation by nuclear norm minimization.
Proceedings of the 2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, 2014
Proceedings of the 2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, 2014
2013
IEEE Trans. Neural Networks Learn. Syst., 2013
Rev. d'Intelligence Artif., 2013
Proceedings of the Statistical Language and Speech Processing, 2013
Proceedings of the SIGDIAL 2013 Conference, 2013
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2013
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 2nd Workshop on Machine Learning for Interactive Systems, 2013
Proceedings of the Innovative and Creative Developments in Multimodal Interaction Systems, 2013
Random projections: A remedy for overfitting issues in time series prediction with echo state networks.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013
2012
Introduction to the Issue on Advances in Spoken Dialogue Systems and Mobile Interface.
IEEE J. Sel. Top. Signal Process., 2012
A Comprehensive Reinforcement Learning Framework for Dialogue Management Optimization.
IEEE J. Sel. Top. Signal Process., 2012
Optimisation d'un tuteur intelligent à partir d'un jeu de données fixé (Optimization of a tutoring system from a fixed set of data) [in French].
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012
Proceedings of the STAIRS 2012, 2012
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012
Statistical User Simulation for Spoken Dialogue Systems: What for, Which Data, Which Future?
Proceedings of the Workshop on Future directions and needs in the Spoken Dialog Community: Tools and Data, 2012
Proceedings of the Natural Interaction with Robots, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
A Reinforcement Learning Approach to Optimize the longitudinal Behavior of a Partial Autonomous Driving Assistance System.
Proceedings of the ECAI 2012, 2012
Proceedings of the 10th ITG Conference on Speech Communication, 2012
2011
ACM Trans. Speech Lang. Process., 2011
Introduction to special issue on machine learning for adaptivity in spoken dialogue systems.
ACM Trans. Speech Lang. Process., 2011
Functional Segmentation of Renal DCE-MRI Sequences Using Vector Quantization Algorithms.
Neural Process. Lett., 2011
Proceedings of the Active Learning and Experimental Design workshop, 2011
Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2011
Uncertainty Management for On-Line Optimisation of a POMDP-Based Large-Scale Spoken Dialogue System.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011
Sample Efficient On-Line Learning of Optimal Dialogue Policies with Kalman Temporal Differences.
Proceedings of the IJCAI 2011, 2011
Proceedings of the 10th International Conference on Machine Learning and Applications and Workshops, 2011
Automation Effects on Driver's Behaviour When Integrating a PADAS and a Distraction Classifier.
Proceedings of the Digital Human Modeling, 2011
Proceedings of the Recent Advances in Reinforcement Learning - 9th European Workshop, 2011
Proceedings of the 19th European Symposium on Artificial Neural Networks, 2011
Batch reinforcement learning for optimizing longitudinal driving assistance strategies.
Proceedings of the 2011 IEEE Symposium on Computational Intelligence in Vehicles and Transportation Systems, 2011
Proceedings of the 2011 IEEE Symposium on Computational Intelligence, 2011
Proceedings of the 2011 IEEE Symposium on Adaptive Dynamic Programming And Reinforcement Learning, 2011
, 2011
2010
Nonlinear Bayesian Filtering for Denoising of Electrocardiograms Acquired in a Magnetic Resonance Environment.
IEEE Trans. Biomed. Eng., 2010
Rev. d'Intelligence Artif., 2010
Proceedings of the SIGDIAL 2010 Conference, 2010
Proceedings of the Modeling Decisions for Artificial Intelligence, 2010
Proceedings of the Spoken Dialogue Systems for Ambient Environments, 2010
Proceedings of the Spoken Dialogue Systems for Ambient Environments, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the International Conference on Ultra Modern Telecommunications, 2010
Proceedings of the International Conference on Ultra Modern Telecommunications, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the 18th European Signal Processing Conference, 2010
Proceedings of the 18th European Symposium on Artificial Neural Networks, 2010
2009
Proceedings of the Neural Information Processing, 16th International Conference, 2009
A specific QRS detector for electrocardiography during MRI: Using wavelets and local regularity characterization.
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the 17th European Symposium on Artificial Neural Networks, 2009
Proceedings of the IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, 2009
2008
Proceedings of the IEEE International Conference on Acoustics, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
Proceedings of the Recent Advances in Reinforcement Learning, 8th European Workshop, 2008
Functional semi-automated segmentation of renal DCE-MRI sequences using a Growing Neural Gas algorithm.
Proceedings of the 2008 16th European Signal Processing Conference, 2008
2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
2006
IEEE Trans. Speech Audio Process., 2006
Consistent Goal-Directed User Model for Realisitc Man-Machine Task-Oriented Spoken Dialogue Simulation.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006
Dynamic Bayesian Networks for NLU Simulation with Applications to Dialog Optimal Strategy Learning.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Machine Learning for Spoken Dialogue Management: An Experiment with Speech-Based Database Querying.
Proceedings of the Artificial Intelligence: Methodology, 2006
2005
Réseau bayesien pour un modèle d'utilisateur et un module de compréhension pour l'optimisation des systèmes de dialogues.
Proceedings of the Actes de la 12ème conférence sur le Traitement Automatique des Langues Naturelles. Articles courts, 2005
Comparing ASR modeling methods for spoken dialogue simulation and optimal strategy learning.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005
2004
Proceedings of the 16th conference on Association Francophone d'Interaction Homme-Machine, 2004
2003
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003
2002
Proceedings of the IEEE International Conference on Acoustics, 2002