We stand with Ukraine

We stand with Ukraine

Léonard Hussenot

According to our database¹, Léonard Hussenot authored at least 26 papers between 2019 and 2024.

Collaborative distances:

Dijkstra number² of three.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Gemma 2: Improving Open Language Models at a Practical Size.

[BibT_eX]

[DOI]

Morgane Rivière

,

,

Pier Giuseppe Sessa

,

,

Surya Bhupatiraju

,

Léonard Hussenot

,

,

Bobak Shahriari

,

Alexandre Ramé

,

,

,

,

,

Michelle Casbon

,

,

,

Charline Le Lan

,

,

Anton Tsitsulin

,

,

,

,

,

,

Shantanu Thakoor

,

Jean-Bastien Grill

,

Behnam Neyshabur

,

,

,

Aliaksei Severyn

,

,

,

Allen Hutchison

,

,

,

,

,

,

Anthony Laforge

,

Antonia Paterson

,

,

,

,

,

,

,

,

,

Christopher A. Choquette-Choo

,

Danila Sinopalnikov

,

David Weinberger

,

Dimple Vijaykumar

,

Dominika Rogozinska

,

Dustin Herbison

,

,

,

,

,

,

Evgenii Eltyshev

,

Francesco Visin

,

Gabriel Rasskin

,

,

,

,

,

Hanna Klimczak-Plucinska

,

,

,

,

,

,

,

,

,

,

Joana Carrasqueira

,

,

,

,

Joost van Amersfoort

,

,

Josh Lipschultz

,

,

,

,

Kartikeya Badola

,

,

,

Keelin McDonell

,

,

Kiranbir Sodhia

,

,

Lars Lowe Sjösund

,

,

,

,

,

CoRR, 2024

Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning.

[BibT_eX]

[DOI]

CoRR, 2024

BOND: Aligning LLMs with Best-of-N Distillation.

[BibT_eX]

[DOI]

CoRR, 2024

WARP: On the Benefits of Weight Averaged Rewarded Policies.

[BibT_eX]

[DOI]

Alexandre Ramé

,

,

,

,

Léonard Hussenot

,

Pierre-Louis Cedoz

,

Pier Giuseppe Sessa

,

,

Arthur Douillard

,

CoRR, 2024

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models.

[BibT_eX]

[DOI]

Aleksandar Botev

,

,

Samuel L. Smith

,

Anushan Fernando

,

George-Cristian Muraru

,

,

Leonard Berrada

,

,

Pier Giuseppe Sessa

,

,

Léonard Hussenot

,

,

,

,

,

Kathleen Kenealy

,

,

,

Surya Bhupatiraju

,

,

,

Morgane Rivière

,

Mihir Sanjay Kale

,

,

,

,

,

,

,

Srivatsan Srinivasan

,

Guillaume Desjardins

,

,

,

,

,

,

Sebastian Borgeaud

,

,

,

Antonia Paterson

,

,

,

,

Nesh Devanathan

,

,

,

,

Luiz GUStavo Martins

,

,

David Huntsperger

,

,

,

,

,

,

Zoubin Ghahramani

,

Clément Farabet

,

Koray Kavukcuoglu

,

,

,

,

Nando de Frietas

CoRR, 2024

Gemma: Open Models Based on Gemini Research and Technology.

[BibT_eX]

[DOI]

CoRR, 2024

WARM: On the Benefits of Weight Averaged Reward Models.

[BibT_eX]

[DOI]

Alexandre Ramé

,

,

Léonard Hussenot

,

,

Geoffrey Cideron

,

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

MusicRL: Aligning Music Generation to Human Preferences.

[BibT_eX]

[DOI]

Geoffrey Cideron

,

,

,

,

,

,

Brian McWilliams

,

Victor Ungureanu

,

,

Olivier Pietquin

,

,

Léonard Hussenot

,

,

Andrea Agostinelli

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Conditional Language Policy: A General Framework For Steerable Multi-Objective Finetuning.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023

Get Back Here: Robust Imitation by Return-to-Distribution Planning.

[BibT_eX]

[DOI]

Geoffrey Cideron

,

Baruch Tabanpour

,

,

,

Léonard Hussenot

,

Gabriel Dulac-Arnold

,

,

Olivier Pietquin

,

CoRR, 2023

Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback.

[BibT_eX]

[DOI]

,

,

,

,

Geoffrey Cideron

,

,

,

,

Léonard Hussenot

,

,

,

Sabela Ramos Garea

,

,

,

,

,

Avinatan Hassidim

,

Olivier Pietquin

,

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

Apprentissage par démonstrations : transfert des motivations humaines aux algorithmes. (Apprenticeship learning : transferring human motivations to artificial agents).

[BibT_eX]

[DOI]

Léonard Hussenot

PhD thesis, 2022

vec2text with Round-Trip Translations.

[BibT_eX]

[DOI]

Geoffrey Cideron

,

,

,

Olivier Pietquin

,

,

Léonard Hussenot

CoRR, 2022

Learning Energy Networks with Generalized Fenchel-Young Losses.

[BibT_eX]

[DOI]

Mathieu Blondel

,

Felipe Llinares-López

,

,

Léonard Hussenot

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Continuous Control with Action Quantization from Demonstrations.

[BibT_eX]

[DOI]

,

Léonard Hussenot

,

,

,

,

,

Olivier Pietquin

Proceedings of the International Conference on Machine Learning, 2022

Offline Reinforcement Learning as Anti-exploration.

[BibT_eX]

[DOI]

Shideh Rezaeifar

,

,

,

Léonard Hussenot

,

,

Olivier Pietquin

,

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

Léonard Hussenot

,

,

Hanna Yakubovich

,

,

,

,

Raphaël Marinier

,

Jeremiah Harmsen

,

Olivier Pietquin

,

CoRR, 2021

What Matters for Adversarial Imitation Learning?

[BibT_eX]

[DOI]

,

,

Léonard Hussenot

,

,

,

,

,

,

Olivier Pietquin

,

Marcin Andrychowicz

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Hyperparameter Selection for Imitation Learning.

[BibT_eX]

[DOI]

Léonard Hussenot

,

Marcin Andrychowicz

,

,

,

,

,

,

,

Raphaël Marinier

,

Lukasz Stafiniak

,

,

,

,

Olivier Pietquin

Proceedings of the 38th International Conference on Machine Learning, 2021

Offline Reinforcement Learning with Pseudometric Learning.

[BibT_eX]

[DOI]

,

Shideh Rezaeifar

,

,

Léonard Hussenot

,

Olivier Pietquin

,

Proceedings of the 38th International Conference on Machine Learning, 2021

Primal Wasserstein Imitation Learning.

[BibT_eX]

[DOI]

,

Léonard Hussenot

,

,

Olivier Pietquin

Proceedings of the 9th International Conference on Learning Representations, 2021

What Matters for On-Policy Deep Actor-Critic Methods? A Large-Scale Study.

[BibT_eX]

[DOI]

Marcin Andrychowicz

,

,

,

,

,

Raphaël Marinier

,

Léonard Hussenot

,

,

Olivier Pietquin

,

Marcin Michalski

,

,

Proceedings of the 9th International Conference on Learning Representations, 2021

Show Me the Way: Intrinsic Motivation from Demonstrations.

[BibT_eX]

[DOI]

Léonard Hussenot

,

,

,

Olivier Pietquin

Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

2020

What Matters In On-Policy Reinforcement Learning? A Large-Scale Empirical Study.

[BibT_eX]

[DOI]

Marcin Andrychowicz

,

,

,

,

,

Raphaël Marinier

,

Léonard Hussenot

,

,

Olivier Pietquin

,

Marcin Michalski

,

,

CoRR, 2020

CopyCAT: : Taking Control of Neural Policies with Constant Attacks.

[BibT_eX]

[DOI]

Léonard Hussenot

,

,

Olivier Pietquin

Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

2019

Targeted Attacks on Deep Reinforcement Learning Agents through Adversarial Observations.

[BibT_eX]

[DOI]

Léonard Hussenot

,

,

Olivier Pietquin

CoRR, 2019

Loading...