Léonard Hussenot

According to our database1, Léonard Hussenot authored at least 26 papers between 2019 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Gemma 2: Improving Open Language Models at a Practical Size.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, 2024

Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning.
CoRR, 2024

BOND: Aligning LLMs with Best-of-N Distillation.
CoRR, 2024

WARP: On the Benefits of Weight Averaged Rewarded Policies.
CoRR, 2024

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models.
CoRR, 2024

Gemma: Open Models Based on Gemini Research and Technology.
CoRR, 2024

WARM: On the Benefits of Weight Averaged Reward Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

MusicRL: Aligning Music Generation to Human Preferences.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Conditional Language Policy: A General Framework For Steerable Multi-Objective Finetuning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023
Get Back Here: Robust Imitation by Return-to-Distribution Planning.
CoRR, 2023

Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Apprentissage par démonstrations : transfert des motivations humaines aux algorithmes. (Apprenticeship learning : transferring human motivations to artificial agents).
PhD thesis, 2022

vec2text with Round-Trip Translations.
CoRR, 2022

Learning Energy Networks with Generalized Fenchel-Young Losses.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Continuous Control with Action Quantization from Demonstrations.
Proceedings of the International Conference on Machine Learning, 2022

Offline Reinforcement Learning as Anti-exploration.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning.
CoRR, 2021

What Matters for Adversarial Imitation Learning?
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Hyperparameter Selection for Imitation Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021

Offline Reinforcement Learning with Pseudometric Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021

Primal Wasserstein Imitation Learning.
Proceedings of the 9th International Conference on Learning Representations, 2021

What Matters for On-Policy Deep Actor-Critic Methods? A Large-Scale Study.
Proceedings of the 9th International Conference on Learning Representations, 2021

Show Me the Way: Intrinsic Motivation from Demonstrations.
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

2020
What Matters In On-Policy Reinforcement Learning? A Large-Scale Empirical Study.
CoRR, 2020

CopyCAT: : Taking Control of Neural Policies with Constant Attacks.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

2019
Targeted Attacks on Deep Reinforcement Learning Agents through Adversarial Observations.
CoRR, 2019


  Loading...