Nino Vieillard

According to our database1, Nino Vieillard authored at least 19 papers between 2019 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Imitating Language via Scalable Inverse Reinforcement Learning.
CoRR, 2024

Gemma 2: Improving Open Language Models at a Practical Size.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, 2024

BOND: Aligning LLMs with Best-of-N Distillation.
CoRR, 2024

WARP: On the Benefits of Weight Averaged Rewarded Policies.
CoRR, 2024

WARM: On the Benefits of Weight Averaged Reward Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
GKD: Generalized Knowledge Distillation for Auto-regressive Sequence Models.
CoRR, 2023

Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice.
Proceedings of the International Conference on Machine Learning, 2023

Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal.
CoRR, 2022

Implicitly Regularized RL with Implicit Q-values.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

Offline Reinforcement Learning as Anti-exploration.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Offline Reinforcement Learning with Pseudometric Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021

2020
Leverage the Average: an Analysis of Regularization in RL.
CoRR, 2020

Munchausen Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Leverage the Average: an Analysis of KL Regularization in Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Momentum in Reinforcement Learning.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

Deep Conservative Policy Iteration.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
On Connections between Constrained Optimization and Reinforcement Learning.
CoRR, 2019


  Loading...