Geoffrey Cideron

According to our database1, Geoffrey Cideron authored at least 13 papers between 2019 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Diversity-Rewarded CFG Distillation.
CoRR, 2024

Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning.
CoRR, 2024

BOND: Aligning LLMs with Best-of-N Distillation.
CoRR, 2024

WARM: On the Benefits of Weight Averaged Reward Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

MusicRL: Aligning Music Generation to Human Preferences.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Conditional Language Policy: A General Framework For Steerable Multi-Objective Finetuning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023
Get Back Here: Robust Imitation by Return-to-Distribution Planning.
CoRR, 2023

Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
vec2text with Round-Trip Translations.
CoRR, 2022

Diversity policy gradient for sample efficient quality-diversity optimization.
Proceedings of the GECCO '22: Genetic and Evolutionary Computation Conference, Boston, Massachusetts, USA, July 9, 2022

2020
QD-RL: Efficient Mixing of Quality and Diversity in Reinforcement Learning.
CoRR, 2020

HIGhER: Improving instruction following with Hindsight Generation for Experience Replay.
Proceedings of the 2020 IEEE Symposium Series on Computational Intelligence, 2020

2019
Self-Educated Language Agent with Hindsight Experience Replay for Instruction Following.
Proceedings of the Visually Grounded Interaction and Language (ViGIL), 2019


  Loading...