Olivier Bachem

Affiliations:
  • ETH Zurich, Department of Computer Science, Switzerland


According to our database1, Olivier Bachem authored at least 61 papers between 2015 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Diversity-Rewarded CFG Distillation.
CoRR, 2024

Imitating Language via Scalable Inverse Reinforcement Learning.
CoRR, 2024

Gemma 2: Improving Open Language Models at a Practical Size.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, 2024

Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning.
CoRR, 2024

BOND: Aligning LLMs with Best-of-N Distillation.
CoRR, 2024

WARP: On the Benefits of Weight Averaged Rewarded Policies.
CoRR, 2024

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models.
CoRR, 2024

WARM: On the Benefits of Weight Averaged Reward Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024


MusicRL: Aligning Music Generation to Human Preferences.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Conditional Language Policy: A General Framework For Steerable Multi-Objective Finetuning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023
Nash Learning from Human Feedback.
CoRR, 2023

GKD: Generalized Knowledge Distillation for Auto-regressive Sequence Models.
CoRR, 2023

Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
C3PO: Learning to Achieve Arbitrary Goals via Massively Entropic Pretraining.
CoRR, 2022

vec2text with Round-Trip Translations.
CoRR, 2022

The Role of Pretrained Representations for the OOD Generalization of RL Agents.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Decoding a Neural Retriever's Latent Space for Query Suggestion.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Concave Utility Reinforcement Learning: The Mean-field Game Viewpoint.
Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022

A general class of surrogate functions for stable and efficient reinforcement learning.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

Offline Reinforcement Learning as Anti-exploration.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Braxlines: Fast and Interactive Toolkit for RL-driven Behavior Engineering beyond Reward Maximization.
CoRR, 2021

A functional mirror ascent view of policy gradient methods with function approximation.
CoRR, 2021

Representation Learning for Out-Of-Distribution Generalization in Reinforcement Learning.
CoRR, 2021

Scaling Hierarchical Agglomerative Clustering to Billion-sized Datasets.
CoRR, 2021

What Matters for Adversarial Imitation Learning?
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Brax - A Differentiable Physics Engine for Large Scale Rigid Body Simulation.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Hyperparameter Selection for Imitation Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021

What Matters for On-Policy Deep Actor-Critic Methods? A Large-Scale Study.
Proceedings of the 9th International Conference on Learning Representations, 2021

2020
A Sober Look at the Unsupervised Learning of Disentangled Representations and their Evaluation.
J. Mach. Learn. Res., 2020

What Matters In On-Policy Reinforcement Learning? A Large-Scale Empirical Study.
CoRR, 2020

Automatic Shortcut Removal for Self-Supervised Representation Learning.
Proceedings of the 37th International Conference on Machine Learning, 2020

Weakly-Supervised Disentanglement Without Compromises.
Proceedings of the 37th International Conference on Machine Learning, 2020

Disentangling Factors of Variations Using Few Labels.
Proceedings of the 8th International Conference on Learning Representations, 2020

Precision-Recall Curves Using Information Divergence Frontiers.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

A Commentary on the Unsupervised Learning of Disentangled Representations.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Google Research Football: A Novel Reinforcement Learning Environment.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
The Visual Task Adaptation Benchmark.
CoRR, 2019

Evaluating Generative Models Using Divergence Frontiers.
CoRR, 2019

Disentangling Factors of Variation Using Few Labels.
CoRR, 2019

Are Disentangled Representations Helpful for Abstract Visual Reasoning?
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

On the Fairness of Disentangled Representations.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

On the Transfer of Inductive Bias from Simulation to the Real World: a New Disentanglement Dataset.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

High-Fidelity Image Generation With Fewer Labels.
Proceedings of the 36th International Conference on Machine Learning, 2019

Challenging Common Assumptions in the Unsupervised Learning of Disentangled Representations.
Proceedings of the 36th International Conference on Machine Learning, 2019

2018
Recent Advances in Autoencoder-Based Representation Learning.
CoRR, 2018

Challenging Common Assumptions in the Unsupervised Learning of Disentangled Representations.
CoRR, 2018

Assessing Generative Models via Precision and Recall.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Scalable k -Means Clustering via Lightweight Coresets.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

One-shot Coresets: The Case of k-Clustering.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

2017
Uniform Deviation Bounds for Unbounded Loss Functions like k-Means.
CoRR, 2017

Scalable and Distributed Clustering via Lightweight Coresets.
CoRR, 2017

Uniform Deviation Bounds for k-Means Clustering.
Proceedings of the 34th International Conference on Machine Learning, 2017

Distributed and Provably Good Seedings for k-Means in Constant Rounds.
Proceedings of the 34th International Conference on Machine Learning, 2017

2016
Fast and Provably Good Seedings for k-Means.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Linear-Time Outlier Detection via Sensitivity.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Horizontally Scalable Submodular Maximization.
Proceedings of the 33nd International Conference on Machine Learning, 2016

Strong Coresets for Hard and Soft Bregman Clustering with Applications to Exponential Family Mixtures.
Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, 2016

Approximate K-Means++ in Sublinear Time.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Coresets for Nonparametric Estimation - the Case of DP-Means.
Proceedings of the 32nd International Conference on Machine Learning, 2015


  Loading...