2024

Data Release for: Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning.

[DOI]

Dataset, April, 2024

Learning agile soccer skills for a bipedal robot with deep reinforcement learning.

[DOI]

Sci. Robotics, 2024

The Effect of Model Size on LLM Post-hoc Explainability via LIME.

[DOI]

CoRR, 2024

On scalable oversight with weak LLMs judging strong LLMs.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

The Probabilities Also Matter: A More Faithful Metric for Faithfulness of Free-Text Explanations in Large Language Models.

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024

2022

Figure Data for the paper "From Motor Control to Team Play in Simulated Humanoid Football".

[DOI]

Dataset, August, 2022

From motor control to team play in simulated humanoid football.

[DOI]

Sci. Robotics, 2022

Solving math word problems with process- and outcome-based feedback.

[DOI]

CoRR, 2022

Imitate and Repurpose: Learning Reusable Robot Movement Skills From Human and Animal Behaviors.

[DOI]

CoRR, 2022

2021

Data-efficient Hindsight Off-policy Option Learning.

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Towards Real Robot Learning in the Wild: A Case Study in Bipedal Locomotion.

[DOI]

Saran Tunyasuvunakool

Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

2020

"What, not how": Solving an under-actuated insertion task from scratch.

[DOI]

CoRR, 2020

Simple Sensor Intentions for Exploration.

[DOI]

Tim Hertweck

Martin A. Riedmiller

Michael Bloesch

Jost Tobias Springenberg

CoRR, 2020

Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning.

[DOI]

Noah Y. Siegel

Jost Tobias Springenberg

CoRR, 2020

Compositional Transfer in Hierarchical Reinforcement Learning.

[DOI]

Markus Wulfmeier

Abbas Abdolmaleki

Roland Hafner

Jost Tobias Springenberg

Proceedings of the Robotics: Science and Systems XVI, 2020

Critic Regularized Regression.

[DOI]

Jost Tobias Springenberg

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Keep Doing What Worked: Behavior Modelling Priors for Offline Reinforcement Learning.

[DOI]

Noah Y. Siegel

Jost Tobias Springenberg

Proceedings of the 8th International Conference on Learning Representations, 2020

2019

Imagined Value Gradients: Model-Based Policy Optimization with Transferable Latent Dynamics Models.

[DOI]

Arunkumar Byravan

Jost Tobias Springenberg

CoRR, 2019

Regularized Hierarchical Policies for Compositional Transfer in Robotics.

[DOI]

Markus Wulfmeier

Abbas Abdolmaleki

Roland Hafner

Jost Tobias Springenberg

CoRR, 2019

Imagined Value Gradients: Model-Based Policy Optimization with Tranferable Latent Dynamics Models.

[DOI]

Arunkumar Byravan

Jost Tobias Springenberg

Proceedings of the 3rd Annual Conference on Robot Learning, 2019