Piotr Milos

According to our database1, Piotr Milos authored at least 37 papers between 2017 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Repurposing Language Models into Embedding Models: Finding the Compute-Optimal Recipe.
CoRR, 2024

What Matters in Hierarchical Search for Combinatorial Reasoning Problems?
CoRR, 2024

Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control.
CoRR, 2024

tsGT: Stochastic Time Series Modeling With Transformer.
CoRR, 2024

Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Magnushammer: A Transformer-Based Approach to Premise Selection.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Analysing The Impact of Sequence Composition on Language Model Pre-Training.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Structured Packing in LLM Training Improves Long Context Utilization.
CoRR, 2023

Exploring Continual Learning of Diffusion Models.
CoRR, 2023

Magnushammer: A Transformer-based Approach to Premise Selection.
CoRR, 2023

Focused Transformer: Contrastive Training for Context Scaling.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Trust Your 𝛁: Gradient-based Intervention Targeting for Causal Discovery.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

The Tunnel Effect: Building Data Representations in Deep Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal Search.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

The Effectiveness of World Models for Continual Reinforcement Learning.
Proceedings of the Conference on Lifelong Learning Agents, 2023

2022
The Surprising Effectiveness of Latent World Models for Continual Reinforcement Learning.
CoRR, 2022

Disentangling Transfer in Continual Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Thor: Wielding Hammers to Integrate Language Models and Automated Theorem Provers.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Planning and Learning using Adaptive Entropy Tree Search.
Proceedings of the International Joint Conference on Neural Networks, 2022

Off-Policy Correction For Multi-Agent Reinforcement Learning.
Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022

2021
Continuous Control With Ensemble Deep Deterministic Policy Gradients.
CoRR, 2021

Robust and Efficient Planning using Adaptive Entropy Tree Search.
CoRR, 2021

Continual World: A Robotic Benchmark For Continual Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Catalytic Role Of Noise And Necessity Of Inductive Biases In The Emergence Of Compositional Communication.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Subgoal Search For Complex Reasoning Tasks.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Trust, but Verify: Alleviating Pessimistic Errors in Model-Based Exploration.
Proceedings of the International Joint Conference on Neural Networks, 2021

Structure and Randomness in Planning and Reinforcement Learning.
Proceedings of the International Joint Conference on Neural Networks, 2021

2020
CARLA Real Traffic Scenarios - novel training ground and benchmark for autonomous driving.
CoRR, 2020

Simulation-Based Reinforcement Learning for Real-World Autonomous Driving.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Model Based Reinforcement Learning for Atari.
Proceedings of the 8th International Conference on Learning Representations, 2020

2019
Uncertainty-sensitive Learning and Planning with Ensembles.
CoRR, 2019

Developmentally motivated emergence of compositional communication via template transfer.
CoRR, 2019

Model-Based Reinforcement Learning for Atari.
CoRR, 2019

2018
Expert-augmented actor-critic for ViZDoom and Montezumas Revenge.
CoRR, 2018

Learning to Run challenge solutions: Adapting reinforcement learning methods for neuromusculoskeletal environments.
CoRR, 2018

2017
Hierarchical Reinforcement Learning with Parameters.
Proceedings of the 1st Annual Conference on Robot Learning, CoRL 2017, Mountain View, 2017


  Loading...