Johannes von Oswald

According to our database1, Johannes von Oswald authored at least 23 papers between 2020 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Interpretability of Learning Algorithms Encoded in Deep Neural Networks.
PhD thesis, 2024

Weight decay induces low-rank attention layers.
CoRR, 2024

Multi-agent cooperation through learning-aware policy gradients.
CoRR, 2024

Learning Randomized Algorithms with Transformers.
CoRR, 2024

When can transformers compositionally generalize in-context?
CoRR, 2024

State Soup: In-Context Skill Learning, Retrieval and Mixing.
CoRR, 2024

Linear Transformers are Versatile In-Context Learners.
CoRR, 2024

Discovering modular solutions that generalize compositionally.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Uncovering mesa-optimization algorithms in Transformers.
CoRR, 2023

Gated recurrent neural networks discover attention.
CoRR, 2023

Transformers Learn In-Context by Gradient Descent.
Proceedings of the International Conference on Machine Learning, 2023

2022
Random initialisations performing above chance and how to find them.
CoRR, 2022

The least-control principle for learning at equilibrium.
CoRR, 2022

A contrastive rule for meta-learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

The least-control principle for local learning at equilibrium.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Disentangling the Predictive Variance of Deep Ensembles through the Neural Tangent Kernel.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
Learning where to learn: Gradient sparsity in meta and continual learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Posterior Meta-Replay for Continual Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Neural networks with late-phase weights.
Proceedings of the 9th International Conference on Learning Representations, 2021

Continual learning in recurrent neural networks.
Proceedings of the 9th International Conference on Learning Representations, 2021

2020
Economical ensembles with hypernetworks.
CoRR, 2020

Continual Learning in Recurrent Neural Networks with Hypernetworks.
CoRR, 2020

Continual learning with hypernetworks.
Proceedings of the 8th International Conference on Learning Representations, 2020


  Loading...