Imanol Schlag

According to our database1, Imanol Schlag authored at least 18 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Understanding and Minimising Outlier Features in Neural Network Training.
CoRR, 2024

Language Imbalance Can Boost Cross-lingual Generalisation.
CoRR, 2024

Navigating Scaling Laws: Compute Optimality in Adaptive Model Training.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

On the Effect of (Near) Duplicate Subwords in Language Modelling.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
The Languini Kitchen: Enabling Language Modelling Research at Different Scales of Compute.
CoRR, 2023

Mindstorms in Natural Language-Based Societies of Mind.
CoRR, 2023

Large Language Model Programs.
CoRR, 2023

2022
Solving Quantitative Reasoning Problems with Language Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Block-Recurrent Transformers.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

A Modern Self-Referential Weight Matrix That Learns to Modify Itself.
Proceedings of the International Conference on Machine Learning, 2022

2021
Improving Baselines in the Wild.
CoRR, 2021

Linear Transformers Are Secretly Fast Weight Memory Systems.
CoRR, 2021

Going Beyond Linear Transformers with Recurrent Fast Weight Programmers.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Linear Transformers Are Secretly Fast Weight Programmers.
Proceedings of the 38th International Conference on Machine Learning, 2021

Learning Associative Inference Using Fast Weight Memory.
Proceedings of the 9th International Conference on Learning Representations, 2021

2019
Enhancing the Transformer with Explicit Relational Encoding for Math Problem Solving.
CoRR, 2019

2018
Learning to Reason with Third Order Tensor Products.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

2017
Ancient Roman Coin Recognition in the Wild Using Deep Learning Based Recognition of Artistically Depicted Face Profiles.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017


  Loading...