Yassir Akram

According to our database1, Yassir Akram authored at least 7 papers between 2022 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Weight decay induces low-rank attention layers.
CoRR, 2024

Learning Randomized Algorithms with Transformers.
CoRR, 2024

When can transformers compositionally generalize in-context?
CoRR, 2024

Attention as a Hypernetwork.
CoRR, 2024

Discovering modular solutions that generalize compositionally.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Gated recurrent neural networks discover attention.
CoRR, 2023

2022
Random initialisations performing above chance and how to find them.
CoRR, 2022


  Loading...