Michael E. Sander

According to our database1, Michael E. Sander authored at least 8 papers between 2021 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Towards Understanding the Universality of Transformers for Next-Token Prediction.
CoRR, 2024

How do Transformers Perform In-Context Autoregressive Learning ?
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Implicit regularization of deep residual networks towards neural ODEs.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Fast, Differentiable and Sparse Top-k: a Convex Analysis Perspective.
Proceedings of the International Conference on Machine Learning, 2023

2022
Do Residual Neural Networks discretize Neural Ordinary Differential Equations?
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Vision Transformers provably learn spatial structure.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Sinkformers: Transformers with Doubly Stochastic Attention.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021
Momentum Residual Neural Networks.
Proceedings of the 38th International Conference on Machine Learning, 2021


  Loading...