Antonio Orvieto
According to our database1,
Antonio Orvieto
authored at least 39 papers
between 2019 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
Gradient Descent on Logistic Regression with Non-Separable Data and Large Step Sizes.
CoRR, 2024
Recurrent neural networks: vanishing and exploding gradients are not the end of the story.
CoRR, 2024
Understanding the differences in Foundation Models: Attention, State Space Models, and Recurrent Neural Networks.
CoRR, 2024
Why do Learning Rates Transfer? Reconciling Optimization and Scaling Limits for Deep Learning.
CoRR, 2024
Universality of Linear Recurrences Followed by Non-linear Projections: Finite-Width Guarantees and Benefits of Complex Eigenvalues.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024
2023
CoRR, 2023
CoRR, 2023
On the effectiveness of Randomized Signatures as Reservoir for Learning Rough Dynamics.
Proceedings of the International Joint Conference on Neural Networks, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Achieving a Better Stability-Plasticity Trade-off via Auxiliary Networks in Continual Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023
2022
Dynamics of SGD with Stochastic Polyak Stepsizes: Truly Adaptive Variants and Convergence to Exact Solution.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Signal Propagation in Transformers: Theoretical Perspectives and the Role of Rank Collapse.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022
2021
Vanishing Curvature and the Power of Adaptive Methods in Randomly Initialized Deep Networks.
CoRR, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Revisiting the Role of Euler Numerical Integration on Acceleration and Stability in Convex Optimization.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021
2020
Proceedings of the 37th International Conference on Machine Learning, 2020
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020
2019
Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019