Aleksandar Botev
Orcid: 0000-0001-9021-1124
According to our database1,
Aleksandar Botev
authored at least 19 papers
between 2017 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
CoRR, 2024
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models.
CoRR, 2024
CoRR, 2024
2023
CoRR, 2023
Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
2022
CoRR, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021
2020
PhD thesis, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the 8th International Conference on Learning Representations, 2020
2018
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
Proceedings of the 6th International Conference on Learning Representations, 2018
2017
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017
Nesterov's accelerated gradient and momentum as approximations to regularised update descent.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017
Proceedings of the 34th International Conference on Machine Learning, 2017
Complementary Sum Sampling for Likelihood Approximation in Large Scale Classification.
Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017