Libin Zhu
According to our database1,
Libin Zhu
authored at least 15 papers
between 2020 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
PhD thesis, 2024
Emergence in non-neural models: grokking modular arithmetic via average gradient outer product.
CoRR, 2024
Catapults in SGD: spikes in the training loss and their impact on generalization through feature learning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
2023
Proceedings of the Uncertainty in Artificial Intelligence, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
2022
CoRR, 2022
CoRR, 2022
Transition to Linearity of General Neural Networks with Directed Acyclic Graph Architecture.
CoRR, 2022
Transition to Linearity of Wide Neural Networks is an Emerging Property of Assembling Weak Models.
CoRR, 2022
Transition to Linearity of General Neural Networks with Directed Acyclic Graph Architecture.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Transition to Linearity of Wide Neural Networks is an Emerging Property of Assembling Weak Models.
Proceedings of the Tenth International Conference on Learning Representations, 2022
2020
Toward a theory of optimization for over-parameterized systems of non-linear equations: the lessons of deep learning.
CoRR, 2020
On the linearity of large non-linear models: when and why the tangent kernel is constant.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020