Darshil Doshi
According to our database1,
Darshil Doshi
authored at least 6 papers
between 2021 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Learning to grok: Emergence of in-context learning and skill composition in modular arithmetic tasks.
CoRR, 2024
To Grok or not to Grok: Disentangling Generalization and Memorization on Corrupted Algorithmic Datasets.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
2023
Critical Initialization of Wide and Deep Neural Networks using Partial Jacobians: General Theory and Applications.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
2022
2021
Critical initialization of wide and deep neural networks through partial Jacobians: general theory and applications to LayerNorm.
CoRR, 2021