Darshil Doshi

According to our database1, Darshil Doshi authored at least 6 papers between 2021 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Grokking Modular Polynomials.
CoRR, 2024

Learning to grok: Emergence of in-context learning and skill composition in modular arithmetic tasks.
CoRR, 2024

To Grok or not to Grok: Disentangling Generalization and Memorization on Corrupted Algorithmic Datasets.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Critical Initialization of Wide and Deep Neural Networks using Partial Jacobians: General Theory and Applications.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022
AutoInit: Automatic Initialization via Jacobian Tuning.
CoRR, 2022

2021
Critical initialization of wide and deep neural networks through partial Jacobians: general theory and applications to LayerNorm.
CoRR, 2021


  Loading...