Sidak Pal Singh

According to our database1, Sidak Pal Singh authored at least 24 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
What Does It Mean to Be a Transformer? Insights from a Theoretical Hessian Analysis.
CoRR, 2024

Local vs Global continual learning.
CoRR, 2024

Landscaping Linear Mode Connectivity.
CoRR, 2024

Hallmarks of Optimization Trajectories in Neural Networks and LLMs: The Lengths, Bends, and Dead Ends.
CoRR, 2024

Towards Meta-Pruning via Optimal Transport.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Some Fundamental Aspects about Lipschitz Continuity of Neural Networks.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Transformer Fusion with Optimal Transport.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers (Student Abstract).
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers.
CoRR, 2023

Towards guarantees for parameter isolation in continual learning.
CoRR, 2023

On the curvature of the loss landscape.
CoRR, 2023

Some Fundamental Aspects about Lipschitz Continuity of Neural Network Functions.
CoRR, 2023

The Hessian perspective into the Nature of Convolutional Neural Networks.
Proceedings of the International Conference on Machine Learning, 2023

2022
Signal Propagation in Transformers: Theoretical Perspectives and the Role of Rank Collapse.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Phenomenology of Double Descent in Finite-Width Neural Networks.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Analytic Insights into Structure and Rank of Neural Network Hessian Maps.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020
WoodFisher: Efficient second-order approximations for model compression.
CoRR, 2020

Model Fusion via Optimal Transport.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

WoodFisher: Efficient Second-Order Approximation for Neural Network Compression.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Context Mover's Distance & Barycenters: Optimal Transport of Contexts for Building Representations.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

2019
GLOSS: Generative Latent Optimization of Sentence Representations.
CoRR, 2019

2018
Wasserstein is all you need.
CoRR, 2018

2017
RaaS and Hierarchical Aggregation Revisited.
Proceedings of the 2017 IEEE International Conference on Web Services, 2017

2016
SL - FII: Syntactic and Lexical Constraints with Frequency based Iterative Improvement for Disease Mention Recognition in News Headlines.
Proceedings of the Workshop on Advances in Bioinformatics and Artificial Intelligence: Bridging the Gap co-located with 25th International Joint Conference on Artificial Intelligence (IJCAI 2016), 2016


  Loading...