Suvrit Sra
Orcid: 0000-0001-8516-4925Affiliations:
- Massachusetts Institute of Technology (MIT), Laboratory for Information and Decision Systems, Cambridge, MA, USA
- Max Planck Institute for Biological Cybernetics, Tübingen, Germany
- University of Texas at Austin, Department of Computer Sciences, Austin, TX, USA
According to our database1,
Suvrit Sra
authored at least 171 papers
between 2003 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on zbmath.org
-
on twitter.com
-
on orcid.org
-
on id.loc.gov
-
on d-nb.info
On csauthors.net:
Bibliography
2024
CoRR, 2024
Transformers Implement Functional Gradient Descent to Learn Non-Linear Functions In Context.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
2023
Sion's Minimax Theorem in Geodesic Metric Spaces and a Riemannian Extragradient Algorithm.
SIAM J. Optim., December, 2023
Transformers learn to implement preconditioned gradient descent for in-context learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Learning for Dynamics and Control Conference, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Toward Understanding State Representation Learning in MuZero: A Case Study in Linear Quadratic Gaussian Control.
Proceedings of the 62nd IEEE Conference on Decision and Control, 2023
2022
On a class of geodesically convex optimization problems solved via Euclidean MM methods.
CoRR, 2022
Proceedings of the 5th Symposium on Simplicity in Algorithms, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Learning for Dynamics and Control Conference, 2022
Neural Network Weights Do Not Converge to Stationary Points: An Invariant Measure Perspective.
Proceedings of the International Conference on Machine Learning, 2022
Beyond Worst-Case Analysis in Stochastic Approximation: Moment Estimation Improves Instance Complexity.
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
Proceedings of the Conference on Learning Theory, 2-5 July 2022, London, UK., 2022
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
CoRR, 2021
Three Operator Splitting with Subgradients, Stochastic Gradients, and Adaptive Learning Rates.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the Conference on Learning Theory, 2021
2020
An alternative to EM for Gaussian mixture models: batch and stochastic Riemannian optimization.
Math. Program., 2020
Frontiers Artif. Intell., 2020
CoRR, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Towards Minimax Optimal Reinforcement Learning in Factored Markov Decision Processes.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
SGD with shuffling: optimal rates without component convexity and large epoch requirements.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the 37th International Conference on Machine Learning, 2020
Proceedings of the 37th International Conference on Machine Learning, 2020
Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition.
Proceedings of the 37th International Conference on Machine Learning, 2020
Why Gradient Clipping Accelerates Training: A Theoretical Justification for Adaptivity.
Proceedings of the 8th International Conference on Learning Representations, 2020
Proceedings of the Conference on Learning Theory, 2020
Geodesically-convex optimization for averaging partially observed covariance matrices.
Proceedings of The 12th Asian Conference on Machine Learning, 2020
2019
CoRR, 2019
Analysis of Gradient Clipping and Adaptive Scaling with a Relaxed Smoothness Condition.
CoRR, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Small ReLU networks are powerful memorizers: a tight analysis of memorization capacity.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
Small nonlinearities in activation functions create bad local minima in neural networks.
Proceedings of the 7th International Conference on Learning Representations, 2019
Proceedings of the 7th International Conference on Learning Representations, 2019
Acceleration in First Order Quasi-strongly Convex Optimization by ODE Discretization.
Proceedings of the 58th IEEE Conference on Decision and Control, 2019
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019
2018
J. Mach. Learn. Res., 2018
R-SPIDER: A Fast Riemannian Stochastic Optimization Algorithm with Curvature Independent Rate.
CoRR, 2018
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
Proceedings of the 6th International Conference on Learning Representations, 2018
Proceedings of the 6th International Conference on Learning Representations, 2018
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018
Proceedings of the Conference On Learning Theory, 2018
Proceedings of the Approximation, 2018
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018
2017
IEEE Trans. Neural Networks Learn. Syst., 2017
Frank-Wolfe methods for geodesically convex optimization with application to the matrix geometric mean.
CoRR, 2017
CoRR, 2017
CoRR, 2017
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017
2016
Comput. Stat. Data Anal., 2016
Proceedings of the 4th International Conference on Learning Representations, 2016
Fast Sampling for Strongly Rayleigh Measures with Application to Determinantal Point Processes.
CoRR, 2016
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016
Fast Mixing Markov Chains for Strongly Rayleigh Measures, DPPs, and Constrained Sampling.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016
Proceedings of the 33nd International Conference on Machine Learning, 2016
Proceedings of the 33nd International Conference on Machine Learning, 2016
Proceedings of the 33nd International Conference on Machine Learning, 2016
Proceedings of the 33nd International Conference on Machine Learning, 2016
Proceedings of the 33nd International Conference on Machine Learning, 2016
Proceedings of the 29th Conference on Learning Theory, 2016
Proceedings of the 55th IEEE Conference on Decision and Control, 2016
Proceedings of the 54th Annual Allerton Conference on Communication, 2016
Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, 2016
Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, 2016
2015
SIAM J. Optim., 2015
CoRR, 2015
Large-scale randomized-coordinate descent methods with non-separable linear constraints.
Proceedings of the Thirty-First Conference on Uncertainty in Artificial Intelligence, 2015
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015
Proceedings of the 32nd International Conference on Machine Learning, 2015
Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, 2015
2014
IEEE Trans. Image Process., 2014
Proceedings of the Thirtieth Conference on Uncertainty in Artificial Intelligence, 2014
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014
Proceedings of the 31th International Conference on Machine Learning, 2014
Proceedings of the 31th International Conference on Machine Learning, 2014
Proceedings of the Computer Vision - ECCV 2014, 2014
Proceedings of the Tractability: Practical Approaches to Hard Problems, 2014
2013
Jensen-Bregman LogDet Divergence with Application to Efficient Similarity Search for Covariance Matrices.
IEEE Trans. Pattern Anal. Mach. Intell., 2013
Optim. Methods Softw., 2013
The multivariate Watson distribution: Maximum-likelihood estimation and other aspects.
J. Multivar. Anal., 2013
Geometric optimisation on positive definite matrices for elliptically contoured distributions.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013
2012
Data Min. Knowl. Discov., 2012
A short note on parameter approximation for von Mises-Fisher distributions: and a fast implementation of <i>I</i> <sub> <i>s</i> </sub>(<i>x</i>).
Comput. Stat., 2012
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012
A new metric on the manifold of kernel matrices with application to matrix geometric means.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012
2011
Generalized Dictionary Learning for Symmetric Positive Definite Matrices with Application to Nearest Neighbor Retrieval.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2011
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2011
Proceedings of the 28th International Conference on Machine Learning, 2011
Efficient similarity search for covariance matrices via the Jensen-Bregman LogDet Divergence.
Proceedings of the IEEE International Conference on Computer Vision, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
2010
SIAM J. Sci. Comput., 2010
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010
Multiframe blind deconvolution, super-resolution, and saturation correction via incremental EM.
Proceedings of the International Conference on Image Processing, 2010
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010
2009
Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics, 2009
Proceedings of the 26th Annual International Conference on Machine Learning, 2009
Proceedings of the Algorithmic Learning Theory, 20th International Conference, 2009
2008
Fast Projection-Based Methods for the Least Squares Nonnegative Matrix Approximation Problem.
Stat. Anal. Data Min., 2008
Proceedings of the 8th IEEE International Conference on Data Mining (ICDM 2008), 2008
2007
Fast Newton-type Methods for the Least Squares Nonnegative Matrix Approximation Problem.
Proceedings of the Seventh SIAM International Conference on Data Mining, 2007
2006
Proceedings of the Knowledge Discovery in Databases: PKDD 2006, 2006
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Proceedings of the Machine Learning: ECML 2006, 2006
2005
J. Mach. Learn. Res., 2005
Proceedings of the Advances in Neural Information Processing Systems 18 [Neural Information Processing Systems, 2005
2004
Proceedings of the Fourth SIAM International Conference on Data Mining, 2004
Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004
2003
Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 24, 2003