Sanjeev Arora

Orcid: 0000-0002-8636-9970

Affiliations:
  • Princeton University, NJ, USA


According to our database1, Sanjeev Arora authored at least 166 papers between 1990 and 2024.

Collaborative distances:
  • Dijkstra number2 of three.
  • Erdős number3 of two.

Awards

ACM Fellow

ACM Fellow 2008, "For foundational work on probabilistically checkable proofs and approximate solutions to NP-hard optimization problems.".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Can Models Learn Skill Composition from Examples?
CoRR, 2024

Instruct-SkillMix: A Powerful Pipeline for LLM Instruction Tuning.
CoRR, 2024

ConceptMix: A Compositional Image Generation Benchmark with Controllable Difficulty.
CoRR, 2024

AI-Assisted Generation of Difficult Math Questions.
CoRR, 2024

CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs.
CoRR, 2024

Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving.
CoRR, 2024

Keeping LLMs Aligned After Fine-tuning: The Crucial Role of Prompt Templates.
CoRR, 2024

Language Models as Science Tutors.
CoRR, 2024

From Word-prediction to Complex Skills: Compositional Thinking and Metacognition in LLMs.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

LESS: Selecting Influential Data for Targeted Instruction Tuning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Trainable Transformer in Transformer.
Proceedings of the Forty-first International Conference on Machine Learning, 2024


SKILL-MIX: a Flexible and Expandable Family of Evaluations for AI Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

A Quadratic Synchronization Rule for Distributed Deep Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Unlearning via Sparse Representations.
CoRR, 2023

A Theory for Emergence of Complex Skills in Language Models.
CoRR, 2023

Fine-Tuning Language Models with Just Forward Passes.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Task-Specific Skill Localization in Fine-tuned Language Models.
Proceedings of the International Conference on Machine Learning, 2023

A Kernel-Based View of Language Model Fine-Tuning.
Proceedings of the International Conference on Machine Learning, 2023

Understanding Influence Functions and Datamodels via Harmonic Analysis.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Why (and When) does Local SGD Generalize Better than SGD?
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Do Transformers Parse while Predicting the Masked Word?
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
Adaptive Gradient Methods with Local Guarantees.
CoRR, 2022

On the SDEs and Scaling Rules for Adaptive Gradient Algorithms.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Understanding the Generalization Benefit of Normalization Layers: Sharpness Reduction.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

New Definitions and Evaluations for Saliency Methods: Staying Intrinsic, Complete and Sound.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Implicit Bias of Gradient Descent on Reparametrized Models: On Equivalence to Mirror Descent.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Understanding Contrastive Learning Requires Incorporating Inductive Biases.
Proceedings of the International Conference on Machine Learning, 2022

Understanding Gradient Descent on the Edge of Stability in Deep Learning.
Proceedings of the International Conference on Machine Learning, 2022

On Predicting Generalization using GANs.
Proceedings of the Tenth International Conference on Learning Representations, 2022

What Happens after SGD Reaches Zero Loss? --A Mathematical Framework.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Rip van Winkle's Razor: A Simple Estimate of Overfit to Test Data.
CoRR, 2021

Technical perspective: Why don't today's deep nets overfit to their training data?
Commun. ACM, 2021

Opening the Black Box of Deep Learning: Some Lessons and Take-aways.
Proceedings of the SIGMETRICS '21: ACM SIGMETRICS / International Conference on Measurement and Modeling of Computer Systems, 2021

Gradient Descent on Two-layer Nets: Margin Maximization and Simplicity Bias.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

On the Validity of Modeling SGD with Stochastic Differential Equations (SDEs).
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Evaluating Gradient Inversion Attacks and Defenses in Federated Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

A Mathematical Exploration of Why Language Models Help Solve Downstream Tasks.
Proceedings of the 9th International Conference on Learning Representations, 2021

Why Are Convolutional Nets More Sample-Efficient than Fully-Connected Nets?
Proceedings of the 9th International Conference on Learning Representations, 2021

2020
Privacy-preserving Learning via Deep Net Pruning.
CoRR, 2020

Over-parameterized Adversarial Training: An Analysis Overcoming the Curse of Dimensionality.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Reconciling Modern Deep Learning with Traditional Optimization Analyses: The Intrinsic Learning Rate.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

A Sample Complexity Separation between Non-Convex and Convex Meta-Learning.
Proceedings of the 37th International Conference on Machine Learning, 2020

InstaHide: Instance-hiding Schemes for Private Distributed Learning.
Proceedings of the 37th International Conference on Machine Learning, 2020

Provable Representation Learning for Imitation Learning via Bi-level Optimization.
Proceedings of the 37th International Conference on Machine Learning, 2020

Harnessing the Power of Infinitely Wide Deep Nets on Small-data Tasks.
Proceedings of the 8th International Conference on Learning Representations, 2020

An Exponential Learning Rate Schedule for Deep Learning.
Proceedings of the 8th International Conference on Learning Representations, 2020

The Quest for Mathematical Understanding of Deep Learning (Invited Talk).
Proceedings of the 40th IARCS Annual Conference on Foundations of Software Technology and Theoretical Computer Science, 2020

TextHide: Tackling Data Privacy for Language Understanding Tasks.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

2019
Enhanced Convolutional Neural Tangent Kernels.
CoRR, 2019

A Simple Saliency Method That Passes the Sanity Checks.
CoRR, 2019

Explaining Landscape Connectivity of Low-cost Solutions for Multilayer Nets.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

On Exact Computation with an Infinitely Wide Neural Net.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Implicit Regularization in Deep Matrix Factorization.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

A Theoretical Analysis of Contrastive Unsupervised Representation Learning.
Proceedings of the 36th International Conference on Machine Learning, 2019

Fine-Grained Analysis of Optimization and Generalization for Overparameterized Two-Layer Neural Networks.
Proceedings of the 36th International Conference on Machine Learning, 2019

Theoretical Analysis of Auto Rate-Tuning by Batch Normalization.
Proceedings of the 7th International Conference on Learning Representations, 2019

A Convergence Analysis of Gradient Descent for Deep Linear Neural Networks.
Proceedings of the 7th International Conference on Learning Representations, 2019

2018
Linear Algebraic Structure of Word Senses, with Applications to Polysemy.
Trans. Assoc. Comput. Linguistics, 2018

Mapping between fMRI responses to movies and their natural language annotations.
NeuroImage, 2018

Learning topic models - provably and efficiently.
Commun. ACM, 2018

On the Optimization of Deep Networks: Implicit Acceleration by Overparameterization.
Proceedings of the 35th International Conference on Machine Learning, 2018

Stronger Generalization Bounds for Deep Nets via a Compression Approach.
Proceedings of the 35th International Conference on Machine Learning, 2018

Do GANs learn the distribution? Some Theory and Empirics.
Proceedings of the 6th International Conference on Learning Representations, 2018

A Compressed Sensing View of Unsupervised Text Embeddings, Bag-of-n-Grams, and LSTMs.
Proceedings of the 6th International Conference on Learning Representations, 2018

Towards Provable Control for Unknown Linear Dynamical Systems.
Proceedings of the 6th International Conference on Learning Representations, 2018

An Analysis of the t-SNE Algorithm for Data Visualization.
Proceedings of the Conference On Learning Theory, 2018

A La Carte Embedding: Cheap but Effective Induction of Semantic Feature Vectors.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Theoretical limitations of Encoder-Decoder GAN architectures.
CoRR, 2017

Extending and Improving Wordnet via Unsupervised Word Embeddings.
CoRR, 2017

Do GANs actually learn the distribution? An empirical study.
CoRR, 2017

Provable benefits of representation learning.
CoRR, 2017

Provable learning of noisy-OR networks.
Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing, 2017

Generalization and Equilibrium in Generative Adversarial Nets (GANs).
Proceedings of the 34th International Conference on Machine Learning, 2017

A Simple but Tough-to-Beat Baseline for Sentence Embeddings.
Proceedings of the 5th International Conference on Learning Representations, 2017

On the Ability of Neural Nets to Express Distributions.
Proceedings of the 30th Conference on Learning Theory, 2017

2016
A Latent Variable Model Approach to PMI-based Word Embeddings.
Trans. Assoc. Comput. Linguistics, 2016

Computing a Nonnegative Matrix Factorization - Provably.
SIAM J. Comput., 2016

A Combinatorial, Primal-Dual Approach to Semidefinite Programs.
J. ACM, 2016

Mapping Between Natural Movie fMRI Responses and Word-Sequence Representations.
CoRR, 2016

Provable Algorithms for Inference in Topic Models.
Proceedings of the 33nd International Conference on Machine Learning, 2016

2015
Subexponential Algorithms for Unique Games and Related Problems.
J. ACM, 2015

Making strategy process intelligent with business intelligence: an empirical investigation.
Int. J. Data Anal. Tech. Strateg., 2015

Why are deep nets reversible: A simple theory, with implications for training.
CoRR, 2015

Random Walks on Context Spaces: Towards an Explanation of the Mysteries of Semantic Word Embeddings.
CoRR, 2015

Provable ICA with Unknown Gaussian Noise, and Implications for Gaussian Mixtures and Autoencoders.
Algorithmica, 2015

Overcoming Intractability in Unsupervised Learning (Invited Talk).
Proceedings of the 32nd International Symposium on Theoretical Aspects of Computer Science, 2015

Simple, Efficient, and Neural Algorithms for Sparse Coding.
Proceedings of The 28th Conference on Learning Theory, 2015

2014
Thoughts on Paper Publishing in the Digital Age.
Bull. EATCS, 2014

More Algorithms for Provable Dictionary Learning.
CoRR, 2014

Technology Enhanced Learning in Addiction Mental Health: Developing a Virtual Knowledge Network: NIMHANS ECHO.
Proceedings of the Sixth IEEE International Conference on Technology for Education, 2014

Provable Bounds for Learning Some Deep Representations.
Proceedings of the 31th International Conference on Machine Learning, 2014

New Algorithms for Learning Incoherent and Overcomplete Dictionaries.
Proceedings of The 27th Conference on Learning Theory, 2014

2013
A Practical Algorithm for Topic Modeling with Provable Guarantees.
Proceedings of the 30th International Conference on Machine Learning, 2013

Towards a Better Approximation for Sparsest Cut?
Proceedings of the 54th Annual IEEE Symposium on Foundations of Computer Science, 2013

2012
The Multiplicative Weights Update Method: a Meta-Algorithm and Applications.
Theory Comput., 2012

Message-Passing Algorithms and Improved LP Decoding.
IEEE Trans. Inf. Theory, 2012

Local Versus Global Properties of Metric Spaces.
SIAM J. Comput., 2012

Testing Permanent Oracles - Revisited.
Electron. Colloquium Comput. Complex., 2012

The Gödel Price 2013. Call for Nominations.
Bull. EATCS, 2012

Finding overlapping communities in social networks: toward a rigorous approach.
Proceedings of the 13th ACM Conference on Electronic Commerce, 2012

<i>Modified</i> hierarchical privacy-aware role based access control model.
Proceedings of the Research in Applied Computation Symposium, 2012

"Provable ICA with Unknown Gaussian Noise, with Implications for Gaussian Mixtures and Autoencoders".
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Learning Topic Models - Going beyond SVD.
Proceedings of the 53rd Annual IEEE Symposium on Foundations of Computer Science, 2012

2011
A Reformulation of the Arora-Rao-Vazirani Structure Theorem
CoRR, 2011

Towards Strong Nonapproximability Results in the Lovász-Schrijver Hierarchy.
Comput. Complex., 2011

Computational complexity and information asymmetry in financial products.
Commun. ACM, 2011

New Algorithms for Learning in Presence of Errors.
Proceedings of the Automata, Languages and Programming - 38th International Colloquium, 2011

Visualizing conductive and convective heat transfer using thermographic techniques.
Proceedings of the 2011 Frontiers in Education Conference, 2011

New Tools for Graph Coloring.
Proceedings of the Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques, 2011

2010
O(sqrt(log(n)) Approximation to SPARSEST CUT in Õ(n<sup>2</sup>) Time.
SIAM J. Comput., 2010

Improved Algorithms for Unique Games via Divide and Conquer.
Electron. Colloquium Comput. Complex., 2010

Learning Parities with Structured Noise.
Electron. Colloquium Comput. Complex., 2010

Limits of Approximation Algorithms: PCPs and Unique Games (DIMACS Tutorial Lecture Notes)
CoRR, 2010

Semidefinite Programming and Approximation Algorithms: A Survey.
Proceedings of the Algorithm Theory, 2010

Computational Complexity and Information Asymmetry in Financial Products (Extended Abstract).
Proceedings of the Innovations in Computer Science, 2010

2009
Expander flows, geometric embeddings and graph partitioning.
J. ACM, 2009

Towards a Study of Low-Complexity Graphs.
Proceedings of the Automata, Languages and Programming, 36th International Colloquium, 2009

Computational Complexity - A Modern Approach.
Cambridge University Press, ISBN: 978-0-521-42426-4, 2009

2008
Geometry, flows, and graph-partitioning algorithms.
Commun. ACM, 2008

Unique games on expanding constraint graphs are easy: extended abstract.
Proceedings of the 40th Annual ACM Symposium on Theory of Computing, 2008

2007
Fréchet Embeddings of Negative Type Metrics.
Discret. Comput. Geom., 2007

2006
Proving Integrality Gaps without Knowing the Linear Program.
Theory Comput., 2006

A 2 + <i>epsilon</i> approximation algorithm for the <i>k</i>-MST problem.
Math. Program., 2006

New approximation guarantee for chromatic number.
Proceedings of the 38th Annual ACM Symposium on Theory of Computing, 2006

A Fast Random Sampling Algorithm for Sparsifying Matrices.
Proceedings of the Approximation, 2006

2005
On Non-Approximability for Quadratic Programs
Electron. Colloquium Comput. Complex., 2005

Is the thrill gone?
Commun. ACM, 2005

Euclidean distortion and the sparsest cut.
Proceedings of the 37th Annual ACM Symposium on Theory of Computing, 2005

Fast Algorithms for Approximate Semide.nite Programming using the Multiplicative Weights Update Method.
Proceedings of the 46th Annual IEEE Symposium on Foundations of Computer Science (FOCS 2005), 2005

2004
A Randomized Online Algorithm for Bandwidth Utilization.
J. Sched., 2004

Approximation Schemes for Degree-Restricted MST and Red-Blue Separation Problems.
Algorithmica, 2004

0(sqrt (log n)) Approximation to SPARSEST CUT in Õ(n<sup>2</sup>) Time.
Proceedings of the 45th Symposium on Foundations of Computer Science (FOCS 2004), 2004

Exploring complexity through reductions.
Proceedings of the Computational Complexity Theory., 2004

2003
Approximation Schemes for Minimum Latency Problems.
SIAM J. Comput., 2003

Approximation schemes for NP-hard geometric optimization problems: a survey.
Math. Program., 2003

Fitting algebraic curves to noisy data.
J. Comput. Syst. Sci., 2003

How NP got a new definition: a survey of probabilistically checkable proofs
CoRR, 2003

Approximation Schemes for Degree-Restricted MST and Red-Blue Separation Problem.
Proceedings of the Automata, Languages and Programming, 30th International Colloquium, 2003

Proving Integrality Gaps without Knowing the Linear Program.
Proceedings of the Fundamentals of Computation Theory, 14th International Symposium, 2003

2002
A new rounding procedure for the assignment problem with applications to dense graph arrangement problems.
Math. Program., 2002

A Note on the Representational Incompatibility of Function Approximation and Factored Dynamics.
Proceedings of the Advances in Neural Information Processing Systems 15 [Neural Information Processing Systems, 2002

Proving Integrality Gaps without Knowing the Linear Program.
Proceedings of the 43rd Symposium on Foundations of Computer Science (FOCS 2002), 2002

2001
Learning mixtures of arbitrary gaussians.
Proceedings of the Proceedings on 33rd Annual ACM Symposium on Theory of Computing, 2001

Approximation Schemes for Geometric NP-Hard Problems: A Survey.
Proceedings of the FST TCS 2001: Foundations of Software Technology and Theoretical Computer Science, 2001

2000
A 2+epsilon approximation algorithm for the <i>k</i>-MST problem.
Proceedings of the Eleventh Annual ACM-SIAM Symposium on Discrete Algorithms, 2000

Approximation algorithms that take advice.
Proceedings of the Approximation Algorithms for Combinatorial Optimization, 2000

1999
Polynomial Time Approximation Schemes for Dense Instances of NP-Hard Problems.
J. Comput. Syst. Sci., 1999

Page Replacement for General Caching Problems.
Proceedings of the Tenth Annual ACM-SIAM Symposium on Discrete Algorithms, 1999

1998
Probabilistic Checking of Proofs: A New Characterization of NP.
J. ACM, 1998

Polynomial Time Approximation Schemes for Euclidean Traveling Salesman and other Geometric Problems.
J. ACM, 1998

Proof verification and the hardness of approximation problems.
Electron. Colloquium Comput. Complex., 1998

Approximation Schemes for Euclidean <i>k</i>-Medians and Related Problems.
Proceedings of the Thirtieth Annual ACM Symposium on the Theory of Computing, 1998

The Approximability of NP-hard Problems.
Proceedings of the Thirtieth Annual ACM Symposium on the Theory of Computing, 1998

A Polynomial-Time Approximation Scheme for Weighted Planar Graph TSP.
Proceedings of the Ninth Annual ACM-SIAM Symposium on Discrete Algorithms, 1998

1997
On Winning Strategies in Ehrenfeucht-Fraïssé Games.
Theor. Comput. Sci., 1997

The Hardness of Approximate Optima in Lattices, Codes, and Systems of Linear Equations.
J. Comput. Syst. Sci., 1997

Improved low-degree testing and its applications
Electron. Colloquium Comput. Complex., 1997

Nearly Linear Time Approximation Schemes for Euclidean TSP and other Geometric Problems.
Proceedings of the 38th Annual Symposium on Foundations of Computer Science, 1997

1996
On-Line Algorithms for Path Selection in a Nonblocking Network.
SIAM J. Comput., 1996

Polynomial Time Approximation Schemes for Euclidean TSP and Other Geometric Problems.
Proceedings of the 37th Annual Symposium on Foundations of Computer Science, 1996

1995
Polynomial time approximation schemes for dense instances of <i>NP</i>-hard problems.
Proceedings of the Twenty-Seventh Annual ACM Symposium on Theory of Computing, 1995

Reductions, Codes, PCPs, and Inapproximability.
Proceedings of the 36th Annual Symposium on Foundations of Computer Science, 1995

1994
Simulating quadratic dynamical systems is PSPACE-complete (preliminary version).
Proceedings of the Twenty-Sixth Annual ACM Symposium on Theory of Computing, 1994

1993
The Hardness of Approximate Optimia in Lattices, Codes, and Systems of Linear Equations
Proceedings of the 34th Annual Symposium on Foundations of Computer Science, 1993

1990
On-line Algorithms for Path Selection in a Nonblocking Network (Extended Abstract)
Proceedings of the 22nd Annual ACM Symposium on Theory of Computing, 1990


  Loading...