Zhao Song
Orcid: 0000-0003-4589-5234Affiliations:
- Adobe Research
- Institute for Advanced Study, Princeton, NJ, USA (former)
- Princeton University, NJ, USA (former)
- University of Washington, DC, USA (former)
- University of Texas at Austin, Department of Computer Science, USA (PhD 2019)
- Harvard University, Cambridge, MA, USA (former)
- University of California Berkeley, CA, USA (former)
- Simon Fraser University, School of Computing Science, Burnaby, Canada (former)
According to our database1,
Zhao Song
authored at least 241 papers
between 2012 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on ias.edu
-
on twitter.com
-
on orcid.org
On csauthors.net:
Bibliography
2024
Advancing the Understanding of Fixed Point Iterations in Deep Neural Networks: A Detailed Analytical Study.
CoRR, 2024
Bypassing the Exponential Dependency: Looped Transformers Efficiently Learn In-context by Multi-step Gradient Descent.
CoRR, 2024
CoRR, 2024
CoRR, 2024
Log-concave Sampling over a Convex Body with a Barrier: a Robust and Unified Dikin Walk.
CoRR, 2024
CoRR, 2024
CoRR, 2024
On Statistical Rates and Provably Efficient Criteria of Latent Diffusion Transformers (DiTs).
CoRR, 2024
CoRR, 2024
Unraveling the Smoothness Properties of Diffusion Models: A Gaussian Mixture Perspective.
CoRR, 2024
CoRR, 2024
Conv-Basis: A New Paradigm for Efficient Attention Inference and Gradient Computation in Transformers.
CoRR, 2024
Exploring the Frontiers of Softmax: Provable Optimization, Applications in Diffusion Model, and Beyond.
CoRR, 2024
Fourier Circuits in Neural Networks: Unlocking the Potential of Large Language Models in Mathematical Reasoning and Modular Arithmetic.
CoRR, 2024
The Fine-Grained Complexity of Gradient Computation for Training Large Language Models.
CoRR, 2024
Enhancing Stochastic Gradient Descent: A Unified Framework and Novel Acceleration Methods for Faster Convergence.
CoRR, 2024
Proceedings of the 2024 ACM-SIAM Symposium on Discrete Algorithms, 2024
Proceedings of the 15th Innovations in Theoretical Computer Science Conference, 2024
On Computational Limits of Modern Hopfield Models: A Fine-Grained Complexity Analysis.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Low Rank Matrix Completion via Robust Alternating Minimization in Nearly Linear Time.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
How to Capture Higher-order Correlations? Generalizing Matrix Softmax Attention to Kronecker Computation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
CoRR, 2023
Revisiting Quantum Algorithms for Linear Regressions: Quadratic Speedups without Data-Dependent Parameters.
CoRR, 2023
One Pass Streaming Algorithm for Super Long Token Attention Approximation in Sublinear Space.
CoRR, 2023
CoRR, 2023
Unmasking Transformers: A Theoretical Approach to Data Recovery via Attention Weights.
CoRR, 2023
CoRR, 2023
An Automatic Learning Rate Schedule Algorithm for Achieving Faster Convergence and Steeper Descent.
CoRR, 2023
CoRR, 2023
A Fast Optimization View: Reformulating Single Layer Attention in LLM Based on Tensor and SVM Trick, and Solving It in Matrix Multiplication Time.
CoRR, 2023
CoRR, 2023
CoRR, 2023
In-Context Learning for Attention Scheme: from Single Softmax Regression to Multiple Softmax Regression via a Tensor Trick.
CoRR, 2023
H<sub>2</sub>O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
CoRR, 2023
Efficient Alternating Minimization with Applications to Weighted Low Rank Approximation.
CoRR, 2023
Query Complexity of Active Learning for Function Family With Nearly Orthogonal Basis.
CoRR, 2023
A Mathematical Abstraction for Balancing the Trade-off Between Creativity and Reality in Large Language Models.
CoRR, 2023
CoRR, 2023
CoRR, 2023
Randomized and Deterministic Attention Sparsification Algorithms for Over-parameterized Feature Dimension.
CoRR, 2023
CoRR, 2023
A Theoretical Analysis Of Nearest Neighbor Search On Approximate Near Neighbor Graph.
CoRR, 2023
Super-resolution and Robust Sparse Continuous Fourier Transform in Any Constant Dimension: Nearly Linear Time and Sample Complexity.
Proceedings of the 2023 ACM-SIAM Symposium on Discrete Algorithms, 2023
H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
InfoPrompt: Information-Theoretic Soft Prompt Tuning for Natural Language Understanding.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Bypass Exponential Time Preprocessing: Fast Neural Network Training via Weight-Data Correlation Preprocessing.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Sketching for First Order Method: Efficient Algorithm for Low-Bandwidth Channel and Vulnerability.
Proceedings of the International Conference on Machine Learning, 2023
Sketching Meets Differential Privacy: Fast Algorithm for Dynamic Kronecker Projection Maintenance.
Proceedings of the International Conference on Machine Learning, 2023
Space-Efficient Interior Point Method, with Applications to Linear Programming and Maximum Weight Bipartite Matching.
Proceedings of the 50th International Colloquium on Automata, Languages, and Programming, 2023
Proceedings of the 64th IEEE Annual Symposium on Foundations of Computer Science, 2023
Proceedings of the IEEE International Conference on Big Data, 2023
Fast Heavy Inner Product Identification Between Weights and Inputs in Neural Network Training.
Proceedings of the IEEE International Conference on Big Data, 2023
Proceedings of the IEEE International Conference on Big Data, 2023
A Tale of Two Efficient Value Iteration Algorithms for Solving Linear MDPs with Large Action Space.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023
An Online and Unified Algorithm for Projection Matrix Vector Multiplication with Application to Empirical Risk Minimization.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
Electron. Colloquium Comput. Complex., 2022
Sketching Meets Differential Privacy: Fast Algorithm for Dynamic Kronecker Projection Maintenance.
CoRR, 2022
Dynamic Maintenance of Kernel Density Estimation Data Structure: From Practice to Theory.
CoRR, 2022
Accelerating Frank-Wolfe Algorithm using Low-Dimensional and Adaptive Data Structures.
CoRR, 2022
CoRR, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the 13th Innovations in Theoretical Computer Science Conference, 2022
Proceedings of the International Conference on Machine Learning, 2022
Bounding the Width of Neural Networks via Coupled Initialization A Worst Case Analysis.
Proceedings of the International Conference on Machine Learning, 2022
Perfectly Balanced: Improving Transfer and Robustness of Supervised Contrastive Learning.
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
Proceedings of the 63rd IEEE Annual Symposium on Foundations of Computer Science, 2022
Proceedings of the IEEE International Conference on Big Data, 2022
Proceedings of the IEEE International Conference on Big Data, 2022
Proceedings of the Approximation, 2022
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
Electron. Colloquium Comput. Complex., 2021
CoRR, 2021
FL-NTK: A Neural Tangent Kernel-based Framework for Federated Learning Convergence Analysis.
CoRR, 2021
When is particle filtering efficient for planning in partially observed linear dynamical systems?
Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021
Proceedings of the STOC '21: 53rd Annual ACM SIGACT Symposium on Theory of Computing, 2021
Minimum cost flows, MDPs, and ℓ<sub>1</sub>-regression in nearly linear time for dense instances.
Proceedings of the STOC '21: 53rd Annual ACM SIGACT Symposium on Theory of Computing, 2021
Breaking the Linear Iteration Cost Barrier for Some Well-known Conditional Gradient Methods Using MaxIP Data-structures.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the 12th Innovations in Theoretical Computer Science Conference, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Near-Optimal Two-Pass Streaming Algorithm for Sampling Random Walks over Directed Graphs.
Proceedings of the 48th International Colloquium on Automata, Languages, and Programming, 2021
2020
CoRR, 2020
CoRR, 2020
CoRR, 2020
An improved cutting plane method for convex optimization, convex-concave games, and its applications.
Proceedings of the 52nd Annual ACM SIGACT Symposium on Theory of Computing, 2020
Proceedings of the 52nd Annual ACM SIGACT Symposium on Theory of Computing, 2020
Proceedings of the 52nd Annual ACM SIGACT Symposium on Theory of Computing, 2020
Proceedings of the 2020 ACM-SIAM Symposium on Discrete Algorithms, 2020
Over-parameterized Adversarial Training: An Analysis Overcoming the Curse of Dimensionality.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the 37th International Conference on Machine Learning, 2020
Proceedings of the 37th International Conference on Machine Learning, 2020
Proceedings of the 61st IEEE Annual Symposium on Foundations of Computer Science, 2020
Proceedings of the 61st IEEE Annual Symposium on Foundations of Computer Science, 2020
Proceedings of the 61st IEEE Annual Symposium on Foundations of Computer Science, 2020
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020
2019
J. Mach. Learn. Res., 2019
Proceedings of the 51st Annual ACM SIGACT Symposium on Theory of Computing, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
Proceedings of the 7th International Conference on Learning Representations, 2019
Proceedings of the 60th IEEE Annual Symposium on Foundations of Computer Science, 2019
(Nearly) Sample-Optimal Sparse Fourier Transform in Any Dimension; RIPless and Filterless.
Proceedings of the 60th IEEE Annual Symposium on Foundations of Computer Science, 2019
Proceedings of the Conference on Learning Theory, 2019
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019
2018
Proceedings of the Encyclopedia of Social Network Analysis and Mining, 2nd Edition, 2018
Electron. Colloquium Comput. Complex., 2018
CoRR, 2018
Sensitivity Sampling Over Dynamic Geometric Data Streams with Applications to k-Clustering.
CoRR, 2018
Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing, 2018
Proceedings of the 35th International Conference on Machine Learning, 2018
Proceedings of the 35th International Conference on Machine Learning, 2018
A Matrix Chernoff Bound for Strongly Rayleigh Distributions and Spectral Sparsifiers from a few Random Spanning Trees.
Proceedings of the 59th IEEE Annual Symposium on Foundations of Computer Science, 2018
Proceedings of the 59th IEEE Annual Symposium on Foundations of Computer Science, 2018
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018
2017
CoRR, 2017
Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing, 2017
Proceedings of the 34th International Conference on Machine Learning, 2017
Proceedings of the 44th International Colloquium on Automata, Languages, and Programming, 2017
2016
Proceedings of the 15th Scandinavian Symposium and Workshops on Algorithm Theory, 2016
Proceedings of the 48th Annual ACM SIGACT Symposium on Theory of Computing, 2016
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016
Proceedings of the IEEE 57th Annual Symposium on Foundations of Computer Science, 2016
2015
Discret. Appl. Math., 2015
Proceedings of the IEEE 56th Annual Symposium on Foundations of Computer Science, 2015
Proceedings of the 52nd Annual Design Automation Conference, 2015
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015
2014
Encyclopedia of Social Network Analysis and Mining, 2014
Electron. Colloquium Comput. Complex., 2014
Algorithmica, 2014
Proceedings of the LATIN 2014: Theoretical Informatics - 11th Latin American Symposium, Montevideo, Uruguay, March 31, 2014
Proceedings of the Computing and Combinatorics - 20th International Conference, 2014
Proceedings of the Combinatorial Optimization and Applications, 2014
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014
2013
Sustainable robot foraging: Adaptive fine-grained multi-robot task allocation for maximum sustainable yield of biological resources.
Proceedings of the 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2013
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013
2012
Computing Minmax Regret 1-Median on a Tree Network with Positive/Negative Vertex Weights.
Proceedings of the Algorithms and Computation - 23rd International Symposium, 2012
Proceedings of the International Conference on Advances in Social Networks Analysis and Mining, 2012
MO-LOST: adaptive ant trail untangling in multi-objective multi-colony robot foraging.
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012