Bit Complexity of Jordan Normal Form and Polynomial Spectral Factorization.
Proceedings of the 14th Innovations in Theoretical Computer Science Conference, 2023
Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer.
CoRR, 2022
Bit Complexity of Jordan Normal Form and Spectral Factorization.
CoRR, 2021
Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Language Models are Few-Shot Learners.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Asymmetric Random Projections.
CoRR, 2019
Exponential Lower Bounds on Spectrahedral Representations of Hyperbolicity Cones.
Proceedings of the Thirtieth Annual ACM-SIAM Symposium on Discrete Algorithms, 2019
The geometry of rank decompositions of matrix multiplication II: 3×3 matrices.
CoRR, 2018
Proceedings of the 8th Innovations in Theoretical Computer Science Conference, 2017