2025

Accelerating neural network training: An analysis of the AlgoPerf competition.

[DOI]

Priya Kasimbeg

Frank Schneider

Runa Eschenhagen

Juhan Bae

Chandramouli Shama Sastry

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

Pre-trained Gaussian Processes for Bayesian Optimization.

[DOI]

J. Mach. Learn. Res., 2024

A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs.

[DOI]

Ankit Singh Rawat

Veeranjaneyulu Sadhanala

CoRR, 2024

2023

A Simple Approach to Improve Single-Model Deep Uncertainty via Distance-Awareness.

[DOI]

Balaji Lakshminarayanan

J. Mach. Learn. Res., 2023

Benchmarking Neural Network Training Algorithms.

[DOI]

CoRR, 2023

Kernel Regression with Infinite-Width Neural Networks on Millions of Examples.

[DOI]

CoRR, 2023

2022

Underspecification Presents Challenges for Credibility in Modern Machine Learning.

[DOI]

J. Mach. Learn. Res., 2022

Adaptive Gradient Methods at the Edge of Stability.

[DOI]

CoRR, 2022

Plex: Towards Reliability using Pretrained Large Model Extensions.

[DOI]

CoRR, 2022

Pre-training helps Bayesian optimization too.

[DOI]

CoRR, 2022

A Loss Curvature Perspective on Training Instabilities of Deep Learning Models.

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Predicting the utility of search spaces for black-box optimization: a simple, budget-aware approach.

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021

A Loss Curvature Perspective on Training Instability in Deep Learning.

[DOI]

CoRR, 2021

Automatic prior selection for meta Bayesian optimization with a case study on tuning deep neural network optimizers.

[DOI]

CoRR, 2021

Uncertainty Baselines: Benchmarks for Uncertainty & Robustness in Deep Learning.

[DOI]

CoRR, 2021

A Large Batch Optimizer Reality Check: Traditional, Generic Optimizers Suffice Across Batch Sizes.

[DOI]

Zachary Nado

Justin Gilmer

Christopher J. Shallue

Rohan Anil

George E. Dahl

CoRR, 2021

Benchmarking Bayesian Deep Learning on Diabetic Retinopathy Detection Tasks.

[DOI]

Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

2020

Revisiting One-vs-All Classifiers for Predictive Uncertainty and Out-of-Distribution Detection in Neural Networks.

[DOI]

Balaji Lakshminarayanan

CoRR, 2020

Evaluating Prediction-Time Batch Normalization for Robustness under Covariate Shift.

[DOI]

Balaji Lakshminarayanan

Jasper Snoek

CoRR, 2020

2019

On Empirical Comparisons of Optimizers for Deep Learning.

[DOI]

Dami Choi

Christopher J. Shallue

CoRR, 2019

Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model.

[DOI]

Christopher J. Shallue

Roger B. Grosse

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Can you trust your model's uncertainty? Evaluating predictive uncertainty under dataset shift.

[DOI]

Jasper Snoek

Yaniv Ovadia

Emily Fertig

Balaji Lakshminarayanan

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

AutoGraph: Imperative-style Coding with Graph-based Performance.

[DOI]

Alexander B. Wiltschko

Proceedings of the Second Conference on Machine Learning and Systems, SysML 2019, 2019

2018

Stochastic Gradient Langevin dynamics that Exploit Neural Network Structure.

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018