Jeremy Bernstein

According to our database1, Jeremy Bernstein authored at least 22 papers between 2017 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Old Optimizer, New Norm: An Anthology.
CoRR, 2024

Scalable Optimization in the Modular Norm.
CoRR, 2024

Training Neural Networks from Scratch with Parallel Low-Rank Adapters.
CoRR, 2024

Devising and Detecting Phishing Emails Using Large Language Models.
IEEE Access, 2024

2023
A Spectral Condition for Feature Learning.
CoRR, 2023

Devising and Detecting Phishing: Large Language Models vs. Smaller Human Models.
CoRR, 2023

SketchOGD: Memory-Efficient Continual Learning.
CoRR, 2023

Automatic Gradient Descent: Deep Learning without Hyperparameters.
CoRR, 2023

2022
Optimisation & Generalisation in Networks of Neurons.
CoRR, 2022

Investigating Generalization by Controlling Normalized Margin.
Proceedings of the International Conference on Machine Learning, 2022

2021
On the Implicit Biases of Architecture & Gradient Descent.
CoRR, 2021

Computing the Information Content of Trained Neural Networks.
CoRR, 2021

Fine-Grained System Identification of Nonlinear Neural Circuits.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Learning by Turning: Neural Architecture Aware Optimisation.
Proceedings of the 38th International Conference on Machine Learning, 2021

2020
Learning compositional functions via multiplicative weight updates.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

On the distance between two neural networks and the stability of learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

2019
signSGD with Majority Vote is Communication Efficient and Fault Tolerant.
Proceedings of the 7th International Conference on Learning Representations, 2019

2018
signSGD with Majority Vote is Communication Efficient And Byzantine Fault Tolerant.
CoRR, 2018

SIGNSGD: Compressed Optimisation for Non-Convex Problems.
Proceedings of the 35th International Conference on Machine Learning, 2018

Stochastic Activation Pruning for Robust Adversarial Defense.
Proceedings of the 6th International Conference on Learning Representations, 2018

Compression by the signs: distributed learning is a two-way street.
Proceedings of the 6th International Conference on Learning Representations, 2018

2017
Markov Transitions between Attractor States in a Recurrent Neural Network.
Proceedings of the 2017 AAAI Spring Symposia, 2017


  Loading...