Andrew M. Saxe

Orcid: 0000-0002-9831-8812

According to our database1, Andrew M. Saxe authored at least 50 papers between 2006 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Abrupt and spontaneous strategy switches emerge in simple regularised neural networks.
PLoS Comput. Biol., 2024

From Lazy to Rich: Exact Learning Dynamics in Deep Linear Networks.
CoRR, 2024

Early learning of the optimal constant solution in neural networks and humans.
CoRR, 2024

When Are Bias-Free ReLU Networks Like Linear Networks?
CoRR, 2024

Get rich quick: exact solutions reveal how unbalanced initializations promote rapid feature learning.
CoRR, 2024

Understanding Unimodal Bias in Multimodal Deep Linear Networks.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

When Representations Align: Universality in Representation Learning Dynamics.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Tilting the Odds at the Lottery: the Interplay of Overparameterisation and Curricula in Neural Networks.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Why Do Animals Need Shaping? A Theory of Task Composition and Curriculum Learning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Modelling continual learning in humans with Hebbian context gating and exponentially decaying task signals.
PLoS Comput. Biol., January, 2023

A Theory of Unimodal Bias in Multimodal Learning.
CoRR, 2023

Meta-Learning Strategies through Value Maximization in Neural Networks.
CoRR, 2023

The RL Perceptron: Generalisation Dynamics of Policy Learning in High Dimensions.
CoRR, 2023

Regularised neural networks mimic human insight.
CoRR, 2023

The Transient Nature of Emergent In-Context Learning in Transformers.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

On The Specialization of Neural Modules.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Know your audience: specializing grounded language models with listener subtraction.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

2022
Probing transfer learning with a model of synthetic correlated datasets.
Mach. Learn. Sci. Technol., 2022

Continual task learning in natural and artificial agents.
CoRR, 2022

Know your audience: specializing grounded language models with the game of Dixit.
CoRR, 2022

An Analytical Theory of Curriculum Learning in Teacher-Student Networks.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Exact learning dynamics of deep linear networks with prior knowledge.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

The Neural Race Reduction: Dynamics of Abstraction in Gated Networks.
Proceedings of the International Conference on Machine Learning, 2022

Maslow's Hammer in Catastrophic Forgetting: Node Re-Use vs. Node Activation.
Proceedings of the International Conference on Machine Learning, 2022

2021
Continual Learning in the Teacher-Student Setup: Impact of Task Similarity.
Proceedings of the 38th International Conference on Machine Learning, 2021

2020
High-dimensional dynamics of generalization error in neural networks.
Neural Networks, 2020

Characterizing emergent representations in a space of candidate learning rules for deep networks.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

2019
Generalisation dynamics of online learning in over-parameterised neural networks.
CoRR, 2019

Dynamics of stochastic gradient descent for two-layer neural networks in the teacher-student setup.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2018
A mathematical theory of semantic development in deep neural networks.
CoRR, 2018

Minnorm training: an algorithm for training over-parameterized deep neural networks.
CoRR, 2018

Energy-entropy competition and the effectiveness of stochastic gradient descent in machine learning.
CoRR, 2018

On the Information Bottleneck Theory of Deep Learning.
Proceedings of the 6th International Conference on Learning Representations, 2018

Hierarchical Subtask Discovery with Non-Negative Matrix Factorization.
Proceedings of the 6th International Conference on Learning Representations, 2018

2017
High-dimensional dynamics of generalization error in neural networks.
CoRR, 2017

Hierarchy Through Composition with Multitask LMDPs.
Proceedings of the 34th International Conference on Machine Learning, 2017

2016
Hierarchy through Composition with Linearly Solvable Markov Decision Processes.
CoRR, 2016

Active Long Term Memory Networks.
CoRR, 2016

Tensor Switching Networks.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Tutorial Workshop on Contemporary Deep Neural Network Models.
Proceedings of the 38th Annual Meeting of the Cognitive Science Society, 2016

2014
Exact solutions to the nonlinear dynamics of learning in deep linear neural networks.
Proceedings of the 2nd International Conference on Learning Representations, 2014

Multitask model-free reinforcement learning.
Proceedings of the 36th Annual Meeting of the Cognitive Science Society, 2014

Deep Learning and the Brain.
Proceedings of the 36th Annual Meeting of the Cognitive Science Society, 2014

Modeling Perceptual Learning with Deep Networks.
Proceedings of the 36th Annual Meeting of the Cognitive Science Society, 2014

2013
Learning hierarchical categories in deep neural networks.
Proceedings of the 35th Annual Meeting of the Cognitive Science Society, 2013

2011
Unsupervised learning models of primary cortical receptive fields and receptive field plasticity.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

On Random Weights and Unsupervised Feature Learning.
Proceedings of the 28th International Conference on Machine Learning, 2011

2009
Measuring Invariances in Deep Networks.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

2006
Prospect Eleven: Princeton University's entry in the 2005 DARPA Grand Challenge.
J. Field Robotics, 2006


  Loading...