We stand with Ukraine

We stand with Ukraine

Andrew M. Saxe

Orcid: 0000-0002-9831-8812

According to our database¹, Andrew M. Saxe authored at least 51 papers between 2006 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

2006

2008

2010

2012

2014

2016

2018

2020

2022

2024

0

5

10

6

5

3

1

1

3

1

2

1

5

3

4

1

1

1

2

1

2

4

1

2

1

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

Online presence:

on orcid.org
on scholar.google.com

On csauthors.net:

Bibliography

2024

Abrupt and spontaneous strategy switches emerge in simple regularised neural networks.

[BibT_eX]

[DOI]

,

,

Paul S. Muhle-Karbe

,

,

Christopher Summerfield

,

Nicolas W. Schuck

PLoS Comput. Biol., 2024

Flexible task abstractions emerge in linear networks with fast and bounded units.

[BibT_eX]

[DOI]

,

,

Alexandra M. Proca

,

,

Christopher Summerfield

,

CoRR, 2024

From Lazy to Rich: Exact Learning Dynamics in Deep Linear Networks.

[BibT_eX]

[DOI]

Clémentine C. J. Dominé

,

Nicolas Anguita

,

Alexandra M. Proca

,

,

,

Pedro A. M. Mediano

,

CoRR, 2024

Early learning of the optimal constant solution in neural networks and humans.

[BibT_eX]

[DOI]

,

,

,

Christopher Summerfield

CoRR, 2024

When Are Bias-Free ReLU Networks Like Linear Networks?

[BibT_eX]

[DOI]

,

,

Peter E. Latham

CoRR, 2024

Get rich quick: exact solutions reveal how unbalanced initializations promote rapid feature learning.

[BibT_eX]

[DOI]

,

Allan Raventós

,

Clémentine Dominé

,

,

David A. Klindt

,

,

CoRR, 2024

Understanding Unimodal Bias in Multimodal Deep Linear Networks.

[BibT_eX]

[DOI]

,

Peter E. Latham

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation.

[BibT_eX]

[DOI]

Aaditya K. Singh

,

,

,

Stephanie C. Y. Chan

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

When Representations Align: Universality in Representation Learning Dynamics.

[BibT_eX]

[DOI]

Loek van Rossem

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Tilting the Odds at the Lottery: the Interplay of Overparameterisation and Curricula in Neural Networks.

[BibT_eX]

[DOI]

Stefano Sarao Mannelli

,

Yaraslau Ivashinka

,

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Why Do Animals Need Shaping? A Theory of Task Composition and Curriculum Learning.

[BibT_eX]

[DOI]

,

Stefano Sarao Mannelli

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023

Modelling continual learning in humans with Hebbian context gating and exponentially decaying task signals.

[BibT_eX]

[DOI]

,

,

,

Christopher Summerfield

PLoS Comput. Biol., January, 2023

A Theory of Unimodal Bias in Multimodal Learning.

[BibT_eX]

[DOI]

,

Peter E. Latham

,

CoRR, 2023

Meta-Learning Strategies through Value Maximization in Neural Networks.

[BibT_eX]

[DOI]

Rodrigo Carrasco-Davis

,

,

CoRR, 2023

The RL Perceptron: Generalisation Dynamics of Policy Learning in High Dimensions.

[BibT_eX]

[DOI]

,

,

Stefano Sarao Mannelli

,

Sebastian Goldt

,

CoRR, 2023

Regularised neural networks mimic human insight.

[BibT_eX]

[DOI]

,

,

Paul S. Muhle-Karbe

,

,

Christopher Summerfield

,

Nicolas W. Schuck

CoRR, 2023

The Transient Nature of Emergent In-Context Learning in Transformers.

[BibT_eX]

[DOI]

Aaditya K. Singh

,

Stephanie C. Y. Chan

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

On The Specialization of Neural Modules.

[BibT_eX]

[DOI]

,

,

Benjamin Rosman

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Know your audience: specializing grounded language models with listener subtraction.

[BibT_eX]

[DOI]

Aaditya K. Singh

,

,

,

,

Andrew K. Lampinen

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

2022

Probing transfer learning with a model of synthetic correlated datasets.

[BibT_eX]

[DOI]

Federica Gerace

,

,

Stefano Sarao Mannelli

,

,

Lenka Zdeborová

Mach. Learn. Sci. Technol., 2022

Continual task learning in natural and artificial agents.

[BibT_eX]

[DOI]

,

,

Christopher Summerfield

CoRR, 2022

Know your audience: specializing grounded language models with the game of Dixit.

[BibT_eX]

[DOI]

Aaditya K. Singh

,

,

,

,

Andrew K. Lampinen

CoRR, 2022

An Analytical Theory of Curriculum Learning in Teacher-Student Networks.

[BibT_eX]

[DOI]

,

Stefano Sarao Mannelli

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Exact learning dynamics of deep linear networks with prior knowledge.

[BibT_eX]

[DOI]

,

Clémentine Dominé

,

James Fitzgerald

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

The Neural Race Reduction: Dynamics of Abstraction in Gated Networks.

[BibT_eX]

[DOI]

,

,

Sam Jay Lewallen

Proceedings of the International Conference on Machine Learning, 2022

Maslow's Hammer in Catastrophic Forgetting: Node Re-Use vs. Node Activation.

[BibT_eX]

[DOI]

,

Stefano Sarao Mannelli

,

Claudia Clopath

,

Sebastian Goldt

,

Proceedings of the International Conference on Machine Learning, 2022

2021

Continual Learning in the Teacher-Student Setup: Impact of Task Similarity.

[BibT_eX]

[DOI]

,

Sebastian Goldt

,

Proceedings of the 38th International Conference on Machine Learning, 2021

2020

High-dimensional dynamics of generalization error in neural networks.

[BibT_eX]

[DOI]

Madhu S. Advani

,

,

Haim Sompolinsky

Neural Networks, 2020

Characterizing emergent representations in a space of candidate learning rules for deep networks.

[BibT_eX]

[DOI]

,

Christopher Summerfield

,

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

2019

Generalisation dynamics of online learning in over-parameterised neural networks.

[BibT_eX]

[DOI]

Sebastian Goldt

,

Madhu S. Advani

,

,

Florent Krzakala

,

Lenka Zdeborová

CoRR, 2019

Dynamics of stochastic gradient descent for two-layer neural networks in the teacher-student setup.

[BibT_eX]

[DOI]

Sebastian Goldt

,

,

,

Florent Krzakala

,

Lenka Zdeborová

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2018

A mathematical theory of semantic development in deep neural networks.

[BibT_eX]

[DOI]

,

James L. McClelland

,

CoRR, 2018

Minnorm training: an algorithm for training over-parameterized deep neural networks.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2018

Energy-entropy competition and the effectiveness of stochastic gradient descent in machine learning.

[BibT_eX]

[DOI]

,

,

Madhu S. Advani

,

CoRR, 2018

On the Information Bottleneck Theory of Deep Learning.

[BibT_eX]

[DOI]

,

,

,

,

Artemy Kolchinsky

,

Brendan D. Tracey

,

Proceedings of the 6th International Conference on Learning Representations, 2018

Hierarchical Subtask Discovery with Non-Negative Matrix Factorization.

[BibT_eX]

[DOI]

Adam Christopher Earle

,

,

Benjamin Rosman

Proceedings of the 6th International Conference on Learning Representations, 2018

2017

High-dimensional dynamics of generalization error in neural networks.

[BibT_eX]

[DOI]

Madhu S. Advani

,

CoRR, 2017

Hierarchy Through Composition with Multitask LMDPs.

[BibT_eX]

[DOI]

,

Adam Christopher Earle

,

Benjamin Rosman

Proceedings of the 34th International Conference on Machine Learning, 2017

2016

Hierarchy through Composition with Linearly Solvable Markov Decision Processes.

[BibT_eX]

[DOI]

,

Adam Christopher Earle

,

Benjamin Rosman

CoRR, 2016

Active Long Term Memory Networks.

[BibT_eX]

[DOI]

Tommaso Furlanello

,

,

,

,

CoRR, 2016

Tensor Switching Networks.

[BibT_eX]

[DOI]

Chuan-Yung Tsai

,

,

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Tutorial Workshop on Contemporary Deep Neural Network Models.

[BibT_eX]

[DOI]

James L. McClelland

,

Steven Stenberg Hansen

,

Proceedings of the 38th Annual Meeting of the Cognitive Science Society, 2016

2014

Exact solutions to the nonlinear dynamics of learning in deep linear neural networks.

[BibT_eX]

[DOI]

,

James L. McClelland

,

Proceedings of the 2nd International Conference on Learning Representations, 2014

Multitask model-free reinforcement learning.

[BibT_eX]

[DOI]

Proceedings of the 36th Annual Meeting of the Cognitive Science Society, 2014

Deep Learning and the Brain.

[BibT_eX]

[DOI]

Proceedings of the 36th Annual Meeting of the Cognitive Science Society, 2014

Modeling Perceptual Learning with Deep Networks.

[BibT_eX]

[DOI]

,

Proceedings of the 36th Annual Meeting of the Cognitive Science Society, 2014

2013

Learning hierarchical categories in deep neural networks.

[BibT_eX]

[DOI]

,

James L. McClelland

,

Proceedings of the 35th Annual Meeting of the Cognitive Science Society, 2013

2011

Unsupervised learning models of primary cortical receptive fields and receptive field plasticity.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

On Random Weights and Unsupervised Feature Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 28th International Conference on Machine Learning, 2011

2009

Measuring Invariances in Deep Networks.

[BibT_eX]

[DOI]

Ian J. Goodfellow

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

2006

Prospect Eleven: Princeton University's entry in the 2005 DARPA Grand Challenge.

[BibT_eX]

[DOI]

Anand R. Atreya

,

Bryan C. Cattle

,

Brendan M. Collins

,

Benjamin Essenburg

,

Gordon H. Franken

,

,

Scott N. Schiffres

,

Alain L. Kornhauser

J. Field Robotics, 2006

Loading...