Stéphane d'Ascoli

Orcid: 0000-0002-3131-3371

According to our database¹, Stéphane d'Ascoli authored at least 25 papers between 2018 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

2018

2019

2020

2021

2022

2023

2024

2025

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Scaling laws for decoding images from brain activity.

[BibT_eX]

[DOI]

CoRR, January, 2025

2024

Decoding individual words from non-invasive brain recordings across 723 participants.

[BibT_eX]

[DOI]

CoRR, 2024

A Polar coordinate system represents syntax in large language models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

ODEFormer: Symbolic Regression of Dynamical Systems with Transformers.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Elucidating the Hierarchical Nature of Behavior with Masked Autoencoders.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

2023

Boolformer: Symbolic Regression of Logic Functions with Transformers.

[BibT_eX]

[DOI]

CoRR, 2023

Length Generalization in Arithmetic Transformers.

[BibT_eX]

[DOI]

Samy Jelassi

Stéphane d'Ascoli

Carles Domingo-Enrich

Yuhuai Wu

Yuanzhi Li

François Charton

CoRR, 2023

2022

Optimal learning rate schedules in high-dimensional non-convex optimization problems.

[BibT_eX]

[DOI]

Stéphane d'Ascoli

Maria Refinetti

Giulio Biroli

CoRR, 2022

Deep Symbolic Regression for Recurrent Sequences.

[BibT_eX]

[DOI]

Stéphane d'Ascoli

Pierre-Alexandre Kamienny

Guillaume Lample

François Charton

CoRR, 2022

End-to-end Symbolic Regression with Transformers.

[BibT_eX]

[DOI]

Pierre-Alexandre Kamienny

Stéphane d'Ascoli

Guillaume Lample

François Charton

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Deep symbolic regression for recurrence prediction.

[BibT_eX]

[DOI]

Stéphane d'Ascoli

Pierre-Alexandre Kamienny

Guillaume Lample

François Charton

Proceedings of the International Conference on Machine Learning, 2022

2021

Transformed CNNs: recasting pre-trained convolutional layers with self-attention.

[BibT_eX]

[DOI]

CoRR, 2021

More data or more parameters? Investigating the effect of data structure on generalization.

[BibT_eX]

[DOI]

CoRR, 2021

On the interplay between data structure and loss function in classification problems.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Align, then memorise: the dynamics of learning with feedback alignment.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

2020

The dynamics of learning with feedback alignment.

[BibT_eX]

[DOI]

CoRR, 2020

Conditioned Text Generation with Transfer for Closed-Domain Dialogue Systems.

[BibT_eX]

[DOI]

Stéphane d'Ascoli

Alice Coucke

Francesco Caltagirone

Alexandre Caulier

Marc Lelarge

Proceedings of the Statistical Language and Speech Processing, 2020

Triple descent and the two kinds of overfitting: where & why do they appear?

[BibT_eX]

[DOI]

Stéphane d'Ascoli

Levent Sagun

Giulio Biroli

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Double Trouble in Double Descent: Bias and Variance(s) in the Lazy Regime.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

2019

Conditioned Query Generation for Task-Oriented Dialogue Systems.

[BibT_eX]

[DOI]

Stéphane d'Ascoli

Alice Coucke

Francesco Caltagirone

Alexandre Caulier

Marc Lelarge

CoRR, 2019

Scaling description of generalization with number of parameters in deep learning.

[BibT_eX]

[DOI]

CoRR, 2019

Finding the Needle in the Haystack with Convolutions: on the benefits of architectural bias.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2018

A jamming transition from under- to over-parametrization affects loss landscape and generalization.

[BibT_eX]

[DOI]

CoRR, 2018

The jamming transition as a paradigm to understand the loss landscape of deep neural networks.

[BibT_eX]

[DOI]

CoRR, 2018

Stéphane d'Ascoli

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...