Stéphane d'Ascoli

Orcid: 0000-0002-3131-3371

According to our database1, Stéphane d'Ascoli authored at least 22 papers between 2018 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
ODEFormer: Symbolic Regression of Dynamical Systems with Transformers.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Elucidating the Hierarchical Nature of Behavior with Masked Autoencoders.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
Boolformer: Symbolic Regression of Logic Functions with Transformers.
CoRR, 2023

Length Generalization in Arithmetic Transformers.
CoRR, 2023

2022
Optimal learning rate schedules in high-dimensional non-convex optimization problems.
CoRR, 2022

Deep Symbolic Regression for Recurrent Sequences.
CoRR, 2022

End-to-end Symbolic Regression with Transformers.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Deep symbolic regression for recurrence prediction.
Proceedings of the International Conference on Machine Learning, 2022

2021
Transformed CNNs: recasting pre-trained convolutional layers with self-attention.
CoRR, 2021

More data or more parameters? Investigating the effect of data structure on generalization.
CoRR, 2021

On the interplay between data structure and loss function in classification problems.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases.
Proceedings of the 38th International Conference on Machine Learning, 2021

Align, then memorise: the dynamics of learning with feedback alignment.
Proceedings of the 38th International Conference on Machine Learning, 2021

2020
The dynamics of learning with feedback alignment.
CoRR, 2020

Conditioned Text Generation with Transfer for Closed-Domain Dialogue Systems.
Proceedings of the Statistical Language and Speech Processing, 2020

Triple descent and the two kinds of overfitting: where & why do they appear?
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Double Trouble in Double Descent: Bias and Variance(s) in the Lazy Regime.
Proceedings of the 37th International Conference on Machine Learning, 2020

2019
Conditioned Query Generation for Task-Oriented Dialogue Systems.
CoRR, 2019

Scaling description of generalization with number of parameters in deep learning.
CoRR, 2019

Finding the Needle in the Haystack with Convolutions: on the benefits of architectural bias.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2018
A jamming transition from under- to over-parametrization affects loss landscape and generalization.
CoRR, 2018

The jamming transition as a paradigm to understand the loss landscape of deep neural networks.
CoRR, 2018


  Loading...