Nicolas Le Roux

Orcid: 0000-0003-0365-1469

According to our database1, Nicolas Le Roux authored at least 68 papers between 2003 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
fPLSA: Learning Semantic Structures in Document Collections Using Foundation Models.
CoRR, 2024

VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment.
CoRR, 2024

Improving Context-Aware Preference Modeling for Language Models.
CoRR, 2024

Towards Modular LLMs by Building and Reusing a Library of LoRAs.
CoRR, 2024

Towards Modular LLMs by Building and Reusing a Library of LoRAs.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Language-guided Skill Learning with Temporal Variational Inference.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Deep Language Networks: Joint Prompt Training of Stacked LLMs using Variational Inference.
CoRR, 2023

Decision-Aware Actor-Critic with Function Approximation and Theoretical Guarantees.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Joint Prompt Optimization of Stacked LLMs using Variational Inference.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Multi-Head Adapter Routing for Cross-Task Generalization.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Target-based Surrogates for Stochastic Optimization.
Proceedings of the International Conference on Machine Learning, 2023

Unraveling the Interconnected Axes of Heterogeneity in Machine Learning for Democratic and Inclusive Advancements.
Proceedings of the 3rd ACM Conference on Equity and Access in Algorithms, 2023

2022
Multi-Head Adapter Routing for Data-Efficient Fine-Tuning.
CoRR, 2022

A general class of surrogate functions for stable and efficient reinforcement learning.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

On the Convergence of Stochastic Extragradient for Bilinear Games using Restarted Iteration Averaging.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021
A functional mirror ascent view of policy gradient methods with function approximation.
CoRR, 2021

On the Convergence of Stochastic Extragradient for Bilinear Games with Restarted Iteration Averaging.
CoRR, 2021

Bridging the Gap Between Adversarial Robustness and Optimization Bias.
CoRR, 2021

Beyond Variance Reduction: Understanding the True Impact of Baselines on Policy Optimization.
Proceedings of the 38th International Conference on Machine Learning, 2021

Batch Reinforcement Learning Through Continuation Method.
Proceedings of the 9th International Conference on Learning Representations, 2021

Impact of Aliasing on Generalization in Deep Convolutional Networks.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
An Effective Anti-Aliasing Approach for Residual Networks.
CoRR, 2020

To Each Optimizer a Norm, To Each Norm its Generalization.
CoRR, 2020

The Geometry of Sign Gradient Descent.
CoRR, 2020

An operator view of policy gradient methods.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

On the interplay between noise and curvature and its effect on optimization and generalization.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

2019
Information matrices and generalization.
CoRR, 2019

Anytime Tail Averaging.
CoRR, 2019

A Geometric Perspective on Optimal Representations for Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Reducing the variance in online optimization by transporting past gradients.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

The Value Function Polytope in Reinforcement Learning.
Proceedings of the 36th International Conference on Machine Learning, 2019

Understanding the Impact of Entropy on Policy Optimization.
Proceedings of the 36th International Conference on Machine Learning, 2019

Distributional reinforcement learning with linear function approximation.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

2018
Online variance-reducing optimization.
Proceedings of the 6th International Conference on Learning Representations, 2018

Negative eigenvalues of the Hessian in deep neural networks.
Proceedings of the 6th International Conference on Learning Representations, 2018

Tracking the gradients using the Hessian: A new look at variance reducing stochastic methods.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

2017
Erratum to: Minimizing finite sums with the stochastic average gradient.
Math. Program., 2017

Minimizing finite sums with the stochastic average gradient.
Math. Program., 2017

A comparative study of counterfactual estimators.
CoRR, 2017

Distributed SAGA: Maintaining linear convergence rate with limited communication.
CoRR, 2017

Tighter bounds lead to improved classifiers.
Proceedings of the 5th International Conference on Learning Representations, 2017

2016
Efficient iterative policy optimization.
CoRR, 2016

2015
Large-Scale Real-Time Product Recommendation at Criteo.
Proceedings of the 9th ACM Conference on Recommender Systems, 2015

2013
Local Component Analysis
Proceedings of the 1st International Conference on Learning Representations, 2013

2012
A Stochastic Gradient Method with an Exponential Convergence Rate for Strongly-Convex Optimization with Finite Training Sets
CoRR, 2012

A Stochastic Gradient Method with an Exponential Convergence Rate for Finite Training Sets.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

A latent factor model for highly multi-relational data.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

2011
Learning a Generative Model of Images by Factoring Appearance and Shape.
Neural Comput., 2011

Convergence Rates of Inexact Proximal-Gradient Methods for Convex Optimization.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Ask the locals: Multi-way local pooling for image recognition.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Weakly Supervised Learning of Foreground-Background Segmentation Using Masked RBMs.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2011, 2011

2010
Deep Belief Networks Are Compact Universal Approximators.
Neural Comput., 2010

A fast natural Newton method.
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

2008
Representational Power of Restricted Boltzmann Machines and Deep Belief Networks.
Neural Comput., 2008

Products of ordinary differential operators by evaluation and interpolation.
Proceedings of the Symbolic and Algebraic Computation, International Symposium, 2008

2007
Continuous Neural Networks.
Proceedings of the Eleventh International Conference on Artificial Intelligence and Statistics, 2007

Topmoumoute Online Natural Gradient Algorithm.
Proceedings of the Advances in Neural Information Processing Systems 20, 2007

Learning the 2-D Topology of Images.
Proceedings of the Advances in Neural Information Processing Systems 20, 2007

2006
Rank reduction of a class of pfaffian systems in two variables.
Proceedings of the Symbolic and Algebraic Computation, International Symposium, 2006

Spectral Dimensionality Reduction.
Proceedings of the Feature Extraction - Foundations and Applications, 2006

Large-Scale Algorithms.
Proceedings of the Semi-Supervised Learning, 2006

Label Propagation and Quadratic Criterion.
Proceedings of the Semi-Supervised Learning, 2006

2005
Convex Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 18 [Neural Information Processing Systems, 2005

The Curse of Highly Variable Functions for Local Kernel Machines.
Proceedings of the Advances in Neural Information Processing Systems 18 [Neural Information Processing Systems, 2005

Efficient Non-Parametric Function Induction in Semi-Supervised Learning.
Proceedings of the Tenth International Workshop on Artificial Intelligence and Statistics, 2005

2004
Learning Eigenfunctions Links Spectral Embedding and Kernel PCA.
Neural Comput., 2004

2003
Out-of-Sample Extensions for LLE, Isomap, MDS, Eigenmaps, and Spectral Clustering.
Proceedings of the Advances in Neural Information Processing Systems 16 [Neural Information Processing Systems, 2003

Computing power series solutions of a nonlinear PDE system.
Proceedings of the Symbolic and Algebraic Computation, 2003


  Loading...