Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

From Sparse Dependence to Sparse Attention: Unveiling How Chain-of-Thought Enhances Transformer Sample Efficiency.

[DOI]

Kaiyue Wen

Huaqing Zhang

Hongzhou Lin

Jingzhao Zhang

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

Unmemorization in Large Language Models via Self-Distillation and Deliberate Imagination.

[DOI]

CoRR, 2024

2023

Deep hybrid model with satellite imagery: how to combine demand modeling and computer vision for behavior analysis?

[DOI]

CoRR, 2023

2022

Beyond Worst-Case Analysis in Stochastic Approximation: Moment Estimation Improves Instance Complexity.

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

2021

Delayed Gradient Averaging: Tolerate the Communication Latency for Federated Learning.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020

Stochastic Optimization with Non-stationary Noise.

[DOI]

CoRR, 2020

On Complexity of Finding Stationary Points of Nonsmooth Nonconvex Functions.

[DOI]

CoRR, 2020

On the Complexity of Minimizing Convex Finite Sums Without Using the Indices of the Individual Functions.

[DOI]

CoRR, 2020

IDEAL: Inexact DEcentralized Accelerated Augmented Lagrangian Method.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Complexity of Finding Stationary Points of Nonconvex Nonsmooth Functions.

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

2019

An Inexact Variable Metric Proximal Point Algorithm for Generic Quasi-Newton Acceleration.

[DOI]

Hongzhou Lin

Julien Mairal

Zaïd Harchaoui

SIAM J. Optim., 2019

2018

ResNet with one-neuron hidden layers is a Universal Approximator.

[DOI]

Hongzhou Lin

Stefanie Jegelka

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Catalyst for Gradient-based Nonconvex Optimization.

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

2017

Generic acceleration schemes for gradient-based optimization in machine learning. (Algorithmes d'accélération générique pour les méthodes d'optimisation en apprentissage statistique).

[DOI]

Hongzhou Lin

PhD thesis, 2017

Catalyst Acceleration for First-order Convex Optimization: from Theory to Practice.

[DOI]

Hongzhou Lin

Julien Mairal

Zaïd Harchaoui

J. Mach. Learn. Res., 2017

2015

A Universal Catalyst for First-Order Optimization.

[DOI]

Hongzhou Lin

Julien Mairal

Zaïd Harchaoui

Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015