Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads.

[DOI]

Tianle Cai

Yuhong Li

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Large Language Models as Tool Makers.

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

FlexAttention for Efficient High-Resolution Vision-Language Models.

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Scaling In-Context Demonstrations with Structured Attention.

[DOI]

CoRR, 2023

Reward Collapse in Aligning Large Language Models.

[DOI]

CoRR, 2023

What Makes Convolutional Models Great on Long Sequence Modeling?

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022

Is Vertical Logistic Regression Privacy-Preserving? A Comprehensive Privacy Analysis and Beyond.

[DOI]

CoRR, 2022

2021

First Place Solution of KDD Cup 2021 & OGB Large-Scale Challenge Graph Prediction Track.

[DOI]

CoRR, 2021

Do Transformers Really Perform Bad for Graph Representation?

[DOI]

CoRR, 2021

Towards Certifying 𝓁<sub>∞</sub> Robustness using Neural Networks with 𝓁<sub>∞</sub>-dist Neurons.

[DOI]

CoRR, 2021

Do Transformers Really Perform Badly for Graph Representation?

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Towards a Theoretical Framework of Out-of-Distribution Generalization.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Stable, Fast and Accurate: Kernelized Attention with Relative Positional Encoding.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Towards Certifying L-infinity Robustness using Neural Networks with L-inf-dist Neurons.

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

GraphNorm: A Principled Approach to Accelerating Graph Neural Network Training.

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

A Theory of Label Propagation for Subpopulation Shift.

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

2020

RANDOM MASK: Towards Robust Convolutional Neural Networks.

[DOI]

CoRR, 2020

Sanity-Checking Pruning Methods: Random Tickets can Win the Jackpot.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Locally Differentially Private (Contextual) Bandits Learning.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

2019

Defective Convolutional Layers Learn Robust CNNs.

[DOI]

CoRR, 2019

Convergence of Adversarial Training in Overparametrized Networks.

[DOI]

CoRR, 2019

Adversarially Robust Generalization Just Requires More Unlabeled Data.

[DOI]

CoRR, 2019

A Gram-Gauss-Newton Method Learning Overparameterized Deep Neural Networks for Regression Problems.

[DOI]

CoRR, 2019

Convergence of Adversarial Training in Overparametrized Neural Networks.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019