2025

Why Do Some Inputs Break Low-Bit LLM Quantization?

[DOI]

Ting-Yun Chang

Muru Zhang

Jesse Thomason

Robin Jia

CoRR, June, 2025

Large-Scale Data Selection for Instruction Tuning.

[DOI]

CoRR, March, 2025

Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping.

[DOI]

Jonathan Ragan-Kelley

Shuaiwen Leon Song

Ben Athiwaratkun

Tri Dao

CoRR, January, 2025

2024

Toward a More Complete OMR Solution.

[DOI]

Proceedings of the 25th International Society for Music Information Retrieval Conference, 2024

How Language Model Hallucinations Can Snowball.

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Learning to Build by Building Your Own Instructions.

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

2023

Impossibly Good Experts and How to Follow Them.

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Measuring and Narrowing the Compositionality Gap in Language Models.

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

STOW: Discrete-Frame Segmentation and Tracking of Unseen Objects for Warehouse Picking Robots.

[DOI]

Proceedings of the Conference on Robot Learning, 2023

2022

Break and Make: Interactive Structural Understanding Using LEGO Bricks.

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022