2025
Why Do Some Inputs Break Low-Bit LLM Quantization?
CoRR, June, 2025

Large-Scale Data Selection for Instruction Tuning.
CoRR, March, 2025

Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping.
CoRR, January, 2025

2024
Toward a More Complete OMR Solution.
Proceedings of the 25th International Society for Music Information Retrieval Conference, 2024

How Language Model Hallucinations Can Snowball.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Learning to Build by Building Your Own Instructions.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
Impossibly Good Experts and How to Follow Them.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Measuring and Narrowing the Compositionality Gap in Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

STOW: Discrete-Frame Segmentation and Tracking of Unseen Objects for Warehouse Picking Robots.
Proceedings of the Conference on Robot Learning, 2023

2022
Break and Make: Interactive Structural Understanding Using LEGO Bricks.
Proceedings of the Computer Vision - ECCV 2022, 2022