Why Do Some Inputs Break Low-Bit LLM Quantization?
CoRR, June, 2025
Large-Scale Data Selection for Instruction Tuning.
CoRR, March, 2025
Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping.
CoRR, January, 2025
Toward a More Complete OMR Solution.
Proceedings of the 25th International Society for Music Information Retrieval Conference, 2024
How Language Model Hallucinations Can Snowball.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Learning to Build by Building Your Own Instructions.
Proceedings of the Computer Vision - ECCV 2024, 2024
Impossibly Good Experts and How to Follow Them.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Measuring and Narrowing the Compositionality Gap in Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
STOW: Discrete-Frame Segmentation and Tracking of Unseen Objects for Warehouse Picking Robots.
Proceedings of the Conference on Robot Learning, 2023
Break and Make: Interactive Structural Understanding Using LEGO Bricks.
Proceedings of the Computer Vision - ECCV 2022, 2022