VOCABTRIM: Vocabulary Pruning for Efficient Speculative Decoding in LLMs.
,
,
,
,
,
,
,
,
,
,
,
CoRR, June, 2025
Distilling Optimal Neural Networks: Rapid Search in Diverse Spaces.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Automatic Grammar Augmentation for Robust Voice Command Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019