2024
Speculative Streaming: Fast LLM Inference without Auxiliary Models.
CoRR, 2024

Conformer-Based Speech Recognition On Extreme Edge-Computing Devices.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Track, 2024

2023
Conformer-Based Speech Recognition On Extreme Edge-Computing Devices.
CoRR, 2023

Variable Attention Masking for Configurable Transformer Transducer Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Optimizing Bilingual Neural Transducer with Synthetic Code-switching Text Generation.
CoRR, 2022

2021
Federated Evaluation and Tuning for On-Device Personalization: System Design & Applications.
CoRR, 2021

2020
SNDCNN: Self-Normalizing Deep CNNs with Scaled Exponential Linear Units for Speech Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Voice Trigger Detection from Lvcsr Hypothesis Lattices Using Bidirectional Lattice Recurrent Neural Networks.
Proceedings of the IEEE International Conference on Acoustics, 2019