Trevor Strohman

According to our database1, Trevor Strohman authored at least 60 papers between 2004 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Efficient Adapter Finetuning for Tail Languages in Streaming Multilingual ASR.
CoRR, 2024

2023
Controlled Decoding from Language Models.
CoRR, 2023

Practical Conformer: Optimizing size, speed and flops of Conformer for on-Device and cloud ASR.
CoRR, 2023

Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages.
CoRR, 2023

UML: A Universal Monolingual Output Layer For Multilingual Asr.
Proceedings of the IEEE International Conference on Acoustics, 2023

From English to More Languages: Parameter-Efficient Model Reprogramming for Cross-Lingual Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Efficient Domain Adaptation for Speech Foundation Models.
Proceedings of the IEEE International Conference on Acoustics, 2023

Comparison of Soft and Hard Target RNN-T Distillation for Large-Scale ASR.
Proceedings of the IEEE International Conference on Acoustics, 2023

Resource-Efficient Transfer Learning from Speech Foundation Model Using Hierarchical Feature Fusion.
Proceedings of the IEEE International Conference on Acoustics, 2023

E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model.
Proceedings of the IEEE International Conference on Acoustics, 2023

Massively Multilingual Shallow Fusion with Large Language Models.
Proceedings of the IEEE International Conference on Acoustics, 2023

Context-Aware end-to-end ASR Using Self-Attentive Embedding and Tensor Fusion.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Internal Language Model Personalization of E2E Automatic Speech Recognition Using Random Encoder Features.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

JOIST: A Joint Speech and Text Streaming Model for ASR.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Modular Hybrid Autoregressive Transducer.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

A Truly Multilingual First Pass and Monolingual Second Pass Streaming on-Device ASR System.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Flickering Reduction with Partial Hypothesis Reranking for Streaming ASR.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification.
Proceedings of the Interspeech 2022, 2022

Improving Rare Word Recognition with LM-aware MWER Training.
Proceedings of the Interspeech 2022, 2022

A Language Agnostic Multilingual Streaming On-Device ASR System.
Proceedings of the Interspeech 2022, 2022

Pseudo Label Is Better Than Human Label.
Proceedings of the Interspeech 2022, 2022

Incremental Layer-Wise Self-Supervised Learning for Efficient Unsupervised Speech Domain Adaptation On Device.
Proceedings of the Interspeech 2022, 2022

Sentence-Select: Large-Scale Language Model Data Selection for Rare-Word Speech Recognition.
Proceedings of the Interspeech 2022, 2022

Improving Deliberation by Text-Only and Semi-Supervised Training.
Proceedings of the Interspeech 2022, 2022

A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes.
Proceedings of the Interspeech 2022, 2022

Streaming Intended Query Detection using E2E Modeling for Continued Conversation.
Proceedings of the Interspeech 2022, 2022

Turn-Taking Prediction for Natural Conversational Speech.
Proceedings of the Interspeech 2022, 2022


Fast Contextual Adaptation with Neural Associative Memory for On-Device Personalized Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

Massively Multilingual ASR: A Lifelong Learning Solution.
Proceedings of the IEEE International Conference on Acoustics, 2022

Large-Scale ASR Domain Adaptation Using Self- and Semi-Supervised Learning.
Proceedings of the IEEE International Conference on Acoustics, 2022

Transducer-Based Streaming Deliberation for Cascaded Encoders.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Input Length Matters: An Empirical Study Of RNN-T And MWER Training For Long-form Telephony Speech Recognition.
CoRR, 2021

Incremental Layer-wise Self-Supervised Learning for Efficient Speech Domain Adaptation On Device.
CoRR, 2021

Transformer Based Deliberation for Two-Pass Speech Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Lookup-Table Recurrent Language Models for Long Tail Speech Recognition.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Less is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging.
Proceedings of the IEEE International Conference on Acoustics, 2021

Cascaded Encoders for Unifying Streaming and Non-Streaming ASR.
Proceedings of the IEEE International Conference on Acoustics, 2021

Confidence Estimation for Attention-Based Sequence-to-Sequence Models for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

A Better and Faster end-to-end Model for Streaming ASR.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency.
CoRR, 2020

Emitting Word Timings with End-to-End Models.
Proceedings of the Interspeech 2020, 2020

Low Latency Speech Recognition Using End-to-End Prefetching.
Proceedings of the Interspeech 2020, 2020

An Attention-Based Joint Acoustic and Text on-Device End-To-End Model.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020


Towards Fast and Accurate Streaming End-To-End ASR.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Recognizing Long-Form Speech Using Streaming End-to-End Models.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Toward Domain-Invariant Speech Recognition via Large Scale Training.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

2017
Speech Research at Google to Enable Universal Speech Interfaces.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2015
Fix it where it fails: Pronunciation learning by mining error corrections from speech logs.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2009
Search Engines - Information Retrieval in Practice.
Pearson Education, ISBN: 978-0-13-136489-9, 2009

2008
A Statistical View of Binned Retrieval Models.
Proceedings of the Advances in Information Retrieval , 2008

2007
Recommending citations for academic papers.
Proceedings of the SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007

Efficient document retrieval in main memory.
Proceedings of the SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007

2006
Indri TREC Notebook 2006: Lessons Learned From Three Terabyte Tracks.
Proceedings of the Fifteenth Text REtrieval Conference, 2006

2005
Indri at TREC 2005: Terabyte Track.
Proceedings of the Fourteenth Text REtrieval Conference, 2005

UMass Robust 2005: Using Mixtures of Relevance Models for Query Expansion.
Proceedings of the Fourteenth Text REtrieval Conference, 2005

Optimization strategies for complex queries.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005

2004
Indri at TREC 2004: Terabyte Track.
Proceedings of the Thirteenth Text REtrieval Conference, 2004


  Loading...