Tara N. Sainath
Orcid: 0000-0002-4126-6556Affiliations:
- Google Inc., New York, NY, USA
- IBM T. J. Watson Research Center, Yorktown Heights, NY, USA
According to our database1,
Tara N. Sainath
authored at least 210 papers
between 2006 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on linkedin.com
-
on orcid.org
-
on ai.google
On csauthors.net:
Bibliography
2024
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Hierarchical Recurrent Adapters for Efficient Multi-Task Adaptation of Large Speech Models.
CoRR, 2024
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
A Comparison of Parameter-Efficient ASR Domain Adaptation Methods for Universal Speech and Language Models.
Proceedings of the IEEE International Conference on Acoustics, 2024
Augmenting Conformers With Structured State-Space Sequence Models For Online Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024
Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models.
Proceedings of the IEEE International Conference on Acoustics, 2024
Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
USM-Lite: Quantization and Sparsity Aware Fine-Tuning for Speech Recognition with Universal Speech Models.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Handling Ambiguity in Emotion: From Out-of-Domain Detection to Distribution Estimation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
Augmenting conformers with structured state space models for online speech recognition.
CoRR, 2023
CoRR, 2023
Practical Conformer: Optimizing size, speed and flops of Conformer for on-Device and cloud ASR.
CoRR, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Re-investigating the Efficient Transfer Learning of Speech Foundation Model using Feature Fusion Methods.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
A Quantum Kernel Learning Approach to Acoustic Modeling for Spoken Command Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023
From English to More Languages: Parameter-Efficient Model Reprogramming for Cross-Lingual Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023
Multi-Output RNN-T Joint Networks for Multi-Task Learning of ASR and Auxiliary Tasks.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Resource-Efficient Transfer Learning from Speech Foundation Model Using Hierarchical Feature Fusion.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Lego-Features: Exporting Modular Encoder Features for Streaming and Deliberation ASR.
Proceedings of the IEEE International Conference on Acoustics, 2023
Improving Multilingual and Code-Switching ASR Using Large Language Model Generated Text.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
Improved Long-Form Speech Recognition By Jointly Modeling The Primary And Non-Primary Speakers.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
2022
BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition.
IEEE J. Sel. Top. Signal Process., 2022
IEEE J. Sel. Top. Signal Process., 2022
Editorial Editorial of Special Issue on Self-Supervised Learning for Speech and Audio Processing.
IEEE J. Sel. Top. Signal Process., 2022
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
A Truly Multilingual First Pass and Monolingual Second Pass Streaming on-Device ASR System.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Unified End-to-End Speech Recognition and Endpointing for Fast and Efficient Speech Systems.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Sentence-Select: Large-Scale Language Model Data Selection for Rare-Word Speech Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
2021
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
An Efficient Streaming Non-Recurrent On-Device End-to-End Model with Improvements to Rare-Word Modeling.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
2020
CoRR, 2020
A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency.
CoRR, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Improving Tail Performance of a Deliberation E2E ASR Model Using a Large Text Corpus.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
A Streaming On-Device End-To-End Model Surpassing Server-Side Conventional Model Quality and Latency.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Improving Proper Noun Recognition in End-To-End Asr by Customization of the Mwer Loss Criterion.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
2019
CoRR, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Phoneme-Based Contextualization for Cross-Lingual Speech Recognition in End-to-End Models.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Bytes Are All You Need: End-to-end Multilingual Speech Recognition and Synthesis with Bytes.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
2018
A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Contextual Speech Recognition in End-to-end Neural Network Systems Using Beam Search.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Domain Adaptation Using Factorized Hidden Layer for Robust Automatic Speech Recognition.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
No Need for a Lexicon? Evaluating the Value of the Pronunciation Lexica in End-to-End Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Spectral Distortion Model for Training Phase-Sensitive Deep-Neural Networks for Far-Field Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
An Analysis of Incorporating an External Language Model into a Sequence-to-Sequence Model.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
2017
IEEE Trans. Parallel Distributed Syst., 2017
Multichannel Signal Processing With Deep Neural Networks for Automatic Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2017
CoRR, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Generation of Large-Scale Simulated Utterances in Virtual Rooms to Train Deep-Neural Networks for Far-Field Speech Recognition in Google Home.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Endpoint Detection Using Grid Long Short-Term Memory Networks for Streaming Speech Recognition.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Improving the efficiency of forward-backward algorithm using batched computation in TensorFlow.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017
2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Complex Linear Projection (CLP): A Discriminative Approach to Joint Feature Extraction and Acoustic Modeling.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Reducing the Computational Complexity of Multimicrophone Acoustic Models with Integrated Feature Extraction.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Modeling Time-Frequency Patterns with LSTM vs. Convolutional Architectures for LVCSR Tasks.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
2015
Neural Networks, 2015
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Locally-connected and convolutional neural networks for small footprint speaker recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Automatic gain control and multi-style training for robust small-footprint keyword spotting with deep neural networks.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
Speaker location and microphone spacing invariant acoustic modeling from raw multichannel waveforms.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Improvements to filterbank and delta learning within a deep neural network framework.
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
2013
Optimization Techniques to Improve Training Speed of Deep Neural Networks for Large Speech Tasks.
IEEE Trans. Speech Audio Process., 2013
Improving training time of Hessian-free optimization for deep neural networks using preconditioning and sampling.
CoRR, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Low-rank matrix factorization for Deep Neural Network training with high-dimensional output targets.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Developing speech recognition systems for corpus indexing under the IARPA Babel program.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013
Accelerating Hessian-free optimization for Deep Neural Networks by implicit preconditioning and sampling.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013
2012
IEEE Signal Process. Mag., 2012
Proceedings of the Workshop: Will We Ever Really Replace the N-gram Model? On the Future of Language Modeling for HLT, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Scalable Minimum Bayes Risk Training of Deep Neural Network Acoustic Models Using Distributed Hessian-free Optimization.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Improved pre-training of Deep Belief Networks using Sparse Encoding Symmetric Machines.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
2011
IEEE ACM Trans. Audio Speech Lang. Process., 2011
Reducing Computational Complexities of Exemplar-Based Sparse Representations with Applications to Large Vocabulary Speech Recognition.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
A-Functions: A generalization of Extended Baum-Welch transformations to convex optimization.
Proceedings of the IEEE International Conference on Acoustics, 2011
A convex hull approach to sparse representations for exemplar-based speech recognition.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011
Making Deep Belief Networks effective for large vocabulary continuous speech recognition.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011
2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
An analysis of sparseness and regularization in exemplar-based methods for speech classification.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Incorporating sparse representation phone identification features in automatic speech recognition using exponential families.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
A voice-commandable robotic forklift working alongside humans in minimally-prepared outdoor environments.
Proceedings of the IEEE International Conference on Robotics and Automation, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
The Use of isometric transformations and bayesian estimation in compressive sensing for fMRI classification.
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the 13th Conference on Information Fusion, 2010
2009
PhD thesis, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009
2008
A comparison of broad phonetic and acoustic units for noise robust segment-based phonetic recognition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Generalization of extended baum-welch parameter estimation for discriminative training and decoding.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Gradient steepness metrics using extended Baum-Welch transformations for universal pattern recognition tasks.
Proceedings of the IEEE International Conference on Acoustics, 2008
2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
Broad phonetic class recognition in a Hidden Markov model framework using extended Baum-Welch transformations.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007
2006
A Sinusoidal Model Approach to Acoustic Landmark Detection and Segmentation for Robust Segment-Based Speech Recognition.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006