We stand with Ukraine

We stand with Ukraine

James Qin

According to our database¹, James Qin authored at least 21 papers between 2019 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Massive End-to-end Speech Recognition Models with Time Reduction.

[BibT_eX]

[DOI]

,

Rohit Prabhavalkar

,

,

,

Dongseong Hwang

,

,

,

,

,

,

,

Chengjian Zheng

,

,

Tara N. Sainath

,

Pedro Moreno Mengibar

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Shuo-Yiin Chang

,

Tara N. Sainath

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Massive End-to-end Models for Short Search Queries.

[BibT_eX]

[DOI]

,

Rohit Prabhavalkar

,

Dongseong Hwang

,

,

,

,

,

,

,

,

,

,

Tara N. Sainath

,

Pedro Moreno Mengibar

CoRR, 2023

AudioPaLM: A Large Language Model That Can Speak and Listen.

[BibT_eX]

[DOI]

CoRR, 2023

Efficient Adapters for Giant Speech Models.

[BibT_eX]

[DOI]

,

,

,

Chung-Cheng Chiu

,

,

,

CoRR, 2023

Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages.

[BibT_eX]

[DOI]

CoRR, 2023

2022

BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2022

LaMDA: Language Models for Dialog Applications.

[BibT_eX]

[DOI]

Romal Thoppilan

,

Daniel De Freitas

,

,

,

Apoorv Kulshreshtha

,

,

,

,

,

,

,

,

Huaixiu Steven Zheng

,

,

Marcelo Menegali

,

,

,

Dmitry Lepikhin

,

,

,

,

,

,

,

,

Chung-Ching Chang

,

,

,

,

Kathleen S. Meier-Hellstern

,

Meredith Ringel Morris

,

,

Renelito Delos Santos

,

,

,

Ben Zevenbergen

,

Vinodkumar Prabhakaran

,

,

,

,

Alejandra Molina

,

Erin Hoffman-John

,

,

,

,

,

,

Viktoriya Kuzmina

,

,

,

Rachel Bernstein

,

,

Blaise Agüera y Arcas

,

,

,

,

CoRR, 2022

Self-supervised learning with random-projection quantizer for speech recognition.

[BibT_eX]

[DOI]

Chung-Cheng Chiu

,

,

,

,

Proceedings of the International Conference on Machine Learning, 2022

Vector-quantized Image Modeling with Improved VQGAN.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Jason Baldridge

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

Improving The Latency And Quality Of Cascaded Encoders.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Scaling End-to-End Models for Large-Scale Multilingual ASR.

[BibT_eX]

[DOI]

,

,

Tara N. Sainath

,

,

,

,

,

,

CoRR, 2021

An Efficient Streaming Non-Recurrent On-Device End-to-End Model with Improvements to Rare-Word Modeling.

[BibT_eX]

[DOI]

Tara N. Sainath

,

,

,

,

,

,

,

,

,

Quoc-Nam Le-The

,

Shuo-Yiin Chang

,

,

,

,

Chung-Cheng Chiu

,

Diamantino Caseiro

,

,

,

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

A Better and Faster end-to-end Model for Streaming ASR.

[BibT_eX]

[DOI]

,

,

,

Tara N. Sainath

,

Chung-Cheng Chiu

,

,

Shuo-Yiin Chang

,

,

,

,

,

,

,

Trevor Strohman

,

Proceedings of the IEEE International Conference on Acoustics, 2021

Scaling End-to-End Models for Large-Scale Multilingual ASR.

[BibT_eX]

[DOI]

,

,

Tara N. Sainath

,

,

,

,

,

,

,

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

w2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-Training.

[BibT_eX]

[DOI]

,

,

,

Chung-Cheng Chiu

,

,

,

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020

Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition.

[BibT_eX]

[DOI]

,

,

,

,

Chung-Cheng Chiu

,

,

,

CoRR, 2020

Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition.

[BibT_eX]

[DOI]

,

,

Chung-Cheng Chiu

,

,

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context.

[BibT_eX]

[DOI]

,

Zhengdong Zhang

,

,

,

Chung-Cheng Chiu

,

,

,

,

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Conformer: Convolution-augmented Transformer for Speech Recognition.

[BibT_eX]

[DOI]

,

,

Chung-Cheng Chiu

,

,

,

,

,

,

Zhengdong Zhang

,

,

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2019

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Tara N. Sainath

,

,

Chung-Cheng Chiu

,

,

,

,

Stella Laurenzo

,

,

,

Wolfgang Macherey

,

,

,

,

,

,

Rohit Prabhavalkar

,

,

,

,

,

,

Sébastien Jean

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Kuan-Chieh Wang

,

Ekaterina Gonina

,

,

,

,

,

,

,

,

,

George F. Foster

,

John Richardson

,

,

Antoine Bruguier

,

,

,

,

,

,

,

Vijayaditya Peddinti

,

,

Michiel Bacchiani

,

Thomas B. Jablin

,

Robert Suderman

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Dmitry Lepikhin

,

,

,

,

Shubham Toshniwal

,

,

Michael Nirschl

,

CoRR, 2019

Loading...