We stand with Ukraine

We stand with Ukraine

Ankur Bapna

According to our database¹, Ankur Bapna authored at least 59 papers between 2017 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

2017

2018

2019

2020

2021

2022

2023

2024

0

5

10

15

1

4

4

2

2

7

2

1

1

9

11

2

3

5

3

2

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

STAB: Speech Tokenizer Assessment Benchmark.

[BibT_eX]

[DOI]

Shikhar Vashishth

,

,

Shikhar Bharadwaj

,

Sriram Ganapathy

,

Chulayuth Asawaroengchai

,

Kartik Audhkhasi

,

Andrew Rosenberg

,

,

Bhuvana Ramabhadran

CoRR, 2024

Multimodal Modeling for Spoken Language Identification.

[BibT_eX]

[DOI]

Shikhar Bharadwaj

,

,

Shikhar Vashishth

,

,

Sriram Ganapathy

,

,

Siddharth Dalmia

,

,

,

,

,

Partha Talukdar

,

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Multimodal Modeling For Spoken Language Identification.

[BibT_eX]

[DOI]

Shikhar Bharadwaj

,

,

Shikhar Vashishth

,

,

Sriram Ganapathy

,

,

Siddharth Dalmia

,

,

,

,

,

Partha Talukdar

,

CoRR, 2023

MADLAD-400: A Multilingual And Document-Level Large Audited Dataset.

[BibT_eX]

[DOI]

Sneha Kudugunta

,

,

,

,

Christopher A. Choquette-Choo

,

,

,

Aditya Kusupati

,

,

,

CoRR, 2023

AudioPaLM: A Large Language Model That Can Speak and Listen.

[BibT_eX]

[DOI]

CoRR, 2023

Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages.

[BibT_eX]

[DOI]

CoRR, 2023

Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text Representations.

[BibT_eX]

[DOI]

,

,

,

,

,

Nobuyuki Morioka

,

,

,

,

Michiel Bacchiani

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

MADLAD-400: A Multilingual And Document-Level Large Audited Dataset.

[BibT_eX]

[DOI]

Sneha Kudugunta

,

,

,

,

,

Aditya Kusupati

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

PronScribe: Highly Accurate Multimodal Phonemic Transcription From Speech and Text.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Label Aware Speech Representation Learning For Language Identification.

[BibT_eX]

[DOI]

Shikhar Vashishth

,

Shikhar Bharadwaj

,

Sriram Ganapathy

,

,

,

,

,

Partha Talukdar

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

LibriTTS-R: A Restored Multi-Speaker Text-to-Speech Corpus.

[BibT_eX]

[DOI]

,

,

,

,

,

Nobuyuki Morioka

,

Michiel Bacchiani

,

,

,

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Mu<sup>2</sup>SLAM: Multitask, Multilingual Speech and Language Models.

[BibT_eX]

[DOI]

,

,

,

Wolfgang Macherey

,

Proceedings of the International Conference on Machine Learning, 2023

Understanding Shared Speech-Text Representations.

[BibT_eX]

[DOI]

,

,

,

,

Andrew Rosenberg

,

Bhuvana Ramabhadran

,

Proceedings of the IEEE International Conference on Acoustics, 2023

SQuId: Measuring Speech Naturalness in Many Languages.

[BibT_eX]

[DOI]

Thibault Sellam

,

,

,

Diana Mackinnon

,

Ankur P. Parikh

,

Proceedings of the IEEE International Conference on Acoustics, 2023

Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-to-Speech.

[BibT_eX]

[DOI]

,

,

,

Nobuyuki Morioka

,

,

,

,

Andrew Rosenberg

,

Bhuvana Ramabhadran

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets.

[BibT_eX]

[DOI]

,

,

,

,

,

Nasanbayar Ulzii-Orshikh

,

,

Nishant Subramani

,

,

Claytone Sikasote

,

Monang Setyawan

,

Supheakmungkol Sarin

,

,

,

,

,

Isabel Papadimitriou

,

,

Pedro Javier Ortiz Suárez

,

,

,

Andre Niyongabo Rubungo

,

,

Mathias Müller

,

,

Shamsuddeen Hassan Muhammad

,

,

Ayanda Mnyakeni

,

Jamshidbek Mirzakhalov

,

Tapiwanashe Matangira

,

,

,

Sneha Kudugunta

,

,

,

,

Bonaventure F. P. Dossou

,

Sakhile Dlamini

,

Nisansa de Silva

,

Sakine Çabuk Balli

,

Stella Biderman

,

Alessia Battisti

,

,

,

Pallavi Baljekar

,

Israel Abebe Azime

,

Ayodele Awokoya

,

,

Orevaoghene Ahia

,

Oghenefego Ahia

,

,

Mofetoluwa Adeyemi

Trans. Assoc. Comput. Linguistics, 2022

Building Machine Translation Systems for the Next Thousand Languages.

[BibT_eX]

[DOI]

CoRR, 2022

mSLAM: Massively multilingual joint pre-training for speech and text.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2022

Towards the Next 1000 Languages in Multilingual Machine Translation: Exploring the Synergy Between Supervised and Self-Supervised Learning.

[BibT_eX]

[DOI]

Aditya Siddhant

,

,

,

,

,

,

CoRR, 2022

JOIST: A Joint Speech and Text Streaming Model for ASR.

[BibT_eX]

[DOI]

Tara N. Sainath

,

Rohit Prabhavalkar

,

,

,

,

,

,

,

Trevor Strohman

Proceedings of the IEEE Spoken Language Technology Workshop, 2022

FLEURS: FEW-Shot Learning Evaluation of Universal Representations of Speech.

[BibT_eX]

[DOI]

,

,

,

,

,

Siddharth Dalmia

,

,

,

Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Maestro-U: Leveraging Joint Speech-Text Representation Learning for Zero Supervised Speech ASR.

[BibT_eX]

[DOI]

,

,

Andrew Rosenberg

,

,

Bhuvana Ramabhadran

,

Pedro J. Moreno

,

Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

XTREME-S: Evaluating Cross-lingual Speech Representations.

[BibT_eX]

[DOI]

,

,

,

,

Patrick von Platen

,

,

,

,

,

,

,

,

,

Jonathan H. Clark

,

,

,

Sebastian Ruder

,

,

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

MAESTRO: Matched Speech Text Representations through Modality Matching.

[BibT_eX]

[DOI]

,

,

Andrew Rosenberg

,

Bhuvana Ramabhadran

,

Pedro J. Moreno

,

,

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Examining Scaling and Transfer of Language Model Architectures for Machine Translation.

[BibT_eX]

[DOI]

,

Behrooz Ghorbani

,

,

,

,

,

Proceedings of the International Conference on Machine Learning, 2022

Scaling Laws for Neural Machine Translation.

[BibT_eX]

[DOI]

Behrooz Ghorbani

,

,

,

,

,

,

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

Joint Unsupervised and Supervised Training for Multilingual ASR.

[BibT_eX]

[DOI]

,

,

,

,

Nikhil Siddhartha

,

,

Tara N. Sainath

Proceedings of the IEEE International Conference on Acoustics, 2022

Multilingual Document-Level Translation Enables Zero-Shot Transfer From Sentences to Documents.

[BibT_eX]

[DOI]

,

,

,

Ali Dabirmoghaddam

,

Naveen Arivazhagan

,

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Multilingual Mix: Example Interpolation Improves Multilingual Neural Machine Translation.

[BibT_eX]

[DOI]

,

,

,

,

,

Wolfgang Macherey

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021

SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training.

[BibT_eX]

[DOI]

,

,

,

,

,

Jonathan H. Clark

,

,

,

,

CoRR, 2021

Gradient-guided Loss Masking for Neural Machine Translation.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2021

Share or Not? Learning to Schedule Language-Specific Capacity for Multilingual Translation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 9th International Conference on Learning Representations, 2021

Beyond Distillation: Task-level Mixture-of-Experts for Efficient Inference.

[BibT_eX]

[DOI]

Sneha Kudugunta

,

,

,

,

Dmitry Lepikhin

,

Minh-Thang Luong

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

2020

Controlling Computation versus Quality for Neural Sequence Models.

[BibT_eX]

[DOI]

,

Naveen Arivazhagan

,

CoRR, 2020

Faster Transformer Decoding: N-gram Masked Self-Attention.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2020

Language ID in the Wild: Unexpected Challenges on the Path to a Thousand-Language Web Text Corpus.

[BibT_eX]

[DOI]

,

Theresa Breiner

,

,

Proceedings of the 28th International Conference on Computational Linguistics, 2020

Leveraging Monolingual Data with Self-Supervision for Multilingual Neural Machine Translation.

[BibT_eX]

[DOI]

Aditya Siddhant

,

,

,

,

,

Sneha Reddy Kudugunta

,

Naveen Arivazhagan

,

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Evaluating the Cross-Lingual Effectiveness of Massively Multilingual Neural Machine Translation.

[BibT_eX]

[DOI]

Aditya Siddhant

,

,

,

,

,

,

,

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Fill in the Blanks: Imputing Missing Sentences for Larger-Context Neural Machine Translation.

[BibT_eX]

[DOI]

Sébastien Jean

,

,

CoRR, 2019

Simple, Scalable Adaptation for Neural Machine Translation.

[BibT_eX]

[DOI]

,

Naveen Arivazhagan

,

CoRR, 2019

Investigating Multilingual NMT Representations at Scale.

[BibT_eX]

[DOI]

Sneha Reddy Kudugunta

,

,

,

Naveen Arivazhagan

,

CoRR, 2019

Evaluating the Cross-Lingual Effectiveness of Massively Multilingual Neural Machine Translation.

[BibT_eX]

[DOI]

Aditya Siddhant

,

,

,

Naveen Arivazhagan

,

,

,

,

CoRR, 2019

Massively Multilingual Neural Machine Translation in the Wild: Findings and Challenges.

[BibT_eX]

[DOI]

Naveen Arivazhagan

,

,

,

Dmitry Lepikhin

,

,

,

,

,

George F. Foster

,

,

Wolfgang Macherey

,

,

CoRR, 2019

The Missing Ingredient in Zero-Shot Neural Machine Translation.

[BibT_eX]

[DOI]

Naveen Arivazhagan

,

,

,

,

,

Wolfgang Macherey

CoRR, 2019

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Tara N. Sainath

,

,

Chung-Cheng Chiu

,

,

,

,

Stella Laurenzo

,

,

,

Wolfgang Macherey

,

,

,

,

,

,

Rohit Prabhavalkar

,

,

,

,

,

,

Sébastien Jean

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Kuan-Chieh Wang

,

Ekaterina Gonina

,

,

,

,

,

,

,

,

,

George F. Foster

,

John Richardson

,

,

Antoine Bruguier

,

,

,

,

,

,

,

Vijayaditya Peddinti

,

,

Michiel Bacchiani

,

Thomas B. Jablin

,

Robert Suderman

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Dmitry Lepikhin

,

,

,

,

Shubham Toshniwal

,

,

Michael Nirschl

,

CoRR, 2019

GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Non-Parametric Adaptation for Neural Machine Translation.

[BibT_eX]

[DOI]

,

Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Large-Scale Multilingual Speech Recognition with a Streaming End-to-End Model.

[BibT_eX]

[DOI]

,

Arindrima Datta

,

Tara N. Sainath

,

Eugene Weinstein

,

Bhuvana Ramabhadran

,

,

,

,

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Investigating Multilingual NMT Representations at Scale.

[BibT_eX]

[DOI]

Sneha Reddy Kudugunta

,

,

,

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Simple, Scalable Adaptation for Neural Machine Translation.

[BibT_eX]

[DOI]

,

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

2018

The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation.

[BibT_eX]

[DOI]

,

,

,

,

Wolfgang Macherey

,

George F. Foster

,

,

,

,

,

,

CoRR, 2018

Building a Conversational Agent Overnight with Dialogue Self-Play.

[BibT_eX]

[DOI]

,

Dilek Hakkani-Tür

,

,

Abhinav Rastogi

,

,

,

CoRR, 2018

Revisiting Character-Based Neural Machine Translation with Capacity and Compression.

[BibT_eX]

[DOI]

,

George F. Foster

,

,

,

Wolfgang Macherey

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Training Deeper Neural Machine Translation Models with Transparent Attention.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation.

[BibT_eX]

[DOI]

,

,

,

,

Wolfgang Macherey

,

George F. Foster

,

,

,

,

,

,

Jakob Uszkoreit

,

,

,

,

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017

Improving Frame Semantic Parsing with Hierarchical Dialogue Encoders.

[BibT_eX]

[DOI]

,

,

Dilek Hakkani-Tür

,

CoRR, 2017

Sequential Dialogue Context Modeling for Spoken Language Understanding.

[BibT_eX]

[DOI]

,

,

Dilek Hakkani-Tür

,

Proceedings of the 18th Annual SIGdial Meeting on Discourse and Dialogue, 2017

Towards Zero-Shot Frame Semantic Parsing for Domain Scaling.

[BibT_eX]

[DOI]

,

,

Dilek Hakkani-Tür

,

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Loading...