Michael Auli

Orcid: 0000-0001-5974-4459

According to our database1, Michael Auli authored at least 95 papers between 2009 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Scaling Speech Technology to 1, 000+ Languages.
J. Mach. Learn. Res., 2024

Improving Multilingual ASR in the Wild Using Simple N-best Re-ranking.
CoRR, 2024

Scaling A Simple Approach to Zero-Shot Speech Recognition.
CoRR, 2024

2023
DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and Language.
Proceedings of the International Conference on Machine Learning, 2023

Measuring the Impact of Domain Factors in Self-Supervised Pre-Training.
Proceedings of the IEEE International Conference on Acoustics, 2023

Toward Joint Language Modeling for Speech Units and Text.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Av-Data2Vec: Self-Supervised Learning of Audio-Visual Speech Representations with Contextualized Target Representations.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Simple and Effective Unsupervised Speech Translation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Measuring the Impact of Individual Domain Factors in Self-Supervised Pre-Training.
CoRR, 2022

Towards End-to-End Unsupervised Speech Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Masked Autoencoders that Listen.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Simple and Effective Zero-shot Cross-lingual Phoneme Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

On-demand compute reduction with stochastic wav2vec 2.0.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Wav2Vec-Aug: Improved self-supervised training with limited data.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Simple and Effective Unsupervised Speech Synthesis.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

XTREME-S: Evaluating Cross-lingual Speech Representations.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language.
Proceedings of the International Conference on Machine Learning, 2022

Improved Language Identification Through Cross-Lingual Self-Supervised Learning.
Proceedings of the IEEE International Conference on Acoustics, 2022

Unified Speech-Text Pre-training for Speech Translation and Recognition.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Beyond English-Centric Multilingual Machine Translation.
J. Mach. Learn. Res., 2021

Improved Language Identification Through Cross-Lingual Self-Supervised Learning.
CoRR, 2021

A Comparison of Approaches to Document-level Machine Translation.
CoRR, 2021

Unsupervised Speech Recognition.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Self-training Improves Pre-training for Natural Language Understanding.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Large-Scale Self- and Semi-Supervised Learning for Speech Translation.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Unsupervised Cross-Lingual Representation Learning for Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

A Comparison of Discrete Latent Variable Models for Speech Representation Learning.
Proceedings of the IEEE International Conference on Acoustics, 2021

Self-Training and Pre-Training are Complementary for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

The Source-Target Domain Mismatch Problem in Machine Translation.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Reservoir Transformers.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Multilingual Speech Translation from Efficient Finetuning of Pretrained Models.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Discriminative Reranking for Neural Machine Translation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Modeling Human Motion with Quaternion-Based Neural Networks.
Int. J. Comput. Vis., 2020

Reservoir Transformer.
CoRR, 2020

Beyond English-Centric Multilingual Machine Translation.
CoRR, 2020

wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations.
CoRR, 2020

Robust and On-the-fly Dataset Denoising for Image Classification.
CoRR, 2020

Language Models not just for Pre-training: Fast Online Neural Noisy Channel Modeling.
Proceedings of the Fifth Conference on Machine Translation, 2020

wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Depth-Adaptive Transformer.
Proceedings of the 8th International Conference on Learning Representations, 2020

vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations.
Proceedings of the 8th International Conference on Learning Representations, 2020

Robust and On-the-Fly Dataset Denoising for Image Classification.
Proceedings of the Computer Vision - ECCV 2020, 2020

On The Evaluation of Machine Translation SystemsTrained With Back-Translation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Improving Conditioning in Context-Aware Sequence to Sequence Models.
CoRR, 2019

Effectiveness of self-supervised pre-training for speech recognition.
CoRR, 2019

Simple and Effective Noisy Channel Modeling for Neural Machine Translation.
CoRR, 2019

On The Evaluation of Machine Translation Systems Trained With Back-Translation.
CoRR, 2019

GLOSS: Generative Latent Optimization of Sentence Representations.
CoRR, 2019

Facebook FAIR's WMT19 News Translation Task Submission.
Proceedings of the Fourth Conference on Machine Translation, 2019

fairseq: A Fast, Extensible Toolkit for Sequence Modeling.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Pre-trained language model representations for language generation.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

wav2vec: Unsupervised Pre-Training for Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Mixture Models for Diverse Machine Translation: Tricks of the Trade.
Proceedings of the 36th International Conference on Machine Learning, 2019

Pay Less Attention with Lightweight and Dynamic Convolutions.
Proceedings of the 7th International Conference on Learning Representations, 2019

Wizard of Wikipedia: Knowledge-Powered Conversational Agents.
Proceedings of the 7th International Conference on Learning Representations, 2019

Adaptive Input Representations for Neural Language Modeling.
Proceedings of the 7th International Conference on Learning Representations, 2019

Simple and Effective Noisy Channel Modeling for Neural Machine Translation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Cloze-driven Pretraining of Self-attention Networks.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

3D Human Pose Estimation in Video With Temporal Convolutions and Semi-Supervised Training.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

ELI5: Long Form Question Answering.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Scaling Neural Machine Translation.
Proceedings of the Third Conference on Machine Translation: Research Papers, 2018

QuickEdit: Editing Text & Translations by Crossing Words Out.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Classical Structured Prediction Losses for Sequence to Sequence Learning.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Analyzing Uncertainty in Neural Machine Translation.
Proceedings of the 35th International Conference on Machine Learning, 2018

Understanding Back-Translation at Scale.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

QuaterNet: A Quaternion-based Recurrent Model for Human Motion.
Proceedings of the British Machine Vision Conference 2018, 2018

Controllable Abstractive Summarization.
Proceedings of the 2nd Workshop on Neural Machine Translation and Generation, 2018

2017
QuickEdit: Editing Text & Translations via Simple Delete Actions.
CoRR, 2017

Convolutional Sequence to Sequence Learning.
Proceedings of the 34th International Conference on Machine Learning, 2017

Language Modeling with Gated Convolutional Networks.
Proceedings of the 34th International Conference on Machine Learning, 2017

A Convolutional Encoder Model for Neural Machine Translation.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Sequence Level Training with Recurrent Neural Networks.
Proceedings of the 4th International Conference on Learning Representations, 2016

Iterative Refinement for Machine Translation.
CoRR, 2016

Generating Text from Structured Data with Application to the Biography Domain.
CoRR, 2016

Vocabulary Selection Strategies for Neural Machine Translation.
CoRR, 2016

Neural Network-based Word Alignment through Score Aggregation.
Proceedings of the First Conference on Machine Translation, 2016

Expected F-Measure Training for Shift-Reduce Parsing with Recurrent Neural Networks.
Proceedings of the NAACL HLT 2016, 2016

Abstractive Sentence Summarization with Attentive Recurrent Neural Networks.
Proceedings of the NAACL HLT 2016, 2016

Neural Text Generation from Structured Data with Application to the Biography Domain.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Strategies for Training Large Vocabulary Neural Language Models.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015
Learning Translation Models from Monolingual Continuous Representations.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

A Neural Network Approach to Context-Sensitive Generation of Conversational Responses.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

CCG Supertagging with a Recurrent Neural Network.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

deltaBLEU: A Discriminative Metric for Generation Tasks with Intrinsically Diverse Targets.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014
Large-scale Expected BLEU Training of Phrase-based Reordering Models.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Minimum Translation Modeling with Recurrent Neural Networks.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

Decoder Integration and Expected BLEU Training for Recurrent Neural Network Language Models.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013
Joint Language and Translation Modeling with Recurrent Neural Networks.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

2011
Training a Log-Linear Parser with Loss Functions via Softmax-Margin.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

Efficient CCG Parsing: A* versus Adaptive Supertagging.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

A Comparison of Loopy Belief Propagation and Dual Decomposition for Integrated CCG Supertagging and Parsing.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

2009
A Systematic Analysis of Translation Model Search Spaces.
Proceedings of the Fourth Workshop on Statistical Machine Translation, 2009


  Loading...