2024
Scaling Speech Technology to 1, 000+ Languages.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
J. Mach. Learn. Res., 2024
Improving Multilingual ASR in the Wild Using Simple N-best Re-ranking.
CoRR, 2024
Scaling A Simple Approach to Zero-Shot Speech Recognition.
CoRR, 2024
2023
DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and Language.
Proceedings of the International Conference on Machine Learning, 2023
Measuring the Impact of Domain Factors in Self-Supervised Pre-Training.
Proceedings of the IEEE International Conference on Acoustics, 2023
Toward Joint Language Modeling for Speech Units and Text.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Av-Data2Vec: Self-Supervised Learning of Audio-Visual Speech Representations with Contextualized Target Representations.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
Simple and Effective Unsupervised Speech Translation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
2022
Measuring the Impact of Individual Domain Factors in Self-Supervised Pre-Training.
CoRR, 2022
Towards End-to-End Unsupervised Speech Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Masked Autoencoders that Listen.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Simple and Effective Zero-shot Cross-lingual Phoneme Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
On-demand compute reduction with stochastic wav2vec 2.0.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Wav2Vec-Aug: Improved self-supervised training with limited data.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Simple and Effective Unsupervised Speech Synthesis.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
XTREME-S: Evaluating Cross-lingual Speech Representations.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale.
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language.
Proceedings of the International Conference on Machine Learning, 2022
Improved Language Identification Through Cross-Lingual Self-Supervised Learning.
Proceedings of the IEEE International Conference on Acoustics, 2022
Unified Speech-Text Pre-training for Speech Translation and Recognition.
,
,
,
,
,
,
,
,
,
,
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
2021
Beyond English-Centric Multilingual Machine Translation.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
J. Mach. Learn. Res., 2021
Improved Language Identification Through Cross-Lingual Self-Supervised Learning.
CoRR, 2021
A Comparison of Approaches to Document-level Machine Translation.
CoRR, 2021
Unsupervised Speech Recognition.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Self-training Improves Pre-training for Natural Language Understanding.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021
Large-Scale Self- and Semi-Supervised Learning for Speech Translation.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training.
,
,
,
,
,
,
,
,
,
,
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Unsupervised Cross-Lingual Representation Learning for Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
A Comparison of Discrete Latent Variable Models for Speech Representation Learning.
Proceedings of the IEEE International Conference on Acoustics, 2021
Self-Training and Pre-Training are Complementary for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021
The Source-Target Domain Mismatch Problem in Machine Translation.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
Multilingual Speech Translation from Efficient Finetuning of Pretrained Models.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
Discriminative Reranking for Neural Machine Translation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
2020
Modeling Human Motion with Quaternion-Based Neural Networks.
Int. J. Comput. Vis., 2020
Beyond English-Centric Multilingual Machine Translation.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2020
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations.
CoRR, 2020
Robust and On-the-fly Dataset Denoising for Image Classification.
CoRR, 2020
Language Models not just for Pre-training: Fast Online Neural Noisy Channel Modeling.
Proceedings of the Fifth Conference on Machine Translation, 2020
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Depth-Adaptive Transformer.
Proceedings of the 8th International Conference on Learning Representations, 2020
vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations.
Proceedings of the 8th International Conference on Learning Representations, 2020
Robust and On-the-Fly Dataset Denoising for Image Classification.
Proceedings of the Computer Vision - ECCV 2020, 2020
On The Evaluation of Machine Translation SystemsTrained With Back-Translation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
2019
Improving Conditioning in Context-Aware Sequence to Sequence Models.
CoRR, 2019
Effectiveness of self-supervised pre-training for speech recognition.
CoRR, 2019
Simple and Effective Noisy Channel Modeling for Neural Machine Translation.
CoRR, 2019
On The Evaluation of Machine Translation Systems Trained With Back-Translation.
CoRR, 2019
GLOSS: Generative Latent Optimization of Sentence Representations.
CoRR, 2019
Facebook FAIR's WMT19 News Translation Task Submission.
Proceedings of the Fourth Conference on Machine Translation, 2019
fairseq: A Fast, Extensible Toolkit for Sequence Modeling.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019
Pre-trained language model representations for language generation.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019
wav2vec: Unsupervised Pre-Training for Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Mixture Models for Diverse Machine Translation: Tricks of the Trade.
Proceedings of the 36th International Conference on Machine Learning, 2019
Pay Less Attention with Lightweight and Dynamic Convolutions.
Proceedings of the 7th International Conference on Learning Representations, 2019
Wizard of Wikipedia: Knowledge-Powered Conversational Agents.
Proceedings of the 7th International Conference on Learning Representations, 2019
Adaptive Input Representations for Neural Language Modeling.
Proceedings of the 7th International Conference on Learning Representations, 2019
Simple and Effective Noisy Channel Modeling for Neural Machine Translation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Cloze-driven Pretraining of Self-attention Networks.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
3D Human Pose Estimation in Video With Temporal Convolutions and Semi-Supervised Training.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
ELI5: Long Form Question Answering.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
2018
Scaling Neural Machine Translation.
Proceedings of the Third Conference on Machine Translation: Research Papers, 2018
QuickEdit: Editing Text & Translations by Crossing Words Out.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018
Classical Structured Prediction Losses for Sequence to Sequence Learning.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018
Analyzing Uncertainty in Neural Machine Translation.
Proceedings of the 35th International Conference on Machine Learning, 2018
Understanding Back-Translation at Scale.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018
QuaterNet: A Quaternion-based Recurrent Model for Human Motion.
Proceedings of the British Machine Vision Conference 2018, 2018
Controllable Abstractive Summarization.
Proceedings of the 2nd Workshop on Neural Machine Translation and Generation, 2018
2017
QuickEdit: Editing Text & Translations via Simple Delete Actions.
CoRR, 2017
Convolutional Sequence to Sequence Learning.
Proceedings of the 34th International Conference on Machine Learning, 2017
Language Modeling with Gated Convolutional Networks.
Proceedings of the 34th International Conference on Machine Learning, 2017
A Convolutional Encoder Model for Neural Machine Translation.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017
2016
Sequence Level Training with Recurrent Neural Networks.
Proceedings of the 4th International Conference on Learning Representations, 2016
Iterative Refinement for Machine Translation.
CoRR, 2016
Generating Text from Structured Data with Application to the Biography Domain.
CoRR, 2016
Vocabulary Selection Strategies for Neural Machine Translation.
CoRR, 2016
Neural Network-based Word Alignment through Score Aggregation.
Proceedings of the First Conference on Machine Translation, 2016
Expected F-Measure Training for Shift-Reduce Parsing with Recurrent Neural Networks.
Proceedings of the NAACL HLT 2016, 2016
Abstractive Sentence Summarization with Attentive Recurrent Neural Networks.
Proceedings of the NAACL HLT 2016, 2016
Neural Text Generation from Structured Data with Application to the Biography Domain.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016
Strategies for Training Large Vocabulary Neural Language Models.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016
2015
Learning Translation Models from Monolingual Continuous Representations.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015
A Neural Network Approach to Context-Sensitive Generation of Conversational Responses.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015
CCG Supertagging with a Recurrent Neural Network.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015
deltaBLEU: A Discriminative Metric for Generation Tasks with Intrinsically Diverse Targets.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015
2014
Large-scale Expected BLEU Training of Phrase-based Reordering Models.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014
Minimum Translation Modeling with Recurrent Neural Networks.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014
Decoder Integration and Expected BLEU Training for Recurrent Neural Network Language Models.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014
2013
Joint Language and Translation Modeling with Recurrent Neural Networks.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013
2011
Training a Log-Linear Parser with Loss Functions via Softmax-Margin.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011
Efficient CCG Parsing: A* versus Adaptive Supertagging.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011
A Comparison of Loopy Belief Propagation and Dual Decomposition for Integrated CCG Supertagging and Parsing.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011
2009
A Systematic Analysis of Translation Model Search Spaces.
Proceedings of the Fourth Workshop on Statistical Machine Translation, 2009