Dani Yogatama

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022

Emergent Abilities of Large Language Models.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2022

Relational Memory-Augmented Language Models.

[BibT_eX]

[DOI]

Qi Liu

Phil Blunsom

Trans. Assoc. Comput. Linguistics, 2022

Language Models Can See: Plugging Visual Controls in Text Generation.

[BibT_eX]

[DOI]

CoRR, 2022

HighMMT: Towards Modality and Task Generalization for High-Modality Representation Learning.

[BibT_eX]

[DOI]

Louis-Philippe Morency

Ruslan Salakhutdinov

CoRR, 2022

A Contrastive Framework for Neural Text Generation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Scale Efficiently: Insights from Pretraining and Finetuning Transformers.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

ABC: Attention with Bounded-memory Control.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021

Adaptive Semiparametric Language Models.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2021

Balancing Average and Worst-case Accuracy in Multitask Learning.

[BibT_eX]

[DOI]

Paul Michel

Sebastian Ruder

CoRR, 2021

Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers.

[BibT_eX]

[DOI]

CoRR, 2021

Pitfalls of Static Language Modelling.

[BibT_eX]

[DOI]

CoRR, 2021

End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering.

[BibT_eX]

[DOI]

Devendra Singh Sachan

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Mind the Gap: Assessing Temporal Generalization in Neural Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

LiRo: Benchmark and leaderboard for Romanian language tasks.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Random Feature Attention.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

Finetuning Pretrained Transformers into RNNs.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020

Syntactic Structure Distillation Pretraining for Bidirectional Encoders.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2020

Modelling Latent Skills for Multitask Language Generation.

[BibT_eX]

[DOI]

Kris Cao

CoRR, 2020

A Mutual Information Maximization Perspective of Language Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

Reducing Sentiment Bias in Language Models via Counterfactual Evaluation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

A Call for More Rigor in Unsupervised Cross-lingual Learning.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

On the Cross-lingual Transferability of Monolingual Representations.

[BibT_eX]

[DOI]

Mikel Artetxe

Sebastian Ruder

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019

Jointly learning sentence embeddings and syntax with unsupervised Tree-LSTMs.

[BibT_eX]

[DOI]

Jean Maillard

Stephen Clark

Nat. Lang. Eng., 2019

Grandmaster level in StarCraft II using multi-agent reinforcement learning.

[BibT_eX]

[DOI]

Nat., 2019

Learning and Evaluating General Linguistic Intelligence.

[BibT_eX]

[DOI]

CoRR, 2019

Episodic Memory in Lifelong Language Learning.

[BibT_eX]

[DOI]

Sebastian Ruder

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Variational Smoothing in Recurrent Neural Network Language Models.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

Achieving Verified Robustness to Symbol Substitutions via Interval Bound Propagation.

[BibT_eX]

[DOI]

Krishnamurthy Dvijotham

Pushmeet Kohli

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

2018

Memory Architectures in Recurrent Neural Network Language Models.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

LSTMs Can Learn Syntax-Sensitive Dependencies Well, But Modeling Structure Makes Them Better.

[BibT_eX]

[DOI]

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017

Generative and Discriminative Text Classification with Recurrent Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2017

Learning to Compose Words into Sentences with Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Learning Representations, 2017

Program Induction by Rationale Generation: Learning to Solve and Explain Algebraic Word Problems.

[BibT_eX]

[DOI]

Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016

Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin.

[BibT_eX]

[DOI]

Proceedings of the 33nd International Conference on Machine Learning, 2016

2015

Bayesian Optimization of Text Representations.

[BibT_eX]

[DOI]

CoRR, 2015

Deep Speech 2: End-to-End Speech Recognition in English and Mandarin.

[BibT_eX]

[DOI]

CoRR, 2015

Learning Word Representations with Hierarchical Sparse Coding.

[BibT_eX]

[DOI]

Proceedings of the 32nd International Conference on Machine Learning, 2015

Bayesian Optimization of Text Representations.

[BibT_eX]

[DOI]

Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Extractive Summarization by Maximizing Semantic Volume.

[BibT_eX]

[DOI]

Fei Liu

Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Embedding Methods for Fine Grained Entity Type Classification.

[BibT_eX]

[DOI]

Daniel Gillick

Nevena Lazic

Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Sparse Overcomplete Word Vector Representations.

[BibT_eX]

[DOI]

2014

Dynamic Language Models for Streaming Text.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2014

Making the Most of Bag of Words: Sentence Regularization with Alternating Direction Method of Multipliers.

[BibT_eX]

[DOI]

Proceedings of the 31th International Conference on Machine Learning, 2014

Efficient Transfer Learning Method for Automatic Hyperparameter Tuning.

[BibT_eX]

[DOI]

Gideon Mann

Proceedings of the Seventeenth International Conference on Artificial Intelligence and Statistics, 2014

Linguistic Structured Sparsity in Text Categorization.

[BibT_eX]

[DOI]

Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013

A Sparse and Adaptive Prior for Time-Dependent Model Parameters.

[BibT_eX]

[DOI]

Bryan R. Routledge

CoRR, 2013

A Penny for Your Tweets: Campaign Contributions and Capitol Hill Microblogs.

[BibT_eX]

[DOI]

Tae Yano

Proceedings of the Seventh International Conference on Weblogs and Social Media, 2013

2012

A Probabilistic Model for Canonicalizing Named Entity Mentions.

[BibT_eX]

[DOI]

Yanchuan Sim

Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011

Predicting a Scientific Community's Response to an Article.

[BibT_eX]

[DOI]

Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments.

[BibT_eX]

[DOI]

Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011

2009

Multilingual Spectral Clustering Using Document Similarity Propagation.

[BibT_eX]

[DOI]