Sebastian Ruder

According to our database1, Sebastian Ruder authored at least 110 papers between 2016 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
M-RewardBench: Evaluating Reward Models in Multilingual Settings.
CoRR, 2024

BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts.
CoRR, 2024

LLM See, LLM Do: Guiding Data Generation to Target Non-Differentiable Objectives.
CoRR, 2024

SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages.
CoRR, 2024

Aya 23: Open Weight Releases to Further Multilingual Progress.
CoRR, 2024

Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning.
CoRR, 2024

BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual Transfer.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

LLM See, LLM Do: Leveraging Active Inheritance to Target Non-Differentiable Objectives.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Understanding and Mitigating Language Confusion in LLMs.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

How Does Quantization Affect Multilingual LLMs?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024


Connecting Language Technologies with Rich, Diverse Data Sources Covering Thousands of Languages.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024


2023
Modular Deep Learning.
Trans. Mach. Learn. Res., 2023

QAmeleon: Multilingual QA with Only 5 Examples.
Trans. Assoc. Comput. Linguistics, 2023

Language and Task Arithmetic with Parameter-Efficient Layers for Zero-Shot Summarization.
CoRR, 2023

PaLM 2 Technical Report.
CoRR, 2023

AfriQA: Cross-lingual Open-Retrieval Question Answering for African Languages.
CoRR, 2023

AfriSenti: A Twitter Sentiment Analysis Benchmark for African Languages.
CoRR, 2023

SemEval-2023 Task 12: Sentiment Analysis for African Languages (AfriSenti-SemEval).
Proceedings of the The 17th International Workshop on Semantic Evaluation, 2023

Language models are multilingual chain-of-thought reasoners.
Proceedings of the Eleventh International Conference on Learning Representations, 2023


Romanization-based Large-scale Adaptation of Multilingual Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Adapters: A Unified Library for Parameter-Efficient and Modular Transfer Learning.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

mmT5: Modular Multilingual Pre-Training Solves Source Language Hallucinations.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023


Evaluating and Modeling Attribution for Cross-Lingual Question Answering.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023


TaTA: A Multilingual Table-to-Text Dataset for African Languages.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Evaluating the Diversity, Equity, and Inclusion of NLP Technology: A Case Study for Indian Languages.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Empowering Cross-lingual Behavioral Testing of NLP Models with Typological Features.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023


2022
NusaCrowd: Open Source Initiative for Indonesian NLP Resources.
CoRR, 2022

NusaX: Multilingual Parallel Sentiment Dataset for 10 Indonesian Local Languages.
CoRR, 2022

Evaluating Inclusivity, Equity, and Accessibility of NLP Technology: A Case Study for Indian Languages.
CoRR, 2022

NaijaSenti: A Nigerian Twitter Sentiment Corpus for Multilingual Sentiment Analysis.
CoRR, 2022

Writing System and Speaker Metadata for 2, 800+ Language Varieties.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

XTREME-S: Evaluating Cross-lingual Speech Representations.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Charformer: Fast Character Transformers via Gradient-based Subword Tokenization.
Proceedings of the Tenth International Conference on Learning Representations, 2022

ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Hyper-X: A Unified Hypernetwork for Multi-Task Multilingual Transfer.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022


FewNLU: Benchmarking State-of-the-Art Methods for Few-Shot Natural Language Understanding.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Memorisation versus Generalisation in Pre-trained Language Models.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Square One Bias in NLP: Towards a Multi-Dimensional Exploration of the Research Manifold.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
MasakhaNER: Named Entity Recognition for African Languages.
Trans. Assoc. Comput. Linguistics, 2021

NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, 2021

Balancing Average and Worst-case Accuracy in Multitask Learning.
CoRR, 2021

FewNLU: Benchmarking State-of-the-Art Methods for Few-Shot Natural Language Understanding.
CoRR, 2021

BERT memorisation and pitfalls in low-resource scenarios.
CoRR, 2021

XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation.
CoRR, 2021

Pitfalls of Static Language Modelling.
CoRR, 2021

Compacter: Efficient Low-Rank Hypercomplex Adapter Layers.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Mind the Gap: Assessing Temporal Generalization in Neural Language Models.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021


Multi-view Subword Regularization.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Long Range Arena : A Benchmark for Efficient Transformers.
Proceedings of the 9th International Conference on Learning Representations, 2021

Rethinking Embedding Coupling in Pre-trained Language Models.
Proceedings of the 9th International Conference on Learning Representations, 2021

Efficient Test Time Adapter Ensembling for Low-resource Language Varieties.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

UNKs Everywhere: Adapting Multilingual Language Models to New Scripts.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

IndoNLG: Benchmark and Resources for Evaluating Indonesian Natural Language Generation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

MAD-G: Multilingual Adapter Generation for Efficient Cross-Lingual Transfer.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernetworks.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Analogy Training Multilingual Encoders.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalization.
CoRR, 2020

XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalisation.
Proceedings of the 37th International Conference on Machine Learning, 2020

Are All Good Word Vector Spaces Isomorphic?
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

MAD-X: An Adapter-Based Framework for Multi-Task Cross-Lingual Transfer.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

AdapterHub: A Framework for Adapting Transformers.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2020

AxCell: Automatic Extraction of Results from Machine Learning Papers.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Morphologically Aware Word-Level Translation.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

A Call for More Rigor in Unsupervised Cross-lingual Learning.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

On the Cross-lingual Transferability of Monolingual Representations.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Cross-Lingual Word Embeddings
Synthesis Lectures on Human Language Technologies, Morgan & Claypool Publishers, ISBN: 978-3-031-02171-8, 2019

A Survey of Cross-lingual Word Embedding Models.
J. Artif. Intell. Res., 2019

What do Deep Networks Like to Read?
CoRR, 2019

To Tune or Not to Tune? Adapting Pretrained Representations to Diverse Tasks.
Proceedings of the 4th Workshop on Representation Learning for NLP, 2019

Episodic Memory in Lifelong Language Learning.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Transfer Learning in Natural Language Processing.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

MultiFiT: Efficient Multi-lingual Language Model Fine-tuning.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Don't Forget the Long Tail! A Comprehensive Analysis of Morphological Generalization in Bilingual Lexicon Induction.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Unsupervised Cross-Lingual Representation Learning.
Proceedings of the 57th Conference of the Association for Computational Linguistics: Tutorial Abstracts, 2019

How to (Properly) Evaluate Cross-Lingual Word Embeddings: On Strong Baselines, Comparative Analyses, and Some Misconceptions.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

A Hierarchical Multi-Task Approach for Learning Embeddings from Semantic Tasks.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Latent Multi-Task Architecture Learning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Off-the-Shelf Unsupervised NMT.
CoRR, 2018

Fine-tuned Language Models for Text Classification.
CoRR, 2018

360° Stance Detection.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics, 2018

Multi-Task Learning of Pairwise Sequence Classification Tasks over Disparate Label Spaces.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

A Discriminative Latent-Variable Model for Bilingual Lexicon Induction.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Generalizing Procrustes Analysis for Better Bilingual Dictionary Induction.
Proceedings of the 22nd Conference on Computational Natural Language Learning, 2018

On the Limitations of Unsupervised Bilingual Dictionary Induction.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Universal Language Model Fine-tuning for Text Classification.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Strong Baselines for Neural Semi-Supervised Learning under Domain Shift.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Data Selection Strategies for Multi-Domain Sentiment Analysis.
CoRR, 2017

Knowledge Adaptation: Teaching to Adapt.
CoRR, 2017

Sluice networks: Learning what to share between loosely related tasks.
CoRR, 2017

An Overview of Multi-Task Learning in Deep Neural Networks.
CoRR, 2017

A survey of cross-lingual embedding models.
CoRR, 2017

Learning to select data for transfer learning with Bayesian Optimization.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

2016
Towards a continuous modeling of natural language domains.
CoRR, 2016

Character-level and Multi-channel Convolutional Neural Networks for Large-scale Authorship Attribution.
CoRR, 2016

An overview of gradient descent optimization algorithms.
CoRR, 2016

INSIGHT-1 at SemEval-2016 Task 5: Deep Learning for Multilingual Aspect-based Sentiment Analysis.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

INSIGHT-1 at SemEval-2016 Task 4: Convolutional Neural Networks for Sentiment Classification and Quantification.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

A Hierarchical Model of Reviews for Aspect-based Sentiment Analysis.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016


  Loading...