Ahmet Üstün

According to our database1, Ahmet Üstün authored at least 35 papers between 2016 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
MURI: High-Quality Instruction Tuning Datasets for Low-Resource Languages via Reverse Instructions.
CoRR, 2024

Nexus: Specialization meets Adaptability for Efficiently Training Mixture of Experts.
CoRR, 2024

To Code, or Not To Code? Exploring Impact of Code in Pre-training.
CoRR, 2024

BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts.
CoRR, 2024

Aya 23: Open Weight Releases to Further Multilingual Progress.
CoRR, 2024

Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning.
CoRR, 2024

Pushing Mixture of Experts to the Limit: Extremely Parameter Efficient MoE for Instruction Tuning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

How Does Quantization Affect Multilingual LLMs?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization for LLMs.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024


Back to Basics: Revisiting REINFORCE-Style Optimization for Learning from Human Feedback in LLMs.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
When Less is More: Investigating Data Pruning for Pretraining LLMs at Scale.
CoRR, 2023

Intriguing Properties of Quantization at Scale.
CoRR, 2023

Intriguing Properties of Quantization at Scale.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022
UDapter: Typology-based Language Adapters for Multilingual Dependency Parsing and Sequence Labeling.
Comput. Linguistics, 2022

When does Parameter-Efficient Transfer Learning Work for Machine Translation?
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Hyper-X: A Unified Hypernetwork for Multi-Task Multilingual Transfer.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021
On the Difficulty of Translating Free-Order Case-Marking Languages.
Trans. Assoc. Comput. Linguistics, 2021

Incorporating word embeddings in unsupervised morphological segmentation.
Nat. Lang. Eng., 2021

On the Effectiveness of Dataset Embeddings in Mono-lingual, Multi-lingual and Zero-shot Conditions.
CoRR, 2021

Unsupervised Translation of German-Lower Sorbian: Exploring Training and Novel Transfer Methods on a Low-Resource Language.
Proceedings of the Sixth Conference on Machine Translation, 2021

From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken Language Understanding.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Automatic Judgement Forecasting for Pending Applications of the European Court of Human Rights.
Proceedings of the Joint Proceedings of the Workshops on Automated Semantic Analysis of Information in Legal Text (ASAIL 2021) & AI and Intelligent Assistance for Legal Professionals in the Digital Workplace (LegalAIIA 2021) held online in conjunction with 18th International Conference on Artificial Intelligence and Law (ICAIL 2021), 2021

Multilingual Unsupervised Neural Machine Translation with Denoising Adapters.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Massive Choice, Ample Tasks (MaChAmp): A Toolkit for Multi-task Learning in NLP.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, 2021

2020
Massive Choice, Ample Tasks (MaChAmp): A Toolkit for Multi-task Learning in NLP.
CoRR, 2020

FiSSA at SemEval-2020 Task 9: Fine-tuned for Feelings.
Proceedings of the Fourteenth Workshop on Semantic Evaluation, 2020

UDapter: Language Adaptation for Truly Universal Dependency Parsing.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

2019
Cross-Lingual Word Embeddings for Morphologically Rich Languages.
Proceedings of the International Conference on Recent Advances in Natural Language Processing, 2019

There and Back Again: Cross-Lingual Transfer Learning for Event Detection.
Proceedings of the Sixth Italian Conference on Computational Linguistics, 2019

2018
Characters or Morphemes: How to Represent Words?
Proceedings of The Third Workshop on Representation Learning for NLP, 2018

2017
A Trie-structured Bayesian Model for Unsupervised Morphological Segmentation.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2017

2016
Unsupervised Morphological Segmentation Using Neural Word Embeddings.
Proceedings of the Statistical Language and Speech Processing, 2016

Turkish PoS Tagging by Reducing Sparsity with Morpheme Tags in Small Datasets.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2016


  Loading...