Lintang Sutawika

According to our database1, Lintang Sutawika authored at least 17 papers between 2021 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages.
CoRR, 2024

Lessons from the Trenches on Reproducible Evaluation of Language Models.
CoRR, 2024

Re-Evaluating Evaluation for Multilingual Summarization.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
Utilizing Weak Supervision To Generate Indonesian Conservation Dataset.
CoRR, 2023

Prompting Multilingual Large Language Models to Generate Code-Mixed Texts: The Case of South East Asian Languages.
CoRR, 2023

Emergent and Predictable Memorization in Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling.
Proceedings of the International Conference on Machine Learning, 2023

BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Crosslingual Generalization through Multitask Finetuning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting.
CoRR, 2022

What Language Model to Train if You Have One Million GPU Hours?
CoRR, 2022

Samsung Research Philippines - Datasaur AI's Submission for the WMT22 Large Scale Multilingual Translation Task.
Proceedings of the Seventh Conference on Machine Translation, 2022


What Language Model to Train if You Have One Million GPU Hours?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Towards better structured and less noisy Web data: Oscar with Register annotations.
Proceedings of the Eighth Workshop on Noisy User-generated Text, 2022

2021
Multitask Prompted Training Enables Zero-Shot Task Generalization.
CoRR, 2021

Data Processing Matters: SRPH-Konvergen AI's Machine Translation System for WMT'21.
Proceedings of the Sixth Conference on Machine Translation, 2021


  Loading...