Samuel Cahyawijaya

Muhammad Dehan Al Kautsar

Chenxi Whitehouse

Ivan Halim Parmonangan

Sonny Lazuardi Hermawan

Dan John Velasco

Willy Fitra Hendria

Yasmin Moslem

Noah Flynn

Peerat Limkonchotiwat

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Re-Evaluating Evaluation for Multilingual Summarization.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

LinguAlchemy: Fusing Typological and Geographical Elements for Unseen Language Generalization.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Cendol: Open Instruction-tuned Generative Large Language Models for Indonesian Languages.

[BibT_eX]

[DOI]

Salsabil Maulana Akbar

Emmanuel Dave

Nuur Shadieq

Muhammad Ihza Mahendra

Dea Annisayanti Putri

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

IndoRobusta: Towards Robustness Against Diverse Code-Mixed Indonesian Local Languages.

[BibT_eX]

[DOI]

Muhammad Dehan Al Kautsar

CoRR, 2023

IndoToD: A Multi-Domain Indonesian Benchmark For End-to-End Task-Oriented Dialogue Systems.

[BibT_eX]

[DOI]

Rahmah Khoirussyifa' Nurdini

Genta Indra Winata

Ayu Purwarianti

CoRR, 2023

InstructTODS: Large Language Models for End-to-End Task-Oriented Dialogue Systems.

[BibT_eX]

[DOI]

CoRR, 2023

Negative Object Presence Evaluation (NOPE) to Measure Object Hallucination in Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Survey of Social Bias in Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Cross-Lingual Cross-Age Group Adaptation for Low-Resource Elderly Speech Emotion Recognition.

[BibT_eX]

[DOI]

CoRR, 2023

GlobalBench: A Benchmark for Global Progress in Natural Language Processing.

[BibT_eX]

[DOI]

Antonios Anastasopoulos

Graham Neubig

CoRR, 2023

Multilingual Large Language Models Are Not (Yet) Code-Switchers.

[BibT_eX]

[DOI]

Ruochen Zhang

Alham Fikri Aji

CoRR, 2023

Instruct-Align: Teaching Novel Languages with to LLMs through Alignment-based Cross-Lingual Instruction.

[BibT_eX]

[DOI]

CoRR, 2023

Prompting Multilingual Large Language Models to Generate Code-Mixed Texts: The Case of South East Asian Languages.

[BibT_eX]

[DOI]

Long Phan

Yin Lin Tan

Alham Fikri Aji

CoRR, 2023

Biomedical Image Reconstruction: A Survey.

[BibT_eX]

[DOI]

CoRR, 2023

Cross-Lingual Cross-Age Adaptation for Low-Resource Elderly Speech Emotion Recognition.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

PICK: Polished & Informed Candidate Scoring for Knowledge-Grounded Dialogue Systems.

[BibT_eX]

[DOI]

Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2023

NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages.

[BibT_eX]

[DOI]

Salsabil Maulana Akbar

Jhonson Lee

Nuur Shadieq

Tjeng Wawan Cenggoro

Hanung Wahyuning Linuwih

Bryan Wilie

Galih Pradipta Muridan

Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2023

A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity.

[BibT_eX]

[DOI]

Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2023

The Obscure Limitation of Modular Multilingual Language Models.

[BibT_eX]

[DOI]

Ayu Purwarianti

Proceedings of the First Tiny Papers Track at ICLR 2023, 2023

Multilingual Large Language Models Are Not (Yet) Code-Switchers.

[BibT_eX]

[DOI]

Ruochen Zhang

Genta Indra Winata

Alham Fikri Aji

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

GlobalBench: A Benchmark for Global Progress in Natural Language Processing.

[BibT_eX]

[DOI]

Antonios Anastasopoulos

Graham Neubig

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

NusaX: Multilingual Parallel Sentiment Dataset for 10 Indonesian Local Languages.

[BibT_eX]

[DOI]

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Which One Are You Referring To? Multimodal Object Identification in Situated Dialogue.

[BibT_eX]

[DOI]

Holy Lovenia

Pawan Sasanka Ammanamanchi

Pascale Fung

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: EACL 2023, 2023

Multi-lingual and Multi-cultural Figurative Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

NusaCrowd: Open Source Initiative for Indonesian NLP Resources.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

NusaCrowd: Open Source Initiative for Indonesian NLP Resources.

[BibT_eX]

[DOI]

CoRR, 2022

Every picture tells a story: Image-grounded controllable stylistic story generation.

[BibT_eX]

[DOI]

CoRR, 2022

NusaCrowd: A Call for Open and Reproducible NLP Research in Indonesian Languages.

[BibT_eX]

[DOI]

CoRR, 2022

Kaggle Competition: Cantonese Audio-Visual Speech Recognition for In-car Commands.

[BibT_eX]

[DOI]

CoRR, 2022

BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing.

[BibT_eX]

[DOI]

CoRR, 2022

GEMv2: Multilingual NLG Benchmarking in a Single Line of Code.

[BibT_eX]

[DOI]

Alexandros Papangelis

Aman Madaan

Angelina McMillan-Major

Khyathi Raghavi Chandu

Laura Perez-Beltrachini

Leonardo F. R. Ribeiro

CoRR, 2022

NusaX: Multilingual Parallel Sentiment Dataset for 10 Indonesian Local Languages.

[BibT_eX]

[DOI]

CoRR, 2022

VScript: Controllable Script Generation with Audio-Visual Presentation.

[BibT_eX]

[DOI]

CoRR, 2022

CI-AVSR: A Cantonese Audio-Visual Speech Dataset for In-car Command Recognition.

[BibT_eX]

[DOI]

Cheuk Tung Shadow Yiu

CoRR, 2022

Clozer": " Adaptable Data Augmentation for Cloze-style Reading Comprehension.

[BibT_eX]

[DOI]

Proceedings of the 7th Workshop on Representation Learning for NLP, 2022

BigBio: A Framework for Data-Centric Biomedical Natural Language Processing.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Automatic Speech Recognition Datasets in Cantonese: A Survey and New Dataset.

[BibT_eX]

[DOI]

Cheuk Tung Shadow Yiu

Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

CI-AVSR: A Cantonese Audio-Visual Speech Datasetfor In-car Command Recognition.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

VScript: Controllable Script Generation with Visual Presentation.

[BibT_eX]

[DOI]

Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

SNP2Vec: Scalable Self-Supervised Pre-Training for Genome-Wide Association Study.

[BibT_eX]

[DOI]

Proceedings of the 21st Workshop on Biomedical Language Processing, 2022

Integrating Question Rewrites in Conversational Question Answering: A Reinforcement Learning Approach.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, 2022

One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

How Long Is Enough? Exploring the Optimal Intervals of Long-Range Clinical Note Language Modeling.

[BibT_eX]

[DOI]

Proceedings of the 13th International Workshop on Health Text Mining and Information Analysis, 2022

Can Question Rewriting Help Conversational Question Answering?

[BibT_eX]

[DOI]

Proceedings of the Third Workshop on Insights from Negative Results in NLP, 2022

Retrieval-Free Knowledge-Grounded Dialogue Response Generation with Adapters.

[BibT_eX]

[DOI]

Proceedings of the Second DialDoc Workshop on Document-grounded Dialogue and Conversational Question Answering, 2022

2021

ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation.

[BibT_eX]

[DOI]

CoRR, 2021

NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation.

[BibT_eX]

[DOI]

Jascha Sohl-Dickstein

Marco Antonio Sobrevilla Cabezudo

Paulo Henrique Santos Vasconcellos

K. V. Aditya Srivatsa

CoRR, 2021

Greenformer: Factorization Toolkit for Efficient Deep Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2021

Greenformers: Improving Computation and Memory Efficiency in Transformer Models via Low-Rank Approximation.

[BibT_eX]

[DOI]