Genta Indra Winata

According to our database1, Genta Indra Winata authored at least 98 papers between 2017 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization.
CoRR, 2024

MetaMetrics: Calibrating Metrics For Generation Tasks Using Human Preferences.
CoRR, 2024

Towards Efficient and Robust VQA-NLE Data Generation with Large Vision-Language Models.
CoRR, 2024

Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey.
CoRR, 2024

SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages.
CoRR, 2024

ProxyLM: Predicting Language Model Performance on Multilingual Tasks via Proxy Models.
CoRR, 2024

Lessons from the Trenches on Reproducible Evaluation of Language Models.
CoRR, 2024

SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 14 Languages.
CoRR, 2024

MINERS: Multilingual Language Models as Semantic Retrievers.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024


Re-Evaluating Evaluation for Multilingual Summarization.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Academics Can Contribute to Domain-Specialized Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

LinguAlchemy: Fusing Typological and Geographical Elements for Unseen Language Generalization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024


Cendol: Open Instruction-tuned Generative Large Language Models for Indonesian Languages.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Transfer learning application of self-supervised learning in ARPES.
Mach. Learn. Sci. Technol., September, 2023

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Trans. Mach. Learn. Res., 2023

IndoRobusta: Towards Robustness Against Diverse Code-Mixed Indonesian Local Languages.
CoRR, 2023

IndoToD: A Multi-Domain Indonesian Benchmark For End-to-End Task-Oriented Dialogue Systems.
CoRR, 2023

Multilingual Few-Shot Learning via Language Model Retrieval.
CoRR, 2023

GlobalBench: A Benchmark for Global Progress in Natural Language Processing.
CoRR, 2023

Prompting Multilingual Large Language Models to Generate Code-Mixed Texts: The Case of South East Asian Languages.
CoRR, 2023

Efficient Zero-Shot Cross-lingual Inference via Retrieval.
Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2023

NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages.
Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2023

Multilingual Large Language Models Are Not (Yet) Code-Switchers.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

GlobalBench: A Benchmark for Global Progress in Natural Language Processing.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

NusaX: Multilingual Parallel Sentiment Dataset for 10 Indonesian Local Languages.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Towards a Unified Multi-Domain Multilingual Named Entity Recognition Model.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Overcoming Catastrophic Forgetting in Massively Multilingual Continual Learning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

The Decades Progress on Code-Switching Research in NLP: A Systematic Survey on Trends and Challenges.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

On "Scientific Debt" in NLP: A Case for More Rigour in Language Model Pre-Training Research.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Multi-lingual and Multi-cultural Figurative Language Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023


2022
NusaCrowd: Open Source Initiative for Indonesian NLP Resources.
CoRR, 2022

BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting.
CoRR, 2022

Transfer Learning Application of Self-supervised Learning in ARPES.
CoRR, 2022

NusaCrowd: A Call for Open and Reproducible NLP Research in Indonesian Languages.
CoRR, 2022

GEMv2: Multilingual NLG Benchmarking in a Single Line of Code.
CoRR, 2022

NusaX: Multilingual Parallel Sentiment Dataset for 10 Indonesian Local Languages.
CoRR, 2022

A Comparative Study on Language Models for Task-Oriented Dialogue Systems.
CoRR, 2022

CI-AVSR: A Cantonese Audio-Visual Speech Dataset for In-car Command Recognition.
CoRR, 2022

ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

CI-AVSR: A Cantonese Audio-Visual Speech Datasetfor In-car Command Recognition.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Cross-lingual Few-Shot Learning on Unseen Languages.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Retrieval-Free Knowledge-Grounded Dialogue Response Generation with Adapters.
Proceedings of the Second DialDoc Workshop on Document-grounded Dialogue and Conversational Question Answering, 2022

2021
ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation.
CoRR, 2021

NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, 2021

Few-Shot Bot: Prompt-Based Learning for Dialogue Systems.
CoRR, 2021

Language Models are Few-shot Multilingual Learners.
CoRR, 2021

Greenformer: Factorization Toolkit for Efficient Deep Neural Networks.
CoRR, 2021

Nora: The Well-Being Coach.
CoRR, 2021

Retrieval-Free Knowledge-Grounded Dialogue Response Generation with Adapters.
CoRR, 2021

Multilingual Transfer Learning for Code-Switched Language and Speech Neural Modeling.
CoRR, 2021

ERICA: An Empathetic Android Companion for Covid-19 Quarantine.
Proceedings of the 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2021

X2Parser: Cross-Lingual and Cross-Domain Framework for Task-Oriented Compositional Semantic Parsing.
Proceedings of the 6th Workshop on Representation Learning for NLP, 2021

Preserving Cross-Linguality of Pre-trained Models via Continual Learning.
Proceedings of the 6th Workshop on Representation Learning for NLP, 2021

BiToD: A Bilingual Multi-Domain Dataset For Task-Oriented Dialogue Modeling.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Adapt-and-Adjust: Overcoming the Long-Tail Problem of Multilingual Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

IndoNLG: Benchmark and Resources for Evaluating Indonesian Natural Language Generation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Continual Mixed-Language Pre-Training for Extremely Low-Resource Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

CAiRE in DialDoc21: Data Augmentation for Information Seeking Dialogue System.
Proceedings of the 1st Workshop on Document-grounded Dialogue and Conversational Question Answering, 2021

Are Multilingual Models Effective in Code-Switching?
Proceedings of the Fifth Workshop on Computational Approaches to Linguistic Code-Switching, 2021

On the Importance of Word Order Information in Cross-lingual Sequence Labeling.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
EmoGraph: Capturing Emotion Correlations using Graph Networks.
CoRR, 2020

Exploring Fine-tuning Techniques for Pre-trained Cross-lingual Models via Continual Learning.
CoRR, 2020

Variational Transformers for Diverse Response Generation.
CoRR, 2020

XPersona: Evaluating Multilingual Personalized Chatbot.
CoRR, 2020

Zero-Resource Cross-Domain Named Entity Recognition.
Proceedings of the 5th Workshop on Representation Learning for NLP, 2020

Learning Fast Adaptation on Cross-Accented Speech Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

IndoNLU: Benchmark and Resources for Evaluating Indonesian Natural Language Understanding.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

Lightweight and Efficient End-To-End Speech Recognition Using Low-Rank Transformer.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Learning Knowledge Bases with Parameters for Task-Oriented Dialogue Systems.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Cross-lingual Spoken Language Understanding with Regularized Representation Alignment.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

MinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Meta-Transfer Learning for Code-Switched Speech Recognition.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Coach: A Coarse-to-Fine Approach for Cross-domain Slot Filling.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Attention-Informed Mixed-Language Training for Zero-Shot Cross-Lingual Task-Oriented Dialogue Systems.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

CAiRE: An End-to-End Empathetic Chatbot.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
On the Effectiveness of Low-Rank Matrix Factorization for LSTM Model Compression.
CoRR, 2019

CAiRE: An End-to-End Empathetic Chatbot.
CoRR, 2019

Incorporating Word and Subword Units in Unsupervised Machine Translation Using Language Model Rescoring.
Proceedings of the Fourth Conference on Machine Translation, 2019

CAiRE_HKUST at SemEval-2019 Task 3: Hierarchical Attention for Dialogue Emotion Classification.
Proceedings of the 13th International Workshop on Semantic Evaluation, 2019

Learning Multilingual Meta-Embeddings for Code-Switching Named Entity Recognition.
Proceedings of the 4th Workshop on Representation Learning for NLP, 2019

Learning Comment Generation by Leveraging User-generated Data.
Proceedings of the IEEE International Conference on Acoustics, 2019

Hierarchical Meta-Embeddings for Code-Switching Named Entity Recognition.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Zero-shot Cross-lingual Dialogue Systems with Transferable Latent Variables.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Code-Switched Language Models Using Neural Based Synthetic Data from Parallel Sentences.
Proceedings of the 23rd Conference on Computational Natural Language Learning, 2019

Generalizing Question Answering System with Pre-trained Language Model Fine-tuning.
Proceedings of the 2nd Workshop on Machine Reading for Question Answering, 2019

2018
Towards End-to-end Automatic Code-Switching Speech Recognition.
CoRR, 2018

Handling Imbalanced Dataset in Multi-label Text Categorization using Bagging and Adaptive Boosting.
CoRR, 2018

Learn to Code-Switch: Data Augmentation using Copy Mechanism on Language Modeling.
CoRR, 2018

End-to-End Dynamic Query Memory Network for Entity-Value Independent Task-Oriented Dialog.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Attention-Based LSTM for Psychological Stress Detection from Spoken Language Using Distant Supervision.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Bilingual Character Representation for Efficiently Addressing Out-of-Vocabulary Words in Code-Switching Named Entity Recognition.
Proceedings of the Third Workshop on Computational Approaches to Linguistic Code-Switching@ACL 2018, 2018

Code-Switching Language Modeling using Syntax-Aware Multi-Task Learning.
Proceedings of the Third Workshop on Computational Approaches to Linguistic Code-Switching@ACL 2018, 2018

2017
Nora the Empathetic Psychologist.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017


  Loading...