Alham Fikri Aji
According to our database1,
Alham Fikri Aji
authored at least 100 papers
between 2014 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
Proceedings of the 31st International Conference on Computational Linguistics, 2025
Proceedings of the 31st International Conference on Computational Linguistics, 2025
2024
SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Black-Box Machine-Generated Text Detection.
Dataset, April, 2024
Data Laundering: Artificially Boosting Benchmark Results through Knowledge Distillation.
CoRR, 2024
CoRR, 2024
CoRR, 2024
Thank You, Stingray: Multilingual Large Language Models Can Not (Yet) Disambiguate Cross-Lingual Word Sense.
CoRR, 2024
WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines.
CoRR, 2024
CoRR, 2024
IndoToxic2024: A Demographically-Enriched Dataset of Hate Speech and Toxicity Types for Indonesian Language.
CoRR, 2024
The Privileged Students: On the Value of Initialization in Multilingual Knowledge Distillation.
CoRR, 2024
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages.
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection.
CoRR, 2024
CoRR, 2024
Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition.
CoRR, 2024
Beyond Probabilities: Unveiling the Misalignment in Evaluating Large Language Models.
CoRR, 2024
CoRR, 2024
CoRR, 2024
Daisy at WASSA 2024 Empathy and Personality Shared Task: A Quick Exploration on Emotional Pattern of Empathy and Distress.
Proceedings of the 14th Workshop on Computational Approaches to Subjectivity, 2024
Proceedings of the 18th International Workshop on Semantic Evaluation, 2024
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Cultural Conditioning or Placebo? On the Effectiveness of Socio-Demographic Prompting.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
LinguAlchemy: Fusing Typological and Geographical Elements for Unseen Language Generalization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024
M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Cendol: Open Instruction-tuned Generative Large Language Models for Indonesian Languages.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models.
CoRR, 2023
Knowledge-Augmented Language Model Prompting for Zero-Shot Knowledge Graph Question Answering.
CoRR, 2023
Bactrian-X : A Multilingual Replicable Instruction-Following Model with Low-Rank Adaptation.
CoRR, 2023
M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection.
CoRR, 2023
CoRR, 2023
Prompting Multilingual Large Language Models to Generate Code-Mixed Texts: The Case of South East Asian Languages.
CoRR, 2023
NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages.
Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
The Decades Progress on Code-Switching Research in NLP: A Systematic Survey on Trends and Challenges.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
On "Scientific Debt" in NLP: A Case for More Rigour in Language Model Pre-Training Research.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
2022
CoRR, 2022
CoRR, 2022
Nix-TTS: An Incredibly Lightweight End-to-End Text-to-Speech Model via Non End-to-End Distillation.
CoRR, 2022
Documenting Geographically and Contextually Diverse Data Sources: The BigScience Catalogue of Language Data and Resources.
CoRR, 2022
Which Student is Best? A Comprehensive Knowledge Distillation Exam for Task-Specific BERT Models.
CoRR, 2022
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Proceedings of the 29th International Conference on Computational Linguistics, 2022
Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering.
Proceedings of the 29th International Conference on Computational Linguistics, 2022
Proceedings of the Eighth Workshop on Noisy User-generated Text, 2022
One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
2021
The University of Edinburgh's Bengali-Hindi Submissions to the WMT21 News Translation Task.
Proceedings of the Sixth Conference on Machine Translation, 2021
Proceedings of the Sixth Conference on Machine Translation, 2021
BERT Goes Brrr: A Venture Towards the Lesser Error in Classifying Medical Self-Reporters on Twitter.
Proceedings of the Sixth Social Media Mining for Health Workshop and Shared Task, 2021
ParaCotta: Synthetic Multilingual Paraphrase Corpora from the Most Diverse Translation Sample Pair.
Proceedings of the 35th Pacific Asia Conference on Language, Information and Computation, 2021
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
IndoCollex: A Testbed for Morphological Transformation of Indonesian Word Colloquialism.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021
2020
Exploring Monolingual Data for Neural Machine Translation with Knowledge Distillation.
CoRR, 2020
CoRR, 2020
No Budget? Don't Flex! Cost Consideration when Planning to Adopt NLP for Your Business.
CoRR, 2020
Semi-Supervised Low-Resource Style Transfer of Indonesian Informal to Formal Language with Iterative Forward-Translation.
CoRR, 2020
Semi-Supervised Low-Resource Style Transfer of Indonesian Informal to Formal Language with Iterative Forward-Translation.
Proceedings of the International Conference on Asian Language Processing, 2020
Proceedings of the Fourth Workshop on Neural Generation and Translation, 2020
Proceedings of the Fourth Workshop on Neural Generation and Translation, 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Proceedings of the 13th Workshop on Building and Using Comparable Corpora, 2020
2019
Proceedings of the 3rd Workshop on Neural Generation and Translation@EMNLP-IJCNLP 2019, 2019
Combining Global Sparse Gradients with Local Gradients in Distributed Neural Network Training.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Proceedings of the 3rd Workshop on Neural Generation and Translation@EMNLP-IJCNLP 2019, 2019
2018
Proceedings of the 2018 International Conference on Asian Language Processing, 2018
Accelerating Asynchronous Stochastic Gradient Descent for Neural Machine Translation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018
Proceedings of ACL 2018, Melbourne, Australia, July 15-20, 2018, System Demonstrations, 2018
2017
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017
2014
Can smartphones be used to detect an earthquake? Using a machine learning approach to identify an earthquake event.
Proceedings of the IEEE International Systems Conference, 2014