Antonios Anastasopoulos

Orcid: 0000-0002-8544-246X

Affiliations:
  • George Mason University, USA


According to our database1, Antonios Anastasopoulos authored at least 131 papers between 2014 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Clinical risk prediction using language models: benefits and considerations.
J. Am. Medical Informatics Assoc., 2024

Birdie: Advancing State Space Models with Reward-Driven Objectives and Curricula.
CoRR, 2024

mHumanEval - A Multilingual Benchmark to Evaluate Large Language Models for Code Generation.
CoRR, 2024

The LLM Effect: Are Humans Truly Using LLMs, or Are They Being Influenced By Them Instead?
CoRR, 2024

Crossroads of Continents: Automated Artifact Extraction for Cultural Adaptation with Large Multimodal Models.
CoRR, 2024

Breaking Bias, Building Bridges: Evaluation and Mitigation of Social Biases in LLMs via Contact Hypothesis.
CoRR, 2024

Script-Agnostic Language Identification.
CoRR, 2024

Unlearning Climate Misinformation in Large Language Models.
CoRR, 2024

EmoMix-3L: A Code-Mixed Dataset for Bangla-English-Hindi Emotion Detection.
CoRR, 2024

Data-Augmentation-Based Dialectal Adaptation for LLMs.
CoRR, 2024

CMULAB: An Open-Source Framework for Training and Deployment of Natural Language Processing Models.
CoRR, 2024

An Efficient Approach for Studying Cross-Lingual Transfer in Multilingual Language Models.
CoRR, 2024

DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages.
CoRR, 2024

A Case Study on Filtering for End-to-End Speech Translation.
CoRR, 2024

A Morphologically-Aware Dictionary-based Data Augmentation Technique for Machine Translation of Under-Represented Languages.
CoRR, 2024

Findings of the WMT 2024 Shared Task of the Open Language Data Initiative.
Proceedings of the Ninth Conference on Machine Translation, 2024

Extracting Lexical Features from Dialects via Interpretable Dialect Classifiers.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Short Papers, 2024

Global Gallery: The Fine Art of Painting Culture Portraits through Multilingual Instruction Tuning.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

A Study on Scaling Up Multilingual News Framing Analysis.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Cross-Modal Multi-Tasking for Speech-to-Text Translation via Hard Parameter Sharing.
Proceedings of the IEEE International Conference on Acoustics, 2024

Enhancing End-to-End Conversational Speech Translation Through Target Language Context Utilization.
Proceedings of the IEEE International Conference on Acoustics, 2024

Trajectory Anomaly Detection with Language Models.
Proceedings of the 32nd ACM International Conference on Advances in Geographic Information Systems, 2024

Urban Mobility Assessment Using LLMs.
Proceedings of the 32nd ACM International Conference on Advances in Geographic Information Systems, 2024

BiasDora: Exploring Hidden Biased Associations in Vision-Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Back to School: Translation Using Grammar Books.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Gloss2Text: Sign Language Gloss translation using LLMs and Semantically Aware Label Smoothing.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

The LLM Effect: Are Humans Truly Using LLMs, or Are They Being Influenced By Them Instead?
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Birdie: Advancing State Space Language Modeling with Dynamic Mixtures of Training Objectives.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

SALSA: Salience-Based Switching Attack for Adversarial Perturbations in Fake News Detection Models.
Proceedings of the Advances in Information Retrieval, 2024

CODET: A Benchmark for Contrastive Dialectal Evaluation of Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

Language and Speech Technology for Central Kurdish Varieties.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

DIALECTBENCH: An NLP Benchmark for Dialects, Varieties, and Closely-Related Languages.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Dictionary-Aided Translation for Handling Multi-Word Expressions in Low-Resource Languages.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Offensive Language Identification in Transliterated and Code-Mixed Bangla.
CoRR, 2023

To token or not to token: A Comparative Study of Text Representations for Cross-Lingual Transfer.
CoRR, 2023

GlobalBench: A Benchmark for Global Progress in Natural Language Processing.
CoRR, 2023

Approaches to Corpus Creation for Low-Resource Language Technology: the Case of Southern Kurdish and Laki.
CoRR, 2023

User-Centric Evaluation of OCR Systems for Kwak'wala.
CoRR, 2023

PALI: A Language Identification Benchmark for Perso-Arabic Scripts.
Proceedings of the Tenth Workshop on NLP for Similar Languages, Varieties and Dialects, 2023

GMNLP at SemEval-2023 Task 12: Sentiment Analysis with Phylogeny-Based Adapters.
Proceedings of the The 17th International Workshop on Semantic Evaluation, 2023

GMU Systems for the IWSLT 2023 Dialect and Low-resource Speech Translation Tasks.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023


Zambezi Voice: A Multilingual Speech Corpus for Zambian Languages.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Towards a Universal Python: Translating the Natural Modality of Python into Other Human Languages.
Proceedings of the IEEE International Conference on Software Maintenance and Evolution, 2023

Are Large Language Models Geospatially Knowledgeable?
Proceedings of the 31st ACM International Conference on Advances in Geographic Information Systems, 2023

GlobalBench: A Benchmark for Global Progress in Natural Language Processing.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Global Voices, Local Biases: Socio-Cultural Prejudices across Languages.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Teacher Perception of Automatically Extracted Grammar Concepts for L2 Language Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

LIMIT: Language Identification, Misidentification, and Translation using Hierarchical Models in 350+ Languages.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Noisy Parallel Data Alignment.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Language Generation Models Can Cause Harm: So What Can We Do About It? An Actionable Survey.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

BIG-C: a Multimodal Multi-Purpose Dataset for Bemba.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Script Normalization for Unconventional Writing of Under-Resourced Languages in Bilingual Communities.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Geographic and Geopolitical Biases of Language Models.
CoRR, 2022

Educational Tools for Mapuzugun.
CoRR, 2022

AUTOLEX: An Automatic Framework for Linguistic Exploration.
CoRR, 2022

Language Adapters for Large-Scale MT: The GMU System for the WMT 2022 Large-Scale Machine Translation Evaluation for African Languages Shared Task.
Proceedings of the Seventh Conference on Machine Translation, 2022

Findings of the WMT'22 Shared Task on Large-Scale Machine Translation Evaluation for African Languages.
Proceedings of the Seventh Conference on Machine Translation, 2022

Quand être absent de mBERT n'est que le commencement : Gérer de nouvelles langues à l'aide de modèles de langues multilingues (When Being Unseen from mBERT is just the Beginning : Handling New Languages With Multilingual Language Models).
Proceedings of the Actes de la 29e Conférence sur le Traitement Automatique des Langues Naturelles. Volume 1 : conférence principale, 2022

UniMorph 4.0: Universal Morphology.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022


Phylogeny-Inspired Adaptation of Multilingual Models to New Languages.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

PROBER: A System for Real-time Propaganda Behavior Analytics on Social Media and Web Data Streams.
Proceedings of the IEEE International Conference on Big Data, 2022

Cross-Lingual Text Classification of Transliterated Hindi and Malayalam.
Proceedings of the IEEE International Conference on Big Data, 2022

Revisiting the Effects of Leakage on Dependency Parsing.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Dataset Geography: Mapping Language Data to Language Users.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Systematic Inequalities in Language Technology Performance across the World's Languages.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Lexically Aware Semi-Supervised Learning for OCR Post-Correction.
Trans. Assoc. Comput. Linguistics, 2021

Reducing Confusion in Active Learning for Part-Of-Speech Tagging.
Trans. Assoc. Comput. Linguistics, 2021

Investigating Post-pretraining Representation Alignment for Cross-Lingual Question Answering.
CoRR, 2021

On the Evaluation of Machine Translation for Terminology Consistency.
CoRR, 2021

Code to Comment Translation: A Comparative Study on Model Effectiveness & Errors.
CoRR, 2021

Phoneme Recognition through Fine Tuning of Phonetic Representations: a Case Study on Luhya Language Varieties.
Proceedings of the 2nd AfricaNLP Workshop Proceedings, AfricaNLP@EACL 2021, Virtual Event, 2021

Multilingual Code-Switching for Zero-Shot Cross-Lingual Intent Prediction and Slot Filling.
CoRR, 2021

BembaSpeech: A Speech Recognition Corpus for the Bemba Language.
Proceedings of the 2nd AfricaNLP Workshop Proceedings, AfricaNLP@EACL 2021, Virtual Event, 2021

Findings of the WMT Shared Task on Machine Translation Using Terminologies.
Proceedings of the Sixth Conference on Machine Translation, 2021

When Being Unseen from mBERT is just the Beginning: Handling New Languages With Multilingual Language Models.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021


Evaluating the Morphosyntactic Well-formedness of Generated Texts.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

SD-QA: Spoken Dialectal Question Answering for the Real World.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

When is Wall a Pared and when a Muro?: Extracting Rules Governing Lexical Selection.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Machine Translation into Low-resource Language Varieties.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Towards more equitable question answering systems: How much more data do you need?
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Comparison of Interactive Knowledge Base Spelling Correction Models for Low-Resource Languages.
CoRR, 2020

Practical Comparable Data Collection for Low-Resource Languages via Images.
CoRR, 2020

Towards Minimal Supervision BERT-based Grammar Error Correction.
CoRR, 2020

A Summary of the First Workshop on Language Technology for Language Documentation and Revitalization.
Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, 2020


Transliteration for Cross-Lingual Morphological Inflection.
Proceedings of the 17th SIGMORPHON Workshop on Computational Research in Phonetics, 2020

The CMU-LTI submission to the SIGMORPHON 2020 Shared Task 0: Language-Specific Cross-Lingual Transfer.
Proceedings of the 17th SIGMORPHON Workshop on Computational Research in Phonetics, 2020

AlloVera: A Multilingual Allophone Database.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

A Resource for Computational Experiments on Mapudungun.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

A Resource for Studying Chatino Verbal Morphology.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Optimizing Data Usage via Differentiable Rewards.
Proceedings of the 37th International Conference on Machine Learning, 2020

Universal Phone Recognition with a Multilingual Allophone System.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

OCR Post Correction for Endangered Language Texts.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

X-FACTR: Multilingual Factual Knowledge Retrieval from Pretrained Language Models.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

It's not a Non-Issue: Negation as a Source of Error in Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Dynamic Data Selection and Weighting for Iterative Back-Translation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Automatic Extraction of Rules Governing Morphological Agreement.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

TICO-19: the Translation Initiative for COvid-19.
Proceedings of the 1st Workshop on NLP for COVID-19@ EMNLP 2020, Online, December 2020, 2020

Automatic Interlinear Glossing for Under-Resourced Languages Leveraging Translations.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Endangered Languages meet Modern NLP.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Fine-Tuning MT systems for Robustness to Second-Language Speaker Variations.
Proceedings of the Sixth Workshop on Noisy User-generated Text, 2020

Predicting Performance for Natural Language Processing Tasks.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

It's Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Should All Cross-Lingual Embeddings Speak English?
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Towards Minimal Supervision BERT-Based Grammar Error Correction (Student Abstract).
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Towards Robust Toxic Content Classification.
CoRR, 2019

Neural Language Modeling with Visual Features.
CoRR, 2019

Improving Robustness of Neural Machine Translation with Multi-task Learning.
Proceedings of the Fourth Conference on Machine Translation, 2019

Findings of the First Shared Task on Machine Translation Robustness.
Proceedings of the Fourth Conference on Machine Translation, 2019

Neural Machine Translation of Text from Non-Native Speakers.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Investigating Meta-Learning Algorithms for Low-Resource Natural Language Understanding Tasks.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Unsupervised Domain Adaptation for Neural Machine Translation with Domain-Aware Feature Embeddings.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Pushing the Limits of Low-Resource Morphological Inflection.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

An Analysis of Source-Side Grammatical Errors in NMT.
Proceedings of the 2019 ACL Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, 2019

Generalized Data Augmentation for Low-Resource Translation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Choosing Transfer Languages for Cross-Lingual Learning.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Neural Machine Translation of Text from Non-Native Speakers.
CoRR, 2018

Freezing Subnetworks to Analyze Domain Adaptation in Neural Machine Translation.
Proceedings of the Third Conference on Machine Translation: Research Papers, 2018

A Small Griko-Italian Speech Translation Corpus.
Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018

Tied Multitask Learning for Neural Speech Translation.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Leveraging Translations for Speech Transcription in Low-resource Settings.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Part-of-Speech Tagging on an Endangered Language: a Parallel Griko-Italian Resource.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

2017
DyNet: The Dynamic Neural Network Toolkit.
CoRR, 2017

A case study on using speech-to-translation alignments for language documentation.
CoRR, 2017

Spoken Term Discovery for Language Documentation using Translations.
Proceedings of the Workshop on Speech-Centric Natural Language Processing, 2017

2016
An Attentional Model for Speech Translation Without Transcription.
Proceedings of the NAACL HLT 2016, 2016

An Unsupervised Probability Model for Speech-to-Translation Alignment of Low-Resource Languages.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

2014
Adaptive Quality Estimation for Machine Translation.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014


  Loading...