Hamdy Mubarak

Orcid: 0000-0002-9051-6240

According to our database1, Hamdy Mubarak authored at least 88 papers between 2014 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Wikidata as a Source of Demographic Information.
Proceedings of The Second Arabic Natural Language Processing Conference, 2024

LLMeBench: A Flexible Framework for Accelerating LLMs Benchmarking.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

LAraBench: Benchmarking Arabic AI with Large Language Models.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

So Hateful! Building a Multi-Label Hate Speech Annotated Arabic Dataset.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Halwasa: Quantify and Analyze Hallucinations in Large Language Models: Arabic as a Case Study.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Beyond Orthography: Automatic Recovery of Short Vowels and Dialectal Sounds in Arabic.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Emojis as anchors to detect Arabic offensive language and hate speech.
Nat. Lang. Eng., November, 2023

Detecting and identifying the reasons for deleted tweets before they are posted.
Frontiers Artif. Intell., February, 2023

LLMeBench: A Flexible Framework for Accelerating LLMs Benchmarking.
CoRR, 2023

Benchmarking Arabic AI with Large Language Models.
CoRR, 2023

Detecting and Reasoning of Deleted Tweets before they are Posted.
CoRR, 2023

Unsupervised Data Selection for TTS: Using Arabic Broadcast News as a Case Study.
CoRR, 2023

ArAIEval Shared Task: Persuasion Techniques and Disinformation Detection in Arabic Text.
Proceedings of ArabicNLP 2023, Singapore (Hybrid), December 7, 2023, 2023

SpeechBlender: Speech Augmentation Framework for Mispronunciation Data Generation.
Proceedings of the 9th Workshop on Speech and Language Technology in Education, 2023

QVoice: Arabic Speech Pronunciation Learning Application.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Towards Generalization of Machine Learning Models: A Case Study of Arabic Sentiment Analysis.
Proceedings of the Seventeenth International AAAI Conference on Web and Social Media, 2023

Overview of the CLEF-2023 CheckThat! Lab Task 1 on Check-Worthiness in Multimodal and Multigenre Content.
Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2023), 2023

2022
Overview of the WANLP 2022 Shared Task on Propaganda Detection in Arabic.
Proceedings of the The Seventh Arabic Natural Language Processing Workshop, 2022

NatiQ: An End-to-end Text-to-Speech System for Arabic.
Proceedings of the The Seventh Arabic Natural Language Processing Workshop, 2022

Benchmarking Evaluation Metrics for Code-Switching Automatic Speech Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

ArCovidVac: Analyzing Arabic Tweets About COVID-19 Vaccination.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022


ArabGend: Gender Analysis and Inference on Arabic Twitter.
Proceedings of the Eighth Workshop on Noisy User-generated Text, 2022

Overview of the CLEF-2022 CheckThat! Lab Task 2 on Detecting Previously Fact-Checked Claims.
Proceedings of the Working Notes of CLEF 2022 - Conference and Labs of the Evaluation Forum, Bologna, Italy, September 5th - to, 2022

Overview of the CLEF-2022 CheckThat! Lab on Fighting the COVID-19 Infodemic and Fake News Detection.
Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2022

Overview of the CLEF-2022 CheckThat! Lab Task 1 on Identifying Relevant Claims in Tweets.
Proceedings of the Working Notes of CLEF 2022 - Conference and Labs of the Evaluation Forum, Bologna, Italy, September 5th - to, 2022

2021
Arabic Diacritic Recovery Using a Feature-rich biLSTM Model.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2021

Automatic Expansion and Retargeting of Arabic Offensive Language Training.
CoRR, 2021

Pre-Training BERT on Arabic Tweets: Practical Considerations.
CoRR, 2021

A panoramic survey of natural language processing in the Arab world.
Commun. ACM, 2021

Arabic Offensive Language on Twitter: Analysis and Experiments.
Proceedings of the Sixth Arabic Natural Language Processing Workshop, 2021

Adult Content Detection on Arabic Twitter: Analysis and Experiments.
Proceedings of the Sixth Arabic Natural Language Processing Workshop, 2021

UL2C: Mapping User Locations to Countries on Arabic Twitter.
Proceedings of the Sixth Arabic Natural Language Processing Workshop, 2021

QADI: Arabic Dialect Identification in the Wild.
Proceedings of the Sixth Arabic Natural Language Processing Workshop, 2021

Fighting the COVID-19 Infodemic in Social Media: A Holistic Perspective and a Call to Arms.
Proceedings of the Fifteenth International AAAI Conference on Web and Social Media, 2021

Fighting the COVID-19 Infodemic: Modeling the Perspective of Journalists, Fact-Checkers, Social Media Platforms, Policy Makers, and the Society.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

ASAD: Arabic Social media Analytics and unDerstanding.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, 2021

QASR: QCRI Aljazeera Speech Resource A Large Scale Annotated Arabic Speech Corpus.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

ArCorona: Analyzing Arabic Tweets in the Early Days of Coronavirus (COVID-19) Pandemic.
Proceedings of the 12th International Workshop on Health Text Mining and Information Analysis, 2021

2020
SemEval-2020 Task 12: Multilingual Offensive Language Identification in Social Media (OffensEval 2020).
Dataset, July, 2020

Effective multi-dialectal arabic POS tagging.
Nat. Lang. Eng., 2020

Arabic Dialect Identification in the Wild.
CoRR, 2020

Fighting the COVID-19 Infodemic: Modeling the Perspective of Journalists, Fact-Checkers, Social Media Platforms, Policy Makers, and the Society.
CoRR, 2020

Spam Detection on Arabic Twitter.
Proceedings of the Social Informatics - 12th International Conference, 2020

SemEval-2020 Task 12: Multilingual Offensive Language Identification in Social Media (OffensEval 2020).
Proceedings of the Fourteenth Workshop on Semantic Evaluation, 2020

ALT at SemEval-2020 Task 12: Arabic and English Offensive Language Identification in Social Media.
Proceedings of the Fourteenth Workshop on Semantic Evaluation, 2020

A Multi-Platform Arabic News Comment Dataset for Offensive Language Detection.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

ADI17: A Fine-Grained Arabic Dialect Identification Dataset.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Arabic Curriculum Analysis.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Constructing a Bilingual Corpus of Parallel Tweets.
Proceedings of the 13th Workshop on Building and Using Comparable Corpora, 2020

2019
Arabic community question answering.
Nat. Lang. Eng., 2019

Language processing and learning models for community question answering in Arabic.
Inf. Process. Manag., 2019

QC-GO Submission for MADAR Shared Task: Arabic Fine-Grained Dialect Identification.
Proceedings of the Fourth Arabic Natural Language Processing Workshop, 2019

POS Tagging for Improving Code-Switching Identification in Arabic.
Proceedings of the Fourth Arabic Natural Language Processing Workshop, 2019

Arabic Offensive Language Classification on Twitter.
Proceedings of the Social Informatics - 11th International Conference, 2019

Highly Effective Arabic Diacritization using Sequence to Sequence Modeling.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

A System for Diacritizing Four Varieties of Arabic.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

The MGB-5 Challenge: Recognition and Dialect Identification of Dialectal Arabic Speech.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Diacritization of Maghrebi Arabic Sub-Dialects.
CoRR, 2018

Build Fast and Accurate Lemmatization for Arabic.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Multi-Dialect Arabic POS Tagging: A CRF Approach.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Part-of-Speech Tagging for Arabic Gulf Dialect Using Bi-LSTM.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

A single-model approach for Arabic segmentation, POS tagging, and named entity recognition.
Proceedings of the 2nd International Conference on Natural Language and Speech Processing, 2018

2017
Arabic Multi-Dialect Segmentation: bi-LSTM-CRF vs. SVM.
CoRR, 2017

A Neural Architecture for Dialectal Arabic Segmentation.
Proceedings of the Third Arabic Natural Language Processing Workshop, 2017

Arabic POS Tagging: Don't Abandon Feature Engineering Just Yet.
Proceedings of the Third Arabic Natural Language Processing Workshop, 2017

Arabic Diacritization: Stats, Rules, and Hacks.
Proceedings of the Third Arabic Natural Language Processing Workshop, 2017

SemEval-2017 Task 3: Community Question Answering.
Proceedings of the 11th International Workshop on Semantic Evaluation, 2017

QCRI Live Speech Translation System.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Learning from Relatives: Unified Dialectal Arabic Segmentation.
Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017), 2017

Crowdsourcing Speech and Language Data for Resource-Poor Languages.
Proceedings of the International Conference on Advanced Intelligent Systems and Informatics 2017, 2017

Abusive Language Detection on Arabic Social Media.
Proceedings of the First Workshop on Abusive Language Online, 2017

2016
The MGB-2 challenge: Arabic multi-dialect broadcast media recognition.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

SemEval-2016 Task 3: Community Question Answering.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

Farasa: A Fast and Furious Segmenter for Arabic.
Proceedings of the Demonstrations Session, 2016

Arabic to English Person Name Transliteration using Twitter.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Farasa: A New Fast and Accurate Arabic Word Segmenter.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

2015
Best Practices for Crowdsourcing Dialectal Arabic Speech Transcription.
Proceedings of the Second Workshop on Arabic Natural Language Processing, 2015

QCRI$@$QALB-2015 Shared Task: Correction of Arabic Text for Native and Non-Native Speakers' Errors.
Proceedings of the Second Workshop on Arabic Natural Language Processing, 2015

Classifying Arab Names Geographically.
Proceedings of the Second Workshop on Arabic Natural Language Processing, 2015

Answer Selection in Arabic Community Question Answering: A Feature-Rich Approach.
Proceedings of the Second Workshop on Arabic Natural Language Processing, 2015

QCRI: Answer Selection for Community Question Answering - Experiments for Arabic and English.
Proceedings of the 9th International Workshop on Semantic Evaluation, 2015

2014
Automatic Correction of Arabic Text: a Cascaded Approach.
Proceedings of the EMNLP 2014 Workshop on Arabic Natural Language Processing, 2014

Using Twitter to Collect a Multi-Dialectal Corpus of Arabic.
Proceedings of the EMNLP 2014 Workshop on Arabic Natural Language Processing, 2014

Using Stem-Templates to Improve Arabic POS and Gender/Number Tagging.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Advances in dialectal Arabic speech recognition: a study using Twitter to improve Egyptian ASR.
Proceedings of the 11th International Workshop on Spoken Language Translation: Papers, 2014

Verifiably Effective Arabic Dialect Identification.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014


  Loading...