Kareem Darwish

According to our database1, Kareem Darwish authored at least 127 papers between 2001 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.



In proceedings 
PhD thesis 


On csauthors.net:


Fanar: An Arabic-Centric Multimodal Generative AI Platform.
CoRR, January, 2025

Creating Arabic LLM Prompts at Scale.
CoRR, 2024

An Automated End-to-End Open-Source Software for High-Quality Text-to-Speech Dataset Generation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Arabic Diacritization Using Morphologically Informed Character-Level Model.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Evaluating Multilingual Speech Translation under Realistic Conditions with Resegmentation and Terminology.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023

News Consumption in Time of Conflict: 2021 Palestinian-Israel War as an Example.
Proceedings of the WebSci '22: 14th ACM Web Science Conference 2022, Barcelona, Spain, June 26, 2022

NatiQ: An End-to-end Text-to-Speech System for Arabic.
Proceedings of the The Seventh Arabic Natural Language Processing Workshop, 2022

MTLens: Machine Translation Output Debugging.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Cross-lingual Emotion Detection.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Arabic Diacritic Recovery Using a Feature-rich biLSTM Model.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2021

Automatic Expansion and Retargeting of Arabic Offensive Language Training.
CoRR, 2021

Pre-Training BERT on Arabic Tweets: Practical Considerations.
CoRR, 2021

BERT Transformer model for Detecting Arabic GPT2 Auto-Generated Tweets.
CoRR, 2021

A panoramic survey of natural language processing in the Arab world.
Commun. ACM, 2021

Arabic Offensive Language on Twitter: Analysis and Experiments.
Proceedings of the Sixth Arabic Natural Language Processing Workshop, 2021

QADI: Arabic Dialect Identification in the Wild.
Proceedings of the Sixth Arabic Natural Language Processing Workshop, 2021

Embeddings-Based Clustering for Target Specific Stances: The Case of a Polarized Turkey.
Proceedings of the Fifteenth International AAAI Conference on Web and Social Media, 2021

Fighting the COVID-19 Infodemic in Social Media: A Holistic Perspective and a Call to Arms.
Proceedings of the Fifteenth International AAAI Conference on Web and Social Media, 2021

Fighting the COVID-19 Infodemic: Modeling the Perspective of Journalists, Fact-Checkers, Social Media Platforms, Policy Makers, and the Society.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

A Few Topical Tweets are Enough for Effective User Stance Detection.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

ASAD: Arabic Social media Analytics and unDerstanding.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, 2021

Effective multi-dialectal arabic POS tagging.
Nat. Lang. Eng., 2020

Arabic Dialect Identification in the Wild.
CoRR, 2020

Fighting the COVID-19 Infodemic: Modeling the Perspective of Journalists, Fact-Checkers, Social Media Platforms, Policy Makers, and the Society.
CoRR, 2020

A Few Topical Tweets are Enough for Effective User-Level Stance Detection.
CoRR, 2020

Bert Transformer model for Detecting Arabic GPT2 Auto-Generated Tweets.
Proceedings of the Fifth Arabic Natural Language Processing Workshop, 2020

Improving Arabic Text Categorization Using Transformer Training Diversification.
Proceedings of the Fifth Arabic Natural Language Processing Workshop, 2020

Political Framing: US COVID19 Blame Game.
Proceedings of the Social Informatics - 12th International Conference, 2020

Spam Detection on Arabic Twitter.
Proceedings of the Social Informatics - 12th International Conference, 2020

Unsupervised User Stance Detection on Twitter.
Proceedings of the Fourteenth International AAAI Conference on Web and Social Media, 2020

Arabic Curriculum Analysis.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Predicting the Topical Stance and Political Leaning of Media using Tweets.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Language processing and learning models for community question answering in Arabic.
Inf. Process. Manag., 2019

A set of parameters for automatically annotating a Sentiment Arabic Corpus.
Int. J. Web Inf. Syst., 2019

Embedding-based Qualitative Analysis of Polarization in Turkey.
CoRR, 2019

Predicting the Topical Stance of Media and Popular Twitter Users.
CoRR, 2019

QC-GO Submission for MADAR Shared Task: Arabic Fine-Grained Dialect Identification.
Proceedings of the Fourth Arabic Natural Language Processing Workshop, 2019

POS Tagging for Improving Code-Switching Identification in Arabic.
Proceedings of the Fourth Arabic Natural Language Processing Workshop, 2019

Arabic Offensive Language Classification on Twitter.
Proceedings of the Social Informatics - 11th International Conference, 2019

Quantifying Polarization on Twitter: The Kavanaugh Nomination.
Proceedings of the Social Informatics - 11th International Conference, 2019

Highly Effective Arabic Diacritization using Sequence to Sequence Modeling.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

FarSpeech: Arabic Natural Language Processing for Live Arabic Speech.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Tanbih: Get To Know What You Are Reading.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

A System for Diacritizing Four Varieties of Arabic.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Predicting Online Islamophobic Behavior after #ParisAttacks.
J. Web Sci., 2018

To Kavanaugh or Not to Kavanaugh: That is the Polarizing Question.
CoRR, 2018

Diacritization of Maghrebi Arabic Sub-Dialects.
CoRR, 2018

Devam vs. Tamam: 2018 Turkish Elections.
CoRR, 2018

Multi-Dialect Arabic POS Tagging: A CRF Approach.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Part-of-Speech Tagging for Arabic Gulf Dialect Using Bi-LSTM.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Arabic Multi-Dialect Segmentation: bi-LSTM-CRF vs. SVM.
CoRR, 2017

A Neural Architecture for Dialectal Arabic Segmentation.
Proceedings of the Third Arabic Natural Language Processing Workshop, 2017

Arabic POS Tagging: Don't Abandon Feature Engineering Just Yet.
Proceedings of the Third Arabic Natural Language Processing Workshop, 2017

Arabic Diacritization: Stats, Rules, and Hacks.
Proceedings of the Third Arabic Natural Language Processing Workshop, 2017

Trump vs. Hillary: What Went Viral During the 2016 US Presidential Election.
Proceedings of the Social Informatics, 2017

Seminar Users in the Arabic Twitter Sphere.
Proceedings of the Social Informatics, 2017

Learning from Relatives: Unified Dialectal Arabic Segmentation.
Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017), 2017

Improved Stance Prediction in a User Similarity Feature Space.
Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2017, Sydney, Australia, July 31, 2017

Abusive Language Detection on Arabic Social Media.
Proceedings of the First Workshop on Abusive Language Online, 2017

#FailedRevolutions: Using Twitter to study the antecedents of ISIS support.
First Monday, 2016

Trump vs. Hillary Analyzing Viral Tweets during US Presidential Elections 2016.
CoRR, 2016

#ISISisNotIslam or #DeportAllMuslims?: predicting unspoken views.
Proceedings of the 8th ACM Conference on Web Science, 2016

QCRI $@$ DSL 2016: Spoken Arabic Dialect Identification Using Textual Features.
Proceedings of the Third Workshop on NLP for Similar Languages, Varieties and Dialects, 2016

Farasa: A Fast and Furious Segmenter for Arabic.
Proceedings of the Demonstrations Session, 2016

Farasa: A New Fast and Accurate Arabic Word Segmenter.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Quantifying Public Response towards Islam on Twitter after Paris Attacks.
CoRR, 2015

Attitudes towards Refugees in Light of the Paris Attacks.
CoRR, 2015

QCRI$@$QALB-2015 Shared Task: Correction of Arabic Text for Native and Non-Native Speakers' Errors.
Proceedings of the Second Workshop on Arabic Natural Language Processing, 2015

Classifying Arab Names Geographically.
Proceedings of the Second Workshop on Arabic Natural Language Processing, 2015

QCRI: Answer Selection for Community Question Answering - Experiments for Arabic and English.
Proceedings of the 9th International Workshop on Semantic Evaluation, 2015

Randomized Greedy Inference for Joint Segmentation, POS Tagging and Dependency Parsing.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

"I like ISIS, but I want to watch Chris Nolan's new movie": Exploring ISIS Supporters on Twitter.
Proceedings of the 26th ACM Conference on Hypertext & Social Media, 2015

Overview of the AraPlagDet PAN@FIRE2015 Shared Task on Arabic Plagiarism Detection.
Proceedings of the Post Proceedings of the Workshops at the 7th Forum for Information Retrieval Evaluation, 2015

Content and Network Dynamics Behind Egyptian Political Polarization on Twitter.
Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing, 2015

Statistical Machine Translation.
Proceedings of the Natural Language Processing of Semitic Languages, 2014

Information Retrieval.
Proceedings of the Natural Language Processing of Semitic Languages, 2014

Arabic Information Retrieval.
Found. Trends Inf. Retr., 2014

Automatic Correction of Arabic Text: a Cascaded Approach.
Proceedings of the EMNLP 2014 Workshop on Arabic Natural Language Processing, 2014

Using Twitter to Collect a Multi-Dialectal Corpus of Arabic.
Proceedings of the EMNLP 2014 Workshop on Arabic Natural Language Processing, 2014

Arabizi Detection and Conversion to Arabic.
Proceedings of the EMNLP 2014 Workshop on Arabic Natural Language Processing, 2014

Query Term Expansion by Automatic Learning of Morphological Equivalence Patterns from Wikipedia.
Proceedings of Workshop on Semantic Matching in Information Retrieval co-located with the 37th international ACM SIGIR conference on research and development in information retrieval, 2014

Simple Effective Microblog Named Entity Recognition: Arabic as an Example.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Using Stem-Templates to Improve Arabic POS and Gender/Number Tagging.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Verifiably Effective Arabic Dialect Identification.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Subjectivity and Sentiment Analysis of Modern Standard Arabic and Arabic Microblogs.
Proceedings of the 4th Workshop on Computational Approaches to Subjectivity, 2013

Detecting Comments on News Articles in Microblogs.
Proceedings of the Seventh International Conference on Weblogs and Social Media, 2013

Translating Dialectal Arabic to English.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Named Entity Recognition using Cross-lingual Resources: Arabic as an Example.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Transliteration Mining Using Large Training and Test Sets.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2012

Statistical denormalization for Arabic text.
Proceedings of the 11th Conference on Natural Language Processing, 2012

A summarization tool for time-sensitive social media.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Joint topic modeling for event summarization across news and social media streams.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Language processing for arabic microblog retrieval.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Arabic Retrieval Revisited: Morphological Hole Filling.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

QCRI @ TREC 2011: Microblog Track.
Proceedings of The Twentieth Text REtrieval Conference, 2011

Improved Transliteration Mining Using Graph Reinforcement.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

Is a Query Worth Translating: Ask the Users!
Proceedings of the Advances in Information Retrieval, 2011

ICE-TEA: In-Context Expansion and Translation of English Abbreviations.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2011

Omni font OCR error correction with effect on retrieval.
Proceedings of the 10th International Conference on Intelligent Systems Design and Applications, 2010

Improved relevance feedback using density-based clustering.
Proceedings of the 10th International Conference on Intelligent Systems Design and Applications, 2010

Classifying Wikipedia Articles into NE's Using SVM's with Threshold Adjustment.
Proceedings of the 2010 Named Entities Workshop, 2010

Transliteration Mining with Phonetic Conflation and Iterative Training.
Proceedings of the 2010 Named Entities Workshop, 2010

Simplified Feature Set for Arabic Named Entity Recognition.
Proceedings of the 2010 Named Entities Workshop, 2010

CMIC@TREC 2009: Relevance Feedback Track.
Proceedings of The Eighteenth Text REtrieval Conference, 2009

Efficient Language-Independent Retrieval of Printed Documents without OCR.
Proceedings of the String Processing and Information Retrieval, 2009

Effect of OCR error correction on Arabic retrieval.
Inf. Retr., 2008

Automatic Extraction of Textual Elements from News Web Pages.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

CMIC@INEX 2008: Link-the-Wiki Track.
Proceedings of the Advances in Focused Retrieval, 2008

Book search: indexing the valuable parts.
Proceedings of the 2008 ACM Workshop on Research Advances in Large Digital Book Repositories, 2008

Error correction vs. query garbling for Arabic OCR document retrieval.
ACM Trans. Inf. Syst., 2007

CMIC at INEX 2007: Book Search Track.
Proceedings of the Focused Access to XML Documents, 2007

BioNoculars: Extracting Protein-Protein Interactions from Biomedical Text.
Proceedings of the Biological, translational, and clinical language processing, 2007

Arabic Cross-Document Person Name Normalization.
Proceedings of the 2007 Workshop on Computational Approaches to Semitic Languages: Common Issues and Resources, 2007

Word-Based Correction for Retrieval of Arabic OCR Degraded Documents.
Proceedings of the String Processing and Information Retrieval, 2006

Building a Heterogeneous Information Retrieval Collection of Printed Arabic Documents.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Arabic OCR Error Correction Using Character Segment Correction, Language Modeling, and Shallow Morphology.
Proceedings of the EMNLP 2006, 2006

Providing Multilingual Access to FLICKR for Arabic Users.
Proceedings of the Working Notes for CLEF 2006 Workshop co-located with the 10th European Conference on Digital Libraries (ECDL 2006), 2006

Examining the Effect of Improved Context Sensitive Morphology on Arabic Information Retrieval.
Proceedings of the Workshop on Computational Approaches to Semitic Languages, 2005

The GUC Goes to TREC 2004: Using Whole or Partial Documents for Retrieval and Classification in the Genomics Track.
Proceedings of the Thirteenth Text REtrieval Conference, 2004

Making MIRACLEs: Interactive translingual search for Cebuano and Hindi.
ACM Trans. Asian Lang. Inf. Process., 2003

Probabilistic structured query methods.
Proceedings of the SIGIR 2003: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 28, 2003

CLIR Experiments at Maryland for TREC 2002: Evidence Combination for Arabic-English Retrieval.
Proceedings of The Eleventh Text REtrieval Conference, 2002

Term selection for searching printed Arabic.
Proceedings of the SIGIR 2002: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2002

Building a Shallow Arabic Morphological Analyser in One Day.
Proceedings of the Workshop on Computational Approaches to Semitic Languages, 2002

TREC-10 Experiments at University of Maryland CLIR and Video.
Proceedings of The Tenth Text REtrieval Conference, 2001
