2023
Generative Models for Product Attribute Extraction.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: EMNLP 2023, 2023

PV2TEA: Patching Visual Modality to Textual-Established Information Extraction.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Prototype-Representations for Training Data Filtering in Weakly-Supervised Information Extraction.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: EMNLP 2022 - Industry Track, Abu Dhabi, UAE, December 7, 2022

Ask-and-Verify: Span Candidate Generation and Verification for Attribute Value Extraction.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: EMNLP 2022 - Industry Track, Abu Dhabi, UAE, December 7, 2022

2021
All You Need to Know to Build a Product Knowledge Graph.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

PAM: Understanding Product Images in Cross Product Category Attribute Extraction.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

End-to-End Conversational Search for Online Shopping with Utterance Transfer.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

AdaTag: Multi-Attribute Value Extraction from Product Profiles with Adaptive Decoding.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
CAMeL Tools: An Open Source Python Toolkit for Arabic Natural Language Processing.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Morphological Analysis and Disambiguation for Gulf Arabic: The Interplay between Resources and Methods.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Utilizing Subword Entities in Character-Level Sequence-to-Sequence Lemmatization Models.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Joint Diacritization, Lemmatization, Normalization, and Fine-Grained Morphological Tagging.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Unsupervised Neologism Normalization Using Embedding Space Mapping.
Proceedings of the 5th Workshop on Noisy User-generated Text, 2019

Adversarial Multitask Learning for Joint Multi-Feature and Multi-Dialect Morphological Modeling.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Noise-Robust Morphological Disambiguation for Dialectal Arabic.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Unified Guidelines and Resources for Arabic Dialect Orthography.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Utilizing Character and Word Embeddings for Text Normalization with Sequence-to-Sequence Models.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Addressing Noise in Multidialectal Word Embeddings.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Optimizing Tokenization Choice for Machine Translation across Multiple Target Languages.
Prague Bull. Math. Linguistics, 2017

Curras: an annotated corpus for the Palestinian Arabic dialect.
Lang. Resour. Evaluation, 2017

Don't Throw Those Morphological Analyzers Away Just Yet: Neural Morphological Disambiguation for Arabic.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

A Parallel Corpus for Evaluating Machine Translation between Arabic and European Languages.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

2016
Multivariate adaptive community detection in Twitter.
Int. J. Big Data Intell., 2016

YAMAMA: Yet Another Multi-Dialect Arabic Morphological Analyzer.
Proceedings of the COLING 2016, 2016

Analysis of Foreign Language Teaching Methods: An Automatic Readability Approach.
Proceedings of the 3rd Workshop on Natural Language Processing Techniques for Educational Applications, 2016

2014
Building a Corpus for Palestinian Arabic: a Preliminary Study.
Proceedings of the EMNLP 2014 Workshop on Arabic Natural Language Processing, 2014

2013
Multidimensional community detection in Twitter.
Proceedings of the 8th International Conference for Internet Technology and Secured Transactions, 2013