Younes Samih

Orcid: 0000-0002-0485-7920

According to our database1, Younes Samih authored at least 42 papers between 2011 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Can a Multichoice Dataset be Repurposed for Extractive Question Answering?
CoRR, 2024

Multilingual Nonce Dependency Treebanks: Understanding how Language Models Represent and Process Syntactic Structure.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

2023
Multilingual Nonce Dependency Treebanks: Understanding how LLMs represent and process syntactic structure.
CoRR, 2023

2022
Probing for Constituency Structure in Neural Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021
Automatic Expansion and Retargeting of Arabic Offensive Language Training.
CoRR, 2021

Pre-Training BERT on Arabic Tweets: Practical Considerations.
CoRR, 2021

Arabic Offensive Language on Twitter: Analysis and Experiments.
Proceedings of the Sixth Arabic Natural Language Processing Workshop, 2021

QADI: Arabic Dialect Identification in the Wild.
Proceedings of the Sixth Arabic Natural Language Processing Workshop, 2021

A Few Topical Tweets are Enough for Effective User Stance Detection.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

2020
Effective multi-dialectal arabic POS tagging.
Nat. Lang. Eng., 2020

Arabic Dialect Identification in the Wild.
CoRR, 2020

A Few Topical Tweets are Enough for Effective User-Level Stance Detection.
CoRR, 2020

ALT at SemEval-2020 Task 12: Arabic and English Offensive Language Identification in Social Media.
Proceedings of the Fourteenth Workshop on Semantic Evaluation, 2020

Effects of Dialectal Code-Switching on Speech Modules: A Study Using Egyptian Arabic Broadcast Speech.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

ADI17: A Fine-Grained Arabic Dialect Identification Dataset.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
QC-GO Submission for MADAR Shared Task: Arabic Fine-Grained Dialect Identification.
Proceedings of the Fourth Arabic Natural Language Processing Workshop, 2019

POS Tagging for Improving Code-Switching Identification in Arabic.
Proceedings of the Fourth Arabic Natural Language Processing Workshop, 2019

Highly Effective Arabic Diacritization using Sequence to Sequence Modeling.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

A System for Diacritizing Four Varieties of Arabic.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

The MGB-5 Challenge: Recognition and Dialect Identification of Dialectal Arabic Speech.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Diacritization of Maghrebi Arabic Sub-Dialects.
CoRR, 2018

GHH at SemEval-2018 Task 10: Discovering Discriminative Attributes in Distributional Semantics.
Proceedings of The 12th International Workshop on Semantic Evaluation, 2018

Multi-Dialect Arabic POS Tagging: A CRF Approach.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Multilingual Multi-class Sentiment Classification Using Convolutional Neural Networks.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Mumpitz at PARSEME Shared Task 2018: A Bidirectional LSTM for the Identification of Verbal Multiword Expressions.
Proceedings of the Joint Workshop on Linguistic Annotation, 2018

German and French Neural Supertagging Experiments for LTAG Parsing.
Proceedings of ACL 2018, Melbourne, Australia, July 15-20, 2018, Student Research Workshop, 2018

GHHT at CALCS 2018: Named Entity Recognition for Dialectal Arabic Using Neural Networks.
Proceedings of the Third Workshop on Computational Approaches to Linguistic Code-Switching@ACL 2018, 2018

2017
Arabic Multi-Dialect Segmentation: bi-LSTM-CRF vs. SVM.
CoRR, 2017

A Neural Architecture for Dialectal Arabic Segmentation.
Proceedings of the Third Arabic Natural Language Processing Workshop, 2017

Learning from Relatives: Unified Dialectal Arabic Segmentation.
Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017), 2017

2016
Arabic spelling error detection and correction.
Nat. Lang. Eng., 2016

An Arabic-Moroccan Darija Code-Switched Corpus.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

CogALex-V Shared Task: GHHH - Detecting Semantic Relations via Word Embeddings.
Proceedings of the 5th Workshop on Cognitive Aspects of the Lexicon, 2016

SAWT: Sequence Annotation Web Tool.
Proceedings of the Second Workshop on Computational Approaches to Code Switching@EMNLP 2016, 2016

Multilingual Code-switching Identification via LSTM Recurrent Neural Networks.
Proceedings of the Second Workshop on Computational Approaches to Code Switching@EMNLP 2016, 2016

2015
Une métagrammaire de l'interface morpho-sémantique dans les verbes en arabe.
Proceedings of the Actes de la 22e conference sur le Traitement Automatique des Langues Naturelles. Articles courts, 2015

2013
Synchronous Regular Relations and Morphological Analysis.
Proceedings of the 11th International Conference on Finite State Methods and Natural Language Processing, 2013

2012
Arabic Word Generation and Modelling for Spell Checking.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Conversion of Procedural Morphologies to Finite-State Morphologies: A Case Study of Arabic.
Proceedings of the 10th International Workshop on Finite State Methods and Natural Language Processing, 2012

The Floating Arabic Dictionary: An Automatic Method for Updating a Lexical Database through the Detection and Lemmatization of Unknown Words.
Proceedings of the COLING 2012, 2012

Improved Spelling Error Detection and Correction for Arabic.
Proceedings of the COLING 2012, 2012

2011
FTrace: A Tool for Finite-State Morphology.
Proceedings of the Finite-State Methods and Natural Language Processing, 2011


  Loading...