Samia Touileb

Orcid: 0000-0003-4584-2554

According to our database1, Samia Touileb authored at least 32 papers between 2014 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Learning Horn envelopes via queries from language models.
Int. J. Approx. Reason., 2024

BabelBot at AraFinNLP2024: Fine-tuning T5 for Multi-dialect Intent Detection with Synthetic Data and Model Ensembling.
Proceedings of The Second Arabic Natural Language Processing Conference, 2024

Incorporating Editorial Feedback in the Evaluation of News Recommender Systems.
Proceedings of the Adjunct Proceedings of the 32nd ACM Conference on User Modeling, 2024

EDEN: A Dataset for Event Detection in Norwegian News.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Learning Horn Envelopes via Queries from Large Language Models.
CoRR, 2023

Arabic dialect identification: An in-depth error analysis on the MADAR parallel corpus.
Proceedings of ArabicNLP 2023, Singapore (Hybrid), December 7, 2023, 2023

JSEEGraph: Joint Structured Event Extraction as Graph Parsing.
Proceedings of the The 12th Joint Conference on Lexical and Computational Semantics, 2023

Automated Claim Detection for Fact-checking: A Case Study using Norwegian Pre-trained Language Models.
Proceedings of the 24th Nordic Conference on Computational Linguistics, 2023

NorBench - A Benchmark for Norwegian Language Models.
Proceedings of the 24th Nordic Conference on Computational Linguistics, 2023

Identifying Token-Level Dialectal Features in Social Media.
Proceedings of the 24th Nordic Conference on Computational Linguistics, 2023

Making Sense of Nonsense: Integrated Gradient-Based Input Reduction to Improve Recall for Check-Worthy Claim Detection.
Proceedings of the 5th Symposium of the Norwegian AI Society, 2023

Measuring Normative and Descriptive Biases in Language Models Using Census Data.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

2022
Measuring Harmful Representations in Scandinavian Language Models.
CoRR, 2022

Annotating Norwegian Language Varieties on Twitter for Part-of-Speech.
CoRR, 2022

NorDiaChange: Diachronic Semantic Change Dataset for Norwegian.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Exploring the Effects of Negation and Grammatical Tense on Bias Probes.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

NERDz: A Preliminary Dataset of Named Entities for Algerian.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

EventGraph at CASE 2021 Task 1: A General Graph-based Approach to Protest Event Extraction.
Proceedings of the 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text, 2022

EventGraph: Event Extraction as Semantic Graph Parsing.
Proceedings of the 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text, 2022

2021
NorDial: A Preliminary Corpus of Written Norwegian Dialect Use.
Proceedings of the 23rd Nordic Conference on Computational Linguistics, 2021

The interplay between language similarity and script on a novel multi-layer Algerian dialect corpus.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
LTG-ST at NADI Shared Task 1: Arabic Dialect Identification using a Stacking Classifier.
Proceedings of the Fifth Arabic Natural Language Processing Workshop, 2020

Identifying Sentiments in Algerian Code-switched User-generated Comments.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Named Entity Recognition without Labelled Data: A Weak Supervision Approach.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Lexicon information in neural sentiment analysis: a multi-task learning approach.
Proceedings of the 22nd Nordic Conference on Computational Linguistics, NoDaLiDa 2019, Turku, Finland, September 30, 2019

Measuring Diachronic Evolution of Evaluative Adjectives with Word Embeddings: the Case for English, Norwegian, and Russian.
Proceedings of the 1st International Workshop on Computational Approaches to Historical Language Change, 2019

2018
NoReC: The Norwegian Review Corpus.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Automatic identification of unknown names with specific roles.
Proceedings of the Second Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, 2018

2016
ADIOS LDA: When Grammar Induction Meets Topic Modeling.
Proceedings of the 29th Norsk Informatikkonferanse, 2016

2014
Constructions: a New Unit of Analysis for Corpus-based Discourse Analysis.
Proceedings of the 28th Pacific Asia Conference on Language, Information and Computation, 2014

Inducing Information Structures for Data-driven Text Analysis.
Proceedings of the Workshop on Language Technologies and Computational Social Science@ACL 2014, 2014

Applying Grammar Induction to Text Mining.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014


  Loading...