José Camacho-Collados

Orcid: 0000-0003-1618-7239

Affiliations:
  • Cardiff University, UK


According to our database1, José Camacho-Collados authored at least 111 papers between 2014 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Federated Learning for Exploiting Annotators' Disagreements in Natural Language Processing.
Trans. Assoc. Comput. Linguistics, 2024

Towards Quality Benchmarking in Question Answering over Tabular Data in Spanish.
Proces. del Leng. Natural, 2024

Analysing Zero-Shot Readability-Controlled Sentence Simplification.
CoRR, 2024

BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages.
CoRR, 2024

Words as Trigger Points in Social Media Discussions.
CoRR, 2024

A Systematic Analysis on the Temporal Generalization of Language Models in Social Media.
Proceedings of the 14th Workshop on Computational Approaches to Subjectivity, 2024

A Multi-Faceted NLP Analysis of Misinformation Spreaders in Twitter.
Proceedings of the 14th Workshop on Computational Approaches to Subjectivity, 2024

Exploring Cross-Cultural Differences in English Hate Speech Annotations: From Dataset Construction to Analysis.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Evaluating Short-Term Temporal Fluctuations of Social Biases in Social Media Data and Masked Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Multilingual Topic Classification in X: Dataset and Analysis.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

A RelEntLess Benchmark for Modelling Graded Relations between Named Entities.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

TweetTER: A Benchmark for Target Entity Retrieval on Twitter without Knowledge Bases.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Do Large Language Models Understand Mansplaining? Well, Actually...
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Question Answering over Tabular Data with DataBench: A Large-Scale Empirical Evaluation of LLMs.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Language Models for Text Classification: Is In-Context Learning Enough?
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Meemi: A simple method for post-processing and integrating cross-lingual word embeddings.
Nat. Lang. Eng., May, 2023

Negativity spreads faster: A large-scale multilingual twitter analysis on the role of sentiment in political communication.
Online Soc. Networks Media, 2023

RelBERT: Embedding Relations with Language Models.
CoRR, 2023

Tweet Insights: A Visualization Platform to Extract Temporal Insights from Twitter.
CoRR, 2023

Robust Hate Speech Detection in Social Media: A Cross-Dataset Empirical Evaluation.
CoRR, 2023

An Efficient Multilingual Language Model Compression through Vocabulary Trimming.
CoRR, 2023

SemEval-2023 Task 1: Visual Word Sense Disambiguation.
Proceedings of the The 17th International Workshop on Semantic Evaluation, 2023

A Predictive Factor Analysis of Social Biases and Task-Performance in Pretrained Masked Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Efficient Multilingual Language Model Compression through Vocabulary Trimming.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Construction Artifacts in Metaphor Identification Datasets.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

SuperTweetEval: A Challenging, Unified and Heterogeneous Benchmark for Social Media NLP Research.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023


Overview of the CLEF-2023 LongEval Lab on Longitudinal Evaluation of Model Performance.
Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2023

Extended Overview of the CLEF-2023 LongEval Lab on Longitudinal Evaluation of Model Performance.
Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2023), 2023

A Practical Toolkit for Multilingual Question and Answer Generation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2023

An Empirical Comparison of LM-based Question and Answer Generation Methods.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
TweetNLP: Cutting-Edge Natural Language Processing for Social Media.
CoRR, 2022

Politics and Virality in the Time of Twitter: A Large-Scale Cross-Party Sentiment Analysis in Greece, Spain and United Kingdom.
CoRR, 2022

LMMS reloaded: Transformer-based sense embeddings for disambiguation and beyond.
Artif. Intell., 2022

Assessing the Limits of the Distributional Hypothesis in Semantic Spaces: Trait-based Relational Knowledge and the Impact of Co-occurrences.
Proceedings of the 11th Joint Conference on Lexical and Computational Semantics, 2022

NLP4SM: Natural Language Processing for social media.
Proceedings of the Annual Conference of the Spanish Association for Natural Language Processing: Projects and Demonstrations (SEPLN-PD 2022) co-located with the Conference of the Spanish Society for Natural Language Processing (SEPLN 2022), 2022

CardiffNLP-Metaphor at SemEval-2022 Task 2: Targeted Fine-tuning of Transformer-based Language Models for Idiomaticity Detection.
Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

XLM-T: Multilingual Language Models in Twitter for Sentiment Analysis and Beyond.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

Generative Language Models for Paragraph-Level Question Generation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Probing Relational Knowledge in Language Models via Word Analogies.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

TweetNLP: Cutting-Edge Natural Language Processing for Social Media.
Proceedings of the The 2022 Conference on Empirical Methods in Natural Language Processing, 2022

TempoWiC: An Evaluation Benchmark for Detecting Meaning Shift in Social Media.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Twitter Topic Classification.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

TimeLMs: Diachronic Language Models from Twitter.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, 2022

2021
Guiding Generative Language Models for Data Augmentation in Few-Shot Text Classification.
CoRR, 2021

Distilling Relation Embeddings from Pre-trained Language Models.
CoRR, 2021

Deriving Disinformation Insights from Geolocalized Twitter Callouts.
CoRR, 2021

XLM-T: A Multilingual Language Model Toolkit for Twitter.
CoRR, 2021

Analysis and Evaluation of Language Models for Word Sense Disambiguation.
Comput. Linguistics, 2021

Modelling General Properties of Nouns by Selectively Averaging Contextualised Embeddings.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Predicting Themes within Complex Unstructured Texts: A Case Study on Safeguarding Reports.
Proceedings of the Joint Proceedings of the 2nd International Workshop on Deep Learning meets Ontologies and Natural Language Processing (DeepOntoNLP 2021) & 6th International Workshop on Explainable Sentiment Mining and Emotion Detection (X-SENTIMENT 2021) co-located with co-located with 18th Extended Semantic Web Conference 2021, Hersonissos, Greece, June 6th, 2021

Back to the Basics: A Quantitative Analysis of Statistical and Graph-Based Term Weighting Schemes for Keyword Extraction.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Distilling Relation Embeddings from Pretrained Language Models.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

T-NER: An All-Round Python Library for Transformer-based Named Entity Recognition.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, 2021

WiC-TSV: An Evaluation Benchmark for Target Sense Verification of Words in Context.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language Models Identify Analogies?
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

COVID-19 and Misinformation: A Large-Scale Lexical Analysis on Twitter.
Proceedings of the ACL-IJCNLP 2021 Student Research Workshop, 2021

2020
Embeddings in Natural Language Processing: Theory and Advances in Vector Representations of Meaning
Synthesis Lectures on Human Language Technologies, Morgan & Claypool Publishers, ISBN: 978-3-031-02177-0, 2020

Language Models and Word Sense Disambiguation: An Overview and Analysis.
CoRR, 2020

Towards Preemptive Detection of Depression and Anxiety in Twitter.
Proceedings of the Fifth Social Media Mining for Health Applications Workshop & Shared Task, 2020

A Short Survey on Sense-Annotated Corpora.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

On the Robustness of Unsupervised and Semi-supervised Cross-lingual Word Embedding Learning.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Learning Cross-Lingual Word Embeddings from Twitter via Distant Supervision.
Proceedings of the Fourteenth International AAAI Conference on Web and Social Media, 2020

XL-WiC: A Multilingual Benchmark for Evaluating Semantic Contextualization.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Don't Neglect the Obvious: On the Role of Unambiguous Words in Word Sense Disambiguation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

TweetEval: Unified Benchmark and Comparative Evaluation for Tweet Classification.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Capturing Word Order in Averaging Based Sentence Embeddings.
Proceedings of the ECAI 2020 - 24th European Conference on Artificial Intelligence, 29 August-8 September 2020, Santiago de Compostela, Spain, August 29 - September 8, 2020, 2020

Understanding the Source of Semantic Regularities in Word Embeddings.
Proceedings of the 24th Conference on Computational Natural Language Learning, 2020

Go Simple and Pre-Train on Domain-Specific Corpora: On the Role of Training Data for Text Classification.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Embeddings in Natural Language Processing.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Inducing Relational Knowledge from BERT.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Modelling Semantic Categories Using Conceptual Neighborhood.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
SenseDefs: a multilingual corpus of semantically annotated textual definitions - Exploiting multiple languages and resources jointly for high-quality Word Sense Disambiguation and Entity Linking.
Lang. Resour. Evaluation, 2019

Knowledge-enhanced document embeddings for text classification.
Knowl. Based Syst., 2019

Meemi: A Simple Method for Post-processing Cross-lingual Word Embeddings.
CoRR, 2019

Learning Cross-lingual Embeddings from Twitter via Distant Supervision.
CoRR, 2019

UA at SemEval-2019 Task 5: Setting A Strong Linear Baseline for Hate Speech Detection.
Proceedings of the 13th International Workshop on Semantic Evaluation, 2019

WiC: the Word-in-Context Dataset for Evaluating Context-Sensitive Meaning Representations.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

A Latent Variable Model for Learning Distributional Relation Vectors.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Relational Word Embeddings.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Applying automatic text-based detection of deceptive language to police reports: Extracting behavioral patterns from a multi-step classification model to understand how we lie to the police.
Knowl. Based Syst., 2018

From Word To Sense Embeddings: A Survey on Vector Representations of Meaning.
J. Artif. Intell. Res., 2018

WiC: 10, 000 Example Pairs for Evaluating Context-Sensitive Representations.
CoRR, 2018

A Short Survey on Sense-Annotated Corpora for Diverse Languages and Resources.
CoRR, 2018

How Gender and Skin Tone Modifiers Affect Emoji Semantics in Twitter.
Proceedings of the Seventh Joint Conference on Lexical and Computational Semantics, 2018

SemEval-2018 Task 9: Hypernym Discovery.
Proceedings of The 12th International Workshop on Semantic Evaluation, 2018

SemEval 2018 Task 2: Multilingual Emoji Prediction.
Proceedings of The 12th International Workshop on Semantic Evaluation, 2018

The interplay between lexical resources and Natural Language Processing.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics, 2018

Improving Cross-Lingual Word Embeddings by Meeting in the Middle.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

On the Role of Text Preprocessing in Neural Network Architectures: An Evaluation Study on Text Categorization and Sentiment Analysis.
Proceedings of the Workshop: Analyzing and Interpreting Neural Networks for NLP, 2018

Interpretable Emoji Prediction via Label-Wise Attention LSTMs.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

2017
Why we have switched from building full-fledged taxonomies to simply detecting hypernymy relations.
CoRR, 2017

SemEval-2017 Task 2: Multilingual and Cross-lingual Semantic Word Similarity.
Proceedings of the 11th International Workshop on Semantic Evaluation, 2017

Word Sense Disambiguation: A Unified Evaluation Framework and Empirical Comparison.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

BabelDomains: Large-Scale Domain Labeling of Lexical Resources.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Embedding Words and Senses Together via Joint Knowledge-Enhanced Training.
Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017), 2017

Towards a Seamless Integration of Word Senses into Downstream NLP Applications.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

EuroSense: Automatic Harvesting of Multilingual Sense Annotations from Parallel Text.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Semantic Representations of Word Senses and Concepts.
CoRR, 2016

Nasari: Integrating explicit knowledge and corpus statistics for a multilingual representation of concepts and entities.
Artif. Intell., 2016

Find the word that does not belong: A Framework for an Intrinsic Evaluation of Word Vector Representations.
Proceedings of the 1st Workshop on Evaluating Vector-Space Representations for NLP, 2016

Semantic Indexing of Multilingual Corpora and its Application on the History Domain.
Proceedings of the Workshop on Language Technology Resources and Tools for Digital Humanities, 2016

A Large-Scale Multilingual Disambiguation of Glosses.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Supervised Distributional Hypernym Discovery via Domain Adaptation.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Extending WordNet with Fine-Grained Collocational Information via Supervised Distributional Learning.
Proceedings of the COLING 2016, 2016

Finding and Expanding Hypernymic Relations in the Music Domain.
Proceedings of the Artificial Intelligence Research and Development, 2016

2015
NASARI: a Novel Approach to a Semantically-Aware Representation of Items.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

A Framework for the Construction of Monolingual and Cross-lingual Word Similarity Datasets.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

A Unified Multilingual Semantic Representation of Concepts.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014
Semantic Annotation and Terminology Validation in full scientific articles in Social Sciences and Humanities (Annotation sémantique et validation terminologique en texte intégral en SHS) [in French].
Proceedings of the Traitement Automatique des Langues Naturelles, 2014


  Loading...