Leon Derczynski

Orcid: 0000-0002-8656-3431

Affiliations:
  • IT University of Copenhagen, Department of Computer Science, Denmark
  • University of Sheffield, UK (PhD)


According to our database1, Leon Derczynski authored at least 104 papers between 2010 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Nemotron-4 340B Technical Report.
CoRR, 2024

garak: A Framework for Security Probing Large Language Models.
CoRR, 2024

Introducing v0.5 of the AI Safety Benchmark from MLCommons.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, 2024

2023
Efficient Methods for Natural Language Processing: A Survey.
Trans. Assoc. Comput. Linguistics, 2023

Summon a Demon and Bind it: A Grounded Theory of LLM Red Teaming in the Wild.
CoRR, 2023

Surveying (Dis)Parities and Concerns of Compute Hungry NLP Research.
CoRR, 2023

Assessing Language Model Deployment with Risk Cards.
CoRR, 2023

TeamAmpa at SemEval-2023 Task 3: Exploring Multilabel and Multilingual RoBERTa Models for Persuasion and Framing Detection.
Proceedings of the The 17th International Workshop on Semantic Evaluation, 2023

The Catalog Problem: Clustering and Ordering Variable-Sized Sets.
Proceedings of the International Conference on Machine Learning, 2023


Anchoring Fine-tuning of Sentence Transformer with Semantic Label Information for Efficient Truly Few-shot Classification.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
Efficient Methods for Natural Language Processing: A Survey.
CoRR, 2022

Training a T5 Using Lab-sized Resources.
CoRR, 2022

Sparse Probability of Agreement.
CoRR, 2022

The ITU Faroese Pairs Dataset.
CoRR, 2022

Handling and Presenting Harmful Text.
CoRR, 2022

Bridging the Domain Gap for Stance Detection for the Zulu Language.
Proceedings of the Intelligent Systems and Applications, 2022

Set Interdependence Transformer: Set-to-Sequence Neural Networks for Permutation Learning and Structure Prediction.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Handling and Presenting Harmful Text in NLP Research.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021
Set-to-Sequence Methods in Machine Learning: A Review.
J. Artif. Intell. Res., 2021

Detecting Abusive Albanian.
CoRR, 2021

Optimal Size-Performance Tradeoffs: Weighing PoS Tagger Models.
CoRR, 2021

Discriminating Between Similar Nordic Languages.
Proceedings of the Eighth Workshop on NLP for Similar Languages, Varieties and Dialects, 2021


DanFEVER: claim verification dataset for Danish.
Proceedings of the 23rd Nordic Conference on Computational Linguistics, 2021

PROCAT: Product Catalogue Dataset for Implicit Clustering, Permutation Learning and Structure Prediction.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Hyperparameter Power Impact in Transformer Language Model Training.
Proceedings of the Second Workshop on Simple and Efficient Natural Language Processing, 2021

Annotating Online Misogyny.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
SemEval-2020 Task 12: Multilingual Offensive Language Identification in Social Media (OffensEval 2020).
Dataset, July, 2020

Power Consumption Variation over Activation Functions.
CoRR, 2020

The Danish Gigaword Project.
CoRR, 2020

Directions in Abusive Language Training Data: Garbage In, Garbage Out.
CoRR, 2020

SemEval-2020 Task 12: Multilingual Offensive Language Identification in Social Media (OffensEval 2020).
Proceedings of the Fourteenth Workshop on Semantic Evaluation, 2020

Offensive Language and Hate Speech Detection for Danish.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Accelerated High-Quality Mutual-Information Based Word Clustering.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Detection and Resolution of Rumors and Misinformation with NLP.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

The Rumour Mill: Making the Spread of Misinformation Explicit and Tangible.
Proceedings of the Extended Abstracts of the 2020 CHI Conference on Human Factors in Computing Systems, 2020

2019
Normalisation of imprecise temporal expressions extracted from text.
Knowl. Inf. Syst., 2019

Simple Natural Language Processing Tools for Danish.
CoRR, 2019

Joint Rumour Stance and Veracity.
Proceedings of the 2019 Truth and Trust Online Conference (TTO 2019), 2019

Misinformation on Twitter During the Danish National Election: A Case Study.
Proceedings of the 2019 Truth and Trust Online Conference (TTO 2019), 2019

SemEval-2019 Task 7: RumourEval, Determining Rumour Veracity and Support for Rumours.
Proceedings of the 13th International Workshop on Semantic Evaluation, 2019

Joint Rumour Stance and Veracity Prediction.
Proceedings of the 22nd Nordic Conference on Computational Linguistics, NoDaLiDa 2019, Turku, Finland, September 30, 2019

Political Stance in Danish.
Proceedings of the 22nd Nordic Conference on Computational Linguistics, NoDaLiDa 2019, Turku, Finland, September 30, 2019

The Lacunae of Danish Natural Language Processing.
Proceedings of the 22nd Nordic Conference on Computational Linguistics, NoDaLiDa 2019, Turku, Finland, September 30, 2019

Bornholmsk Natural Language Processing: Resources and Tools.
Proceedings of the 22nd Nordic Conference on Computational Linguistics, NoDaLiDa 2019, Turku, Finland, September 30, 2019

Quantifying the morphosyntactic content of Brown Clusters.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

2018
RumourEval 2019: Determining Rumour Veracity and Support for Rumours.
CoRR, 2018

IUCM at SemEval-2018 Task 11: Similar-Topic Texts as a Comprehension Knowledge Source.
Proceedings of The 12th International Workshop on Semantic Evaluation, 2018

Stance Prediction for Russian: Data and Analysis.
Proceedings of 6th International Conference in Software Engineering for Defence Applications, 2018

Helping Crisis Responders Find the Informative Needle in the Tweet Haystack.
Proceedings of the 15th International Conference on Information Systems for Crisis Response and Management, 2018

2017
Automatically Ordering Events and Times in Text
Studies in Computational Intelligence 677, Springer, ISBN: 978-3-319-47241-6, 2017

Domain-Sensitive Temporal Tagging, by Jannik Strötgen , Michael Gertz . CA, USA. Morgan & Claypool, 2016. ISBN 9781627054591.
Nat. Lang. Eng., 2017

Generalisation in named entity recognition: A quantitative analysis.
Comput. Speech Lang., 2017

Tracking the Diffusion of Named Entities.
CoRR, 2017

SemEval-2017 Task 8: RumourEval: Determining rumour veracity and support for rumours.
Proceedings of the 11th International Workshop on Semantic Evaluation, 2017

Simple Open Stance Classification for Rumour Analysis.
Proceedings of the International Conference Recent Advances in Natural Language Processing, 2017

Results of the WNUT2017 Shared Task on Novel and Emerging Entity Recognition.
Proceedings of the 3rd Workshop on Noisy User-generated Text, 2017

2016
Desiderata for Vector-Space Word Representations.
CoRR, 2016

SemEval-2016 Task 12: Clinical TempEval.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

GATE-Time: Extraction of Temporal Expressions and Events.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Complementarity, F-score, and NLP Evaluation.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Broad Twitter Corpus: A Diverse Named Entity Recognition Resource.
Proceedings of the COLING 2016, 2016

Representation and Learning of Temporal Relations.
Proceedings of the COLING 2016, 2016

Twitter Geolocation Prediction Shared Task of the 2016 Workshop on Noisy User-generated Text.
Proceedings of the 2nd Workshop on Noisy User-generated Text, 2016

Generalised Brown Clustering and Roll-Up Feature Generation.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Time and information retrieval: Introduction to the special issue.
Inf. Process. Manag., 2015

Analysis of named entity recognition and linking for tweets.
Inf. Process. Manag., 2015

Entity Grouping for Accessing Social Streams via Word Clouds.
Proceedings of the Web Information Systems and Technologies, 2015

Enhanced Information Access to Social Streams Through Word Clouds with Entity Grouping.
Proceedings of the WEBIST 2015, 2015

Swiss-Chocolate: Combining Flipout Regularization and Random Forests with Artificially Built Subsystems to Boost Text-Classification for Sentiment.
Proceedings of the 9th International Workshop on Semantic Evaluation, 2015

UFPRSheffield: Contrasting Rule-based and Support Vector Machine Approaches to Time Expression Identification in Clinical TempEval.
Proceedings of the 9th International Workshop on Semantic Evaluation, 2015

SemEval-2015 Task 6: Clinical TempEval.
Proceedings of the 9th International Workshop on Semantic Evaluation, 2015

Temporal Relation Classification using a Model of Tense and Aspect.
Proceedings of the Recent Advances in Natural Language Processing, 2015

Tune Your Brown Clustering, Please.
Proceedings of the Recent Advances in Natural Language Processing, 2015

Efficient Named Entity Annotation through Pre-empting.
Proceedings of the Recent Advances in Natural Language Processing, 2015

USFD: Twitter NER with Drift Compensation and Linked Data.
Proceedings of the Workshop on Noisy User-generated Text, 2015

2014
Clinical TempEval.
CoRR, 2014

Pheme: Veracity in Digital Social Networks.
Proceedings of the Posters, 2014

Corpus Annotation through Crowdsourcing: Towards Best Practice Guidelines.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

DKIE: Open Source Information Extraction for Danish.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

Passive-Aggressive Sequence Labeling with Discriminative Post-Editing for Recognising Person Entities in Tweets.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

The GATE Crowdsourcing Plugin: Crowdsourcing Annotated Corpora Made Easy.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

2013
TimeML-strict: clarifying temporal annotation
CoRR, 2013

Question Answering Against Very-Large Text Collections
CoRR, 2013

SemEval-2013 Task 1: TempEval-3: Evaluating Time Expressions, Events, and Temporal Relations.
Proceedings of the 7th International Workshop on Semantic Evaluation, 2013

Twitter Part-of-Speech Tagging for All: Overcoming Sparse and Noisy Data.
Proceedings of the Recent Advances in Natural Language Processing, 2013

Recognising and Interpreting Named Temporal Expressions.
Proceedings of the Recent Advances in Natural Language Processing, 2013

TwitIE: An Open-Source Information Extraction Pipeline for Microblog Text.
Proceedings of the Recent Advances in Natural Language Processing, 2013

Empirical Validation of Reichenbach's Tense Framework.
Proceedings of the 10th International Conference on Computational Semantics, 2013

Information Retrieval for Temporal Bounding.
Proceedings of the International Conference on the Theory of Information Retrieval, 2013

Microblog-genre noise and impact on semantic annotation accuracy.
Proceedings of the 24th ACM Conference on Hypertext and Social Media (part of ECRC), 2013

Towards context-aware search and analysis on social media data.
Proceedings of the Joint 2013 EDBT/ICDT Conferences, 2013

Temporal Signals Help Label Temporal Relations.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012
TempEval-3: Evaluating Events, Time Expressions, and Temporal Relations
CoRR, 2012

A Data Driven Approach to Query Expansion in Question Answering
CoRR, 2012

A Corpus-based Study of Temporal Signals
CoRR, 2012

An Annotation Scheme for Reichenbach's Verbal Tense Structure
CoRR, 2012

Using Signals to Improve Automatic Classification of Temporal Relations
CoRR, 2012

TIMEN: An Open Temporal Expression Normalisation Resource.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Massively Increasing TIMEX3 Resources: A Transduction Approach.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

2011
USFD at KBP 2011: Entity Linking, Slot Filling and Temporal Bounding.
Proceedings of the Fourth Text Analysis Conference, 2011

2010
USFD2: Annotating Temporal Expresions and TLINKs for TempEval-2.
Proceedings of the 5th International Workshop on Semantic Evaluation, 2010

Analysing Temporally Annotated Corpora with CAVaT.
Proceedings of the International Conference on Language Resources and Evaluation, 2010


  Loading...