Rob van der Goot

According to our database1, Rob van der Goot authored at least 61 papers between 2014 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
data2lang2vec: Data Driven Typological Features Completion.
CoRR, 2024

Big City Bias: Evaluating the Impact of Metropolitan Size on Computational Job Market Abilities of Language Models.
CoRR, 2024

Deep Learning-based Computational Job Market Analysis: A Survey on Skill Extraction and Classification from Job Postings.
CoRR, 2024

EEVEE: An Easy Annotation Tool for Natural Language Processing.
CoRR, 2024

What's wrong with your model? A Quantitative Analysis of Relation Classification.
Proceedings of the 13th Joint Conference on Lexical and Computational Semantics, 2024

Entity Linking in the Job Market Domain.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

NNOSE: Nearest Neighbor Occupational Skill Extraction.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Slot and Intent Detection Resources for Bavarian and Lithuanian: Assessing Translations vs Natural Queries to Digital Assistants.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Enough Is Enough! a Case Study on the Effect of Data Size for Evaluation Using Universal Dependencies.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

How to Encode Domain Information in Relation Classification.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Can Humans Identify Domains?
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Cross-Domain Evaluation of POS Taggers: From Wall Street Journal to Fandom Wiki.
CoRR, 2023

Findings of the VarDial Evaluation Campaign 2023.
Proceedings of the Tenth Workshop on NLP for Similar Languages, Varieties and Dialects, 2023

MaChAmp at SemEval-2023 tasks 2, 3, 4, 5, 7, 8, 9, 10, 11, and 12: On the Effectiveness of Intermediate Training on an Uncurated Collection of Datasets.
Proceedings of the The 17th International Workshop on Semantic Evaluation, 2023

DanTok: Domain Beats Language for Danish Social Media POS Tagging.
Proceedings of the 24th Nordic Conference on Computational Linguistics, 2023

Multi-CrossRE A Multi-Lingual Multi-Domain Dataset for Relation Extraction.
Proceedings of the 24th Nordic Conference on Computational Linguistics, 2023

Subspace Chronicles: How Linguistic Information Emerges, Shifts and Interacts during Language Model Training.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Establishing Trustworthiness: Rethinking Tasks and Model Evaluation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Native Language Prediction from Gaze: a Reproducibility Study.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, 2023

Silver Syntax Pre-training for Cross-Domain Relation Extraction.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
MaChAmp at SemEval-2022 Tasks 2, 3, 4, 6, 10, 11, and 12: Multi-task Multi-lingual Learning for a Pre-selected Set of Semantic Datasets.
Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

Sort by Structure: Language Model Ranking as Dependency Probing.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Frustratingly Easy Performance Improvements for Low-resource Setups: A Tale on BERT and Segment Embeddings.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Skill Extraction from Job Postings using Weak Supervision.
Proceedings of the 2nd Workshop on Recommender Systems for Human Resources (RecSys-in-HR 2022) co-located with the 16th ACM Conference on Recommender Systems (RecSys 2022), 2022

Experimental Standards for Deep Learning in Natural Language Processing Research.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Spectral Probing.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

On Language Spaces, Scales and Cross-Lingual Transfer of UD Parsers.
Proceedings of the 26th Conference on Computational Natural Language Learning, 2022

Increasing Robustness for Cross-domain Dialogue Act Classification on Social Media Data.
Proceedings of the Eighth Workshop on Noisy User-generated Text, 2022

Tafsir Dataset: A Novel Multi-Task Benchmark for Named Entity Recognition and Topic Modeling in Classical Arabic Literature.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Probing for Labeled Dependency Trees.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
How Universal is Genre in Universal Dependencies?
CoRR, 2021

Parsing with Pretrained Language Models, Multiple Datasets, and Dataset Embeddings.
CoRR, 2021

On the Effectiveness of Dataset Embeddings in Mono-lingual, Multi-lingual and Zero-shot Conditions.
CoRR, 2021

Creating a Universal Dependencies Treebank of Spoken Frisian-Dutch Code-switched Data.
CoRR, 2021

From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken Language Understanding.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Genre as Weak Supervision for Cross-lingual Dependency Parsing.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

We Need to Talk About train-dev-test Splits.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Massive Choice, Ample Tasks (MaChAmp): A Toolkit for Multi-task Learning in NLP.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, 2021

Lexical Normalization for Code-switched Data and its Effect on POS Tagging.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get?
Proceedings of the Fifth Workshop on Computational Approaches to Linguistic Code-Switching, 2021

2020
Massive Choice, Ample Tasks (MaChAmp): A Toolkit for Multi-task Learning in NLP.
CoRR, 2020

Fair Is Better than Sensational: Man Is to Doctor as Woman Is to Doctor.
Comput. Linguistics, 2020

Norm It! Lexical Normalization for Italian and Its Downstream Effects for Dependency Parsing.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Synthetic Data for English Lexical Normalization: How Close Can We Get to Manually Annotated Data?
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Biomedical Event Extraction as Sequence Labeling.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

DaN+: Danish Nested Named Entities and Lexical Normalization.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

NLP North at WNUT-2020 Task 2: Pre-training versus Ensembling for Detection of Informative COVID-19 English Tweets.
Proceedings of the Sixth Workshop on Noisy User-generated Text, 2020

2019
sthruggle at SemEval-2019 Task 5: An Ensemble Approach to Hate Speech Detection.
Proceedings of the 13th International Workshop on Semantic Evaluation, 2019

An In-depth Analysis of the Effect of Lexical Normalization on the Dependency Parsing of Social Media.
Proceedings of the 5th Workshop on Noisy User-generated Text, 2019

MoNoise: A Multi-lingual and Easy-to-use Lexical Normalization Tool.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
A Taxonomy for In-depth Evaluation of Normalization for User Generated Content.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Modeling Input Uncertainty in Neural Network Dependency Parsing.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Bleaching Text: Abstract Features for Cross-lingual Gender Prediction.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
MoNoise: Modeling Noise Using a Modular Normalization System.
CoRR, 2017

Sharing Is Caring: The Future of Shared Tasks.
Comput. Linguistics, 2017

To normalize, or not to normalize: The impact of normalization on Part-of-Speech tagging.
Proceedings of the 3rd Workshop on Noisy User-generated Text, 2017

Parser Adaptation for Social Media by Integrating Normalization.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
The Denoised Web Treebank: Evaluating Dependency Parsing under Noisy Input Conditions.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

2015
ROB: Using Semantic Meaning to Recognize Paraphrases.
Proceedings of the 9th International Workshop on Semantic Evaluation, 2015

2014
The Meaning Factory: Formal Semantics for Recognizing Textual Entailment and Determining Semantic Similarity.
Proceedings of the 8th International Workshop on Semantic Evaluation, 2014


  Loading...