Gertjan van Noord

Orcid: 0000-0001-5564-6341

According to our database1, Gertjan van Noord authored at least 93 papers between 1989 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Are Character-level Translations Worth the Wait? Comparing ByT5 and mT5 for Machine Translation.
Trans. Assoc. Comput. Linguistics, 2024

Endowing Neural Language Learners with Human-like Biases: A Case Study on Dependency Length Minimization.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Are Character-level Translations Worth the Wait? An Extensive Comparison of Character- and Subword-level Models for Machine Translation.
CoRR, 2023

2022
Patching Leaks in the Charformer for Efficient Character-Level Generation.
CoRR, 2022

UDapter: Typology-based Language Adapters for Multilingual Dependency Parsing and Sequence Labeling.
Comput. Linguistics, 2022

Evaluating Pre-training Objectives for Low-Resource Translation into Morphologically Rich Languages.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Hyper-X: A Unified Hypernetwork for Multi-Task Multilingual Transfer.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Subword-Delimited Downsampling for Better Character-Level Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021
Unsupervised Translation of German-Lower Sorbian: Exploring Training and Novel Transfer Methods on a Low-Resource Language.
Proceedings of the Sixth Conference on Machine Translation, 2021

The Importance of Context in Very Low Resource Language Modeling.
Proceedings of the 18th International Conference on Natural Language Processing (ICON 2021), National Institute of Technology Silchar, Silchar, India, December 16, 2021

Optimal Word Segmentation for Neural Machine Translation into Dravidian Languages.
Proceedings of the 8th Workshop on Asian Translation, 2021

2020
Data Selection for Unsupervised Translation of German-Upper Sorbian.
Proceedings of the Fifth Conference on Machine Translation, 2020

Linguistically Motivated Subwords for English-Tamil Translation: University of Groningen's Submission to WMT-2020.
Proceedings of the Fifth Conference on Machine Translation, 2020

AlpinoGraph: A Graph-based Search Engine for Flexible and Efficient Treebank Search.
Proceedings of the 19th International Workshop on Treebanks and Linguistic Theories, 2020

A Shared Task of a New, Collaborative Type to Foster Reproducibility: A First Exercise in the Area of Language Science and Technology with REPROLANG2020.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

UDapter: Language Adaptation for Truly Universal Dependency Parsing.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Low-Resource Unsupervised NMT: Diagnosing the Problem and Providing a Linguistically Motivated Solution.
Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, 2020

2019
BERTje: A Dutch BERT Model.
CoRR, 2019

Cross-Lingual Word Embeddings for Morphologically Rich Languages.
Proceedings of the International Conference on Recent Advances in Natural Language Processing, 2019

2018
Reproducibility in Computational Linguistics: Are We Willing to Share?
Comput. Linguistics, 2018

Simple Embedding-Based Word Sense Disambiguation.
Proceedings of the 9th Global Wordnet Conference, 2018

A Taxonomy for In-depth Evaluation of Normalization for User Generated Content.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Modeling Input Uncertainty in Neural Network Dependency Parsing.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

2017
MoNoise: Modeling Noise Using a Modular Normalization System.
CoRR, 2017

Distributional Lesk: Effective Knowledge-Based Word Sense Disambiguation.
Proceedings of the IWCS 2017 - 12th International Conference on Computational Semantics - Short papers, Montpellier, France, September 19, 2017

The Power of Character N-grams in Native Language Identification.
Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications, 2017

Increasing Return on Annotation Investment: The Automatic Construction of a Universal Dependency Treebank for Dutch.
Proceedings of the NoDaLiDa Workshop on Universal Dependencies, 2017

Parser Adaptation for Social Media by Integrating Normalization.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
In Memoriam: Susan Armstrong.
Comput. Linguistics, 2016

SMT and Hybrid systems of the QTLeap project in the WMT16 IT-task.
Proceedings of the First Conference on Machine Translation, 2016

Bilingual Learning of Multi-sense Embeddings with Discrete Autoencoders.
Proceedings of the NAACL HLT 2016, 2016

2015
Word Representations, Tree Models and Syntactic Functions.
CoRR, 2015

ROB: Using Semantic Meaning to Recognize Paraphrases.
Proceedings of the 9th International Workshop on Semantic Evaluation, 2015

Comparison of Coreference Resolvers for Deep Syntax Translation.
Proceedings of the Second Workshop on Discourse in Machine Translation, 2015

Lexical choice in Abstract Dependency Trees.
Proceedings of the 1st Deep Machine Translation Workshop, 2015

2014
Treelet Probabilities for HPSG Parsing and Error Correction.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

From neighborhood to parenthood: the advantages of dependency representation over bigrams in Brown clustering.
Proceedings of the COLING 2014, 2014

2013
Parse and Corpus-Based Machine Translation.
Proceedings of the Essential Speech and Language Technology for Dutch, 2013

Large Scale Syntactic Annotation of Written Dutch: Lassy.
Proceedings of the Essential Speech and Language Technology for Dutch, 2013

Question Answering of InformativeWeb Pages: How Summarisation Technology Helps.
Proceedings of the Essential Speech and Language Technology for Dutch, 2013

2011
Adaptability of Lexical Acquisition for Large-scale Grammars.
Proceedings of the Recent Advances in Natural Language Processing, 2011

An Empirical Comparison of Unknown Word Prediction Methods.
Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011

Effective Measures of Domain Similarity for Parsing.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

Reversible Stochastic Attribute-Value Grammars.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011

2010
POS Multi-tagging Based on Combined Models.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Using Unknown Word Techniques to Learn Known Words.
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 2010

Acquisition of Unknown Word Paradigms for Large-Scale Grammars.
Proceedings of the COLING 2010, 2010

Self-Trained Bilexical Preferences to Improve Disambiguation Accuracy.
Proceedings of the Trends in Parsing Technology, 2010

2009
Combining Finite State and Corpus-based Techniques for Unknown Word Prediction.
Proceedings of the Recent Advances in Natural Language Processing, 2009

Learning Efficient Parsing.
Proceedings of the EACL 2009, 12th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference, Athens, Greece, March 30, 2009

2008
From D-Coi to SoNaR: a reference corpus for Dutch.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Exploring an Auxiliary Distribution Based Approach to Domain Adaptation of a Syntactic Disambiguation Model.
Proceedings of the workshop on Cross-Framework and Cross-Domain Parser Evaluation@COLING 2008, 2008

Question Answering with Joost at CLEF 2008.
Proceedings of the Working Notes for CLEF 2008 Workshop co-located with the 12th European Conference on Digital Libraries (ECDL 2008) , 2008

2007
Using Self-Trained Bilexical Preferences to Improve Disambiguation Accuracy.
Proceedings of the Tenth International Conference on Parsing Technologies, 2007

The Impact of Deep Linguistic Processing on Parsing Technology.
Proceedings of the Tenth International Conference on Parsing Technologies, 2007

Question Answering with Joost at CLEF 2007.
Proceedings of the Advances in Multilingual and Multimodal Information Retrieval, 2007

2006
At Last Parsing Is Now Operational.
Proceedings of the Actes de la 13ème conférence sur le Traitement Automatique des Langues Naturelles. Conférences invitées, 2006

Syntactic Annotation of Large Corpora in STEVIN.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

The University of Groningen at QA@CLEF 2006: Using Syntactic Knowledge for QA.
Proceedings of the Working Notes for CLEF 2006 Workshop co-located with the 10th European Conference on Digital Libraries (ECDL 2006), 2006

Using Syntactic Knowledge for QA.
Proceedings of the Evaluation of Multilingual and Multi-modal Information Retrieval, 2006

2005
Parsing Partially Bracketed Input.
Proceedings of the Computational Linguistics in the Netherlands 2005, 2005

Question Answering for Dutch Using Dependency Relations.
Proceedings of the Accessing Multilingual Information Repositories, 2005

2004
Finite automata for compact representation of tuple dictionaries.
Theor. Comput. Sci., 2004

Error Mining for Wide-Coverage Grammar Engineering.
Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, 2004

2003
Finite state methods in natural language processing.
Nat. Lang. Eng., 2003

2001
Finite State Transducers with Predicates and Identities.
Grammars, 2001

Finite Automata for Compact Representation of Language Models in NLP.
Proceedings of the Implementation and Application of Automata, 2001

Statistical Parsing of Dutch using Maximum Entropy Models with Feature Merging.
Proceedings of the Sixth Natural Language Processing Pacific Rim Symposium, 2001

Unsupervised POS-Tagging Improves Parsing Accuracy and Parsing Efficiency.
Proceedings of the Seventh International Workshop on Parsing Technologies (IWPT-2001), 2001

The Alpino Dependency Treebank.
Proceedings of the Computational Linguistics in the Netherlands 2001, 2001

2000
Approximation and Exactness in Finite State Optimality Theory
CoRR, 2000

Treatment of Epsilon Moves in Subset Construction.
Comput. Linguistics, 2000

Alpino: Wide-coverage Computational Analysis of Dutch.
Proceedings of the Computational Linguistics in the Netherlands 2000, 2000

1999
Robust grammatical analysis for spoken dialogue systems.
Nat. Lang. Eng., 1999

Evaluation of the NLP Components of the OVIS2 Spoken Dialogue System
CoRR, 1999

An Extendible Regular Expression Compiler for Finite-State Approaches in Natural Language Processing.
Proceedings of the Automata Implementation, 1999

Transducers from Rewrite Rules with Backreferences.
Proceedings of the EACL 1999, 1999

1997
An Efficient Implementation of the Head-Corner Parser.
Comput. Linguistics, 1997

Grammatical analysis in the OVIS spoken-dialogue system.
Proceedings of the Interactive Spoken Dialog Systems: Bringing Speech and NLP Together in Real Applications@ACL/EACL 1997, 1997

1996
FSA Utilities: A Toolbox to Manipulate Finite-State Automata.
Proceedings of the Automata Implementation, 1996

1995
The Intersection of Finite State Automata and Definite Clause Grammars.
Proceedings of the 33rd Annual Meeting of the Association for Computational Linguistics, 1995

1994
Constraint-Based Categorial Grammar.
CoRR, 1994

Head-Corner Parsing for TAG.
Comput. Intell., 1994

Adjuncts and the Processing of Lexical Rules.
Proceedings of the 15th International Conference on Computational Linguistics, 1994

Constraint-Based Categorical Grammar.
Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics, 1994

1993
Head-driven Parsing for Lexicalist Grammars: Experimental Results.
Proceedings of the Sixth Conference of the European Chapter of the Association for Computational Linguistics, 1993

1992
Self-Monitoring with Reversible Grammars.
Proceedings of the 14th International Conference on Computational Linguistics, 1992

1991
An overview of MiMo2.
Mach. Transl., 1991

Head Corner Parsing for Discontinuous Constituency.
Proceedings of the 29th Annual Meeting of the Association for Computational Linguistics, 1991

1990
Semantic-Head-Driven Generation.
Comput. Linguistics, 1990

Reversible Unification Based Machine Translation.
Proceedings of the 13th International Conference on Computational Linguistics, 1990

1989
An Approach To Sentence-Level Anaphora In Machine Translation.
Proceedings of the EACL 1989, 1989

A Semantic-Head-Driven Generation Algorithm for Unification-Based Formalisms.
Proceedings of the 27th Annual Meeting of the Association for Computational Linguistics, 1989


  Loading...