Maarten Janssen

Orcid: 0000-0003-0272-6318

According to our database1, Maarten Janssen authored at least 20 papers between 2007 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
UDMorph: Morphosyntactically Tagged UD Corpora.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
TEITOK API - Programmable DH Corpora.
Proceedings of the Annual International Conference of the Alliance of Digital Humanities Organizations, 2023

How Corpus Analysis Helps Operationalize Research Questions and Entices Literary Scholars to Learn Programming.
Proceedings of the Annual International Conference of the Alliance of Digital Humanities Organizations, 2023

2021
A Corpus with Wavesurfer and TEI: Speech and Video in TEITOK.
Proceedings of the Text, Speech, and Dialogue - 24th International Conference, 2021

2020
Integrating TEITOK and KonText/PMLTQ at LINDAT.
Proceedings of the Selected Papers from the CLARIN Annual Conference 2020, 2020

2018
TEITOK as a tool for Dependency Grammar.
Proces. del Leng. Natural, 2018

Technical Implementation of the Vocabulário Ortográfico Comum da Língua Portuguesa.
Proceedings of the Computational Processing of the Portuguese Language, 2018

Dependency Graphs and TEITOK: Exploiting Dependency Parsing.
Proceedings of the Computational Processing of the Portuguese Language, 2018

Adding Words to Manuscripts: From PagesXML to TEITOK.
Proceedings of the Digital Libraries for Open Knowledge, 2018

2017
Improving POS Tagging in Old Spanish Using TEITOK.
Proceedings of the NoDaLiDa 2017 Workshop on Processing Historical Language, 2017

2016
The COPLE2 corpus: a learner corpus for Portuguese.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

TEITOK: Text-Faithful Annotated Corpora.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

2013
POS Tagging and Less Resources Languages Individuated Features in CorpusWiki.
Proceedings of the Human Language Technology. Challenges for Computer Science and Linguistics, 2013

2012
NeoTag: a POS Tagger for Grammatical Neologism Detection.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

The Common Orthographic Vocabulary of the Portuguese Language: a set of open lexical resources for a pluricentric language.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

A Rule Based Pronunciation Generator and Regional Accent Databank for Portuguese.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

2011
Towards an Enterprise Information Subsystem for Measuring (Perceived) Landside Accessibility of Airports.
Proceedings of the ENTERprise Information Systems - International Conference, 2011

2010
Combining Resources: Taxonomy Extraction from Multiple Dictionaries.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

2008
Spock - a Spoken Corpus Client.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

2007
Portal da Lingua Portuguesa.
Proces. del Leng. Natural, 2007


  Loading...