Walter Daelemans

Orcid: 0000-0002-9832-7890

Affiliations:
  • University of Antwerp, Belgium


According to our database1, Walter Daelemans authored at least 256 papers between 1987 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Towards a large scale analysis of claims: developing a machine learning method for detecting and classifying politicians' claims of representation.
J. Comput. Soc. Sci., April, 2024

Bag of Lies: Robustness in Continuous Pre-training BERT.
CoRR, 2024

PersonalityChat: Conversation Distillation for Personalized Dialog Modeling with Facts and Traits.
CoRR, 2024

Model Priming with Triplet Loss for Few-Shot Emotion Classification in Text.
Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

2023
Transfer Learning for the Visual Arts: The Multi-modal Retrieval of Iconclass Codes.
ACM Journal on Computing and Cultural Heritage, June, 2023

Who are the haters? A corpus-based demographic analysis of authors of hate speech.
Frontiers Artif. Intell., February, 2023

Proposal for a framework of contextual metadata in selected research infrastructures of the life sciences and the social sciences & humanities.
Int. J. Metadata Semant. Ontologies, 2023

Improving Dutch Vaccine Hesitancy Monitoring via Multi-Label Data Augmentation with GPT-3.5.
Proceedings of the 13th Workshop on Computational Approaches to Subjectivity, 2023

Combining Active Learning and Task Adaptation with BERT for Cost-Effective Annotation of Social Media Datasets.
Proceedings of the 13th Workshop on Computational Approaches to Subjectivity, 2023

Advancing Topical Text Classification: A Novel Distance-Based Method with Contextual Embeddings.
Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing, 2023

2022
Language Report Dutch.
Proceedings of the European Language Equality, 2022

EmoLabel: Semi-Automatic Methodology for Emotion Annotation of Social Media Text.
IEEE Trans. Affect. Comput., 2022

Linguistic Accommodation in Teenagers' Social Media Writing: Convergence Patterns in Mixed-gender Conversations.
J. Quant. Linguistics, 2022

An Ensemble Approach for Dutch Cross-Domain Hate Speech Detection.
Proceedings of the Natural Language Processing and Information Systems, 2022

Detecting Vaccine Skepticism on Twitter Using Heterogeneous Information Networks.
Proceedings of the Natural Language Processing and Information Systems, 2022

Cyberbullying Classifiers are Sensitive to Model-Agnostic Perturbations.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

CoNTACT: A Dutch COVID-19 Adapted BERT for Vaccine Hesitancy and Argumentation Detection.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Domain- and Task-Adaptation for VaccinChatNL, a Dutch COVID-19 FAQ Answering Corpus and Classification Model.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Open-Domain Dialog Evaluation Using Follow-Ups Likelihood.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Is It Smaller Than a Tennis Ball? Language Models Play the Game of Twenty Questions.
Proceedings of the Fifth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2022

2021
Current limitations in cyberbullying detection: On evaluation criteria, reproducibility, and data scarcity.
Lang. Resour. Evaluation, 2021

Interlocutors' Age Impacts Teenagers' Online Writing Style: Accommodation in Intra- and Intergenerational Online Conversations.
Frontiers Artif. Intell., 2021

Advances in Digital Music Iconography: Benchmarking the detection of musical instruments in unrestricted, non-photorealistic images from the artistic domain.
Digit. Humanit. Q., 2021

Teach Me What to Say and I Will Learn What to Pick: Unsupervised Knowledge Selection Through Response Generation with Pretrained Generative Models.
CoRR, 2021

MFAQ: a Multilingual FAQ Dataset.
CoRR, 2021

ConveRT for FAQ Answering.
CoRR, 2021

Exploring Stylometric and Emotion-Based Features for Multilingual Cross-Domain Hate Speech Detection.
Proceedings of the Eleventh Workshop on Computational Approaches to Subjectivity, 2021

Multi-modal Label Retrieval for the Visual Arts: The Case of Iconclass.
Proceedings of the 13th International Conference on Agents and Artificial Intelligence, 2021

Mapping probability word problems to executable representations.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Transfer Learning with Style Transfer between the Photorealistic and Artistic Domain.
Proceedings of the Computer Vision and Image Analysis of Art 2021, 2021

Conceptual Grounding Constraints for Truly Robust Biomedical Name Representations.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Contextual explanation rules for neural clinical classifiers.
Proceedings of the 20th Workshop on Biomedical Language Processing, 2021

Are we there yet? Exploring clinical domain knowledge of BERT models.
Proceedings of the 20th Workshop on Biomedical Language Processing, 2021

Scalable Few-Shot Learning of Robust Biomedical Name Representations.
Proceedings of the 20th Workshop on Biomedical Language Processing, 2021

Integrating Higher-Level Semantics into Robust Biomedical Name Representations.
Proceedings of the 12th International Workshop on Health Text Mining and Information Analysis, 2021

2020
Distilling neural networks into skipgram-level decision lists.
CoRR, 2020

Character-Level Transformer-Based Neural Machine Translation.
Proceedings of the NLPIR 2020: 4th International Conference on Natural Language Processing and Information Retrieval, 2020

Orthographic Codes and the Neighborhood Effect: Lessons from Information Theory.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020


BART for Knowledge Grounded Conversations.
Proceedings of the KDD 2020 Workshop on Conversational Systems Towards Mainstream Adoption co-located with the 26TH ACM SIGKDD Conference on Knowledge Discovery and Data Mining (SIGKDD 2020), 2020

Transfer Learning for Digital Heritage Collections: Comparing Neural Machine Translation at the Subword-level and Character-level.
Proceedings of the 12th International Conference on Agents and Artificial Intelligence, 2020

A Deep Generative Approach to Native Language Identification.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Streaming Language-Specific Twitter Data with Optimal Keywords.
Proceedings of the 12th Web as Corpus Workshop, 2020

Sarcasm Detection Using an Ensemble Approach.
Proceedings of the Second Workshop on Figurative Language Processing, 2020

2019
PAN19 Authorship Analysis: Cross-Domain Authorship Attribution.
Dataset, November, 2019

Discourse lexicon induction for multiple languages and its use for gender profiling.
Digit. Scholarsh. Humanit., 2019

Unsupervised concept extraction from clinical text through semantic composition.
J. Biomed. Informatics, 2019

Why can't memory networks read effectively?
CoRR, 2019

A weakly supervised sequence tagging and grammar induction approach to semantic frame slot filling.
CoRR, 2019

Effective weakly supervised semantic frame induction using expression sharing in hierarchical hidden Markov models.
CoRR, 2019

Overview of the CLIN29 Shared Task on Cross-Genre Gender Prediction in Dutch.
Proceedings of the Shared Task on Cross-Genre Gender Prediction in Dutch at CLIN29 (GxG@CLIN29) co-located with the 29th Conference on Computational Linguistics in The Netherlands (CLIN29), 2019

Overview of the Cross-domain Authorship Attribution Task at PAN 2019.
Proceedings of the Working Notes of CLEF 2019, 2019

Overview of PAN 2019: Bots and Gender Profiling, Celebrity Profiling, Cross-Domain Authorship Attribution and Style Change Detection.
Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2019

Evolution of the PAN Lab on Digital Text Forensics.
Proceedings of the Information Retrieval Evaluation in a Changing World, 2019

2018
PAN18 Multi-Author Analysis: Style-Change-Detection.
Dataset, September, 2018

PAN18 Author Identification: Attribution.
Dataset, September, 2018

Patient representation learning and interpretable evaluation using clinical notes.
J. Biomed. Informatics, 2018

Multilingual Cross-domain Perspectives on Online Hate Speech.
CoRR, 2018

Automatic Detection of Cyberbullying in Social Media Text.
CoRR, 2018

Predicting Adolescents' Educational Track from Chat Messages on Dutch Social Media.
Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, 2018

Exploring Classifier Combinations for Language Variety Identification.
Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects, 2018

CliCR: a Dataset of Clinical Case Reports for Machine Reading Comprehension.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

WordKit: a Python Package for Orthographic and Phonological Featurization.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Rule induction for global explanation of trained models.
Proceedings of the Workshop: Analyzing and Interpreting Neural Networks for NLP, 2018

Deep Transfer Learning for Art Classification Problems.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

From Strings to Other Things: Linking the Neighborhood and Transposition Effects in Word Reading.
Proceedings of the 22nd Conference on Computational Natural Language Learning, 2018

Enhancing General Sentiment Lexicons for Domain-Specific Use.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Overview of the Author Identification Task at PAN-2018: Cross-domain Authorship Attribution and Style Change Detection.
Proceedings of the Working Notes of CLEF 2018, 2018

Revisiting neural relation classification in clinical notes with external information.
Proceedings of the Ninth International Workshop on Health Text Mining and Information Analysis, 2018

2017
POS Tagging.
Proceedings of the Encyclopedia of Machine Learning and Data Mining, 2017

Lemmatization for variation-rich languages using deep learning.
Digit. Scholarsh. Humanit., 2017

Literary detective work on the computer. Michael P. Oakes.
Digit. Scholarsh. Humanit., 2017

Assigning clinical codes with data-driven concept representation on Dutch clinical free text.
J. Biomed. Informatics, 2017

Selecting relevant features from the electronic health record for clinical code prediction.
J. Biomed. Informatics, 2017

Unsupervised patient representations from clinical notes with interpretable classification decisions.
CoRR, 2017

Unsupervised Context-Sensitive Spelling Correction of English and Dutch Clinical Free-Text with Word and Character N-Gram Embeddings.
CoRR, 2017

Assessing the Stylistic Properties of Neurally Generated Text in Authorship Attribution.
CoRR, 2017

Towards the Improvement of Automatic Emotion Pre-annotation with Polarity and Subjective Information.
Proceedings of the International Conference Recent Advances in Natural Language Processing, 2017

A Short Review of Ethical Challenges in Clinical Natural Language Processing.
Proceedings of the First ACL Workshop on Ethics in Natural Language Processing, 2017

Evidence for a facilitatory effect of multi-word units on child word learning.
Proceedings of the 39th Annual Meeting of the Cognitive Science Society, 2017

Distributional learning and lexical category acquisition: What makes words easy to categorize?
Proceedings of the 39th Annual Meeting of the Cognitive Science Society, 2017

Overview of the Author Identification Task at PAN-2017: Style Breach Detection and Author Clustering.
Proceedings of the Working Notes of CLEF 2017, 2017

Unsupervised Context-Sensitive Spelling Correction of Clinical Free-Text with Word and Character N-Gram Embeddings.
Proceedings of the BioNLP 2017, Vancouver, Canada, August 4, 2017, 2017

Clinical Machine Comprehension Using Case Reports.
Proceedings of the AMIA 2017, 2017

Simple Queries as Distant Labels for Predicting Gender on Twitter.
Proceedings of the 3rd Workshop on Noisy User-generated Text, 2017

2016

TwiSty: a multilingual Twitter Stylometry corpus for gender and personality profiling.
Dataset, March, 2016

Multimodular Text Normalization of Dutch User-Generated Content.
ACM Trans. Intell. Syst. Technol., 2016

The strategic impact of META-NET on the regional, national and international level.
Lang. Resour. Evaluation, 2016

Data integration of structured and unstructured sources for assigning clinical codes to patient stays.
J. Am. Medical Informatics Assoc., 2016

Authenticating the writings of Julius Caesar.
Expert Syst. Appl., 2016

A Dictionary-based Approach to Racism Detection in Dutch Social Media.
CoRR, 2016

The Effects of Age, Gender and Region on Non-standard Linguistic Variation in Online Social Networks.
CoRR, 2016

Predicting the Effectiveness of Self-Training: Application to Sentiment Classification.
CoRR, 2016

TwiSty: A Multilingual Twitter Stylometry Corpus for Gender and Personality Profiling.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Authorship Verification with the Ruzicka Metric.
Proceedings of the 11th Annual International Conference of the Alliance of Digital Humanities Organizations, 2016

Constraining the Search Space in Cross-Situational Word Learning: Different Models Make Different Predictions.
Proceedings of the 38th Annual Meeting of the Cognitive Science Society, 2016

Clustering by Authorship Within and Across Documents.
Proceedings of the Working Notes of CLEF 2016, 2016

Overview of the 4th Author Profiling Task at PAN 2016: Cross-Genre Evaluations.
Proceedings of the Working Notes of CLEF 2016, 2016

Using Distributed Representations to Disambiguate Biomedical and Clinical Concepts.
Proceedings of the 15th Workshop on Biomedical Natural Language Processing, 2016

2015
CLiPS Stylometry Investigation (CSI) Corpus.
Dataset, October, 2015

PAN15 Author Identification: Verification.
Dataset, September, 2015

Automatic monitoring of cyberbullying on social networking sites: From technological feasibility to desirability.
Telematics Informatics, 2015

Detection and Fine-Grained Classification of Cyberbullying Events.
Proceedings of the Recent Advances in Natural Language Processing, 2015

Overview of the Author Identification Task at PAN 2015.
Proceedings of the Working Notes of CLEF 2015, 2015

Overview of the 3rd Author Profiling Task at PAN 2015.
Proceedings of the Working Notes of CLEF 2015, 2015

2014

AuCoPro - Semantics.
Dataset, January, 2014

Evaluating and understanding text-based stock price prediction models.
Inf. Process. Manag., 2014

Lazy and Eager Relational Learning Using Graph-Kernels.
Proceedings of the Statistical Language and Speech Processing, 2014

Using Wiktionary to Build an Italian Part-of-Speech Tagger.
Proceedings of the Natural Language Processing and Information Systems, 2014

Evaluating Content-Independent Features for Personality Recognition.
Proceedings of the 2014 ACM Multi Media on Workshop on Computational Personality Recognition, 2014

CLiPS Stylometry Investigation (CSI) corpus: A Dutch corpus for the detection of age, gender, personality, sentiment and deception in text.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Creative Web Services with Pattern.
Proceedings of the Fifth International Conference on Computational Creativity, 2014

Overview of the Author Identification Task at PAN 2014.
Proceedings of the Working Notes for CLEF 2014 Conference, 2014

Overview of the Author Profiling Task at PAN 2014.
Proceedings of the Working Notes for CLEF 2014 Conference, 2014

2013

Personae corpus.
Dataset, November, 2013

COREA: Coreference Resolution for Extracting Answers for Dutch.
Proceedings of the Essential Speech and Language Technology for Dutch, 2013

A Self Learning Vocal Interface for Speech-impaired Users.
Proceedings of the Fourth Workshop on Speech and Language Processing for Assistive Technologies, 2013

Self-taught assistive vocal interfaces: an overview of the ALADIN project.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Explanation in Computational Stylometry.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2013

2012
Robust Rhymes? The Stability of Authorial Style in Medieval Narratives.
J. Quant. Linguistics, 2012

Pattern for Python.
J. Mach. Learn. Res., 2012

Media coverage in times of political crisis: A text mining approach.
Expert Syst. Appl., 2012

"Vreselijk mooi!" (terribly beautiful): A Subjectivity Lexicon for Dutch Adjectives.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

ConanDoyle-neg: Annotation of negation cues and their scope in Conan Doyle stories.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

The Netlog Corpus. A Resource for the Study of Flemish Dutch Internet Language.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

A Self-Learning Assistive Vocal Interface Based on Vocabulary Learning and Grammar Induction.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

A Statistical Relational Learning Approach to Identifying Evidence Based Medicine Categories.
Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012

Evaluating Unmasking for Cross-Genre Authorship Verification.
Proceedings of the 7th Annual International Conference of the Alliance of Digital Humanities Organizations, 2012

Improving Topic Classification for Highly Inflective Languages.
Proceedings of the COLING 2012, 2012

Conversation Level Constraints on Pedophile Detection in Chat Rooms.
Proceedings of the CLEF 2012 Evaluation Labs and Workshop, 2012

Machine Reading of Biomedical Texts about Alzheimer's Disease.
Proceedings of the CLEF 2012 Evaluation Labs and Workshop, 2012

Annotating Modality and Negation for a Machine Reading Evaluation.
Proceedings of the CLEF 2012 Evaluation Labs and Workshop, 2012

Towards a Self-Learning Assistive Vocal Interface: Vocabulary and Grammar Learning.
Proceedings of the 1st Workshop on Speech and Multimodal Interaction in Assistive Environments, 2012

2011
Constraint-Satisfaction Inference for Entity Recognition.
Proceedings of the Interactive Multi-modal Question-Answering, 2011

The effect of author set size and data size in authorship attribution.
Lit. Linguistic Comput., 2011

Assessment of NER solutions against the first and second CALBC Silver Standard Corpus.
J. Biomed. Semant., 2011

Automatic Emotion Classification for Interpersonal Communication.
Proceedings of the 2nd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis, 2011

Corpus-based approaches to processing the scope of negation cues: an evaluation of the state of the art.
Proceedings of the Ninth International Conference on Computational Semantics, 2011

Kernel-Based Logical and Relational Learning with kLog for Hedge Cue Detection.
Proceedings of the Inductive Logic Programming - 21st International Conference, 2011

BioGraph: Knowledge Discovery and Exploration in the Biomedical Domain.
Proceedings of the Data Mining Workshops (ICDMW), 2011

Generative Art Inspired by Nature, Using NodeBox.
Proceedings of the Applications of Evolutionary Computation, 2011

Overview of the QA4MRE Pilot Task: Annotating Modality and Negation for a Machine Reading Evaluation.
Proceedings of the CLEF 2011 Labs and Workshop, 2011

Intrinsic Plagiarism Detection Using Character Trigram Distance Scores - Notebook for PAN at CLEF 2011.
Proceedings of the CLEF 2011 Labs and Workshop, 2011

Text Mining in Biograph.
Proceedings of the CLEF 2011 Labs and Workshop, 2011

Predicting age and gender in online social networks.
Proceedings of the 3rd International CIKM Workshop on Search and Mining User-Generated Contents, 2011

2010
POS Tagging.
Proceedings of the Encyclopedia of Machine Learning, 2010

Colin de la Higuera: Grammatical inference: learning automata and grammars - Cambridge University Press, 2010, iv + 417 pages.
Mach. Transl., 2010

Weigh your words - memory-based lemmatization for Middle Dutch.
Lit. Linguistic Comput., 2010

Highlights of the BioTM 2010 workshop on advances in bio text mining.
BMC Bioinform., 2010

On the Limits of Sentence Compression by Deletion.
Proceedings of the Empirical Methods in Natural Language Generation: Data-oriented Methods and Empirical Evaluation, 2010

Memory-Based Resolution of In-Sentence Scopes of Hedge Cues.
Proceedings of the Fourteenth Conference on Computational Natural Language Learning: Shared Task, 2010

A Chunk-Driven Bootstrapping Approach to Extracting Translation Patterns.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2010

2009
Prototype-based Active Learning for Lemmatization.
Proceedings of the Recent Advances in Natural Language Processing, 2009

Prepositional Phrase Attachment in Shallow Parsing.
Proceedings of the Recent Advances in Natural Language Processing, 2009

Is Sentence Compression an NLG task?
Proceedings of the ENLG 2009, 2009

A Robust and Extensible Exemplar-Based Model of Thematic Fit.
Proceedings of the EACL 2009, 12th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference, Athens, Greece, March 30, 2009

A Metalearning Approach to Processing the Scope of Negation.
Proceedings of the Thirteenth Conference on Computational Natural Language Learning, 2009

Learning the Scope of Hedge Cues in Biomedical Texts.
Proceedings of the BioNLP Workshop, BioNLP@HLT-NAACL 2009, 2009

A memory-based learning approach to event extraction in biomedical texts.
Proceedings of the BioNLP 2009 Workshop Companion Volume for Shared Task, BioNLP@HLT-NAACL 2009, 2009

Memory-Based Language Processing.
Studies in natural language processing, Cambridge University Press, ISBN: 978-0-521-11445-5, 2009

2008
Guest Editors' introduction: special issue of selected papers from ECML PKDD 2008.
Mach. Learn., 2008

Personae: a Corpus for Author and Personality Prediction from Text.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

A Coreference Corpus and Resolution System for Dutch.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

CNTS: Memory-Based Learning of Generating Repeated References.
Proceedings of the INLG 2008, 2008

Learning the Scope of Negation in Biomedical Texts.
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 2008

A Combined Memory-Based Semantic Role Labeler of English.
Proceedings of the Twelfth Conference on Computational Natural Language Learning, 2008

Authorship Attribution and Verification with Many Authors and Limited Data.
Proceedings of the COLING 2008, 2008

Semantic and Syntactic Features for Dutch Coreference Resolution.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2008

2007
Letter to the Editor.
Comput. Linguistics, 2007

Disambiguation of the Neuter Pronoun and Its Effect on Pronominal Coreference Resolution.
Proceedings of the Text, Speech and Dialogue, 10th International Conference, 2007

Invited talk: Text Analysis and Machine Learning for Stylometrics and Stylogenetics.
Proceedings of the 16th Nordic Conference of Computational Linguistics, 2007

Evaluating Hybrid Versus Data-Driven Coreference Resolution.
Proceedings of the Anaphora: Analysis, 2007

2006
A mixed word / morphological approach for extending CELEX for high coverage on contemporary large corpora.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Constraint Satisfaction Inference: Non-probabilistic Global Inference for Sequence Labelling.
Proceedings of the Workshop on Learning Structured Information in Natural Language Applications@EACL 2006, 2006

Investigating Lexical Substitution Scoring for Subtitle Generation.
Proceedings of the Tenth Conference on Computational Natural Language Learning, 2006

A Mission for Computational Natural Language Learning.
Proceedings of the Tenth Conference on Computational Natural Language Learning, 2006

2005
Improving sequence segmentation learning by predicting trigrams.
Proceedings of the BNAIC 2005, 2005

Machine learning of natural language (Maschinelles Lernen natürlicher Sprache).
Proceedings of the Quantitative Linguistik / Quantitative Linguistics, 2005

2004
Why Evaluate Ontology Technologies? Because It Works!.
IEEE Intell. Syst., 2004

Using rule-induction techniques to model pronunciation variation in Dutch.
Comput. Speech Lang., 2004

Recent Advances in Example-Based Machine Translation edited by Michael CarlAndy Way.
Comput. Linguistics, 2004

A Comparison of Two Different Approaches to Morphological Analysis of Dutch.
Proceedings of the 7th Meeting of the ACL Special Interest Group in Computational Phonology: Current Themes in Computational Phonology and Morphology, 2004

GAMBL, genetic algorithm optimization of memory-based WSD.
Proceedings of the Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, 2004

Unsupervised Text Mining for Ontology Extraction: An Evaluation of Statistical Measures.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

Multimodal, Multilingual Resources in the Subtitling Process.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

Evaluation and Adaptation of the Celex Dutch Morphological Database.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

Automatic Sentence Simplification for Subtitling in Dutch and English.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

Automatic Initiation of an Ontology.
Proceedings of the On the Move to Meaningful Internet Systems 2004: CoopIS, 2004

Memory-based semantic role labeling: Optimizing features, algorithm, and output.
Proceedings of the Eighth Conference on Computational Natural Language Learning, 2004

Shallow Text Analysis and Machine Learning for Authorship Attribtion.
Proceedings of the Computational Linguistics in the Netherlands 2004, 2004

Learning Dutch Coreference Resolution.
Proceedings of the Computational Linguistics in the Netherlands 2004, 2004

2003
Combined Optimization of Feature Selection and Algorithm Parameters in Machine Learning of Language.
Proceedings of the Machine Learning: ECML 2003, 2003

Mining for Lexons: Applying Unsupervised Learning Methods to Create Ontology Bases.
Proceedings of the On The Move to Meaningful Internet Systems 2003: CoopIS, DOA, and ODBASE, 2003

Memory-Based Named Entity Recognition using Unannotated Data.
Proceedings of the Seventh Conference on Natural Language Learning, 2003

Reduction of Dutch Sentences for Automatic Subtitling.
Proceedings of the Computational Linguistics in the Netherlands 2003, 2003

Is Shallow Parsing Useful for Unsupervised Learning of Semantic Clusters?
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2003

Learning to Predict Pitch Accents and Prosodic Boundaries in Dutch.
Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, 2003

Feature-Rich Memory-Based Classification for Shallow NLP and Information Extraction.
Proceedings of the Text Mining, Theoretical Aspects and Applications, 2003

2002
Parameter optimization for machine-learning of word sense disambiguation.
Nat. Lang. Eng., 2002

Introduction to Special Issue on Machine Learning Approaches to Shallow Parsing.
J. Mach. Learn. Res., 2002

Logistic-based patient grouping for multi-disciplinary treatment.
Artif. Intell. Medicine, 2002

Evaluating the results of a memory-based word-expert approach to unrestricted word sense disambiguation.
Proceedings of the ACL Workshop on Word Sense Disambiguation: Recent Successes and Future Directions, 2002

Dutch Word Sense Disambiguation: Optimizing the Localness of Context.
Proceedings of the ACL Workshop on Word Sense Disambiguation: Recent Successes and Future Directions, 2002

Evaluation of Machine Learning Methods for Natural Language Processing Tasks.
Proceedings of the Third International Conference on Language Resources and Evaluation, 2002

A Field Survey for Establishing Priorities in the Development of HLT Resources for Dutch.
Proceedings of the Third International Conference on Language Resources and Evaluation, 2002

Dutch HLT resources: from BLARK to priority lists.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Combining information sources for memory-based pitch accent placement.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Transcription of out-of-vocabulary words in large vocabulary speech recognition based on phoneme-to-grapheme conversion.
Proceedings of the IEEE International Conference on Acoustics, 2002

2001
Complex answers: a case study using a WWW question answering system.
Nat. Lang. Eng., 2001

Improving Accuracy in NLP Through Combination of Machine Learning Systems.
Comput. Linguistics, 2001

Predicting phrase breaks with memory-based learning.
Proceedings of the 4th ITRW on Speech Synthesis, 2001

Classifier Optimization and Combination in the English All Words Task.
Proceedings of Second International Workshop on Evaluating Word Sense Disambiguation Systems, 2001

A Named Entity Recognition System for Dutch.
Proceedings of the Computational Linguistics in the Netherlands 2001, 2001

Memory-Based Phoneme-to-Grapheme Conversion.
Proceedings of the Computational Linguistics in the Netherlands 2001, 2001

2000
Memory-Based Word Sense Disambiguation.
Comput. Humanit., 2000

Bootstrapping a Tagged Corpus through Combination of Existing Heterogeneous Taggers.
Proceedings of the Second International Conference on Language Resources and Evaluation, 2000

Part of Speech Tagging and Lemmatisation for the Spoken Dutch Corpus.
Proceedings of the Second International Conference on Language Resources and Evaluation, 2000

Meta-Learning for Phonemic Annotation of Corpora.
Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, June 29, 2000

The Role of Algorithm Bias vs Information Source in Learning Algorithms for Morphosyntactic Disambiguation.
Proceedings of the Fourth Conference on Computational Natural Language Learning, 2000

Genetic Algorithms for Feature Relevance Assignment in Memory-Based Language Processing.
Proceedings of the Fourth Conference on Computational Natural Language Learning, 2000

Applying System Combination to Base Noun Phrase Identification.
Proceedings of the COLING 2000, 18th International Conference on Computational Linguistics, Proceedings of the Conference, 2 Volumes, July 31, 2000

A Rule Induction Approach to Modeling Regional Pronunciation Variation.
Proceedings of the COLING 2000, 18th International Conference on Computational Linguistics, Proceedings of the Conference, 2 Volumes, July 31, 2000

1999
Forgetting Exceptions is Harmful in Language Learning.
Mach. Learn., 1999

Introduction to the special issue on memory-based language processing.
J. Exp. Theor. Artif. Intell., 1999

Machine learning of word pronunciation: the case against abstraction.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Cascaded Grammatical Relation Assignment.
Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, 1999

Memory-Based Shallow Parsing.
Proceedings of the 1999 Workshop on Computational Natural Language Learning, 1999

Machine learning for modeling Dutch pronunciation variation.
Proceedings of the Computational Linguistics in the Netherlands 1999, 1999

Lemmatisation and morphosyntactic annotation for the spoken Dutch corpus.
Proceedings of the Computational Linguistics in the Netherlands 1999, 1999

Memory-Based Morphological Analysis.
Proceedings of the 27th Annual Meeting of the Association for Computational Linguistics, 1999

1998
Abstraction is Harmful in Language Learning.
Proceedings of the Joint Conference on New Methods in Language Processing and Computational Natural Language Learning, 1998

Modularity in Inductively-Learned Word Pronunciation Systems.
Proceedings of the Joint Conference on New Methods in Language Processing and Computational Natural Language Learning, 1998

Do Not Forget: Full Memory in Memory-Based Learning of Word Pronunciation.
Proceedings of the Joint Conference on New Methods in Language Processing and Computational Natural Language Learning, 1998

Improving Data Driven Wordclass Tagging by System Combination.
Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, 1998

1997
IGTree: Using Trees for Compression and Classification in Lazy Learning Algorithms.
Artif. Intell. Rev., 1997

Empirical Learning of Natural Language Processing Task.
Proceedings of the Machine Learning: ECML-97, 1997

Resolving PP attachment Ambiguities with Memory-Based Learning.
Proceedings of the 1997 Meeting of the ACL Special Interest Group in Natural Language Learning: Computational Natural Language Learning, 1997

Memory-Based Learning: Using Similarity for Smoothing.
Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and 8th Conference of the European Chapter of the Association for Computational Linguistics, 1997

1996
Morphological Analysis as Classification: an Inductive-Learning Approach
CoRR, 1996

Unsupervised Discovery of Phonological Categories through Supervised Learning of Morphological Rules.
Proceedings of the 16th International Conference on Computational Linguistics, 1996

MBT: A Memory-Based Part of Speech Tagger-Generator.
Proceedings of the Fourth Workshop on Very Large Corpora, 1996

1994
Measuring the Complexity of Writing Systems.
J. Quant. Linguistics, 1994

Default inheritance in an object-oriented representation of linguistic categories.
Int. J. Hum. Comput. Stud., 1994

The Acquisition of Stress: A Data-Oriented Approach.
Comput. Linguistics, 1994

A language-independent, data-oriented architecture for grapheme-to-phoneme conversion.
Proceedings of the Second ESCA/IEEE Workshop on Speech Synthesis, 1994

1993
Tabtalk: reusability in data-oriented grapheme-to-phoneme conversion.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Memory-Based Lexical Acquisition and Processing.
Proceedings of the Machine Translation and the Lexicon, 1993

Data-Oriented Methods for Grapheme-to-Phoneme Conversion.
Proceedings of the Sixth Conference of the European Chapter of the Association for Computational Linguistics, 1993

1992
Inheritance in Natural Language Processing.
Comput. Linguistics, 1992

1988
A Model of Dutch Morphophonology and its Applications.
AI Commun., 1988

GRAFON: a grapheme-to-phoneme conversion system for Duth.
Proceedings of the 12th International Conference on Computational Linguistics, 1988

1987
A Tool For The Automatic Creation, Extension And Updating Of Lexical Knowledge Bases.
Proceedings of the EACL 1989, 1987


  Loading...