Antal van den Bosch

Orcid: 0000-0003-2493-656X

  • Radboud University, Nijmegen, The Netherlands

According to our database1, Antal van den Bosch authored at least 172 papers between 1993 and 2024.

Collaborative distances:



In proceedings 
PhD thesis 


Online presence:



A time-robust group recommender for featured comments on news platforms.
Frontiers Big Data, 2024

Re-evaluating the Tomes for the Times.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Corpus Creation and Automatic Alignment of Historical Dutch Dialect Speech.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Foundation models and the privatization of public knowledge.
Nat. Mac. Intell., August, 2023

Hybrid moderation in the newsroom: Recommending featured posts to content moderators.
CoRR, 2023

Contextual Profiling of Charged Terms in Historical Newspapers.
Proceedings of the 4th Conference on Language, Data and Knowledge, 2023

Lexicon or grammar? Using memory-based learning to investigate the syntactic relationship between Belgian and Netherlandic Dutch.
Nat. Lang. Eng., 2022

Unsupervised Text Segmentation Predicts Eye Fixations During Reading.
Frontiers Artif. Intell., 2022

Words matter: Case studies in Cultural AI.
Proceedings of the SETN 2022: 12th Hellenic Conference on Artificial Intelligence, Corfu, Greece, September 7, 2022

Correlating Political Party Names in Tweets, Newspapers and Election Results.
Proceedings of the LREC 2022 workshop on Natural Language Processing for Political Sciences, 2022

Detecting Minority Arguments for Mutual Understanding: A Moderation Tool for the Online Climate Change Debate.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Calculating Argument Diversity in Online Threads.
Proceedings of the 3rd Conference on Language, Data and Knowledge, 2021

Query-based summarization of discussion threads.
Nat. Lang. Eng., 2020

Uncovering the language of wine experts.
Nat. Lang. Eng., 2020

Monitoring stance towards vaccination in twitter messages.
BMC Medical Informatics Decis. Mak., 2020

Anticipointment Detection in Event Tweets.
Int. J. Artif. Intell. Tools, 2020

Optimising Twitter-based Political Election Prediction with Relevance andSentiment Filters.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Frequency-specific brain dynamics related to prediction during language comprehension.
NeuroImage, 2019

Predicting life expectancy with a long short-term memory recurrent neural network using electronic medical records.
BMC Medical Informatics Decis. Mak., 2019

A Longitudinal Study on Twitter-Based Forecasting of Five Dutch National Elections.
Proceedings of the Social Informatics - 11th International Conference, 2019

Question Similarity in Community Question Answering: A Systematic Exploration of Preprocessing Methods and Models.
Proceedings of the International Conference on Recent Advances in Natural Language Processing, 2019

Quantifying Expectation Modulation in Human Speech Processing.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Listening with Great Expectations: An Investigation of Word Form Anticipations in Naturalistic Speech.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

A Comparative Study on Generalizability of Information Extraction Models on Protest News.
Proceedings of the Working Notes of CLEF 2019, 2019

Creating a reference data set for the summarization of discussion forum threads.
Lang. Resour. Evaluation, 2018

Language Identification and Morphosyntactic Tagging: The Second VarDial Evaluation Campaign.
Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects, 2018

Discovering the Language of Wine Reviews: A Text Mining Account.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

A Multilingual Wikified Data Set of Educational Material.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Aspect-based summarization of pros and cons in unstructured product reviews.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Hidden Markov Models.
Proceedings of the Encyclopedia of Machine Learning and Data Mining, 2017

Unraveling reported dreams with text analytics.
Digit. Humanit. Q., 2017

Exploring Lexical and Syntactic Features for Language Variety Identification.
Proceedings of the Fourth Workshop on NLP for Similar Languages, 2017

Supporting Experts to Handle Tweet Collections About Significant Events.
Proceedings of the Natural Language Processing and Information Systems, 2017

Overview of the 4th HistoInformatics Workshop.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

Automatic Summarization of Domain-specific Forum Threads: Collecting Reference Data.
Proceedings of the 2017 Conference on Conference Human Information Interaction and Retrieval, 2017

Predicting Liaison: an Example-Based Approach.
Trait. Autom. des Langues, 2016

Human-inspired modulation frequency features for noise-robust ASR.
Speech Commun., 2016

Estimating search engine index size variability: a 9-year longitudinal study.
Scientometrics, 2016

Open-domain extraction of future events from Twitter.
Nat. Lang. Eng., 2016

Information access in the art history domain: Evaluating a federated search engine for Rembrandt research.
Digit. Humanit. Q., 2016

Text analytics in industry: Challenges, desiderata and trends.
Comput. Ind., 2016

Editorial: Special issue on natural language processing and text analytics in industry.
Comput. Ind., 2016

Using Demographics in Predicting Election Results with Twitter.
Proceedings of the Social Informatics - 8th International Conference, 2016

Relevancer: Finding and Labeling Relevant Information in Tweet Collections.
Proceedings of the Social Informatics - 8th International Conference, 2016

Can Tweets Predict TV Ratings?
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Enhancing Access to Online Education: Quality Machine Translation of MOOC Content.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Nederlab: Towards a Single Portal and Research Environment for Diachronic Dutch Text Corpora.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Analysing the Role of Key Term Inflections in Knowledge Discovery on Twitter.
Proceedings of the 2nd International Workshop on Knowledge Discovery on the WEB, 2016

Abstractive Compression of Captions with Attentive Recurrent Neural Networks.
Proceedings of the INLG 2016, 2016

Using Relevancer to Detect Relevant Tweets: The Nepal Earthquake Case.
Proceedings of the Working notes of FIRE 2016, 2016

TraMOOC (Translation for Massive Open Online Courses): providing reliable MT for MOOCs.
Proceedings of the 19th Annual Conference of the European Association for Machine Translation: Projects/Products, 2016

Notoriously Toxic: The Language and Cost of Hate in the Chat Systems of Online Games.
Proceedings of the 11th Annual International Conference of the Alliance of Digital Humanities Organizations, 2016

Predicting Civil Unrest by Categorizing Dutch Twitter Events.
Proceedings of the BNAIC 2016: Artificial Intelligence, 2016

Improving cross-domain n-gram language modelling with skipgrams.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Very quaffable and great fun: Applying NLP to wine reviews.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Signaling sarcasm: From hyperbole to hashtag.
Inf. Process. Manag., 2015

Automatically Identifying Periodic Social Events from Twitter.
Proceedings of the Recent Advances in Natural Language Processing, 2015

A Longitudinal Analysis of Search Engine Index Size.
Proceedings of the 15th International Conference on Scientometrics and Informetrics, Istanbul, Turkey, June 29, 2015

Looking for Books in Social Media: An Analysis of Complex Search Requests.
Proceedings of the Advances in Information Retrieval, 2015

TraMOOC: Translation for Massive Open Online Courses.
Proceedings of the 18th Annual Conference of the European Association for Machine Translation, 2015

Animacy Detection in Stories.
Proceedings of the 6th Workshop on Computational Models of Narrative, 2015

The Love Equation: Computational Modeling of Romantic Relationships in French Classical Drama.
Proceedings of the 6th Workshop on Computational Models of Narrative, 2015

Multi-perspective Event Detection in Texts Documenting the 1944 Battle of Arnhem.
Proceedings of the Text Mining, 2014

Peter Spyns and Jan Odijk (eds): Essential speech and language technology for Dutch: results by the STEVIN programme - Springer, 2013, ISBN: 978-3-642-30909-0, xvii + 413 pp.
Mach. Transl., 2014

Automatic thematic classification of election manifestos.
Inf. Process. Manag., 2014

The 2nd HistoInformatics Workshop - Introduction.
Proceedings of the Social Informatics - SocInfo 2014 International Workshops, Barcelona, 2014

SemEval 2014 Task 5 - L2 Writing Assistant.
Proceedings of the 8th International Workshop on Semantic Evaluation, 2014

Creating and using large monolingual parallel corpora for sentential paraphrase generation.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Using idiolects and sociolects to improve word prediction.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

Translation Assistance by Translation of L1 Fragments in an L2 Context.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

On the assessment of expertise profiles.
J. Assoc. Inf. Sci. Technol., 2013

The perfect solution for detecting sarcasm in tweets #not.
Proceedings of the 4th Workshop on Computational Approaches to Subjectivity, 2013

Documenting Social Unrest: Detecting Strikes in Historical Daily Newspapers.
Proceedings of the Social Informatics, 2013

WSD2: Parameter optimisation for Memory-based Cross-Lingual Word-Sense Disambiguation.
Proceedings of the 7th International Workshop on Semantic Evaluation, 2013

Memory-based Named Entity Recognition in Tweets.
Proceedings of the Concept Extraction Challenge at the Workshop on 'Making Sense of Microposts', 2013

Using character overlap to improve language transformation.
Proceedings of the 7th Workshop on Language Technology for Cultural Heritage, 2013

Relating Political Party Mentions on Twitter with Polls and Election Results.
Proceedings of the 13th Dutch-Belgian Workshop on Information Retrieval, 2013

Estimating the Time between Twitter Messages and Future Events.
Proceedings of the 13th Dutch-Belgian Workshop on Information Retrieval, 2013

Memory-based Grammatical Error Correction.
Proceedings of the Seventeenth Conference on Computational Natural Language Learning: Shared Task, 2013

The socialist network.
Decis. Support Syst., 2012

DutchSemCor: Targeting the ideal sense-tagged corpus.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

The effect of domain and text type on text prediction quality.
Proceedings of the EACL 2012, 2012

Memory-based text correction for preposition and determiner errors.
Proceedings of the Seventh Workshop on Building Educational Applications Using NLP, 2012

Sentence Simplification by Monolingual Machine Translation.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

Constraint-Satisfaction Inference for Entity Recognition.
Proceedings of the Interactive Multi-modal Question-Answering, 2011

Proceedings of the Interactive Multi-modal Question-Answering, 2011

Integrating source-language context into phrase-based statistical machine translation.
Mach. Transl., 2011

Fusing Recommendations for Social Bookmarking Web Sites.
Int. J. Electron. Commer., 2011

A Link to the Past: Constructing Historical Social Networks.
Proceedings of the 2nd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis, 2011

Enrichment and Structuring of Archival Description Metadata.
Proceedings of the 5th ACL Workshop on Language Technology for Cultural Heritage, 2011

Comparing Phrase-based and Syntax-based Paraphrase Generation.
Proceedings of the Workshop on Monolingual Text-To-Text Generation@ACL, 2011

Hidden Markov Models.
Proceedings of the Encyclopedia of Machine Learning, 2010

Paraphrase Generation as Monolingual Translation: Data and Evaluation.
Proceedings of the INLG 2010, 2010

On the Evaluation of Entity Profiles.
Proceedings of the Multilingual and Multimodal Information Access Evaluation, 2010

Supertags as Source Language Context in Hierarchical Phrase-Based SMT.
Proceedings of the 9th Conference of the Association for Machine Translation in the Americas: Research Papers, 2010

A Kids' Open Mind Common Sense.
Proceedings of the Commonsense Knowledge, 2010

Memory-Based Machine Translation and Language Modeling.
Prague Bull. Math. Linguistics, 2009

Making a Clean Sweep of Cultural Heritage.
IEEE Intell. Syst., 2009

Dependency Parsing and Semantic Role Labeling as a Single Task.
Proceedings of the Recent Advances in Natural Language Processing, 2009

Dependency Relations as Source Context in Phrase-Based SMT.
Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation, 2009

Instance-Driven Discovery of Ontological Relation Labels.
Proceedings of the Workshop on Language Technology and Resources for Cultural Heritage, Social Sciences, Humanities, and Education LaTeCH, 2009

A semantic relatedness metric based on free link structure (short paper).
Proceedings of the Eight International Conference on Computational Semantics, 2009

Comparing Alternative Data-Driven Ontological Vistas of Natural History (short paper).
Proceedings of the Eight International Conference on Computational Semantics, 2009

Clustering and Matching Headlines for Automatic Paraphrase Acquisition.
Proceedings of the ENLG 2009, 2009

A Constraint Satisfaction Approach to Machine Translation.
Proceedings of the 13th Annual conference of the European Association for Machine Translation, 2009

Joint Memory-Based Learning of Syntactic and Semantic Dependencies in Multiple Languages.
Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, 2009

Memory-Based Language Processing.
Studies in natural language processing, Cambridge University Press, ISBN: 978-0-521-11445-5, 2009

Recommending scientific articles using citeulike.
Proceedings of the 2008 ACM Conference on Recommender Systems, 2008

Efficient context-sensitive word completion for mobile devices.
Proceedings of the 10th Conference on Human-Computer Interaction with Mobile Devices and Services, 2008

A Personalized Recommender System for Writing in the Internet Age.
Proceedings of the ICEIS 2008, 2008

Letter to the Editor.
Comput. Linguistics, 2007

Exploiting source similarity for SMT using context-informed features.
Proceedings of the 11th Conference on Theoretical and Methodological Issues in Machine Translation of Natural Languages: Papers, 2007

Broad expertise retrieval in sparse data environments.
Proceedings of the SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007

ILK: Machine learning of semantic relations with shallow features and almost no data.
Proceedings of the 4th International Workshop on Semantic Evaluations, 2007

Comparing and evaluating information retrieval algorithms for news recommendation.
Proceedings of the 2007 ACM Conference on Recommender Systems, 2007

What a Proactive Recommendation System Needs - Relevance, Non-Intrusiveness, and a New Long-Term Memory.
Proceedings of the ICEIS 2007, 2007

A Rule-Based Approach for Process Discovery: Dealing with Noise and Imbalance in Process Logs.
Data Min. Knowl. Discov., 2006

Improved morpho-phonological sequence processing with constraint satisfaction inference.
Proceedings of the Eighth Meeting of the ACL Special Interest Group on Computational Phonology and Morphology, 2006

Identifying Named Entities in Text Databases from the Natural History Domain.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Transferring PoS-tagging and lemmatization tools from spoken to written Dutch corpus development.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Authoritative Re-ranking of Search Results.
Proceedings of the Advances in Information Retrieval, 2006

Constraint Satisfaction Inference: Non-probabilistic Global Inference for Sequence Labelling.
Proceedings of the Workshop on Learning Structured Information in Natural Language Applications@EACL 2006, 2006

Dependency Parsing by Inference over High-recall Dependency Predictions.
Proceedings of the Tenth Conference on Computational Natural Language Learning, 2006

Memory-based understanding of user utterances in a spoken dialogue system: Effects of Feature selection and co-learning.
Proceedings of the 6th International Conference on Case-Based Reasoning, 2005

Hybrid Algorithms with Instance-Based Classification.
Proceedings of the Machine Learning: ECML 2005, 2005

Applying Spelling Error Correction Techniques for Improving Semantic Role Labelling.
Proceedings of the Ninth Conference on Computational Natural Language Learning, 2005

Improving sequence segmentation learning by predicting trigrams.
Proceedings of the BNAIC 2005, 2005

Memory-Based Morphological Analysis Generation and Part-of-Speech Tagging of Arabic.
Proceedings of the Workshop on Computational Approaches to Semitic Languages, 2005

GAMBL, genetic algorithm optimization of memory-based WSD.
Proceedings of the Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, 2004

Memory-based semantic role labeling: Optimizing features, algorithm, and output.
Proceedings of the Eighth Conference on Computational Natural Language Learning, 2004

Learning PP attachment for filtering prosodic phrasing.
Proceedings of the EACL 2003, 2003

Memory-based disfluency chunking.
Proceedings of the ISCA Tutorial and Research Workshop (ITRW) on Disfluency in Spontaneous Speech, 2003

Memory-based one-step named-entity recognition: Effects of seed list features, classifier stacking, and unannotated data.
Proceedings of the Seventh Conference on Natural Language Learning, 2003

A Memory-Based Shallow Parser for Spoken Dutch.
Proceedings of the Computational Linguistics in the Netherlands 2003, 2003

Learning to Predict Pitch Accents and Prosodic Boundaries in Dutch.
Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, 2003

Parameter optimization for machine-learning of word sense disambiguation.
Nat. Lang. Eng., 2002

Logistic-based patient grouping for multi-disciplinary treatment.
Artif. Intell. Medicine, 2002

Evaluating the results of a memory-based word-expert approach to unrestricted word sense disambiguation.
Proceedings of the ACL Workshop on Word Sense Disambiguation: Recent Successes and Future Directions, 2002

Dutch Word Sense Disambiguation: Optimizing the Localness of Context.
Proceedings of the ACL Workshop on Word Sense Disambiguation: Recent Successes and Future Directions, 2002

Combining information sources for memory-based pitch accent placement.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Process Mining: Discovering Direct Successors in Process Logs.
Proceedings of the Discovery Science, 5th International Conference, 2002

Shallow Parsing on the Basis of Words Only: A Case Study.
Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, 2002

Predicting phrase breaks with memory-based learning.
Proceedings of the 4th ITRW on Speech Synthesis, 2001

Dutch Word Sense Disambiguation: Data and Preliminary Results.
Proceedings of Second International Workshop on Evaluating Word Sense Disambiguation Systems, 2001

Multi-feature Error Detection in Spoken Dialogue Systems.
Proceedings of the Computational Linguistics in the Netherlands 2001, 2001

Detecting Problematic Turns in Human-Machine Interactions: Rule-induction Versus Memory-based Learning Approaches.
Proceedings of the Association for Computational Linguistic, 2001

Memory-Based Word Sense Disambiguation.
Comput. Humanit., 2000

Integrating Seed Names and ngrams for a Named Entity List and Classifier.
Proceedings of the Second International Conference on Language Resources and Evaluation, 2000

Unpacking Multi-valued Symbolic Features and Classes in Memory-Based Language Learning.
Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, June 29, 2000

Systematic design of a 14-bit 150-MS/s CMOS current-steering D/A converter.
Proceedings of the 37th Conference on Design Automation, 2000

Single-Classifier Memory-Based Phrase Chunking.
Proceedings of the Fourth Conference on Computational Natural Language Learning, 2000

Using Induced Rules as Complex Features in Memory-Based Language Learning.
Proceedings of the Fourth Conference on Computational Natural Language Learning, 2000

Automatic detection of problematic turns in human-machine interactions.
Proceedings of the Computational Linguistics in the Netherlands 2000, 2000

Forgetting Exceptions is Harmful in Language Learning.
Mach. Learn., 1999

Careful abstraction from instance families in memory-based language learning.
J. Exp. Theor. Artif. Intell., 1999

Learning Statistically Neutral Tasks without Expert Guidance.
Proceedings of the Advances in Neural Information Processing Systems 12, [NIPS Conference, Denver, Colorado, USA, November 29, 1999

Machine learning of word pronunciation: the case against abstraction.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Instance-Family Abstraction in Memory-Based Language Learning.
Proceedings of the Sixteenth International Conference on Machine Learning (ICML 1999), Bled, Slovenia, June 27, 1999

Memory-Based Morphological Analysis.
Proceedings of the 27th Annual Meeting of the Association for Computational Linguistics, 1999

Interpretable Neural Networks with BP-SOM.
Proceedings of the Tasks and Methods in Applied Artificial Intelligence, 1998

Interpretable Neural Networks with BP-SOM.
Proceedings of the Machine Learning: ECML-98, 1998

Modularity in Inductively-Learned Word Pronunciation Systems.
Proceedings of the Joint Conference on New Methods in Language Processing and Computational Natural Language Learning, 1998

Do Not Forget: Full Memory in Memory-Based Learning of Word Pronunciation.
Proceedings of the Joint Conference on New Methods in Language Processing and Computational Natural Language Learning, 1998

Behavioural Aspects of Combining Backpropagation Learning and Self-organizing Maps.
Connect. Sci., 1997

IGTree: Using Trees for Compression and Classification in Lazy Learning Algorithms.
Artif. Intell. Rev., 1997

Avoiding Overfitting with BP-SOM.
Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence, 1997

Empirical Learning of Natural Language Processing Task.
Proceedings of the Machine Learning: ECML-97, 1997

Morphological Analysis as Classification: an Inductive-Learning Approach
CoRR, 1996

Measuring the Complexity of Writing Systems.
J. Quant. Linguistics, 1994

A language-independent, data-oriented architecture for grapheme-to-phoneme conversion.
Proceedings of the Second ESCA/IEEE Workshop on Speech Synthesis, 1994

Tabtalk: reusability in data-oriented grapheme-to-phoneme conversion.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Data-Oriented Methods for Grapheme-to-Phoneme Conversion.
Proceedings of the Sixth Conference of the European Chapter of the Association for Computational Linguistics, 1993
