Chris Biemann

Orcid: 0000-0002-8449-9624

  • University of Hamburg, Department of Informatics, Germany
  • Darmstadt University of Technology, Computer Science Department, Germany

According to our database1, Chris Biemann authored at least 280 papers between 2002 and 2024.

Collaborative distances:



In proceedings 
PhD thesis 


Online presence:



Text augmentation for semantic frame induction and parsing.
Lang. Resour. Evaluation, June, 2024

CogSteer: Cognition-Inspired Selective Layer Intervention for Efficient Semantic Steering in Large Language Models.
CoRR, 2024

Large Language Models Are Overparameterized Text Encoders.
CoRR, 2024

Demarked: A Strategy for Enhanced Abusive Speech Moderation through Counterspeech, Detoxification, and Message Management.
CoRR, 2024

Exploring Boundaries and Intensities in Offensive and Hate Speech: Unveiling the Complex Spectrum of Social Media Discourse.
CoRR, 2024

SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 14 Languages.
CoRR, 2024

Extending the Comparative Argumentative Machine: Multilingualism and Stance Detection.
Proceedings of the Robust Argumentation Machines - First International Conference, 2024

Concept Over Time Analysis: Unveiling Temporal Patterns for Qualitative Data Analysis.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: System Demonstrations, 2024

Fine-grained quotation detection and attribution in German news articles.
Proceedings of the 20th Conference on Natural Language Processing, 2024

Low-Resource Machine Translation through the Lens of Personalized Federated Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Story Embeddings - Narrative-Focused Representations of Fictional Stories.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

CAM 2.0: End-to-End Open Domain Comparative Question Answering System.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

On Zero-Shot Counterspeech Generation by LLMs.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Dataset of Quotation Attribution in German News Articles.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Tell Me Again! a Large-Scale Dataset of Multiple Summaries for the Same Story.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Extending the Discourse Analysis Tool Suite with Whiteboards for Visual Qualitative Analysis.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

On Improving Repository-Level Code QA for Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024

Machine learning in computational literary studies.
it Inf. Technol., August, 2023

Dataset, February, 2023

Probing Language Models from A Human Behavioral Perspective.
CoRR, 2023

A system for Human-AI collaboration for Online Customer Support.
CoRR, 2023

DBLPLink: An Entity Linker for the DBLP Scholarly Knowledge Graph.
Proceedings of the ISWC 2023 Posters, 2023

LT at SemEval-2023 Task 1: Effective Zero-Shot Visual Word Sense Disambiguation Approaches using External Knowledge Sources.
Proceedings of the The 17th International Workshop on Semantic Evaluation, 2023

Using Wikidata for Enhancing Compositionality in Pretrained Language Models.
Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing, 2023

Exploring Amharic Hate Speech Data Collection and Classification Approaches.
Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing, 2023

Multilingual Racial Hate Speech Detection Using Transfer Learning.
Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing, 2023

From Qualitative to Quantitative Research: Semi-Automatic Annotation Scaling in the Digital Humanities.
Proceedings of the 19th Conference on Natural Language Processing (KONVENS 2023), 2023

Predicting Terms in IS-A Relations with Pre-trained Transformers.
Proceedings of the Findings of the Association for Computational Linguistics: IJCNLP-AACL 2023, 2023

Das Konzept des House of Computing and Data Science als Blaupause zur Digitalisierung der Forschung.
Proceedings of the 53. Jahrestagung der Gesellschaft für Informatik, INFORMATIK 2023, Designing Future, 2023

GETT-QA: Graph Embedding Based T2T Transformer for Knowledge Graph Question Answering.
Proceedings of the Semantic Web - 20th International Conference, 2023

Using Self-Supervised Dual Constraint Contrastive Learning for Cross-Modal Retrieval.
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

Dimensions of Similarity: Towards Interpretable Dimension-Based Text Similarity.
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

CodeAnno: Extending WebAnno with Hierarchical Document Level Annotation and Automation.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics. EACL 2023, 2023

Narrativität und Handlung.
Proceedings of the 9. Tagung des Verbands Digital Humanities im deutschsprachigen Raum, 2023

D-WISE - Digitale Wissenssoziologische Diskursanalyse.
Proceedings of the 9. Tagung des Verbands Digital Humanities im deutschsprachigen Raum, 2023

Open Science Prinzipien und interdisziplinäre Kollaboration in D-WISE: Zwischen Hermeneutik und Digitaler Methode in der Diskursanalyse.
Proceedings of the 9. Tagung des Verbands Digital Humanities im deutschsprachigen Raum, 2023

DBLP-QuAD: A Question Answering Dataset over the DBLP Scholarly Knowledge Graph.
Proceedings of the 13th International Workshop on Bibliometric-enhanced Information Retrieval co-located with 45th European Conference on Information Retrieval (ECIR 2023), 2023

Unsupervised Ultra-Fine Entity Typing with Distributionally Induced Word Senses.
Proceedings of the Analysis of Images, Social Networks and Texts, 2023

RuCAM: Comparative Argumentative Machine for the Russian Language.
Proceedings of the Analysis of Images, Social Networks and Texts, 2023

The D-WISE Tool Suite: Multi-Modal Machine-Learning-Powered Tools Supporting and Enhancing Digital Discourse Analysis.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2023

The Role of Output Vocabulary in T2T LMs for SPARQL Semantic Parsing.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

ARDIAS: AI-Enhanced Research Management, Discovery, and Advisory System.
Proceedings of the Workshop on Scientific Document Understanding co-located with 37th AAAI Conference on Artificial Inteligence (AAAI 2023), 2023

Hypernymy Detection for Low-resource Languages: A Study for Hindi, Bengali, and Amharic.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2022

Neural entity linking: A survey of models based on deep learning.
Semantic Web, 2022

Golden Retriever: A Real-Time Multi-Modal Text-Image Retrieval System with the Ability to Focus.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Modern Baselines for SPARQL Semantic Parsing.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Using distributional thesaurus to enhance transformer-based contextualized representations for low resource languages.
Proceedings of the SAC '22: The 37th ACM/SIGAPP Symposium on Applied Computing, Virtual Event, April 25, 2022

CWITR: A Corpus for Automatic Complex Word Identification in Turkish Texts.
Proceedings of the 2022 6th International Conference on Natural Language Processing and Information Retrieval, 2022

MOTIF: Contextualized Images for Complex Words to Improve Human Reading.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Dataset of Student Solutions to Algorithm and Data Structure Programming Assignments.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Elvis vs. M. Jackson: Who has More Albums? Classification and Identification of Elements in Comparative Questions.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

More Like This: Semantic Retrieval with Linguistic Information.
Proceedings of the 18th Conference on Natural Language Processing (KONVENS 2022), 2022

Improved Open Source Automatic Subtitling for Lecture Videos.
Proceedings of the 18th Conference on Natural Language Processing (KONVENS 2022), 2022

Measuring Faithfulness of Abstractive Summaries.
Proceedings of the 18th Conference on Natural Language Processing (KONVENS 2022), 2022

Language over Labels: Contrastive Language Supervision Exceeds Purely Label-Supervised Classification Performance on Chest X-Rays.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

Modeling Referential Gaze in Task-oriented Settings of Varying Referential Complexity.
Proceedings of the Findings of the Association for Computational Linguistics: AACL-IJCNLP 2022, 2022

The 5Js in Ethiopia: Amharic Hate Speech Data Annotation Using Toloka Crowdsourcing Platform.
Proceedings of the International Conference on Information and Communication Technology for Development for Africa, 2022

D-WISE Tool Suite for the Sociology of Knowledge Approach to Discourse.
Proceedings of the Culture and Computing, 2022

Measuring Gender Bias in German Language Generation.
Proceedings of the 52. Jahrestagung der Gesellschaft für Informatik, INFORMATIK 2022, Informatik in den Naturwissenschaften, 26., 2022

Overview of Touché 2022: Argument Retrieval - Extended Abstract.
Proceedings of the Advances in Information Retrieval, 2022

Overview of Touché 2022: Argument Retrieval.
Proceedings of the Working Notes of CLEF 2022 - Conference and Labs of the Evaluation Forum, Bologna, Italy, September 5th - to, 2022

Dataset, November, 2021

Dataset, November, 2021

Language Models Explain Word Reading Times Better Than Empirical Predictability.
Frontiers Artif. Intell., 2021

Introducing Various Semantic Models for Amharic: Experimentation and Evaluation with Multiple Tasks and Datasets.
Future Internet, 2021

How Hateful are Movies? A Study and Prediction on Movie Subtitles.
CoRR, 2021

Error Analysis of using BART for Multi-Document Summarization: A Study for English and German Language.
Proceedings of the 23rd Nordic Conference on Computational Linguistics, 2021

ActiveAnno: General-Purpose Document-Level Annotation Tool with Active Learning Integration.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Demonstrations, 2021

Towards Layered Events and Schema Representations in Long Documents.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop, 2021

Word Complexity is in the Eye of the Beholder.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

LTUHH@STSS: Applying Coreference to Literary Scene Segmentation.
Proceedings of the Shared Task on Scene Segmentation co-located with the 17th Conference on Natural Language Processing (KONVENS 2021), 2021

MoM: Minutes of Meeting Bot.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Live Subtitling for BigBlueButton with Open-Source Software.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Impacts of Homophone Normalization on Semantic Models for Amharic.
Proceedings of the International Conference on Information and Communication Technology for Development for Africa, 2021

Gaze-based Multimodal Meaning Recovery for Noisy / Complex Environments.
Proceedings of the ICMI '21: International Conference on Multimodal Interaction, 2021

Probing Pre-trained Language Models for Semantic Attributes and their Values.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Overview of Touché 2021: Argument Retrieval - Extended Abstract.
Proceedings of the Advances in Information Retrieval, 2021

Forum 4.0: An Open-Source User Comment Analysis Framework.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, 2021

SCoT: Sense Clustering over Time: a tool for the analysis of lexical change.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, 2021

Which is Better for Deep Learning: Python or MATLAB? Answering Comparative Questions in Natural Language.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, 2021

Overview of Touché 2021: Argument Retrieval.
Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2021

Automated Event Annotation in Literary Texts.
Proceedings of the Conference on Computational Humanities Research, 2021

HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Dataset, September, 2020

Dataset, September, 2020

Answering Comparative Questions with Arguments.
Datenbank-Spektrum, 2020

Social Media Unrest Prediction during the {COVID}-19 Pandemic: Neural Implicit Motive Pattern Recognition as Psychometric Signs of Severe Crises.
CoRR, 2020

Introducing various Semantic Models for Amharic: Experimentation and Evaluation with multiple Tasks and Datasets.
CoRR, 2020

Individual corpora predict fast memory retrieval during reading.
CoRR, 2020

UHH-LT & LT2 at SemEval-2020 Task 12: Fine-Tuning of Pre-Trained Transformer Networks for Offensive Language Detection.
CoRR, 2020

Comparative Web Search Questions.
Proceedings of the WSDM '20: The Thirteenth ACM International Conference on Web Search and Data Mining, 2020

Supervised Pun Detection and Location with Feature Engineering and Logistic Regression.
Proceedings of the 5th Swiss Text Analytics Conference and the 16th Conference on Natural Language Processing, 2020

UHH-LT at SemEval-2020 Task 12: Fine-Tuning of Pre-Trained Transformer Networks for Offensive Language Detection.
Proceedings of the Fourteenth Workshop on Semantic Evaluation, 2020

Automatic Compilation of Resources for Academic Writing and Evaluating with Informal Word Identification and Paraphrasing System.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Word Sense Disambiguation for 158 Languages using Word Embeddings Only.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Improving Unsupervised Sparsespeech Acoustic Models with Categorical Reparameterization.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Enhancing a Theory-Focused Course Through the Introduction of Automatically Assessed Programming Exercises - Lessons Learned.
Proceedings of the Impact Papers at EC-TEL 2020, 2020

Touché: First Shared Task on Argument Retrieval.
Proceedings of the Advances in Information Retrieval, 2020

Exploring Amharic Sentiment Analysis from Social Media Texts: Building Annotation Tools and Classification Models.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Overview of Touché 2020: Argument Retrieval.
Proceedings of the Working Notes of CLEF 2020, 2020

Overview of Touché 2020: Argument Retrieval - Extended Abstract.
Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2020

Estimating the influence of auxiliary tasks for multi-task learning of sequence tagging tasks.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Datasets for Watset: Local-Global Graph Clustering with Applications in Sense and Frame Induction.
Dataset, April, 2019

Token-based spelling variant detection in Middle Low German texts.
Lang. Resour. Evaluation, 2019

Analysis of the Ethiopic Twitter Dataset for Abusive Speech in Amharic.
CoRR, 2019

Comput. Linguistics, 2019

Learning Graph Embeddings from WordNet-based Similarity Measures.
Proceedings of the Eighth Joint Conference on Lexical and Computational Semantics, 2019

UHH-LT at SemEval-2019 Task 6: Supervised vs. Unsupervised Transfer Learning for Offensive Language Detection.
Proceedings of the 13th International Workshop on Semantic Evaluation, 2019

HHMM at SemEval-2019 Task 2: Unsupervised Frame Induction using Contextualized Word Embeddings.
Proceedings of the 13th International Workshop on Semantic Evaluation, 2019

LT Expertfinder: An Evaluation Framework for Expert Finding Methods.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Does BERT Make Any Sense? Interpretable Word Sense Disambiguation with Contextualized Embeddings.
Proceedings of the 15th Conference on Natural Language Processing, 2019

GermEval 2019 Task 1: Hierarchical Classification of Blurbs.
Proceedings of the 15th Conference on Natural Language Processing, 2019

Creating Information-maximizing Natural Language Messages Through Image Captioning-Retrieval.
Proceedings of the 15th Conference on Natural Language Processing, 2019

Neural classification with attention assessment of the implicit-association test OMT and prediction of subsequent academic success.
Proceedings of the 15th Conference on Natural Language Processing, 2019

Language-Agnostic Model for Aspect-Based Sentiment Analysis.
Proceedings of the 13th International Conference on Computational Semantics, 2019

SparseSpeech: Unsupervised Acoustic Unit Discovery with Memory-Augmented Sequence Autoencoders.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Annotation gesprochener Daten mit WebAnno-MM.
Proceedings of the 6. Tagung des Verbands Digital Humanities im deutschsprachigen Raum, 2019

Answering Comparative Questions: Better than Ten-Blue-Links?
Proceedings of the 2019 Conference on Human Information Interaction and Retrieval, 2019

Categorizing Comparative Sentences.
Proceedings of the 6th Workshop on Argument Mining, ArgMining@ACL 2019, Florence, Italy, 2019

Improving Neural Entity Disambiguation with Graph Embeddings.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Making Fast Graph-based Algorithms with Graph Metric Embeddings.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

On the Compositionality Prediction of Noun Phrases using Poincaré Embeddings.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Adversarial Learning of Privacy-Preserving Text Representations for De-Identification of Medical Records.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

TARGER: Neural Argument Mining at Your Fingertips.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Hierarchical Multi-label Classification of Text with Capsule Networks.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Every Child Should Have Parents: A Taxonomy Refinement Algorithm Based on Hyperbolic Term Embeddings.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

A framework for enriching lexical semantic resources with distributional semantics.
Nat. Lang. Eng., 2018

A comparison of graph-based word sense induction clustering algorithms in a pseudoword evaluation framework.
Lang. Resour. Evaluation, 2018

Transfer Learning from LDA to BiLSTM-CNN for Offensive Language Detection in Twitter.
CoRR, 2018

Categorization of Comparative Sentences for Argument Mining.
CoRR, 2018

Local-Global Graph Clustering with Applications in Sense and Frame Induction.
CoRR, 2018

Learning Graph Embeddings from WordNet-based Similarity Measures.
CoRR, 2018

Using Semantics for Granularities of Tokenization.
Comput. Linguistics, 2018

Simple Co-Occurrence Statistics Reproducibly Predict Association Ratings.
Cogn. Sci., 2018

Webis at TREC 2018: Common Core Track.
Proceedings of the Twenty-Seventh Text REtrieval Conference, 2018

New/s/leak 2.0 - Multilingual Information Extraction and Visualization for Investigative Journalism.
Proceedings of the Social Informatics, 2018

BomJi at SemEval-2018 Task 10: Combining Vector-, Pattern- and Graph-based Information to Identify Discriminative Attributes.
Proceedings of The 12th International Workshop on Semantic Evaluation, 2018

Document-based Recommender System for Job Postings using Dense Representations.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

An Unsupervised Word Sense Disambiguation System for Under-Resourced Languages.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Retrofitting Word Representations for Unsupervised Sense Aware Word Similarities.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Improving Hypernymy Extraction with Distributional Semantic Classes.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Building a Web-Scale Dependency-Parsed Corpus from CommonCrawl.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Enriching Frame Representations with Distributionally Induced Senses.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

microNER: A Micro-Service for German Named Entity Recognition based on BiLSTM-CRF.
Proceedings of the 14th Conference on Natural Language Processing, 2018

Unsupervised Sense-Aware Hypernymy Extraction.
Proceedings of the 14th Conference on Natural Language Processing, 2018

Variations on the theme of variation: Dealing with spelling variation for finegrained POS tagging of historical texts.
Proceedings of the 14th Conference on Natural Language Processing, 2018

Unspeech: Unsupervised Speech Context Embeddings.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Demonstrating Par4Sem - A Semantic Writing Aid with Adaptive Paraphrasing.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, 2018

A Multilingual Information Extraction Pipeline for Investigative Journalism.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, 2018

Par4Sim - Adaptive Paraphrasing for Text Simplification.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

WebAnno-MM: EXMARaLDA meets WebAnno.
Proceedings of the Selected papers from the CLARIN Annual Conference 2018, 2018

LawStats - Large-Scale German Court Decision Evaluation Using Web Service Classifiers.
Proceedings of the Machine Learning and Knowledge Extraction, 2018

Between the Lines: Machine Learning for Prediction of Psychological Traits - A Survey.
Proceedings of the Machine Learning and Knowledge Extraction, 2018

A Report on the Complex Word Identification Shared Task 2018.
Proceedings of the Thirteenth Workshop on Innovative Use of NLP for Building Educational Applications@NAACL-HLT 2018, 2018

Unsupervised Semantic Frame Induction using Triclustering.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

What do we need to build explainable AI systems for the medical domain?
CoRR, 2017

Fighting with the Sparsity of Synonymy Dictionaries.
CoRR, 2017

Watset: Automatic Induction of Synsets from a Graph of Synonyms.
CoRR, 2017

Guidance for Multi-Type Entity Graphs from Text Collections.
Proceedings of the 8th International EuroVis Workshop on Visual Analytics, 2017

IIT-UHH at SemEval-2017 Task 3: Exploring Multiple Features for Community Question Answering and Implicit Dialogue Identification.
Proceedings of the 11th International Workshop on Semantic Evaluation, 2017

IITPB at SemEval-2017 Task 5: Sentiment Prediction in Financial Text.
Proceedings of the 11th International Workshop on Semantic Evaluation, 2017

STS-UHH at SemEval-2017 Task 1: Scoring Semantic Textual Similarity Using Supervised and Unsupervised Ensemble.
Proceedings of the 11th International Workshop on Semantic Evaluation, 2017

Multilingual and Cross-Lingual Complex Word Identification.
Proceedings of the International Conference Recent Advances in Natural Language Processing, 2017

Entity-Centric Information Access with Human in the Loop for the Biomedical Domain.
Proceedings of the Biomedical NLP Workshop associated with RANLP 2017, Varna, 2017

Replacing OOV Words For Dependency Parsing With Distributional Semantics.
Proceedings of the 21st Nordic Conference on Computational Linguistics, 2017

Using Pseudowords for Algorithm Comparison: An Evaluation Framework for Graph-based Word Sense Induction.
Proceedings of the 21st Nordic Conference on Computational Linguistics, 2017

Feature Selection Using Multi-objective Optimization for Aspect Based Sentiment Analysis.
Proceedings of the Natural Language Processing and Information Systems, 2017

There's no 'Count or Predict' but task-based \\selection for distributional models.
Proceedings of the IWCS 2017 - 12th International Conference on Computational Semantics - Short papers, Montpellier, France, September 19, 2017

CWIG3G2 - Complex Word Identification Task across Three Text Genres and Two User Groups.
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017

Unsupervised, Knowledge-Free, and Interpretable Word Sense Disambiguation.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Negative Sampling Improves Hypernymy Extraction Based on Projection Learning.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Unsupervised Does Not Mean Uninterpretable: The Case for Word Sense Induction and Disambiguation.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

The ContrastMedium Algorithm: Taxonomy Induction From Noisy Knowledge Graphs With Just A Few Links.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Storyfinder: Personalized Knowledge Base Construction and Management by Browsing the Web.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

Matching, Re-Ranking and Scoring: Learning Textual Similarity by Incorporating Dependency Graph Alignment and Coverage Features.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2017

Fighting with the Sparsity of Synonymy Dictionaries for Automatic Synset Induction.
Proceedings of the Analysis of Images, Social Networks and Texts, 2017

Automatic Induction of Synsets from a Graph of Synonyms.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Data Knowl. Eng., 2016

An adaptive annotation approach for biomedical entity and relation recognition.
Brain Informatics, 2016

Linked Disambiguated Distributional Semantic Networks.
Proceedings of the Semantic Web - ISWC 2016, 2016

TAXI at SemEval-2016 Task 13: a Taxonomy Induction Method based on Lexico-Syntactic Patterns, Substrings and Focused Crawling.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain Dependency and Distributional Semantics Features for Aspect Based Sentiment Analysis.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

Making Sense of Word Embeddings.
Proceedings of the 1st Workshop on Representation Learning for NLP, 2016

Unsupervised Compound Splitting With Distributional Semantics Rivals Supervised Methods.
Proceedings of the NAACL HLT 2016, 2016

Learning Paraphrasing for Multiword Expressions.
Proceedings of the 12th Workshop on Multiword Expressions, 2016

Impact of MWE Resources on Multiword Recognition.
Proceedings of the 12th Workshop on Multiword Expressions, 2016

A Web-based Tool for the Integrated Annotation of Semantic and Syntactic Structures.
Proceedings of the Workshop on Language Technology Resources and Tools for Digital Humanities, 2016

Domain-Specific Corpus Expansion with Focused Webcrawling.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

SemRelData ― Multilingual Contextual Annotation of Semantic Relations between Nominals: Dataset and Guidelines.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Noun Sense Induction and Disambiguation using Graph-Based Distributional Semantics.
Proceedings of the 13th Conference on Natural Language Processing, 2016

Running into Brick Walls Attempting to Improve a Simple Unsupervised Parser.
Proceedings of the 13th Conference on Natural Language Processing, 2016

Demonstrating Ambient Search: Implicit Document Retrieval for Speech Streams.
Proceedings of the COLING 2016, 2016

Ambient Search: A Document Retrieval System for Speech Streams.
Proceedings of the COLING 2016, 2016

Towards a resource based on users' knowledge to overcome the Tip of the Tongue problem.
Proceedings of the 5th Workshop on Cognitive Aspects of the Lexicon, 2016

Vectors or Graphs? On Differences of Representations for Distributional Semantic Models.
Proceedings of the 5th Workshop on Cognitive Aspects of the Lexicon, 2016

Human and Machine Judgements for Russian Semantic Relatedness.
Proceedings of the Analysis of Images, Social Networks and Texts, 2016

EmpiriST: AIPHES - Robust Tokenization and POS-Tagging for Different Genres.
Proceedings of the 10th Web as Corpus Workshop, 2016

new/s/leak - Information Extraction and Visualization for Investigative Data Journalists.
Proceedings of ACL-2016 System Demonstrations, Berlin, Germany, August 7-12, 2016, 2016

Language Transfer Learning for Supervised Lexical Substitution.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

An automatic approach to identify word sense changes in text media across timescales.
Nat. Lang. Eng., 2015

Ontology-Based Interpretation of Natural Language Philipp Cimiano, Christina Unger, and John McCrae (University of Arminia Bielefeld, Germany) Morgan & Claypool, Synthesis Lectures on Human Language Technologies, March 2014, 178 pages, (doi: 10.2200/S00561ED1V01Y201401HLT024) , $45.00.
Comput. Linguistics, 2015

Open Source German Distant Speech Recognition: Corpus and Acoustic Model.
Proceedings of the Text, Speech, and Dialogue - 18th International Conference, 2015

Distributional Semantics for Resolving Bridging Mentions.
Proceedings of the Recent Advances in Natural Language Processing, 2015

Do Supervised Distributional Methods Really Learn Lexical Inference Relations?
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

IIT-TUDA: System for Sentiment Analysis in Indian Languages Using Lexical Acquisition.
Proceedings of the Mining Intelligence and Knowledge Exploration, 2015

Using representation learning and out-of-domain data for a paralinguistic speech task.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Rule-based Dependency Parse Collapsing and Propagation for German and English.
Proceedings of the International Conference of the German Society for Computational Linguistics and Language Technology, 2015

Adaptive Natural Language Processing.
Proceedings of the International Conference of the German Society for Computational Linguistics and Language Technology, 2015

GermaNER: Free Open German Named Entity Recognition Tool.
Proceedings of the International Conference of the German Society for Computational Linguistics and Language Technology, 2015

A Single Word is not Enough: Ranking Multiword Expressions Using Distributional Semantics.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Interactive and Iterative Annotation for Biomedical Entity Recognition.
Proceedings of the Brain Informatics and Health - 8th International Conference, 2015

JoBimViz: A Web-based Visualization for Graph-based Distributional Semantic Models.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Computational Humanities - bridging the gap between Computer Science and Digital Humanities (Dagstuhl Seminar 14301).
Dagstuhl Reports, 2014

Networks of Names: Visual Exploration and Semi-Automatic Tagging of Social Networks from Newspaper Articles.
Comput. Graph. Forum, 2014

Towards Ambient Search.
Proceedings of the 16th LWA Workshops: KDML, 2014

Distributed Distributional Similarities of Google Books over the Centuries.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Lexical Substitution Dataset for German.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

NoSta-D Named Entity Annotation for German: Guidelines and Dataset.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Knowledge Discovery in Scientific Literature.
Proceedings of the 12th Edition of the Konvens Conference, 2014

Multiobjective Optimization and Unsupervised Lexical Acquisition for Named Entity Recognition and Classification.
Proceedings of the 11th International Conference on Natural Language Processing, 2014

Combining Supervised and Unsupervised Parsing for Distributional Similarity.
Proceedings of the COLING 2014, 2014

Automatic Annotation Suggestions and Custom Annotation Layers in WebAnno.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

That's sick dude!: Automatic identification of word sense change across different timescales.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

An Open Source Corpus and Recording Software for Distant Speech Recognition with the Microsoft Kinect.
Proceedings of the 11th ITG Symposium on Speech Communication, 2014

Creating a system for lexical substitutions from scratch using crowdsourcing.
Lang. Resour. Evaluation, 2013

Scalable Construction of High-Quality Web Corpora.
J. Lang. Technol. Comput. Linguistics, 2013

Text: now in 2D! A framework for lexical expansion with contextual similarity.
J. Lang. Model., 2013

JoBimText Visualizer: A Graph-based Approach to Contextualizing Distributional Similarity.
Proceedings of TextGraphs@EMNLP 2013: the 8th Workshop on Graph-based Methods for Natural Language Processing, 2013

From Global to Local Similarities: A Graph-Based Contextualization Method using Distributional Thesauri.
Proceedings of TextGraphs@EMNLP 2013: the 8th Workshop on Graph-based Methods for Natural Language Processing, 2013

SemEval-2013 Task 5: Evaluating Phrasal Semantics.
Proceedings of the 7th International Workshop on Semantic Evaluation, 2013

Supervised All-Words Lexical Substitution using Delexicalized Features.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Three Knowledge-Free Methods for Automatic Lexical Chain Extraction.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Scaling to Large3 Data: An Efficient and Effective Method to Compute Distributional Thesauri.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Exploring Cities in Crime: Significant Concordance and Co-occurrence in Quantitative Literary Analysis.
Proceedings of the Workshop on Computational Linguistics for Literature, 2013

WebAnno: A Flexible, Web-based and Visually Supported System for Distributed Annotations.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Text Segmentation wit h Topic Models.
J. Lang. Technol. Comput. Linguistics, 2012

Graph-Based Natural Language Processing and Information Retrieval Rada Mihalcea and Dragomir Radev (University of North Texas and University of Michigan) Cambridge, UK: Cambridge University Press, 2011, viii+192 pp; hardbound, ISBN 978-0-521-89613-9, $65.00.
Comput. Linguistics, 2012

UKP: Computing Semantic Textual Similarity by Combining Multiple Content Similarity Measures.
Proceedings of the 6th International Workshop on Semantic Evaluation, 2012

How Text Segmentation Algorithms Gain from Topic Models.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2012

Turk Bootstrap Word Sense Inventory 2.0: A Large-Scale Resource for Lexical Substitution.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Using Distributional Similarity for Lexical Expansion in Knowledge-based Word Sense Disambiguation.
Proceedings of the COLING 2012, 2012

Quantifying Semantics using Complex Network Analysis.
Proceedings of the COLING 2012, 2012

Structure Discovery in Natural Language.
Theory and Applications of Natural Language Processing, Springer, ISBN: 978-3-642-25923-4, 2012

Co-Occurrence Cluster Features for Lexical Substitutions in Context.
Proceedings of TextGraphs@ACL 2010 Workshop on Graph-based Methods for Natural Language Processing, 2010

Syntax is from Mars while Semantics from Venus! Insights from Spectral Analysis of Distributional Similarity Networks.
Proceedings of the ACL 2009, 2009

Unsupervised Parts-of-Speech Induction for Bengali.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

ASV Toolbox: a Modular Collection of Language Exploration Tools.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Unsupervised and Knowledge-Free Learning of Compound Splits and Periphrases.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2008

Similarity of Documents and Document Collections using Attributes with Low Noise.
Proceedings of the WEBIST 2007, 2007

Combining Contexts in Lexicon Learning for Semantic Parsing.
Proceedings of the 16th Nordic Conference of Computational Linguistics, 2007

İslenskur Orðasjóður - Building a Large Icelandic Corpus.
Proceedings of the 16th Nordic Conference of Computational Linguistics, 2007

Unsupervised Natural Language Processing Using Graph Models.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

A Random Text Model for the Generation of Statistical Language Invariants.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

Das Structure Discovery Paradigma: Unüberwachte und Vorwissensfreie Automatische Sprachverarbeitung [Unsupervised and Knowledge-free Natural Language Processing in the Structure Discovery Paradigm].
Proceedings of the Ausgezeichnete Informatikdissertationen 2007, 2007

Supporting Web-based Address Extraction with Unsupervised Tagging.
Proceedings of the Data Analysis, Machine Learning and Applications, 2007

Unsupervised and knowledge-free natural language processing in the structure discovery paradigm.
PhD thesis, 2007

Corpus Portal for Search in Monolingual Corpora.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Ord i Dag: Mining Norwegian Daily Newswire.
Proceedings of the Advances in Natural Language Processing, 2006

Unsupervised Part-of-Speech Tagging Employing Efficient Graph Clustering.
Proceedings of the ACL 2006, 2006

Ontology Learning from Text: A Survey of Methods.
LDV Forum, 2005

Rigorous dimensionality reduction through linguistically motivated feature selection for text categorization.
Proceedings of the 15th Nordic Conference of Computational Linguistics, 2005

Dictionary acquisition using parallel text and co-occurrence statistics.
Proceedings of the 15th Nordic Conference of Computational Linguistics, 2005

Automatic Extension of Feature-based Semantic Lexicons via Contextual Attributes.
Proceedings of the From Data and Information Analysis to Knowledge Engineering, 2005

Disentangling from Babylonian Confusion - Unsupervised Language Identification.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2005

Lernen paradigmatischer Relationen auf iterierten Kollokationen.
LDV Forum, 2004

SemanticTalk: Software for Visualizing Brainstorming Sessions and Thematic Concept Trails on Document Collections.
Proceedings of the Knowledge Discovery in Databases: PKDD 2004, 2004

Linguistic Corpus Search.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

Web Services for Language Resources and Language Technology Applications.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

Automatic Acquisition of Paradigmatic Relations Using Iterated Co-occurrences.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

Automatically Building Concept Structures and Displaying Concept Trails for the Use in Brainstorming Sessions and Content Management Systems.
Proceedings of the Innovative Internet Community Systems, 4th InternationalWorkshop, 2004

Semiautomatic Extension of CoreNet using a Bootstrapping Mechanism on Corpus-based Co-occurrences.
Proceedings of the COLING 2004, 2004

Language-Independent Methods for Compiling Monolingual Lexical Data.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2004

Extraktion von semantischen Relationen aus natürlichsprachlichem Text mit Hilfe von maschinellem Lernen.
LDV Forum, 2003

Automatic Discovery and Aggregation of Compound Names for the Use in Knowledge Representations.
J. Univers. Comput. Sci., 2003

Named Entity Learning and Verification: Expectation Maximization in Large Corpora.
Proceedings of the 6th Conference on Natural Language Learning, 2002
