Anna Korhonen

Orcid: 0000-0002-3692-3144

Affiliations:
  • University of Cambridge, Language Technology Laboratory, UK


According to our database1, Anna Korhonen authored at least 226 papers between 1998 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
AutoPEFT: Automatic Configuration Search for Parameter-Efficient Fine-Tuning.
Trans. Assoc. Comput. Linguistics, 2024

MURI: High-Quality Instruction Tuning Datasets for Low-Resource Languages via Reverse Instructions.
CoRR, 2024

SYNTHEVAL: Hybrid Behavioral Testing of NLP Models with Synthetic CheckLists.
CoRR, 2024

Culturally Aware and Adapted NLP: A Taxonomy and a Survey of the State of the Art.
CoRR, 2024

Spectral Editing of Activations for Large Language Model Alignment.
CoRR, 2024

Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators.
CoRR, 2024

Analyzing and Adapting Large Language Models for Few-Shot Multilingual NLU: Are We There Yet?
CoRR, 2024

Scaling Sparse Fine-Tuning to Large Language Models.
CoRR, 2024

DIALIGHT: Lightweight Multilingual Development and Evaluation of Task-Oriented Dialogue Systems with Large Language Models.
CoRR, 2024

CALRec: Contrastive Alignment of Generative LLMs for Sequential Recommendation.
Proceedings of the 18th ACM Conference on Recommender Systems, 2024

SQATIN: Supervised Instruction Tuning Meets Question Answering for Improved Dialogue NLU.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Are Large Language Model Temporally Grounded?
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

DIALIGHT: Lightweight Multilingual Development and Evaluation of Task-Oriented Dialogue Systems with Large Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: System Demonstrations, 2024

Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

SynthEval: Hybrid Behavioral Testing of NLP Models with Synthetic Evaluation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

TurkishMMLU: Measuring Massive Multitask Language Understanding in Turkish.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

TopViewRS: Vision-Language Models as Top-View Spatial Reasoners.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

LongForm: Effective Instruction Tuning with Reverse Instructions.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

"Seeing the Big through the Small": Can LLMs Approximate Human Judgment Distributions on NLI from a Few Explanations?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Investigating the Potential of Task Arithmetic for Cross-Lingual Transfer.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

LoSST-AD: A Longitudinal Corpus for Tracking Alzheimer's Disease Related Changes in Spontaneous Speech.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Reranking Overgenerated Responses for End-to-End Task-Oriented Dialogue Systems.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Can Rule-Based Insights Enhance LLMs for Radiology Report Classification? Introducing the RadPrompt Methodology.
Proceedings of the 23rd Workshop on Biomedical Natural Language Processing, 2024

Self-Augmented In-Context Learning for Unsupervised Word Translation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024

Your Prompt Is My Command: On Assessing the Human-Centred Generality of Multimodal Models (Abstract Reprint).
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Navigating the development challenges in creating complex data systems.
Nat. Mac. Intell., July, 2023

Cross-Lingual Dialogue Dataset Creation via Outline-Based Generation.
Trans. Assoc. Comput. Linguistics, 2023

Multi 3 WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapted Task-Oriented Dialog Systems.
Trans. Assoc. Comput. Linguistics, 2023

Your Prompt is My Command: On Assessing the Human-Centred Generality of Multimodal Models.
J. Artif. Intell. Res., 2023

On Task Performance and Model Calibration with Supervised and Self-Ensembled In-Context Learning.
CoRR, 2023

Are Large Language Models Temporally Grounded?
CoRR, 2023

Navigating Cultural Chasms: Exploring and Unlocking the Cultural POV of Text-To-Image Models.
CoRR, 2023

Multi3WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapted Task-Oriented Dialog Systems.
CoRR, 2023

Language-Agnostic Bias Detection in Language Models.
CoRR, 2023

LongForm: Optimizing Instruction Tuning for Long Text Generation with Corpus Extraction.
CoRR, 2023

Ethical considerations in the early detection of Alzheimer's disease using speech and AI.
Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency, 2023

Survival of the Most Influential Prompts: Efficient Black-Box Prompt Search via Clustering and Pruning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Transfer-Free Data-Efficient Multilingual Slot Labeling.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Detecting and Mitigating Hallucinations in Multilingual Summarisation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

On Bilingual Lexicon Induction with Large Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Language-Agnostic Bias Detection in Language Models with Bias Probing.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Quantifying the Dialect Gap and its Correlates Across Languages.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

A Systematic Study of Performance Disparities in Multilingual Task-Oriented Dialogue Systems.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Unifying Cross-Lingual Transfer across Scenarios of Resource Scarcity.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Can Pretrained Language Models (Yet) Reason Deductively?
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Probing Cross-Lingual Lexical Knowledge from Multilingual Sentence Encoders.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Delving Deeper into Cross-lingual Visual Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Cross-Lingual Transfer with Target Language-Ready Task Adapters.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Multi3NLU++: A Multilingual, Multi-Intent, Multi-Domain Dataset for Natural Language Understanding in Task-Oriented Dialogue.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Translation-Enhanced Multilingual Text-to-Image Generation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Distilling Efficient Language-Specific Models for Cross-Lingual Transfer.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Crossing the Conversational Chasm: A Primer on Natural Language Processing for Multilingual Task-Oriented Dialogue Systems.
J. Artif. Intell. Res., 2022

Exposing Cross-Lingual Lexical Knowledge from Multilingual Sentence Encoders.
CoRR, 2022

BAD-X: Bilingual Adapters Improve Zero-Shot Cross-Lingual Transfer.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Measuring Context-Word Biases in Lexical Semantic Datasets.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Improving Bilingual Lexicon Induction with Cross-Encoder Reranking.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Data Augmentation and Learned Layer Aggregation for Improved Multilingual Language Understanding in Dialogue.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Improving Word Translation via Two-Stage Contrastive Learning.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Composable Sparse Fine-Tuning for Cross-Lingual Transfer.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
PROTOTYPE-TO-STYLE: Dialogue Generation With Style-Aware Editing on Retrieval Memory.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Parameter Space Factorization for Zero-Shot Learning across Tasks and Languages.
Trans. Assoc. Comput. Linguistics, 2021

Context vs Target Word: Quantifying Biases in Lexical Semantic Datasets.
CoRR, 2021

AM2iCo: Evaluating Word Meaning in Context across Low-ResourceLanguages with Adversarial Examples.
CoRR, 2021

Crossing the Conversational Chasm: A Primer on Multilingual Task-Oriented Dialogue Systems.
CoRR, 2021

Fast, Effective and Self-Supervised: Transforming Masked LanguageModels into Universal Lexical and Sentence Encoders.
CoRR, 2021

Semantic Data Set Construction from Human Clustering and Spatial Arrangement.
Comput. Linguistics, 2021

BioVerbNet: a large semantic-syntactic classification of verbs in biomedicine.
J. Biomed. Semant., 2021

Improving Machine Translation of Rare and Unseen Word Senses.
Proceedings of the Sixth Conference on Machine Translation, 2021

AM2iCo: Evaluating Word Meaning in Context across Low-Resource Languages with Adversarial Examples.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

MAD-G: Multilingual Adapter Generation for Efficient Cross-Lingual Transfer.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Fast, Effective, and Self-Supervised: Transforming Masked Language Models into Universal Lexical and Sentence Encoders.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Combining Deep Generative Models and Multi-lingual Pretraining for Semi-supervised Document Classification.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

MirrorWiC: On Eliciting Word-in-Context Representations from Pretrained Language Models.
Proceedings of the 25th Conference on Computational Natural Language Learning, 2021

A Closer Look at Few-Shot Crosslingual Transfer: The Choice of Shots Matters.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

LexFit: Lexical Fine-Tuning of Pretrained Language Models.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Verb Knowledge Injection for Multilingual Event Processing.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Learning Domain-Specialised Representations for Cross-Lingual Biomedical Entity Linking.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
A systematic literature review of automatic Alzheimer's disease detection from speech and language.
J. Am. Medical Informatics Assoc., 2020

A Closer Look at Few-Shot Crosslingual Transfer: Variance, Benchmarks and Baselines.
CoRR, 2020

Prototype-to-Style: Dialogue Generation with Style-Aware Editing on Retrieval Memory.
CoRR, 2020

Stylistic Dialogue Generation via Information-Guided Reinforcement Learning Strategy.
CoRR, 2020

Multi-SimLex: A Large-Scale Evaluation of Multilingual and Cross-Lingual Lexical Semantic Similarity.
CoRR, 2020

Lost in Embedding Space: Explaining Cross-Lingual Task Performance with Eigenvalue Divergence.
CoRR, 2020

Multi-SimLex: A Large-Scale Evaluation of Multilingual and Crosslingual Lexical Semantic Similarity.
Comput. Linguistics, 2020

SemEval-2020 Task 2: Predicting Multilingual and Cross-Lingual (Graded) Lexical Entailment.
Proceedings of the Fourteenth Workshop on Semantic Evaluation, 2020

Improving Bilingual Lexicon Induction with Unsupervised Post-Processing of Monolingual Word Vector Spaces.
Proceedings of the 5th Workshop on Representation Learning for NLP, 2020

Spatial Multi-Arrangement for Clustering and Multi-way Similarity Dataset Construction.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Probing Pretrained Language Models for Lexical Semantics.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Towards Better Context-aware Lexical Semantics: Adjusting Contextualized Representations through Static Anchors.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

The Secret is in the Spectra: Predicting Cross-lingual Task Performance with Spectral Similarity Measures.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Manual Clustering and Spatial Arrangement of Verbs for Multilingual Evaluation and Typology Analysis.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Emergent Communication Pretraining for Few-Shot Machine Translation.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Specializing Unsupervised Pretraining Models for Word-Level Semantic Similarity.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Investigating Word-Class Distributions in Word Vector Spaces.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Classification-Based Self-Learning for Weakly Supervised Bilingual Lexicon Induction.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Multidirectional Associative Optimization of Function-Specific Word Representations.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Informing Unsupervised Pretraining with External Linguistic Knowledge.
CoRR, 2019

Modeling Language Variation and Universals: A Survey on Typological Linguistics for Natural Language Processing.
Comput. Linguistics, 2019

A neural classification method for supporting the creation of BioVerbNet.
J. Biomed. Semant., 2019

LION LBD: a literature-based discovery system for cancer biology.
Bioinform., 2019

Second-order contexts from lexical substitutes for few-shot learning of word representations.
Proceedings of the Eighth Joint Conference on Lexical and Computational Semantics, 2019

A Systematic Study of Leveraging Subword Information for Learning Word Representations.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Bayesian Learning for Neural Dependency Parsing.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Show Some Love to Your n-grams: A Bit of Progress and Stronger n-gram Language Modeling Baselines.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Do We Really Need Fully Unsupervised Cross-Lingual Embeddings?
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Semi-Supervised Bootstrapping of Dialogue State Trackers for Task-Oriented Modelling.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Cross-lingual Semantic Specialization via Lexical Relation Induction.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Towards Zero-shot Language Modeling.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

On the Importance of Subword Information for Morphological Tasks in Truly Low-Resource Languages.
Proceedings of the 23rd Conference on Computational Natural Language Learning, 2019

Investigating Cross-Lingual Alignment Methods for Contextualized Embeddings with Token-Level Evaluation.
Proceedings of the 23rd Conference on Computational Natural Language Learning, 2019

Enhancing biomedical word embeddings by retrofitting to verb clusters.
Proceedings of the 18th BioNLP Workshop and Shared Task, 2019

2018
Language Modeling for Morphologically Rich Languages: Character-Aware Modeling for Word-Level Prediction.
Trans. Assoc. Comput. Linguistics, 2018

Investigating the cross-lingual translatability of VerbNet-style classification.
Lang. Resour. Evaluation, 2018

Neural networks for link prediction in realistic biomedical graphs: a multi-dimensional evaluation of graph embedding-based approaches.
BMC Bioinform., 2018

Bio-SimVerb and Bio-SimLex: wide-coverage evaluation sets of word similarity in biomedicine.
BMC Bioinform., 2018

Post-Specialisation: Retrofitting Vectors of Words Unseen in Lexical Resources.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Acquiring Verb Classes Through Bottom-Up Semantic Verb Clustering.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Adversarial Propagation and Zero-Shot Cross-Lingual Transfer of Word Vector Specialization.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

On the Relation between Linguistic Typology and (Limitations of) Multilingual Language Modeling.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Isomorphic Transfer of Syntactic Structures in Cross-Lingual NLP.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Semantic Specialization of Distributional Word Vector Spaces using Monolingual and Cross-Lingual Constraints.
Trans. Assoc. Comput. Linguistics, 2017

Erratum: Link prediction in drug-target interactions network using similarity indices.
CoRR, 2017

Semantic Specialisation of Distributional Word Vector Spaces using Monolingual and Cross-Lingual Constraints.
CoRR, 2017

HyperLex: A Large-Scale Evaluation of Graded Lexical Entailment.
Comput. Linguistics, 2017

Link prediction in drug-target interactions network using similarity indices.
BMC Bioinform., 2017

A neural network multi-task learning approach to biomedical named entity recognition.
BMC Bioinform., 2017

Cancer Hallmarks Analytics Tool (CHAT): a text mining approach to organize and evaluate scientific literature on cancer.
Bioinform., 2017

Decoding Sentiment from Distributed Representations of Sentences.
Proceedings of the 6th Joint Conference on Lexical and Computational Semantics, 2017

Cross-Lingual Induction and Transfer of Verb Classes Based on Word Vector Space Specialisation.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Event-Related Features in Feedforward Neural Networks Contribute to Identifying Causal Relations in Discourse.
Proceedings of the 2nd Workshop on Linking Models of Lexical, 2017

Evaluation by Association: A Systematic Study of Quantitative Word Association Evaluation.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Automatic Selection of Context Configurations for Improved Class-Specific Word Representations.
Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017), 2017

Initializing neural networks for hierarchical multi-label text classification.
Proceedings of the BioNLP 2017, Vancouver, Canada, August 4, 2017, 2017

Morph-fitting: Fine-Tuning Word Vector Spaces with Simple Language-Specific Rules.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Learning to Understand Phrases by Embedding the Dictionary.
Trans. Assoc. Comput. Linguistics, 2016

Automatic Selection of Context Configurations for Improved (and Fast) Class-Specific Word Representations.
CoRR, 2016

Bias and Agreement in Syntactic Annotations.
CoRR, 2016

Automatic semantic classification of scientific literature according to the hallmarks of cancer.
Bioinform., 2016

Intrinsic Evaluation of Word Vectors Fails to Predict Extrinsic Performance.
Proceedings of the 1st Workshop on Evaluating Vector-Space Representations for NLP, 2016

Learning Distributed Representations of Sentences from Unlabelled Data.
Proceedings of the NAACL HLT 2016, 2016

SimVerb-3500: A Large-Scale Evaluation Set of Verb Similarity.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Anchoring and Agreement in Syntactic Annotations.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Survey on the Use of Typological Information in Natural Language Processing.
Proceedings of the COLING 2016, 2016

Cancer Hallmark Text Classification Using Convolutional Neural Networks.
Proceedings of the Fifth Workshop on Building and Evaluating Resources for Biomedical Text Mining, 2016

Robust Text Classification for Sparsely Labelled Data Using Multi-level Embeddings.
Proceedings of the COLING 2016, 2016

How to Train good Word Embeddings for Biomedical NLP.
Proceedings of the 15th Workshop on Biomedical Natural Language Processing, 2016

Is "Universal Syntax" Universally Useful for Learning Distributed Word Representations?
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

On the Role of Seed Lexicons in Learning Bilingual Word Embeddings.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015
Unsupervised Declarative Knowledge Induction for Constraint-Based Learning of Information Structure in Scientific Documents.
Trans. Assoc. Comput. Linguistics, 2015

SimLex-999: Evaluating Semantic Models With (Genuine) Similarity Estimation.
Comput. Linguistics, 2015

Unsupervised discovery of information structure in biomedical documents.
Bioinform., 2015

Evaluating Learning Language Representations.
Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2015

2014
Multi-Modal Models for Concrete and Abstract Concept Meaning.
Trans. Assoc. Comput. Linguistics, 2014

Probabilistic Distributional Semantics with Latent Variable Models.
Comput. Linguistics, 2014

Automatic Extraction of Property Norm-Like Data From Large Text Corpora.
Cogn. Sci., 2014

A Quantitative Empirical Analysis of the Abstract/Concrete Distinction.
Cogn. Sci., 2014

Native Language Identification Using Large, Longitudinal Data.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Learning Abstract Concept Embeddings from Multi-Modal Data: Since You Probably Can't See What I Mean.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

An Unsupervised Model for Instance Level Subcategorization Acquisition.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

CRAB 2.0: A text mining tool for supporting literature review in chemical cancer risk assessment.
Proceedings of the COLING 2014, 2014

Verb Clustering for Brazilian Portuguese.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2014

Improving Literature-Based Discovery with Advanced Text Mining.
Proceedings of the Computational Intelligence Methods for Bioinformatics and Biostatistics, 2014

Improving Multi-Modal Representations Using Image Dispersion: Why Less is Sometimes More.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

Concreteness and Subjectivity as Dimensions of Lexical Meaning.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013
Computational Modeling as a Methodology for Studying Human Language Learning.
Proceedings of the Cognitive Aspects of Computational Language Acquisition, 2013

A computational model of logical metonymy.
ACM Trans. Speech Lang. Process., 2013

Conceptual metaphor theory meets the data: a corpus-based human annotation study.
Lang. Resour. Evaluation, 2013

Acquisition and evaluation of verb subcategorization resources for biomedicine.
J. Biomed. Informatics, 2013

Approaches to verb subcategorization for biomedicine.
J. Biomed. Informatics, 2013

Statistical Metaphor Processing.
Comput. Linguistics, 2013

Active learning-based information structure analysis of full scientific articles and two applications for biomedical literature review.
Bioinform., 2013

Improved Information Structure Analysis of Scientific Documents Through Discourse and Lexical Constraints.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

A Tensor-based Factorization Model of Semantic Compositionality.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Minimally Supervised Learning for Unconstrained Conceptual Property Extraction.
Proceedings of the 35th Annual Meeting of the Cognitive Science Society, 2013

Large-Scale Empricial Analyses of the Abstract/Concrete Distinction.
Proceedings of the 35th Annual Meeting of the Cognitive Science Society, 2013

Diathesis alternation approximation for verb clustering.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Improved Lexical Acquisition through DPP-based Verb Clustering.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Concreteness and Corpora: A Theoretical and Practical Study.
Proceedings of the Fourth Annual Workshop on Cognitive Modeling and Computational Linguistics, 2013

2012
Modelling selectional preferences in a lexical hierarchy.
Proceedings of the First Joint Conference on Lexical and Computational Semantics, 2012

Unsupervised Metaphor Paraphrasing using a Vector Space Model.
Proceedings of the COLING 2012, 2012

Document and Corpus Level Inference For Unsupervised and Transductive Learning of Information Structure of Scientific Documents.
Proceedings of the COLING 2012, 2012

CRAB Reader: A Tool for Analysis and Visualization of Argumentative Zones in Scientific Literature.
Proceedings of the COLING 2012, 2012

Multi-way Tensor Factorization for Unsupervised Lexical Acquisition.
Proceedings of the COLING 2012, 2012

Using Argumentative Zones for Extractive Summarization of Scientific Articles.
Proceedings of the COLING 2012, 2012

Learning Syntactic Verb Frames using Graphical Models.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

Semi-supervised learning for automatic conceptual property extraction.
Proceedings of the 3rd Workshop on Cognitive Modeling and Computational Linguistics, 2012

2011
Exploring subdomain variation in biomedical language.
BMC Bioinform., 2011

A comparison and user-based evaluation of models of textual information structure in the context of cancer risk assessment.
BMC Bioinform., 2011

Weakly supervised learning of information structure of scientific abstracts - is it accurate enough to benefit real-world tasks in biomedicine?
Bioinform., 2011

Hierarchical Verb Clustering Using Graph Factorization.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

Probabilistic models of similarity in syntactic context.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

A Weakly-supervised Approach to Argumentative Zoning of Scientific Documents.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

Latent Vector Weighting for Word Meaning in Context.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

2010
Annotating the Enron Email Corpus with Number Senses.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Investigating the cross-linguistic potential of VerbNet-style classification.
Proceedings of the COLING 2010, 2010

Metaphor Identification Using Verb and Noun Clustering.
Proceedings of the COLING 2010, 2010

Exploring variation across biomedical subdomains.
Proceedings of the COLING 2010, 2010

Identifying the Information Structure of Scientific Abstracts: An Investigation of Three Different Schemes.
Proceedings of the 2010 Workshop on Biomedical Natural Language Processing, 2010

2009
The first step in the development of text mining technology for cancer risk assessment: identifying and organizing scientific evidence in risk assessment literature.
BMC Bioinform., 2009

Automatic Lexical Classification -- Balancing between Machine Learning and Linguistics.
Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation, 2009

VerbNet overview, extensions, mappings and applications.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Improving Verb Clustering with Automatically Acquired Selectional Preferences.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009

User-Driven Development of Text Mining Resources for Cancer Risk Assessment.
Proceedings of the BioNLP Workshop, BioNLP@HLT-NAACL 2009, 2009

2008
A large-scale classification of English verbs.
Lang. Resour. Evaluation, 2008

LexSchem: a Large Subcategorization Lexicon for French Verbs.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Automatic Classification of English Verbs Using Rich Syntactic Features.
Proceedings of the Third International Joint Conference on Natural Language Processing, 2008

The Choice of Features for Classification of Verbs in Biomedical Texts.
Proceedings of the COLING 2008, 2008

Verb Class Discovery from Rich Syntactic Data.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2008

2007
A System for Large-Scale Acquisition of Verbal, Nominal and Adjectival Subcategorization Frames from Corpora.
Proceedings of the ACL 2007, 2007

2006
Zone analysis in biology articles as a basis for information extraction.
Int. J. Medical Informatics, 2006

A Large Subcategorization Lexicon for Natural Language Processing Applications.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Extending VerbNet with Novel Verb Classes.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Automatic Classification of Verbs in Biomedical Texts.
Proceedings of the ACL 2006, 2006

2005
Introduction to the special issue on multiword expressions: Having a crack at a hard nut.
Comput. Speech Lang., 2005

Automatic Acquisition of Adjectival Subcategorization from Corpora.
Proceedings of the ACL 2005, 2005

2004
WSD for subcategorization acquisition task description.
Proceedings of the Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, 2004

2003
Improving Subcategorization Acquisition Using Word Sense Disambiguation.
Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, 2003

Clustering Polysemic Subcategorization Frame Distributions Semantically.
Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, 2003

2002
Subcategorization acquisition.
PhD thesis, 2002

Improving Subcategorization Acquisition with WSD.
Proceedings of the ACL Workshop on Word Sense Disambiguation: Recent Successes and Future Directions, 2002

Subcategorization Acquisition as an Evaluation Method for WSD.
Proceedings of the Third International Conference on Language Resources and Evaluation, 2002

On the Robustness of Entropy-Based Similarity Measures in Evaluation of Subcategorization Acquisition Systems.
Proceedings of the 6th Conference on Natural Language Learning, 2002

2000
Statistical Filtering and Subcategorization Frame Acquisition.
Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, 2000

Using Semantically Motivated Estimates to Help Subcategorization Acquisition.
Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, 2000

1998
Detecting Verbal Participation in Diathesis Alternations.
Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, 1998


  Loading...