Anna Korhonen
Orcid: 0000-0002-3692-3144Affiliations:
- University of Cambridge, Language Technology Laboratory, UK
According to our database1,
Anna Korhonen
authored at least 226 papers
between 1998 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on zbmath.org
On csauthors.net:
Bibliography
2024
Trans. Assoc. Comput. Linguistics, 2024
MURI: High-Quality Instruction Tuning Datasets for Low-Resource Languages via Reverse Instructions.
CoRR, 2024
CoRR, 2024
CoRR, 2024
Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators.
CoRR, 2024
Analyzing and Adapting Large Language Models for Few-Shot Multilingual NLU: Are We There Yet?
CoRR, 2024
DIALIGHT: Lightweight Multilingual Development and Evaluation of Task-Oriented Dialogue Systems with Large Language Models.
CoRR, 2024
Proceedings of the 18th ACM Conference on Recommender Systems, 2024
SQATIN: Supervised Instruction Tuning Meets Question Answering for Improved Dialogue NLU.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
DIALIGHT: Lightweight Multilingual Development and Evaluation of Task-Oriented Dialogue Systems with Large Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: System Demonstrations, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
"Seeing the Big through the Small": Can LLMs Approximate Human Judgment Distributions on NLI from a Few Explanations?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024
LoSST-AD: A Longitudinal Corpus for Tracking Alzheimer's Disease Related Changes in Spontaneous Speech.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Can Rule-Based Insights Enhance LLMs for Radiology Report Classification? Introducing the RadPrompt Methodology.
Proceedings of the 23rd Workshop on Biomedical Natural Language Processing, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024
Your Prompt Is My Command: On Assessing the Human-Centred Generality of Multimodal Models (Abstract Reprint).
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
Nat. Mac. Intell., July, 2023
Trans. Assoc. Comput. Linguistics, 2023
Multi 3 WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapted Task-Oriented Dialog Systems.
Trans. Assoc. Comput. Linguistics, 2023
Your Prompt is My Command: On Assessing the Human-Centred Generality of Multimodal Models.
J. Artif. Intell. Res., 2023
On Task Performance and Model Calibration with Supervised and Self-Ensembled In-Context Learning.
CoRR, 2023
Navigating Cultural Chasms: Exploring and Unlocking the Cultural POV of Text-To-Image Models.
CoRR, 2023
Multi3WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapted Task-Oriented Dialog Systems.
CoRR, 2023
LongForm: Optimizing Instruction Tuning for Long Text Generation with Corpus Extraction.
CoRR, 2023
Ethical considerations in the early detection of Alzheimer's disease using speech and AI.
Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency, 2023
Survival of the Most Influential Prompts: Efficient Black-Box Prompt Search via Clustering and Pruning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
A Systematic Study of Performance Disparities in Multilingual Task-Oriented Dialogue Systems.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Multi3NLU++: A Multilingual, Multi-Intent, Multi-Domain Dataset for Natural Language Understanding in Task-Oriented Dialogue.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
2022
Crossing the Conversational Chasm: A Primer on Natural Language Processing for Multilingual Task-Oriented Dialogue Systems.
J. Artif. Intell. Res., 2022
CoRR, 2022
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
Data Augmentation and Learned Layer Aggregation for Improved Multilingual Language Understanding in Dialogue.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
2021
PROTOTYPE-TO-STYLE: Dialogue Generation With Style-Aware Editing on Retrieval Memory.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Trans. Assoc. Comput. Linguistics, 2021
AM2iCo: Evaluating Word Meaning in Context across Low-ResourceLanguages with Adversarial Examples.
CoRR, 2021
Crossing the Conversational Chasm: A Primer on Multilingual Task-Oriented Dialogue Systems.
CoRR, 2021
Fast, Effective and Self-Supervised: Transforming Masked LanguageModels into Universal Lexical and Sentence Encoders.
CoRR, 2021
Comput. Linguistics, 2021
J. Biomed. Semant., 2021
Proceedings of the Sixth Conference on Machine Translation, 2021
AM2iCo: Evaluating Word Meaning in Context across Low-Resource Languages with Adversarial Examples.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021
Fast, Effective, and Self-Supervised: Transforming Masked Language Models into Universal Lexical and Sentence Encoders.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Combining Deep Generative Models and Multi-lingual Pretraining for Semi-supervised Document Classification.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021
MirrorWiC: On Eliciting Word-in-Context Representations from Pretrained Language Models.
Proceedings of the 25th Conference on Computational Natural Language Learning, 2021
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
Learning Domain-Specialised Representations for Cross-Lingual Biomedical Entity Linking.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
2020
A systematic literature review of automatic Alzheimer's disease detection from speech and language.
J. Am. Medical Informatics Assoc., 2020
CoRR, 2020
Prototype-to-Style: Dialogue Generation with Style-Aware Editing on Retrieval Memory.
CoRR, 2020
Stylistic Dialogue Generation via Information-Guided Reinforcement Learning Strategy.
CoRR, 2020
Multi-SimLex: A Large-Scale Evaluation of Multilingual and Cross-Lingual Lexical Semantic Similarity.
CoRR, 2020
Lost in Embedding Space: Explaining Cross-Lingual Task Performance with Eigenvalue Divergence.
CoRR, 2020
Multi-SimLex: A Large-Scale Evaluation of Multilingual and Crosslingual Lexical Semantic Similarity.
Comput. Linguistics, 2020
SemEval-2020 Task 2: Predicting Multilingual and Cross-Lingual (Graded) Lexical Entailment.
Proceedings of the Fourteenth Workshop on Semantic Evaluation, 2020
Improving Bilingual Lexicon Induction with Unsupervised Post-Processing of Monolingual Word Vector Spaces.
Proceedings of the 5th Workshop on Representation Learning for NLP, 2020
Spatial Multi-Arrangement for Clustering and Multi-way Similarity Dataset Construction.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
Towards Better Context-aware Lexical Semantics: Adjusting Contextualized Representations through Static Anchors.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
The Secret is in the Spectra: Predicting Cross-lingual Task Performance with Spectral Similarity Measures.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
Manual Clustering and Spatial Arrangement of Verbs for Multilingual Evaluation and Typology Analysis.
Proceedings of the 28th International Conference on Computational Linguistics, 2020
Proceedings of the 28th International Conference on Computational Linguistics, 2020
Proceedings of the 28th International Conference on Computational Linguistics, 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Classification-Based Self-Learning for Weakly Supervised Bilingual Lexicon Induction.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
2019
Modeling Language Variation and Universals: A Survey on Typological Linguistics for Natural Language Processing.
Comput. Linguistics, 2019
J. Biomed. Semant., 2019
Second-order contexts from lexical substitutes for few-shot learning of word representations.
Proceedings of the Eighth Joint Conference on Lexical and Computational Semantics, 2019
A Systematic Study of Leveraging Subword Information for Learning Word Representations.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019
Show Some Love to Your n-grams: A Bit of Progress and Stronger n-gram Language Modeling Baselines.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Semi-Supervised Bootstrapping of Dialogue State Trackers for Task-Oriented Modelling.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
On the Importance of Subword Information for Morphological Tasks in Truly Low-Resource Languages.
Proceedings of the 23rd Conference on Computational Natural Language Learning, 2019
Investigating Cross-Lingual Alignment Methods for Contextualized Embeddings with Token-Level Evaluation.
Proceedings of the 23rd Conference on Computational Natural Language Learning, 2019
Proceedings of the 18th BioNLP Workshop and Shared Task, 2019
2018
Language Modeling for Morphologically Rich Languages: Character-Aware Modeling for Word-Level Prediction.
Trans. Assoc. Comput. Linguistics, 2018
Lang. Resour. Evaluation, 2018
Neural networks for link prediction in realistic biomedical graphs: a multi-dimensional evaluation of graph embedding-based approaches.
BMC Bioinform., 2018
Bio-SimVerb and Bio-SimLex: wide-coverage evaluation sets of word similarity in biomedicine.
BMC Bioinform., 2018
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018
Adversarial Propagation and Zero-Shot Cross-Lingual Transfer of Word Vector Specialization.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018
On the Relation between Linguistic Typology and (Limitations of) Multilingual Language Modeling.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018
2017
Semantic Specialization of Distributional Word Vector Spaces using Monolingual and Cross-Lingual Constraints.
Trans. Assoc. Comput. Linguistics, 2017
Erratum: Link prediction in drug-target interactions network using similarity indices.
CoRR, 2017
Semantic Specialisation of Distributional Word Vector Spaces using Monolingual and Cross-Lingual Constraints.
CoRR, 2017
Comput. Linguistics, 2017
BMC Bioinform., 2017
A neural network multi-task learning approach to biomedical named entity recognition.
BMC Bioinform., 2017
Cancer Hallmarks Analytics Tool (CHAT): a text mining approach to organize and evaluate scientific literature on cancer.
Bioinform., 2017
Proceedings of the 6th Joint Conference on Lexical and Computational Semantics, 2017
Cross-Lingual Induction and Transfer of Verb Classes Based on Word Vector Space Specialisation.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017
Event-Related Features in Feedforward Neural Networks Contribute to Identifying Causal Relations in Discourse.
Proceedings of the 2nd Workshop on Linking Models of Lexical, 2017
Evaluation by Association: A Systematic Study of Quantitative Word Association Evaluation.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017
Automatic Selection of Context Configurations for Improved Class-Specific Word Representations.
Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017), 2017
Proceedings of the BioNLP 2017, Vancouver, Canada, August 4, 2017, 2017
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017
2016
Trans. Assoc. Comput. Linguistics, 2016
Automatic Selection of Context Configurations for Improved (and Fast) Class-Specific Word Representations.
CoRR, 2016
Automatic semantic classification of scientific literature according to the hallmarks of cancer.
Bioinform., 2016
Proceedings of the 1st Workshop on Evaluating Vector-Space Representations for NLP, 2016
Proceedings of the NAACL HLT 2016, 2016
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016
Proceedings of the COLING 2016, 2016
Proceedings of the Fifth Workshop on Building and Evaluating Resources for Biomedical Text Mining, 2016
Proceedings of the COLING 2016, 2016
Proceedings of the 15th Workshop on Biomedical Natural Language Processing, 2016
Is "Universal Syntax" Universally Useful for Learning Distributed Word Representations?
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016
2015
Unsupervised Declarative Knowledge Induction for Constraint-Based Learning of Information Structure in Scientific Documents.
Trans. Assoc. Comput. Linguistics, 2015
Comput. Linguistics, 2015
Bioinform., 2015
Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2015
2014
Trans. Assoc. Comput. Linguistics, 2014
Comput. Linguistics, 2014
Cogn. Sci., 2014
Cogn. Sci., 2014
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014
Learning Abstract Concept Embeddings from Multi-Modal Data: Since You Probably Can't See What I Mean.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014
CRAB 2.0: A text mining tool for supporting literature review in chemical cancer risk assessment.
Proceedings of the COLING 2014, 2014
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2014
Proceedings of the Computational Intelligence Methods for Bioinformatics and Biostatistics, 2014
Improving Multi-Modal Representations Using Image Dispersion: Why Less is Sometimes More.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014
2013
Proceedings of the Cognitive Aspects of Computational Language Acquisition, 2013
Lang. Resour. Evaluation, 2013
J. Biomed. Informatics, 2013
Active learning-based information structure analysis of full scientific articles and two applications for biomedical literature review.
Bioinform., 2013
Improved Information Structure Analysis of Scientific Documents Through Discourse and Lexical Constraints.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013
Proceedings of the 35th Annual Meeting of the Cognitive Science Society, 2013
Proceedings of the 35th Annual Meeting of the Cognitive Science Society, 2013
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013
Proceedings of the Fourth Annual Workshop on Cognitive Modeling and Computational Linguistics, 2013
2012
Proceedings of the First Joint Conference on Lexical and Computational Semantics, 2012
Proceedings of the COLING 2012, 2012
Document and Corpus Level Inference For Unsupervised and Transductive Learning of Information Structure of Scientific Documents.
Proceedings of the COLING 2012, 2012
CRAB Reader: A Tool for Analysis and Visualization of Argumentative Zones in Scientific Literature.
Proceedings of the COLING 2012, 2012
Proceedings of the COLING 2012, 2012
Proceedings of the COLING 2012, 2012
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012
Proceedings of the 3rd Workshop on Cognitive Modeling and Computational Linguistics, 2012
2011
A comparison and user-based evaluation of models of textual information structure in the context of cancer risk assessment.
BMC Bioinform., 2011
Weakly supervised learning of information structure of scientific abstracts - is it accurate enough to benefit real-world tasks in biomedicine?
Bioinform., 2011
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011
2010
Proceedings of the International Conference on Language Resources and Evaluation, 2010
Proceedings of the COLING 2010, 2010
Proceedings of the COLING 2010, 2010
Proceedings of the COLING 2010, 2010
Identifying the Information Structure of Scientific Abstracts: An Investigation of Three Different Schemes.
Proceedings of the 2010 Workshop on Biomedical Natural Language Processing, 2010
2009
The first step in the development of text mining technology for cancer risk assessment: identifying and organizing scientific evidence in risk assessment literature.
BMC Bioinform., 2009
Automatic Lexical Classification -- Balancing between Machine Learning and Linguistics.
Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation, 2009
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009
Proceedings of the BioNLP Workshop, BioNLP@HLT-NAACL 2009, 2009
2008
Proceedings of the International Conference on Language Resources and Evaluation, 2008
Proceedings of the Third International Joint Conference on Natural Language Processing, 2008
Proceedings of the COLING 2008, 2008
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2008
2007
A System for Large-Scale Acquisition of Verbal, Nominal and Adjectival Subcategorization Frames from Corpora.
Proceedings of the ACL 2007, 2007
2006
Int. J. Medical Informatics, 2006
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006
Proceedings of the ACL 2006, 2006
2005
Introduction to the special issue on multiword expressions: Having a crack at a hard nut.
Comput. Speech Lang., 2005
Proceedings of the ACL 2005, 2005
2004
Proceedings of the Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, 2004
2003
Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, 2003
Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, 2003
2002
Proceedings of the ACL Workshop on Word Sense Disambiguation: Recent Successes and Future Directions, 2002
Proceedings of the Third International Conference on Language Resources and Evaluation, 2002
On the Robustness of Entropy-Based Similarity Measures in Evaluation of Subcategorization Acquisition Systems.
Proceedings of the 6th Conference on Natural Language Learning, 2002
2000
Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, 2000
Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, 2000
1998
Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, 1998