Sampo Pyysalo
Orcid: 0000-0002-6279-5000Affiliations:
- University of Cambridge, Department of Theoretical and Applied Linguistics, UK
According to our database1,
Sampo Pyysalo
authored at least 122 papers
between 2004 and 2025.
Collaborative distances:
Collaborative distances:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
LSD600: the first corpus of biomedical abstracts annotated with lifestyle-disease relations.
Database J. Biol. Databases Curation, 2025
RegulaTome: a corpus of typed, directed, and signed relations between biomedical entities in the scientific literature.
Database J. Biol. Databases Curation, January, 2024
Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order.
CoRR, 2024
Bioinform., 2024
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Register identification from the unrestricted open Web using the Corpus of Online Registers of English.
Lang. Resour. Evaluation, September, 2023
The STRING database in 2023: protein-protein association networks and functional enrichment analyses for any sequenced genome of interest.
Nucleic Acids Res., January, 2023
Overview of DrugProt task at BioCreative VII: data and methods for large-scale text mining and knowledge graph generation of heterogenous chemical-protein relations.
Database J. Biol. Databases Curation, 2023
Proceedings of the 24th Nordic Conference on Computational Linguistics, 2023
Proceedings of the 24th Nordic Conference on Computational Linguistics, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Proceedings of the Eighth Workshop on Noisy User-generated Text, 2022
The STRING database in 2021: customizable protein-protein networks, and functional characterization of user-uploaded gene/measurement sets.
Nucleic Acids Res., 2021
Quantitative Evaluation of Alternative Translations in a Corpus of Highly Dissimilar Finnish Paraphrases.
CoRR, 2021
Proceedings of the 23rd Nordic Conference on Computational Linguistics, 2021
Proceedings of the 23rd Nordic Conference on Computational Linguistics, 2021
Deep learning for sentence clustering in essay grading support.
Proceedings of the 14th International Conference on Educational Data Mining, 2021
Beyond the English Web: Zero-Shot Cross-Lingual and Lightweight Monolingual Classification of Registers.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop, 2021
Proceedings of The 12th Language Resources and Evaluation Conference, 2020
Proceedings of The 12th Language Resources and Evaluation Conference, 2020
Turku Enhanced Parser Pipeline: From Raw Text to Enhanced Graphs in the IWPT 2020 Shared Task.
Proceedings of the 16th International Conference on Parsing Technologies and the IWPT 2020 Shared Task on Parsing into Enhanced Universal Dependencies, 2020
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020
Proceedings of the 28th International Conference on Computational Linguistics, 2020
Proceedings of the 12th Web as Corpus Workshop, 2020
J. Biomed. Semant., 2019
Proceedings of the 22nd Nordic Conference on Computational Linguistics, NoDaLiDa 2019, Turku, Finland, September 30, 2019
Neural Dependency Parsing of Biomedical Text: TurkuNLP entry in the CRAFT Structural Annotation Task.
Proceedings of The 5th Workshop on BioNLP Open Shared Tasks, 2019
Proceedings of The 5th Workshop on BioNLP Open Shared Tasks, 2019
Proceedings of The 5th Workshop on BioNLP Open Shared Tasks, 2019
Neural networks for link prediction in realistic biomedical graphs: a multi-dimensional evaluation of graph embedding-based approaches.
BMC Bioinform., 2018
Bio-SimVerb and Bio-SimLex: wide-coverage evaluation sets of word similarity in biomedicine.
BMC Bioinform., 2018
A neural network multi-task learning approach to biomedical named entity recognition.
BMC Bioinform., 2017
Cancer Hallmarks Analytics Tool (CHAT): a text mining approach to organize and evaluate scientific literature on cancer.
Bioinform., 2017
Proceedings of the Fourth International Conference on Dependency Linguistics, 2017
CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies.
Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, 2017
Cell line name recognition in support of the identification of synthetic lethality in cancer from text.
Bioinform., 2016
Proceedings of the 1st Workshop on Evaluating Vector-Space Representations for NLP, 2016
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016
Proceedings of the COLING 2016, 2016
Proceedings of the Fifth Workshop on Building and Evaluating Resources for Biomedical Text Mining, 2016
Deep Learning with Minimal Training Data: TurkuNLP Entry in the BioNLP Shared Task 2016.
Proceedings of the 4th BioNLP Shared Task Workshop, BioNLP 2016, 2016
Proceedings of the 15th Workshop on Biomedical Natural Language Processing, 2016
Overview of the Cancer Genetics and Pathway Curation tasks of BioNLP Shared Task 2013.
BMC Bioinform., December, 2015
Proceedings of the 20th Nordic Conference of Computational Linguistics, 2015
Towards the Classification of the Finnish Internet Parsebank: Detecting Translations and Informality.
Proceedings of the 20th Nordic Conference of Computational Linguistics, 2015
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015
Proceedings of the Third International Conference on Dependency Linguistics, 2015
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015
Generalising semantic category disambiguation with large lexical resources for fun and profit.
J. Biomed. Semant., 2014
Wide coverage biomedical event extraction using multiple partially overlapping corpora.
BMC Bioinform., 2013
BMC Bioinform., 2013
A method for integrating and ranking the evidence for biochemical pathways by mining reactions from text.
Bioinform., 2013
Proceedings of the BioNLP Shared Task 2013 Workshop, Sofia, 2013
Proceedings of the BioNLP Shared Task 2013 Workshop, Sofia, 2013
Proceedings of the BioNLP Shared Task 2013 Workshop, Sofia, 2013
BMC Bioinform., 2012
Proceedings of the EACL 2012, 2012
Proceedings of the 2012 Workshop on Biomedical Natural Language Processing, 2012
PubMed-Scale Event Extraction for Post-Translational Modifications, Epigenetics and Protein Structural Relations.
Proceedings of the 2012 Workshop on Biomedical Natural Language Processing, 2012
Bridging the Gap Between Scope-based and Event-based Negation/Speculation Annotations: A Bridge Not Too Far.
Proceedings of the Workshop on Extra-Propositional Aspects of Meaning in Computational Linguistics, 2012
Comput. Intell., 2011
BMC Bioinform., 2011
J. Biomed. Semant., 2011
Ontology design patterns to disambiguate relations between genes and gene products in GENIA.
J. Biomed. Semant., 2011
Proceedings of BioNLP Shared Task 2011 Workshop, Portland, Oregon, USA, June 24, 2011, 2011
SimSem: Fast Approximate String Matching in Relation to Semantic Category Disambiguation.
Proceedings of the 2011 Workshop on Biomedical Natural Language Processing, 2011
Proceedings of BioNLP Shared Task 2011 Workshop, Portland, Oregon, USA, June 24, 2011, 2011
Proceedings of BioNLP Shared Task 2011 Workshop, Portland, Oregon, USA, June 24, 2011, 2011
Proceedings of the 2011 Workshop on Biomedical Natural Language Processing, 2011
Overview of the Epigenetics and Post-translational Modifications (EPI) task of BioNLP Shared Task 2011.
Proceedings of BioNLP Shared Task 2011 Workshop, Portland, Oregon, USA, June 24, 2011, 2011
Proceedings of the 2011 Workshop on Biomedical Natural Language Processing, 2011
Proceedings of BioNLP Shared Task 2011 Workshop, Portland, Oregon, USA, June 24, 2011, 2011
J. Bioinform. Comput. Biol., 2010
J. Bioinform. Comput. Biol., 2010
BMC Bioinform., 2010
Proceedings of the Fourth International Symposium for Semantic Mining in Biomedicine, 2010
Proceedings of the COLING 2010, 2010
Proceedings of the 2010 Workshop on Biomedical Natural Language Processing, 2010
Proceedings of the 2010 Workshop on Biomedical Natural Language Processing, 2010
Proceedings of the 2010 Workshop on Biomedical Natural Language Processing, 2010
Proceedings of the 2010 Workshop on Biomedical Natural Language Processing, 2010
Proceedings of the 2010 Workshop on Biomedical Natural Language Processing, 2010
Matrix representations, linear transformations, and kernels for disambiguation in natural language.
Mach. Learn., 2009
Towards automated processing of clinical Finnish: Sublanguage analysis and a rule-based parser.
Int. J. Medical Informatics, 2009
Combining hidden Markov models and latent semantic analysis for topic segmentation and labeling: Method and clinical application.
Int. J. Medical Informatics, 2009
BMC Bioinform., 2009
Proceedings of the 17th Nordic Conference of Computational Linguistics, 2009
Proceedings of the BioNLP Workshop, BioNLP@HLT-NAACL 2009, 2009
Proceedings of the BioNLP Workshop, BioNLP@HLT-NAACL 2009, 2009
Proceedings of the BioNLP 2009 Workshop Companion Volume for Shared Task, BioNLP@HLT-NAACL 2009, 2009
BMC Bioinform., 2008
All-paths graph kernel for protein-protein interaction extraction with evaluation of cross-corpus learning.
BMC Bioinform., 2008
Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing, 2008
BMC Bioinform., 2007
On the unification of syntactic annotations under the Stanford dependency scheme: A case study on BioInfer and GENIA.
Proceedings of the Biological, translational, and clinical language processing, 2007
Evaluation of two dependency parsers on biomedical corpus targeted at protein-protein interactions.
Int. J. Medical Informatics, 2006
Lexical adaptation of link grammar to the biomedical sublanguage: a comparative evaluation of three approaches.
BMC Bioinform., 2006
Proceedings of the Advances in Natural Language Processing, 2006
Proceedings of the Advances in Intelligent Data Analysis VI, 2005
Kernels Incorporating Word Positional Information in Natural Language Disambiguation Tasks.
Proceedings of the Eighteenth International Florida Artificial Intelligence Research Society Conference, 2005
Proceedings of the Advances in Natural Language Processing, 4th International Conference, 2004
Extracting Protein-Protein Interaction Sentences by Applying Rough Set Data Analysis.
Proceedings of the Rough Sets and Current Trends in Computing, 2004
Analysis of Link Grammar on Biomedical Dependency Corpus Targeted at Protein-Protein Interactions.
Proceedings of the International Joint Workshop on Natural Language Processing in Biomedicine and its Applications, 2004