Tapio Salakoski

Affiliations:
  • University of Turku, Finland


According to our database1, Tapio Salakoski authored at least 167 papers between 1993 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023

2022
Neural Network and Random Forest Models in Protein Function Prediction.
IEEE ACM Trans. Comput. Biol. Bioinform., 2022

Clustering Nursing Sentences - Comparing Three Sentence Embedding Methods.
Proceedings of the Challenges of Trustable AI and Added-Value on Health, 2022

2021
Universal Lemmatizer: A sequence-to-sequence model for lemmatizing Universal Dependencies treebanks.
Nat. Lang. Eng., 2021

Predictive Models as Early Warning Systems: A Bayesian Classification Model to Identify At-Risk Students of Programming.
Proceedings of the Intelligent Computing, 2021

Text Classification Model Explainability for Keyword Extraction - Towards Keyword-Based Summarization of Nursing Care Episodes.
Proceedings of the MEDINFO 2021: One World, One Health - Global Partnership for Digital Innovation, 2021

2020
Supporting the use of standardized nursing terminologies with automatic subject heading prediction: a comparison of sentence-level text classification methods.
J. Am. Medical Informatics Assoc., 2020

Long Term Effects on Technology Enhanced Learning: The Use of Weekly Digital Lessons in Mathematics.
Informatics Educ., 2020

Assisting nurses in care documentation: from automated sentence classification to coherent document structures with subject headings.
J. Biomed. Semant., 2020

Using early assessment performance as early warning signs to identify at-risk students in programming courses.
Proceedings of the IEEE Frontiers in Education Conference, 2020

Propagating AI Knowledge Across University Disciplines- The Design of A Multidisciplinary AI Study Module.
Proceedings of the IEEE Frontiers in Education Conference, 2020

Entity-Pair Embeddings for Improving Relation Extraction in the Biomedical Domain.
Proceedings of the 28th European Symposium on Artificial Neural Networks, 2020

2019
Multilingual is not enough: BERT for Finnish.
CoRR, 2019

Is Multilingual BERT Fluent in Language Generation?
CoRR, 2019

Leveraging Text Repetitions and Denoising Autoencoders in OCR Post-correction.
CoRR, 2019

An Unsupervised Query Rewriting Approach Using N-gram Co-occurrence Statistics to Find Similar Phrases in Large Text Corpora.
Proceedings of the 22nd Nordic Conference on Computational Linguistics, NoDaLiDa 2019, Turku, Finland, September 30, 2019

Template-free Data-to-Text Generation of Finnish Sports News.
Proceedings of the 22nd Nordic Conference on Computational Linguistics, NoDaLiDa 2019, Turku, Finland, September 30, 2019

Unsupervised Phrase-Level Query Rewriting for Assisting Search in Clinical Free Text.
Proceedings of the MEDINFO 2019: Health and Wellbeing e-Networks for All, 2019

2018
Potent pairing: ensemble of long short-term memory networks and support vector machine for chemical-protein relation extraction.
Database J. Biol. Databases Curation, 2018

Wide-scope biomedical named entity recognition and normalization with CRFs, fuzzy matching and character level modeling.
Database J. Biol. Databases Curation, 2018

Improving Layman Readability of Clinical Narratives with Unsupervised Synonym Replacement.
Proceedings of the Building Continents of Knowledge in Oceans of Data: The Future of Co-Created eHealth, 2018

Turku Neural Parser Pipeline: An End-to-End System for the CoNLL 2018 Shared Task.
Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, Brussels, Belgium, October 31, 2018

Biomedical Event Extraction Using Convolutional Neural Networks and Dependency Parsing.
Proceedings of the BioNLP 2018 workshop, Melbourne, Australia, July 19, 2018, 2018

Evaluation of a Prototype System that Automatically Assigns Subject Headings to Nursing Narratives Using Recurrent Neural Network.
Proceedings of the Ninth International Workshop on Health Text Mining and Information Analysis, 2018

2017
A System for Identifying and Exploring Text Repetition in Large Historical Document Corpora.
Proceedings of the 21st Nordic Conference on Computational Linguistics, 2017

Creating register sub-corpora for the Finnish Internet Parsebank.
Proceedings of the 21st Nordic Conference on Computational Linguistics, 2017

Applying BLAST to Text Reuse Detection in Finnish Newspapers and Journals, 1771-1910.
Proceedings of the NoDaLiDa 2017 Workshop on Processing Historical Language, 2017

Detecting mentions of pain and acute confusion in Finnish clinical text.
Proceedings of the BioNLP 2017, Vancouver, Canada, August 4, 2017, 2017

End-to-End System for Bacteria Habitat Extraction.
Proceedings of the BioNLP 2017, Vancouver, Canada, August 4, 2017, 2017

Ensemble of Convolutional Neural Networks for Medicine Intake Recognition in Twitter.
Proceedings of the 2nd Social Media Mining for Health Research and Applications Workshop co-located with the American Medical Informatics Association Annual Symposium (AMIA 2017), 2017

2016
Comparison of automatic summarisation methods for clinical free text notes.
Artif. Intell. Medicine, 2016

Factors Affecting the Availability of Electronic Patient Records for Secondary Purposes - A Case Study.
Proceedings of the Building Sustainable Health Ecosystems, 2016

UTU at SemEval-2016 Task 10: Binary Classification for Expression Detection (BCED).
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

Medical warning system based on Internet of Things using fog computing.
Proceedings of the International Workshop on Big Data and Information Security, 2016

Programming Misconceptions in an Introductory Level Programming Course Exam.
Proceedings of the 2016 ACM Conference on Innovation and Technology in Computer Science Education, 2016

Deep Learning with Minimal Training Data: TurkuNLP Entry in the BioNLP Shared Task 2016.
Proceedings of the 4th BioNLP Shared Task Workshop, BioNLP 2016, 2016

Syntactic analyses and named entity recognition for PubMed and PubMed Central - up-to-the-minute.
Proceedings of the 15th Workshop on Biomedical Natural Language Processing, 2016

Automatically assessed electronic exams in programming courses.
Proceedings of the Australasian Computer Science Week Multiconference, 2016

2015
Application of the EVEX resource to event extraction and network construction: Shared Task entry and result analysis.
BMC Bioinform., December, 2015

TEES 2.2: Biomedical Event Extraction for Diverse Corpora.
BMC Bioinform., December, 2015

Smart Dosing: A mobile application for tracking the medication tray-filling and dispensation processes in hospital wards.
Proceedings of the Recent Advances in Ambient Assisted Living, 2015

Care episode retrieval: distributional semantic models for information retrieval in the clinical domain.
BMC Medical Informatics Decis. Mak., 2015

On study habits on an introductory course on programming.
Comput. Sci. Educ., 2015

Towards the Classification of the Finnish Internet Parsebank: Detecting Translations and Informality.
Proceedings of the 20th Nordic Conference of Computational Linguistics, 2015

Interactive Learning Content for Introductory Computer Science Course Using the ViLLE Exercise Framework.
Proceedings of the 2015 International Conference on Learning and Teaching in Computing and Engineering, 2015

Learning Low Cost Multi-target Models by Enforcing Sparsity.
Proceedings of the Current Approaches in Applied Artificial Intelligence, 2015

Comparing student performance between traditional and technologically enhanced programming course.
Proceedings of the 17th Australasian Computing Education Conference, 2015

2014
Multi-label learning under feature extraction budgets.
Pattern Recognit. Lett., 2014

Building the essential resources for Finnish: the Turku Dependency Treebank.
Lang. Resour. Evaluation, 2014

Predicting patient acuity from electronic patient records.
J. Biomed. Informatics, 2014

Statistical parsing of varieties of clinical Finnish.
Artif. Intell. Medicine, 2014

Documentation of the Clinical Phase of the Cardiac Rehabilitation Process in a Finnish University Hospital District.
Proceedings of the Safe and Secure Cities, 2014

Properties of Object-Level Cross-Validation Schemes for Symmetric Pair-Input Data.
Proceedings of the Structural, Syntactic, and Statistical Pattern Recognition, 2014

On Evaluation of Automatically Generated Clinical Discharge Summaries.
Proceedings of the 2nd European Workshop on Practical Aspects of Health Informatics, 2014

Automatic assessment and immediate feedback in first grade mathematics.
Proceedings of the 14th Koli Calling International Conference on Computing Education Research, 2014

Can IT health-care applications improve the medication tray-filling process at hospital wards? An exploratory study using eye-tracking and stress response.
Proceedings of the 16th IEEE International Conference on e-Health Networking, 2014

Care Episode Retrieval.
Proceedings of the 5th International Workshop on Health Text Mining and Information Analysis, 2014

2013
Using cluster analysis to identify weak signals of lethal trends in aviation and healthcare documentation.
Int. J. Netw. Virtual Organisations, 2013

UTurku: Drug Named Entity Recognition and Drug-Drug Interaction Extraction Using SVM Classification and Domain Knowledge.
Proceedings of the 7th International Workshop on Semantic Evaluation, 2013

Towards a Dependency-based PropBank of General Finnish.
Proceedings of the 19th Nordic Conference of Computational Linguistics, 2013

Building a Large Automatically Parsed Corpus of Finnish.
Proceedings of the 19th Nordic Conference of Computational Linguistics, 2013

Predicting Conjunct Propagation and Other Extended Stanford Dependencies.
Proceedings of the Second International Conference on Dependency Linguistics, 2013

Using Text Mining to Explore Factors Associated with Acute Confusion in Cardiac Patients Documentation.
Proceedings of the Working Notes for CLEF 2013 Conference , 2013

EVEX in ST'13: Application of a large-scale text mining resource to event extraction and network construction.
Proceedings of the BioNLP Shared Task 2013 Workshop, Sofia, 2013

TEES 2.1: Automated Annotation Scheme Learning in the BioNLP 2013 Shared Task.
Proceedings of the BioNLP Shared Task 2013 Workshop, Sofia, 2013

2012
A Kernel-Based Framework for Learning Graded Relations From Data.
IEEE Trans. Fuzzy Syst., 2012

Parallelized Online Regularized Least-Squares for Adaptive Embedded Systems.
Int. J. Embed. Real Time Commun. Syst., 2012

Semantically linking molecular entities in literature through entity relationships.
BMC Bioinform., 2012

University of Turku in the BioNLP'11 Shared Task.
BMC Bioinform., 2012

Wrapper-based selection of genetic features in genome-wide association studies through fast matrix operations.
Algorithms Mol. Biol., 2012

Exploring Biomolecular Literature with EVEX: Connecting Genes through Events, Homology, and Indirect Associations.
Adv. Bioinformatics, 2012

Avoiding Hazards - What Can Health Care Learn from Aviation?
Proceedings of the Exploring the Abyss of Inequalities, 2012

Parallel Feature Selection for Regularized Least-Squares.
Proceedings of the Applied Parallel and Scientific Computing, 2012

VirTu: virtual tutoring environment.
Proceedings of the 12th Koli Calling International Conference on Computing Education Research, 2012

Computer-assisted learning in primary school mathematics using ViLLE education tool.
Proceedings of the 12th Koli Calling International Conference on Computing Education Research, 2012

Breaking the Programming Language Barrier: Using Program Visualizations to Transfer Programming Knowledge in One Programming Language to Another.
Proceedings of the 12th IEEE International Conference on Advanced Learning Technologies, 2012

Implementation and Analysis of Block Dense Matrix Decomposition on Network-on-Chips.
Proceedings of the 14th IEEE International Conference on High Performance Computing and Communication & 9th IEEE International Conference on Embedded Software and Systems, 2012

Vocabulary In Discharge Documents The Patient's Perspective.
Proceedings of the CLEF 2012 Evaluation Labs and Workshop, 2012

An Ontology to Improve Accessibility and Quality of Patient Instructions.
Proceedings of the CLEF 2012 Evaluation Labs and Workshop, 2012

PubMed-Scale Event Extraction for Post-Translational Modifications, Epigenetics and Protein Structural Relations.
Proceedings of the 2012 Workshop on Biomedical Natural Language Processing, 2012

2011
Training linear ranking SVMs in linearithmic time using red-black trees.
Pattern Recognit. Lett., 2011

On Learning and Cross-Validation with Decomposed Nyström Approximation of Kernel Matrix.
Neural Process. Lett., 2011

An experimental comparison of cross-validation techniques for estimating the area under the ROC curve.
Comput. Stat. Data Anal., 2011

Extracting Contextualized Complex Biological Events with Rich Graph-Based Feature Sets.
Comput. Intell., 2011

U-Compare bio-event meta-service: compatible BioNLP event extraction services.
BMC Bioinform., 2011

Learning Multi-Label Predictors under Sparsity Budget.
Proceedings of the Eleventh Scandinavian Conference on Artificial Intelligence, 2011

A Parallel Online Regularized Least-squares Machine Learning Algorithm for Future Multi-core Processors.
Proceedings of the PECCS 2011, 2011

Greedy Regularized Least-Squares for Multi-task Learning.
Proceedings of the Data Mining Workshops (ICDMW), 2011

An Improved Training Algorithm for the Linear Ranking Support Vector Machine.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2011, 2011

Fast and parallelized greedy forward selection of genetic variants in Genome-wide association studies.
Proceedings of the 2011 IEEE International Workshop on Genomic Signal Processing and Statistics, 2011

Comparing the collaborative and independent viewing of program visualizations.
Proceedings of the 2011 Frontiers in Education Conference, 2011

Learning Valued Relations from Data.
Proceedings of the Eurofuse 2011, 2011

A Dependency-based Analysis of Treebank Annotation Errors.
Proceedings of the Computational Dependency Theory [papers from the International Conference on Dependency Linguistics, 2011

EVEX: A PubMed-Scale Resource for Homology-Based Generalization of Text Mining Predictions.
Proceedings of the 2011 Workshop on Biomedical Natural Language Processing, 2011

Generalizing Biomedical Event Extraction.
Proceedings of BioNLP Shared Task 2011 Workshop, Portland, Oregon, USA, June 24, 2011, 2011

2010
A comparison of AUC estimators in small-sample studies.
Proceedings of the third International Workshop on Machine Learning in Systems Biology, 2010

Accuracy of protein hydropathy predictions.
Int. J. Data Min. Bioinform., 2010

Learning intransitive reciprocal relations with kernel methods.
Eur. J. Oper. Res., 2010

Complex event extraction at PubMed scale.
Bioinform., 2010

Host-IP clustering technique for deep web characterization.
Proceedings of the 2010 ACM Symposium on Applied Computing (SAC), 2010

Conditional Ranking on Relational Data.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2010

Reflections on Koli Calling Conference: emergence of the Koli spirit.
Proceedings of the 10th Koli Calling International Conference on Computing Education Research, 2010

Speeding Up Greedy Forward Selection for Regularized Least-Squares.
Proceedings of the Ninth International Conference on Machine Learning and Applications, 2010

Applying Permutation Tests for Assessing the Statistical Significance of Wrapper Based Feature Selection.
Proceedings of the Ninth International Conference on Machine Learning and Applications, 2010

Reconstruction of Semantic Relationships from Their Projections in Biomolecular Domain.
Proceedings of the 2010 Workshop on Biomedical Natural Language Processing, 2010

Scaling up Biomedical Event Extraction to the Entire PubMed.
Proceedings of the 2010 Workshop on Biomedical Natural Language Processing, 2010

Dependency-Based PropBanking of Clinical Finnish.
Proceedings of the Fourth Linguistic Annotation Workshop, 2010

Co-Regularized Least-Squares for Label Ranking.
Proceedings of the Preference Learning., 2010

2009
Matrix representations, linear transformations, and kernels for disambiguation in natural language.
Mach. Learn., 2009

Towards automated processing of clinical Finnish: Sublanguage analysis and a rule-based parser.
Int. J. Medical Informatics, 2009

Combining hidden Markov models and latent semantic analysis for topic segmentation and labeling: Method and clinical application.
Int. J. Medical Informatics, 2009

Effects, Experiences and Feedback from Studies of a Program Visualization Tool.
Informatics Educ., 2009

Locality kernels for sequential data and their applications to parse ranking.
Appl. Intell., 2009

Parsing Clinical Finnish: Experiments with Rule-Based and Statistical Dependency Parsers.
Proceedings of the 17th Nordic Conference of Computational Linguistics, 2009

Learning to Extract Biological Event and Relation Graphs.
Proceedings of the 17th Nordic Conference of Computational Linguistics, 2009

Evaluating Pain in Intensive Care.
Proceedings of the Connecting Health and Humans - Proceedings of NI2009, The 10th International Congress on Nursing Informatics, 28 June, 2009

Efficient Hold-Out for Subset of Regressors.
Proceedings of the Adaptive and Natural Computing Algorithms, 9th International Conference, 2009

Integrated Data Management and Analysis Environment for Medical Longitudinal Research with Machine Learning Based Prediction Models.
Proceedings of the CSIE 2009, 2009 WRI World Congress on Computer Science and Information Engineering, March 31, 2009

Extracting Complex Biological Events with Rich Graph-Based Feature Sets.
Proceedings of the BioNLP 2009 Workshop Companion Volume for Shared Task, BioNLP@HLT-NAACL 2009, 2009

2008
Kernels for Text Analysis.
Proceedings of the Advances of Computational Intelligence in Industrial Systems, 2008

Effectiveness of Program Visualization: A Case Study with the ViLLE Tool.
J. Inf. Technol. Educ. Innov. Pract., 2008

Comparative analysis of five protein-protein interaction corpora.
BMC Bioinform., 2008

All-paths graph kernel for protein-protein interaction extraction with evaluation of cross-corpus learning.
BMC Bioinform., 2008

Detecting biological associations between genes based on the theory of phase synchronization.
Biosyst., 2008

A Sparse Regularized Least-Squares Preference Learning Algorithm.
Proceedings of the Tenth Scandinavian Conference on Artificial Intelligence, 2008

Efficient AUC Maximization with Regularized Least-Squares.
Proceedings of the Tenth Scandinavian Conference on Artificial Intelligence, 2008

Automatic assessment of program visualization exercises.
Proceedings of the 8th International Conference on Computing Education Research, 2008

Utterance-level normalization for relative articulation rate analysis.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Investigating perception of places of articulation in sign and speech.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Correlation of utterance length and segmental duration in Finnish is questionable.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Define and Visualize Your First Programming Language.
Proceedings of the 8th IEEE International Conference on Advanced Learning Technologies, 2008

A Graph Kernel for Protein-Protein Interaction Extraction.
Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing, 2008

2007
Applying language technology to nursing documents: Pros and cons with a focus on ethics.
Int. J. Medical Informatics, 2007

Towards automated classification of intensive care nursing narratives.
Int. J. Medical Informatics, 2007

BioInfer: a corpus for information extraction in the biomedical domain.
BMC Bioinform., 2007

Locality Kernels for Protein Classification.
Proceedings of the Algorithms in Bioinformatics, 7th International Workshop, 2007

Role of Different Spectral Attributes in Vowel Categorization: the Case of Udmurt.
Proceedings of the 16th Nordic Conference of Computational Linguistics, 2007

Utterance-Initial Duration of Finnish Non-Plosive Consonants.
Proceedings of the 16th Nordic Conference of Computational Linguistics, 2007

On Estimating the Scale of National Deep Web.
Proceedings of the Database and Expert Systems Applications, 18th International Conference, 2007

On the unification of syntactic annotations under the Stanford dependency scheme: A case study on BioInfer and GENIA.
Proceedings of the Biological, translational, and clinical language processing, 2007

Evaluation of Protein Hydropathy Scales.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2007

2006
Evaluation of two dependency parsers on biomedical corpus targeted at protein-protein interactions.
Int. J. Medical Informatics, 2006

Editorial comments.
Comput. Sci. Educ., 2006

What about a simple language? Analyzing the difficulties in learning to program.
Comput. Sci. Educ., 2006

Lexical adaptation of link grammar to the biomedical sublanguage: a comparative evaluation of three approaches.
BMC Bioinform., 2006

Relevance Ranking of Intensive Care Nursing Narratives.
Proceedings of the Knowledge-Based Intelligent Information and Engineering Systems, 2006

Locality-Convolution Kernel and Its Application to Dependency Parse Ranking.
Proceedings of the Advances in Applied Artificial Intelligence, 2006

Implementing a Rule-Based Speech Synthesizer on a Mobile Platform.
Proceedings of the Advances in Natural Language Processing, 2006

Segmental Duration in Utterance-Initial Environment: Evidence from Finnish Speech Corpora.
Proceedings of the Advances in Natural Language Processing, 2006

Incorporating External Information in Bayesian Classifiers Via Linear Feature Transformations.
Proceedings of the Advances in Natural Language Processing, 2006

The Vowel Game: Continuous Real-Time Visualization for Pronunciation Learning with Vowel Charts.
Proceedings of the Advances in Natural Language Processing, 2006

Regular Approximation of Link Grammar.
Proceedings of the Advances in Natural Language Processing, 2006

2005
Multi-Perspective Study of Novice Learners Adopting the Visual Algorithm Simulation Exercise System TRAKLA2.
Informatics Educ., 2005

On the visualization of the DNA sequence and its nucleotide content.
SIGSAM Bull., 2005

Contextual weighting for Support Vector Machines in literature mining: an application to gene versus protein name disambiguation.
BMC Bioinform., 2005

Regularized Least-Squares for Parse Ranking.
Proceedings of the Advances in Intelligent Data Analysis VI, 2005

The Feasibility of Automatic Assessment and Feedback.
Proceedings of the Cognition and Exploratory Learning in Digital Age, 2005

Kernels Incorporating Word Positional Information in Natural Language Disambiguation Tasks.
Proceedings of the Eighteenth International Florida Artificial Intelligence Research Society Conference, 2005

Web Services for DNA Sequence Analysis.
Proceedings of the 16th International Workshop on Database and Expert Systems Applications (DEXA 2005), 2005

2004
New Techniques for Disambiguation in Natural Language and Their Application to Biological Text.
J. Mach. Learn. Res., 2004

Ontology-Based Feature Transformations: A Data-Driven Approach.
Proceedings of the Advances in Natural Language Processing, 4th International Conference, 2004

Extracting Protein-Protein Interaction Sentences by Applying Rough Set Data Analysis.
Proceedings of the Rough Sets and Current Trends in Computing, 2004

Analysis of Link Grammar on Biomedical Dependency Corpus Targeted at Protein-Protein Interactions.
Proceedings of the International Joint Workshop on Natural Language Processing in Biomedicine and its Applications, 2004

2002
Mobile access to biological databases on the Internet.
IEEE Trans. Biomed. Eng., 2002

2001
BioWAP, mobile Internet service for bioinformatics.
Bioinform., 2001

1997
Representative noise-free complete-link classification with application to protein structures.
Pattern Recognit., 1997

1993
General formulation and evaluation of agglomerative clustering methods with metric and non-metric distances.
Pattern Recognit., 1993


  Loading...