Ulf Leser

Orcid: 0000-0003-2166-9582

Affiliations:
  • Humboldt University of Berlin, Germany


According to our database1, Ulf Leser authored at least 240 papers between 1997 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Beyond Silos: Next Steps in Research Data Management Report from the 2024 Spring Symposium of FG Datenbanken.
Datenbank-Spektrum, July, 2024

Raising the ClaSS of Streaming Time Series Segmentation.
Proc. VLDB Endow., April, 2024

preon: Fast and accurate entity normalization for drug names and cancer types in precision oncology.
Bioinform., March, 2024

Lotaru: Locally predicting workflow task runtimes for resource management on heterogeneous infrastructures.
Future Gener. Comput. Syst., January, 2024

Validity constraints for data analysis workflows.
Future Gener. Comput. Syst., 2024

Workflows Community Summit 2024: Future Trends and Challenges in Scientific Workflows.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, 2024

Discovering Leitmotifs in Multidimensional Time Series.
CoRR, 2024

HunFlair2 in a cross-corpus evaluation of biomedical named entity recognition and normalization tools.
CoRR, 2024

BELHD: Improving Biomedical Entity Linking with Homonoym Disambiguation.
CoRR, 2024

TLIMB - A Transfer Learning Framework for IMage Analysis of the Brain.
Proceedings of the Workshops of the EDBT/ICDT 2024 Joint Conference co-located with the EDBT/ICDT 2024 Joint Conference, 2024

Optimizing Job/Task Granularity for Metagenomic Workflows in Heterogeneous Cluster Infrastructures.
Proceedings of the Workshops of the EDBT/ICDT 2024 Joint Conference co-located with the EDBT/ICDT 2024 Joint Conference, 2024

CuttleFlow: Infrastructure-Specific Workflow Adaption for Improved Reusability.
Proceedings of the 20th IEEE International Conference on e-Science, 2024

Ponder: Online Prediction of Task Memory Requirements for Scientific Workflows.
Proceedings of the 20th IEEE International Conference on e-Science, 2024

HU-WBI at BioASQ12B Phase A: Exploring Rank Fusion of Dense Retrievers and Re-rankers.
Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2024), 2024

2023
WEASEL 2.0: a random dilated dictionary transform for fast, accurate and memory constrained time series classification.
Mach. Learn., December, 2023

A mathematical programming approach for resource allocation of data analysis workflows on heterogeneous clusters.
J. Supercomput., November, 2023

PEDL+: protein-centered relation extraction from PubMed at your fingertip.
Bioinform., November, 2023

BELB: a biomedical entity linking benchmark.
Bioinform., October, 2023

ClaSP: parameter-free time series segmentation.
Data Min. Knowl. Discov., May, 2023

Common Workflow Scheduler Evaluation with Nextflow and Kubernetes.
Dataset, May, 2023

Common Workflow Scheduler for Kubernetes.
Dataset, May, 2023

Nextflow with the Common Workflow Scheduler Interface for Kubernetes.
Dataset, May, 2023

Integrating HPC, AI, and Workflows for Scientific Data Analysis (Dagstuhl Seminar 23352).
Dagstuhl Reports, 2023

The Common Workflow Scheduler Interface: Status Quo and Future Plans.
CoRR, 2023

Large Language Models to the Rescue: Reducing the Complexity in Scientific Workflow Development Using ChatGPT.
CoRR, 2023

Validity Constraints for Data Analysis Workflows.
CoRR, 2023


Applicability of Models Trained on Generated Clinical German Datasets on Out-domain Data.
Proceedings of the Lernen, 2023

Comparing Humans and Algorithms in Feature Ranking: A Case-Study in the Medical Domain.
Proceedings of the Lernen, 2023

Time Series Segmentation Applied to a New Data Set for Mobile Sensing of Human Activities.
Proceedings of the Workshops of the EDBT/ICDT 2023 Joint Conference, 2023

Contract-Driven Design of Scientific Data Analysis Workflows.
Proceedings of the 19th IEEE International Conference on e-Science, 2023

Design by Contract Revisited in the Context of Scientific Data Analysis Workflows.
Proceedings of the 19th IEEE International Conference on e-Science, 2023

How Workflow Engines Should Talk to Resource Managers: A Proposal for a Common Workflow Scheduling Interface.
Proceedings of the 23rd IEEE/ACM International Symposium on Cluster, 2023

Human Activity Segmentation Challenge @ ECML/PKDD'23.
Proceedings of the Advanced Analytics and Learning on Temporal Data, 2023

2022
Motiflets - Simple and Accurate Detection of Motifs in Time Series.
Proc. VLDB Endow., 2022

Motiflets - Fast and Accurate Detection of Motifs in Time Series.
CoRR, 2022

Chemical-protein relation extraction with ensembles of carefully tuned pretrained language models.
Database J. Biol. Databases Curation, 2022

RegEl corpus: identifying DNA regulatory elements in the scientific literature.
Database J. Biol. Databases Curation, 2022

Lotaru: Locally Estimating Runtimes of Scientific Workflow Tasks in Heterogeneous Clusters.
Proceedings of the SSDBM 2022: 34th International Conference on Scientific and Statistical Database Management, Copenhagen, Denmark, July 6, 2022

Reshi: Recommending Resources for Scientific Workflow Tasks on Heterogeneous Infrastructures.
Proceedings of the IEEE International Performance, 2022

BEEDS: Large-Scale Biomedical Event Extraction using Distant Supervision and Question Answering.
Proceedings of the 21st Workshop on Biomedical Language Processing, 2022

Window Size Selection in Unsupervised Time Series Analytics: A Review and Benchmark.
Proceedings of the Advanced Analytics and Learning on Temporal Data, 2022

2021
The Collaborative Research Center FONDA.
Datenbank-Spektrum, 2021

Workflows Community Summit: Advancing the State-of-the-art of Scientific Workflows Management Systems Research and Development.
CoRR, 2021

HunFlair: an easy-to-use tool for state-of-the-art biomedical named entity recognition.
Bioinform., 2021

Large-scale entity representation learning for biomedical relationship extraction.
Bioinform., 2021


Extend, don't rebuild: Phrasing conditional graph modification as autoregressive sequence labelling.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

FORCE on Nextflow: Scalable Analysis of Earth Observation Data on Commodity Clusters.
Proceedings of the CIKM 2021 Workshops co-located with 30th ACM International Conference on Information and Knowledge Management (CIKM 2021), 2021

ClaSP - Time Series Segmentation.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

WBI at MEDIQA 2021: Summarizing Consumer Health Questions with Generative Transformers.
Proceedings of the 20th Workshop on Biomedical Language Processing, 2021

Early Detection of Sexual Predators in Chats.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Finding k-shortest paths with limited overlap.
VLDB J., 2020

TEASER: early and accurate time series classification.
Data Min. Knowl. Discov., 2020

DeepTable: a permutation invariant neural network for table orientation classification.
Data Min. Knowl. Discov., 2020

Portability of Scientific Workflows in NGS Data Analysis: A Case Study.
CoRR, 2020

PEDL: extracting protein-protein associations using deep language models and distant supervision.
Bioinform., 2020

HUNER: improving biomedical NER with pretraining.
Bioinform., 2020

PatSeg: A Sequential Patent Segmentation Approach.
Big Data Res., 2020

TabSim: A Siamese Neural Network for Accurate Estimation of Table Similarity.
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020

Biomedical Event Extraction as Multi-turn Question Answering.
Proceedings of the 11th International Workshop on Health Text Mining and Information Analysis, 2020

2019
Predictive performance modeling for distributed batch processing using black box monitoring and machine learning.
Inf. Syst., 2019

VIST - a Variant-Information Search Tool for precision oncology.
BMC Bioinform., 2019

Feedback-Based Resource Allocation for Batch Scheduling of Scientific Workflows.
Proceedings of the 17th International Conference on High Performance Computing & Simulation, 2019

Learning Low-Wastage Memory Allocations for Scientific Workflows at IceCube.
Proceedings of the 17th International Conference on High Performance Computing & Simulation, 2019

BB-Tree: A Main-Memory Index Structure for Multidimensional Range Queries.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

BB-Tree: A practical and efficient main-memory index structure for multidimensional workloads.
Proceedings of the Advances in Database Technology, 2019

Classifying German Animal Experiment Summaries with Multi-lingual BERT at CLEF eHealth 2019 Task 1.
Proceedings of the Working Notes of CLEF 2019, 2019

Information Retrieval for Precision Oncology.
Proceedings of the Datenbanksysteme für Business, 2019

NLProlog: Reasoning with Weak Unification for Question Answering in Natural Language.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Graph Management in the Life Sciences.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Semantic Data Integration for Life Science Entities.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Set Similarity Joins on MapReduce: An Experimental Survey.
Proc. VLDB Endow., 2018

Variant information systems for precision oncology.
BMC Medical Informatics Decis. Mak., 2018

Column-wise compression of open relational data.
Inf. Sci., 2018

Finding k-Dissimilar Paths with Minimum Collective Length.
CoRR, 2018

Predictive Performance Modeling for Distributed Computing using Black-Box Monitoring and Machine Learning.
CoRR, 2018

Cross-lingual Candidate Search for Biomedical Concept Normalization.
CoRR, 2018

Multidimensional range queries on modern hardware.
Proceedings of the 30th International Conference on Scientific and Statistical Database Management, 2018

Dissecting Company Names using Sequence Labeling.
Proceedings of the Conference "Lernen, Wissen, Daten, Analysen", 2018

Exploiting Automatic Vectorization to Employ SPMD on SIMD Registers.
Proceedings of the 34th IEEE International Conference on Data Engineering Workshops, 2018

Finding <i>k</i>-dissimilar paths with minimum collective length.
Proceedings of the 26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, 2018

Classifying Land Cover from Satellite Images Using Time Series Analytics.
Proceedings of the Workshops of the EDBT/ICDT 2018 Joint Conference (EDBT/ICDT 2018), 2018

POS: Online Learning for Memory-Aware Scheduling of Scientific Workflows.
Proceedings of the 14th IEEE International Conference on e-Science, 2018

WBI at CLEF eHealth 2018 Task 1: Language-independent ICD-10 Coding using Multi-lingual Embeddings and Recurrent Neural Networks.
Proceedings of the Working Notes of CLEF 2018, 2018

Identifying Key Sentences for Precision Oncology Using Semi-Supervised Learning.
Proceedings of the BioNLP 2018 workshop, Melbourne, Australia, July 19, 2018, 2018

2017
Computation semantics of the functional scientific workflow language Cuneiform.
J. Funct. Program., 2017

How to improve information extraction from German medical records.
it Inf. Technol., 2017

Optimization of Complex Dataflows with User-Defined Functions.
ACM Comput. Surv., 2017

Multivariate Time Series Classification with WEASEL+MUSE.
CoRR, 2017

Estimating genome-wide regulatory activity from multi-omics data sets using mathematical optimization.
BMC Syst. Biol., 2017

Deep learning with word embeddings improves biomedical named entity recognition.
Bioinform., 2017

Comparative assessment of differential network analysis methods.
Briefings Bioinform., 2017

Fine-Grained Opinion Mining from Mobile App Reviews with Word Embedding Features.
Proceedings of the Natural Language Processing and Information Systems, 2017

Exact and Approximate Algorithms for Finding k-Shortest Paths with Limited Overlap.
Proceedings of the 20th International Conference on Extending Database Technology, 2017

Hi-WAY: Execution of Scientific Workflows on Hadoop YARN.
Proceedings of the 20th International Conference on Extending Database Technology, 2017

Modeling Data Flow Execution in a Parallel Environment.
Proceedings of the Big Data Analytics and Knowledge Discovery, 2017

Multi-lingual ICD-10 Coding using a Hybrid rule-based and Supervised Classification Approach at CLEF eHealth 2017.
Proceedings of the Working Notes of CLEF 2017, 2017

Fast and Accurate Time Series Classification with WEASEL.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

Benchmarking Univariate Time Series Classifiers.
Proceedings of the Datenbanksysteme für Business, 2017

2016
Fast Sampling-Based Whole-Genome Haplotype Block Recognition.
IEEE ACM Trans. Comput. Biol. Bioinform., 2016

graphANNIS: A Fast Query Engine for Deeply Annotated Linguistic Corpora.
J. Lang. Technol. Comput. Linguistics, 2016

Recognizing chemicals in patents: a comparative analysis.
J. Cheminformatics, 2016

Effective and efficient similarity search in scientific workflow repositories.
Future Gener. Comput. Syst., 2016

SETH detects and normalizes genetic variants in text.
Bioinform., 2016

SoFIA: a data integration framework for annotating high-throughput datasets.
Bioinform., 2016

Cache-Sensitive Skip List: Efficient Range Queries on Modern CPUs.
Proceedings of the Data Management on New Hardware, 2016

PIEJoin: Towards Parallel Set Containment Joins.
Proceedings of the 28th International Conference on Scientific and Statistical Database Management, 2016

Performance of Gene Name Recognition Tools on Patents.
Proceedings of the 7th International Symposium on Semantic Mining in Biomedicine, 2016

Potential and Pitfalls of Domain-Specific Information Extraction at Web Scale.
Proceedings of the 2016 International Conference on Management of Data, 2016

Graph n-grams for Scientific Workflow Similarity Search.
Proceedings of the Conference "Lernen, 2016

SCARE ― The Sentiment Corpus of App Reviews with Fine-grained Annotations in German.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

2015
MRCSI: Compressing and Searching String Collections with Multiple References.
Proc. VLDB Endow., 2015

SAASFEE: Scalable Scientific Workflow Execution Engine.
Proc. VLDB Endow., 2015

SOFA: An extensible logical optimizer for UDF-heavy data flows.
Inf. Syst., 2015

DynamicCloudSim: Simulating heterogeneity in computational clouds.
Future Gener. Comput. Syst., 2015

Computer-assisted curation of a human regulatory core network from the biological literature.
Bioinform., 2015

BiobankCloud: A Platform for the Secure Storage, Sharing, and Processing of Large Biomedical Data Sets.
Proceedings of the Biomedical Data Management and Graph Online Querying, 2015

Alternative routing: k-shortest paths with limited overlap.
Proceedings of the 23rd SIGSPATIAL International Conference on Advances in Geographic Information Systems, 2015

Massively Parallel Analysis of Similarity Matrices on Heterogeneous Hardware.
Proceedings of the Workshops of the EDBT/ICDT 2015 Joint Conference (EDBT/ICDT), 2015

Cuneiform: a Functional Language for Large Scale Scientific Data Analysis.
Proceedings of the Workshops of the EDBT/ICDT 2015 Joint Conference (EDBT/ICDT), 2015

2014
The Stratosphere platform for big data analytics.
VLDB J., 2014

State-of-the-art in string similarity search and join.
SIGMOD Rec., 2014

Similarity Search for Scientific Workflows.
Proc. VLDB Endow., 2014

CellFinder: a cell data repository.
Nucleic Acids Res., 2014

A survey on annotation tools for the biomedical literature.
Briefings Bioinform., 2014

Extracting and aggregating temporal events from text.
Proceedings of the 23rd International World Wide Web Conference, 2014

Versatile optimization of UDF-heavy data flows with sofa.
Proceedings of the International Conference on Management of Data, 2014

Next Generation Data Integration (for the Life Sciences).
Proceedings of the 26th GI-Workshop Grundlagen von Datenbanken, 2014

Layer Decomposition: An Effective Structure-Based Approach for Scientific Workflow Similarity.
Proceedings of the 10th IEEE International Conference on e-Science, 2014

RRCA: Ultra-Fast Multiple In-species Genome Alignments.
Proceedings of the Algorithms for Computational Biology - First International Conference, 2014

2013
FRESCO: Referential Compression of Highly Similar Sequences.
IEEE ACM Trans. Comput. Biol. Bioinform., 2013

RCSI: Scalable similarity search in thousand(s) of genomes.
Proc. VLDB Endow., 2013

Parallelization in Scientific Workflow Management Systems
CoRR, 2013

SOFA: An Extensible Logical Optimizer for UDF-heavy Dataflows.
CoRR, 2013

A detailed error analysis of 13 kernel methods for protein-protein interaction extraction.
BMC Bioinform., 2013

CELDA - an ontology for the comprehensive representation of cells in complex systems.
BMC Bioinform., 2013

Using ontologies to study cell transitions.
J. Biomed. Semant., 2013

Preliminary evaluation of the CellFinder literature curation pipeline for gene expression in kidney cells and anatomical parts.
Database J. Biol. Databases Curation, 2013

WBI-DDI: Drug-Drug Interaction Extraction using Majority Voting.
Proceedings of the 7th International Workshop on Semantic Evaluation, 2013

WBI-NER: The impact of domain-specific features on the performance of identifying and classifying mentions of drugs.
Proceedings of the 7th International Workshop on Semantic Evaluation, 2013

CPU and cache efficient management of memory-resident databases.
Proceedings of the 29th IEEE International Conference on Data Engineering, 2013

OmixAnalyzer - A Web-Based System for Management and Analysis of High-Throughput Omics Data Sets.
Proceedings of the Data Integration in the Life Sciences - 9th International Conference, 2013

Experiences from Developing the Domain-Specific Entity Search Engine GeneView.
Proceedings of the Datenbanksysteme für Business, 2013

QGramProjector: Q-Gram Projection for Indexing Highly-Similar Strings.
Proceedings of the Advances in Databases and Information Systems, 2013

2012
GeneView: a comprehensive semantic search engine for PubMed.
Nucleic Acids Res., 2012

Improving data quality by source analysis.
ACM J. Data Inf. Qual., 2012

Data Management Challenges in Next Generation Sequencing.
Datenbank-Spektrum, 2012

ChemSpot: a hybrid system for chemical named entity recognition.
Bioinform., 2012

Tools for managing and analyzing microarray data.
Briefings Bioinform., 2012

Adaptive efficient compression of genomes.
Algorithms Mol. Biol., 2012

(Re)Use in Public Scientific Workflow Repositories.
Proceedings of the Scientific and Statistical Database Management, 2012

Regular Path Queries on Large Graphs.
Proceedings of the Scientific and Statistical Database Management, 2012

Efficient Similarity Search in Very Large String Sets.
Proceedings of the Scientific and Statistical Database Management, 2012

String Searching in Referentially Compressed Genomes.
Proceedings of the KDIR 2012 - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval, Barcelona, Spain, 4, 2012

InterOnto - Ranking Inter-Ontology Links.
Proceedings of the Data Integration in the Life Sciences - 8th International Conference, 2012

Discovering conditional inclusion dependencies.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

2011
Search, adapt, and reuse: the future of scientific workflows.
SIGMOD Rec., 2011

Molecular Event Extraction from Link Grammar Parse Trees in the BioNLP'09 Shared Task.
Comput. Intell., 2011

BioCreative III interactive task: an overview.
BMC Bioinform., 2011

Next generation data integration for Life Sciences.
Proceedings of the 27th International Conference on Data Engineering, 2011

Pipa: custom integration of protein interactions and pathways.
Proceedings of the 41. Jahrestagung der Gesellschaft für Informatik, 2011

Scalable Sequence Similarity Search and Join in Main Memory on Multi-cores.
Proceedings of the Euro-Par 2011: Parallel Processing Workshops - CCPI, CGWS, HeteroPar, HiBB, HPCVirt, HPPC, HPSS, MDGS, ProPer, Resilience, UCHPC, VHPC, Bordeaux, France, August 29, 2011

Not all links are equal: Exploiting Dependency Types for the Extraction of Protein-Protein Interactions from Text.
Proceedings of the 2011 Workshop on Biomedical Natural Language Processing, 2011

2010
Deep Web Integration with VisQI.
Proc. VLDB Endow., 2010

A Comprehensive Benchmark of Kernel Methods to Extract Protein-Protein Interactions from Literature.
PLoS Comput. Biol., 2010

A fast and effective dependency graph kernel for PPI relation extraction.
BMC Bioinform., 2010

Species identification for gene name normalization.
BMC Bioinform., 2010

Simple tricks for improving pattern-based information extraction from the biomedical literature.
J. Biomed. Semant., 2010

Phenoclustering: online mining of cross-species phenotypes.
Bioinform., 2010

Graph-based concept identification and disambiguation for enterprise search.
Proceedings of the 19th International Conference on World Wide Web, 2010

Prefix Tree Indexing for Similarity Search and Similarity Joins on Genomic Data.
Proceedings of the Scientific and Statistical Database Management, 2010

Selecting Materialized Views for RDF Data.
Proceedings of the Current Trends in Web Engineering, 2010

An urban health risk analysis for Berlin: exploration and integration of spatio-temporal information on the urban environment.
Proceedings of the Integration of Environmental Information in Europe: Proceedings of the 24th International Conference on Informatics for Environmental Protection, 2010

Estimating Result Size and Execution Times for Graph Queries.
Proceedings of the Local Proceedings of the Fourteenth East-European Conference on Advances in Databases and Information Systems, 2010

2009
Graph Management in the Life Sciences.
Proceedings of the Encyclopedia of Database Systems, 2009

Semantic Data Integration for Life Science Entities.
Proceedings of the Encyclopedia of Database Systems, 2009

A Hierarchical Approach to Model Web Query Interfaces for Web Source Integration.
Proc. VLDB Endow., 2009

A Machine Learning Approach to Foreign Key Discovery.
Proceedings of the 12th International Workshop on the Web and Databases, 2009

SOA-Based Integration of Text Mining Services.
Proceedings of the 2009 IEEE Congress on Services, Part I, 2009

Graph-Based Ontology Construction from Heterogenous Evidences.
Proceedings of the Semantic Web - ISWC 2009, 8th International Semantic Web Conference, 2009

Exploiting scientific workflows for large-scale gene expression data analysis.
Proceedings of the 24th International Symposium on Computer and Information Sciences, 2009

High-performance information extraction with AliBaba.
Proceedings of the EDBT 2009, 2009

Molecular event extraction from Link Grammar parse trees.
Proceedings of the BioNLP 2009 Workshop Companion Volume for Shared Task, BioNLP@HLT-NAACL 2009, 2009

Word Sense Disambiguation in Biomedical Applications.
Proceedings of the Information Retrieval in Biomedicine, 2009

2008
A Flexible Framework for Integrating Annotations from Different Tools and Tag Sets.
Trait. Autom. des Langues, 2008

Integrating protein-protein interactions and text mining for protein function prediction.
BMC Bioinform., 2008

Mining phenotypes for gene function prediction.
BMC Bioinform., 2008

Querying Distributed RDF Data Sources with SPARQL.
Proceedings of the Semantic Web: Research and Applications, 2008

Mining Phenotypes for Protein Function Prediction.
Proceedings of the Ontologies and Text Mining for Life Sciences: Current Status and Future Perspectives, 24.03., 2008

08131 Executive Summary -- Ontologies and Text Mining for Life Sciences : Current Status and Future Perspectives.
Proceedings of the Ontologies and Text Mining for Life Sciences: Current Status and Future Perspectives, 24.03., 2008

2007
Fast and practical indexing and querying of very large graphs.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2007

Efficiently Detecting Inclusion Dependencies.
Proceedings of the 23rd International Conference on Data Engineering, 2007

High-Precision Function Prediction using Conserved Interactions.
Proceedings of the German Conference on Bioinformatics, 2007

What's New? What's Certain? - Scoring Search Results in the Presence of Overlapping Data Sources.
Proceedings of the Data Integration in the Life Sciences, 4th International Workshop, 2007

Informationsintegration - Architekturen und Methoden zur Integration verteilter und heterogener Datenquellen.
dpunkt.verlag, 2007

2006
A structural keystone for drug design.
J. Integr. Bioinform., 2006

Adapters, shims, and glue - service interoperability for <i>in silico</i> experiments.
Bioinform., 2006

ALIBABA: PubMed as a graph.
Bioinform., 2006

Collecting a Large Corpus from all of Medline.
Proceedings of the Second International Symposium on Semantic Mining in Biomedicine, 2006

Efficiently Computing Inclusion Dependencies for Schema Discovery.
Proceedings of the 22nd International Conference on Data Engineering Workshops, 2006

Classification of Contradiction Patterns.
Proceedings of the Advances in Data Analysis, 2006

Implementing a Linguistic Query Language for Historic Texts.
Proceedings of the Current Trends in Database Technology - EDBT 2006, 2006

Describing differences between databases.
Proceedings of the 2006 ACM CIKM International Conference on Information and Knowledge Management, 2006

Informationsintegration: Architekturen und Methoden zur Integration verteilter und heterogener Datenquellen.
dpunkt, ISBN: 3-89864-400-6, 2006

2005
Erkennen und Bereinigen von Datenfehlern in naturwissenschaftlichen Daten.
Datenbank-Spektrum, 2005

Columba: an integrated database of proteins, structures, and annotations.
BMC Bioinform., 2005

Systematic feature evaluation for gene name recognition.
BMC Bioinform., 2005

GandrKB--ontological microarray annotation and visualization.
Bioinform., 2005

What makes a gene name? Named entity recognition in the biomedical literature.
Briefings Bioinform., 2005

Optimizing syntax patterns for discovering protein-protein interactions.
Proceedings of the 2005 ACM Symposium on Applied Computing (SAC), 2005

A query language for biological networks.
Proceedings of the ECCB/JBI'05 Proceedings, Fourth European Conference on Computational Biology/Sixth Meeting of the Spanish Bioinformatics Network (Jornadas de BioInformática), Palacio de Congresos, Madrid, Spain, September 28, 2005

Querying Ontologies in Relational Database Systems.
Proceedings of the Data Integration in the Life Sciences, Second InternationalWorkshop, 2005

Integrating and Warehousing Liver Gene Expression Data and Related Biomedical Resources in GEDAW.
Proceedings of the Data Integration in the Life Sciences, Second InternationalWorkshop, 2005

(Almost) Hands-Off Information Integration for the Life Sciences.
Proceedings of the Second Biennial Conference on Innovative Data Systems Research, 2005

2004
Completeness of integrated information sources.
Inf. Syst., 2004

Mining for Patterns in Contradictory Data.
Proceedings of the IQIS 2004, 2004

Columba: Multidimensional Data Integration of Protein Annotations.
Proceedings of the Data Integration in the Life Sciences, First International Workshop, 2004


Cooperative Transaction Processing between Clients and Servers.
Proceedings of the Advances in Databases and Information Systems, 2004

2003
Integration molekularbiologischer Daten.
Datenbank-Spektrum, 2003

2000
Query planning in mediator based information systems.
PhD thesis, 2000

Query Planning with Information Quality Bounds.
Proceedings of the Flexible Query Answering Systems, 2000

Strategies for the Conceptual Design of Federated Information Systems.
Proceedings of the Engineering Federated Information Systems, 2000

1999
IXDB, an X chromosome integrated database (update).
Nucleic Acids Res., 1999

EDITtoTrEMBL: a distributed approach to high-quality automated protein sequence annotation.
Bioinform., 1999

A proposal for a standard CORBA interface for genome maps.
Bioinform., 1999

Quality-driven Integration of Heterogenous Information Systems.
Proceedings of the VLDB'99, 1999

Globale Anfragebearbeitung mit verteilten und heterogenen Datenquellen.
Proceedings of the Informatik '99, 1999

Density Scores for Cooperative Query Answering.
Proceedings of the 4. Workshop Föderierte Datenbanken, 1999

Integration durch Standards: Erfahrungen mit CORBA in Life Science Research.
Proceedings of the 4. Workshop Föderierte Datenbanken, 1999

Constructing IDL Views on Relational Databases.
Proceedings of the Advanced Information Systems Engineering, 1999

Designing a Global Information Resource for Molecular Biology.
Proceedings of the Datenbanksysteme in Büro, 1999

1998
IXDB, an X chromosome integrated database.
Nucleic Acids Res., 1998

Issues in developing integrated genomic databases and application to the human X chromosome.
Bioinform., 1998

Combining Heterogeneous Data Sources through Query Correspondence Assertions.
Proceedings of the CIKM'98 First Workshop on Web Information and Data Management (WIDM'98), 1998

Design Issues of Database Access in a CORBA Environment.
Proceedings of the Integration heterogener Softwaresysteme, IHS'98, 1998

Query Mediation for Heterogeneous Data Sources.
Proceedings of the 3. Workshop "Föderierte Datenbanken", 1998

1997
Ansätze zur Föderation moleklargenetischer Datenbanken.
Datenbank Rundbr., 1997

IXDB, an integrated database for genomic data of the Human X Chromosome.
Proceedings of the German Conference on Bioinformatics, 1997


  Loading...