2024
Beyond Silos: Next Steps in Research Data Management Report from the 2024 Spring Symposium of FG Datenbanken.
Datenbank-Spektrum, July, 2024
Raising the ClaSS of Streaming Time Series Segmentation.
Proc. VLDB Endow., April, 2024
preon: Fast and accurate entity normalization for drug names and cancer types in precision oncology.
Bioinform., March, 2024
Lotaru: Locally predicting workflow task runtimes for resource management on heterogeneous infrastructures.
Future Gener. Comput. Syst., January, 2024
Validity constraints for data analysis workflows.
,
,
,
,
,
,
,
,
,
,
,
,
,
Future Gener. Comput. Syst., 2024
Fast and Exact Similarity Search in less than a Blink of an Eye.
CoRR, 2024
Workflows Community Summit 2024: Future Trends and Challenges in Scientific Workflows.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Discovering Leitmotifs in Multidimensional Time Series.
CoRR, 2024
HunFlair2 in a cross-corpus evaluation of biomedical named entity recognition and normalization tools.
CoRR, 2024
BELHD: Improving Biomedical Entity Linking with Homonoym Disambiguation.
CoRR, 2024
TLIMB - A Transfer Learning Framework for IMage Analysis of the Brain.
Proceedings of the Workshops of the EDBT/ICDT 2024 Joint Conference co-located with the EDBT/ICDT 2024 Joint Conference, 2024
Optimizing Job/Task Granularity for Metagenomic Workflows in Heterogeneous Cluster Infrastructures.
Proceedings of the Workshops of the EDBT/ICDT 2024 Joint Conference co-located with the EDBT/ICDT 2024 Joint Conference, 2024
CuttleFlow: Infrastructure-Specific Workflow Adaption for Improved Reusability.
Proceedings of the 20th IEEE International Conference on e-Science, 2024
Ponder: Online Prediction of Task Memory Requirements for Scientific Workflows.
Proceedings of the 20th IEEE International Conference on e-Science, 2024
HU-WBI at BioASQ12B Phase A: Exploring Rank Fusion of Dense Retrievers and Re-rankers.
Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2024), 2024
Multivariate Human Activity Segmentation: Systematic Benchmark with ClaSP.
Proceedings of the Advanced Analytics and Learning on Temporal Data, 2024
2023
WEASEL 2.0: a random dilated dictionary transform for fast, accurate and memory constrained time series classification.
Mach. Learn., December, 2023
A mathematical programming approach for resource allocation of data analysis workflows on heterogeneous clusters.
J. Supercomput., November, 2023
PEDL+: protein-centered relation extraction from PubMed at your fingertip.
Bioinform., November, 2023
BELB: a biomedical entity linking benchmark.
Bioinform., October, 2023
ClaSP: parameter-free time series segmentation.
Data Min. Knowl. Discov., May, 2023
Common Workflow Scheduler Evaluation with Nextflow and Kubernetes.
Dataset, May, 2023
Common Workflow Scheduler for Kubernetes.
Dataset, May, 2023
Nextflow with the Common Workflow Scheduler Interface for Kubernetes.
Dataset, May, 2023
Integrating HPC, AI, and Workflows for Scientific Data Analysis (Dagstuhl Seminar 23352).
Dagstuhl Reports, 2023
The Common Workflow Scheduler Interface: Status Quo and Future Plans.
CoRR, 2023
Large Language Models to the Rescue: Reducing the Complexity in Scientific Workflow Development Using ChatGPT.
CoRR, 2023
Validity Constraints for Data Analysis Workflows.
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
Novel Approaches Toward Scalable Composable Workflows in Hyper-Heterogeneous Computing Environments.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023
Applicability of Models Trained on Generated Clinical German Datasets on Out-domain Data.
Proceedings of the Lernen, 2023
Comparing Humans and Algorithms in Feature Ranking: A Case-Study in the Medical Domain.
Proceedings of the Lernen, 2023
Time Series Segmentation Applied to a New Data Set for Mobile Sensing of Human Activities.
Proceedings of the Workshops of the EDBT/ICDT 2023 Joint Conference, 2023
Contract-Driven Design of Scientific Data Analysis Workflows.
Proceedings of the 19th IEEE International Conference on e-Science, 2023
Design by Contract Revisited in the Context of Scientific Data Analysis Workflows.
Proceedings of the 19th IEEE International Conference on e-Science, 2023
How Workflow Engines Should Talk to Resource Managers: A Proposal for a Common Workflow Scheduling Interface.
Proceedings of the 23rd IEEE/ACM International Symposium on Cluster, 2023
Human Activity Segmentation Challenge @ ECML/PKDD'23.
Proceedings of the Advanced Analytics and Learning on Temporal Data, 2023
2022
Motiflets - Simple and Accurate Detection of Motifs in Time Series.
Proc. VLDB Endow., 2022
Motiflets - Fast and Accurate Detection of Motifs in Time Series.
CoRR, 2022
Chemical-protein relation extraction with ensembles of carefully tuned pretrained language models.
Database J. Biol. Databases Curation, 2022
RegEl corpus: identifying DNA regulatory elements in the scientific literature.
Database J. Biol. Databases Curation, 2022
Lotaru: Locally Estimating Runtimes of Scientific Workflow Tasks in Heterogeneous Clusters.
Proceedings of the SSDBM 2022: 34th International Conference on Scientific and Statistical Database Management, Copenhagen, Denmark, July 6, 2022
Reshi: Recommending Resources for Scientific Workflow Tasks on Heterogeneous Infrastructures.
Proceedings of the IEEE International Performance, 2022
BEEDS: Large-Scale Biomedical Event Extraction using Distant Supervision and Question Answering.
Proceedings of the 21st Workshop on Biomedical Language Processing, 2022
Window Size Selection in Unsupervised Time Series Analytics: A Review and Benchmark.
Proceedings of the Advanced Analytics and Learning on Temporal Data, 2022
2021
The Collaborative Research Center FONDA.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Datenbank-Spektrum, 2021
Workflows Community Summit: Advancing the State-of-the-art of Scientific Workflows Management Systems Research and Development.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2021
HunFlair: an easy-to-use tool for state-of-the-art biomedical named entity recognition.
Bioinform., 2021
Large-scale entity representation learning for biomedical relationship extraction.
Bioinform., 2021
A Community Roadmap for Scientific Workflows Research and Development.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2021 IEEE Workshop on Workflows in Support of Large-Scale Science (WORKS), 2021
Extend, don't rebuild: Phrasing conditional graph modification as autoregressive sequence labelling.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
FORCE on Nextflow: Scalable Analysis of Earth Observation Data on Commodity Clusters.
Proceedings of the CIKM 2021 Workshops co-located with 30th ACM International Conference on Information and Knowledge Management (CIKM 2021), 2021
ClaSP - Time Series Segmentation.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021
WBI at MEDIQA 2021: Summarizing Consumer Health Questions with Generative Transformers.
Proceedings of the 20th Workshop on Biomedical Language Processing, 2021
Early Detection of Sexual Predators in Chats.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
2020
Finding k-shortest paths with limited overlap.
VLDB J., 2020
TEASER: early and accurate time series classification.
Data Min. Knowl. Discov., 2020
DeepTable: a permutation invariant neural network for table orientation classification.
Data Min. Knowl. Discov., 2020
Portability of Scientific Workflows in NGS Data Analysis: A Case Study.
CoRR, 2020
PEDL: extracting protein-protein associations using deep language models and distant supervision.
Bioinform., 2020
HUNER: improving biomedical NER with pretraining.
Bioinform., 2020
PatSeg: A Sequential Patent Segmentation Approach.
Big Data Res., 2020
TabSim: A Siamese Neural Network for Accurate Estimation of Table Similarity.
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020
Biomedical Event Extraction as Multi-turn Question Answering.
Proceedings of the 11th International Workshop on Health Text Mining and Information Analysis, 2020
2019
Predictive performance modeling for distributed batch processing using black box monitoring and machine learning.
Inf. Syst., 2019
VIST - a Variant-Information Search Tool for precision oncology.
,
,
,
,
,
,
,
,
,
,
,
BMC Bioinform., 2019
Feedback-Based Resource Allocation for Batch Scheduling of Scientific Workflows.
Proceedings of the 17th International Conference on High Performance Computing & Simulation, 2019
Learning Low-Wastage Memory Allocations for Scientific Workflows at IceCube.
Proceedings of the 17th International Conference on High Performance Computing & Simulation, 2019
BB-Tree: A Main-Memory Index Structure for Multidimensional Range Queries.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019
BB-Tree: A practical and efficient main-memory index structure for multidimensional workloads.
Proceedings of the Advances in Database Technology, 2019
Classifying German Animal Experiment Summaries with Multi-lingual BERT at CLEF eHealth 2019 Task 1.
Proceedings of the Working Notes of CLEF 2019, 2019
Information Retrieval for Precision Oncology.
Proceedings of the Datenbanksysteme für Business, 2019
NLProlog: Reasoning with Weak Unification for Question Answering in Natural Language.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
2018
Graph Management in the Life Sciences.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018
Semantic Data Integration for Life Science Entities.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018
Set Similarity Joins on MapReduce: An Experimental Survey.
Proc. VLDB Endow., 2018
Variant information systems for precision oncology.
BMC Medical Informatics Decis. Mak., 2018
Column-wise compression of open relational data.
Inf. Sci., 2018
Finding k-Dissimilar Paths with Minimum Collective Length.
CoRR, 2018
Predictive Performance Modeling for Distributed Computing using Black-Box Monitoring and Machine Learning.
CoRR, 2018
Cross-lingual Candidate Search for Biomedical Concept Normalization.
CoRR, 2018
Multidimensional range queries on modern hardware.
Proceedings of the 30th International Conference on Scientific and Statistical Database Management, 2018
Dissecting Company Names using Sequence Labeling.
Proceedings of the Conference "Lernen, Wissen, Daten, Analysen", 2018
Exploiting Automatic Vectorization to Employ SPMD on SIMD Registers.
Proceedings of the 34th IEEE International Conference on Data Engineering Workshops, 2018
Finding <i>k</i>-dissimilar paths with minimum collective length.
Proceedings of the 26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, 2018
Classifying Land Cover from Satellite Images Using Time Series Analytics.
Proceedings of the Workshops of the EDBT/ICDT 2018 Joint Conference (EDBT/ICDT 2018), 2018
POS: Online Learning for Memory-Aware Scheduling of Scientific Workflows.
Proceedings of the 14th IEEE International Conference on e-Science, 2018
WBI at CLEF eHealth 2018 Task 1: Language-independent ICD-10 Coding using Multi-lingual Embeddings and Recurrent Neural Networks.
Proceedings of the Working Notes of CLEF 2018, 2018
Identifying Key Sentences for Precision Oncology Using Semi-Supervised Learning.
Proceedings of the BioNLP 2018 workshop, Melbourne, Australia, July 19, 2018, 2018
2017
Computation semantics of the functional scientific workflow language Cuneiform.
J. Funct. Program., 2017
How to improve information extraction from German medical records.
it Inf. Technol., 2017
Optimization of Complex Dataflows with User-Defined Functions.
ACM Comput. Surv., 2017
Multivariate Time Series Classification with WEASEL+MUSE.
CoRR, 2017
Estimating genome-wide regulatory activity from multi-omics data sets using mathematical optimization.
BMC Syst. Biol., 2017
Deep learning with word embeddings improves biomedical named entity recognition.
Bioinform., 2017
Comparative assessment of differential network analysis methods.
Briefings Bioinform., 2017
Fine-Grained Opinion Mining from Mobile App Reviews with Word Embedding Features.
Proceedings of the Natural Language Processing and Information Systems, 2017
Exact and Approximate Algorithms for Finding k-Shortest Paths with Limited Overlap.
Proceedings of the 20th International Conference on Extending Database Technology, 2017
Hi-WAY: Execution of Scientific Workflows on Hadoop YARN.
Proceedings of the 20th International Conference on Extending Database Technology, 2017
Modeling Data Flow Execution in a Parallel Environment.
Proceedings of the Big Data Analytics and Knowledge Discovery, 2017
Multi-lingual ICD-10 Coding using a Hybrid rule-based and Supervised Classification Approach at CLEF eHealth 2017.
Proceedings of the Working Notes of CLEF 2017, 2017
Fast and Accurate Time Series Classification with WEASEL.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017
Benchmarking Univariate Time Series Classifiers.
Proceedings of the Datenbanksysteme für Business, 2017
2016
Fast Sampling-Based Whole-Genome Haplotype Block Recognition.
IEEE ACM Trans. Comput. Biol. Bioinform., 2016
graphANNIS: A Fast Query Engine for Deeply Annotated Linguistic Corpora.
J. Lang. Technol. Comput. Linguistics, 2016
Recognizing chemicals in patents: a comparative analysis.
J. Cheminformatics, 2016
Effective and efficient similarity search in scientific workflow repositories.
Future Gener. Comput. Syst., 2016
SETH detects and normalizes genetic variants in text.
Bioinform., 2016
SoFIA: a data integration framework for annotating high-throughput datasets.
Bioinform., 2016
Cache-Sensitive Skip List: Efficient Range Queries on Modern CPUs.
Proceedings of the Data Management on New Hardware, 2016
PIEJoin: Towards Parallel Set Containment Joins.
Proceedings of the 28th International Conference on Scientific and Statistical Database Management, 2016
Performance of Gene Name Recognition Tools on Patents.
Proceedings of the 7th International Symposium on Semantic Mining in Biomedicine, 2016
Potential and Pitfalls of Domain-Specific Information Extraction at Web Scale.
Proceedings of the 2016 International Conference on Management of Data, 2016
Graph n-grams for Scientific Workflow Similarity Search.
Proceedings of the Conference "Lernen, 2016
SCARE ― The Sentiment Corpus of App Reviews with Fine-grained Annotations in German.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016
2015
MRCSI: Compressing and Searching String Collections with Multiple References.
Proc. VLDB Endow., 2015
SAASFEE: Scalable Scientific Workflow Execution Engine.
Proc. VLDB Endow., 2015
SOFA: An extensible logical optimizer for UDF-heavy data flows.
Inf. Syst., 2015
DynamicCloudSim: Simulating heterogeneity in computational clouds.
Future Gener. Comput. Syst., 2015
Computer-assisted curation of a human regulatory core network from the biological literature.
Bioinform., 2015
BiobankCloud: A Platform for the Secure Storage, Sharing, and Processing of Large Biomedical Data Sets.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Biomedical Data Management and Graph Online Querying, 2015
Alternative routing: k-shortest paths with limited overlap.
Proceedings of the 23rd SIGSPATIAL International Conference on Advances in Geographic Information Systems, 2015
Massively Parallel Analysis of Similarity Matrices on Heterogeneous Hardware.
Proceedings of the Workshops of the EDBT/ICDT 2015 Joint Conference (EDBT/ICDT), 2015
Cuneiform: a Functional Language for Large Scale Scientific Data Analysis.
Proceedings of the Workshops of the EDBT/ICDT 2015 Joint Conference (EDBT/ICDT), 2015
2014
The Stratosphere platform for big data analytics.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
VLDB J., 2014
State-of-the-art in string similarity search and join.
,
,
,
,
,
,
,
,
,
,
SIGMOD Rec., 2014
Similarity Search for Scientific Workflows.
Proc. VLDB Endow., 2014
CellFinder: a cell data repository.
Nucleic Acids Res., 2014
A survey on annotation tools for the biomedical literature.
Briefings Bioinform., 2014
Extracting and aggregating temporal events from text.
Proceedings of the 23rd International World Wide Web Conference, 2014
Versatile optimization of UDF-heavy data flows with sofa.
Proceedings of the International Conference on Management of Data, 2014
Next Generation Data Integration (for the Life Sciences).
Proceedings of the 26th GI-Workshop Grundlagen von Datenbanken, 2014
Layer Decomposition: An Effective Structure-Based Approach for Scientific Workflow Similarity.
Proceedings of the 10th IEEE International Conference on e-Science, 2014
RRCA: Ultra-Fast Multiple In-species Genome Alignments.
Proceedings of the Algorithms for Computational Biology - First International Conference, 2014
2013
FRESCO: Referential Compression of Highly Similar Sequences.
IEEE ACM Trans. Comput. Biol. Bioinform., 2013
RCSI: Scalable similarity search in thousand(s) of genomes.
Proc. VLDB Endow., 2013
Parallelization in Scientific Workflow Management Systems
CoRR, 2013
SOFA: An Extensible Logical Optimizer for UDF-heavy Dataflows.
CoRR, 2013
A detailed error analysis of 13 kernel methods for protein-protein interaction extraction.
BMC Bioinform., 2013
CELDA - an ontology for the comprehensive representation of cells in complex systems.
BMC Bioinform., 2013
Using ontologies to study cell transitions.
J. Biomed. Semant., 2013
Preliminary evaluation of the CellFinder literature curation pipeline for gene expression in kidney cells and anatomical parts.
Database J. Biol. Databases Curation, 2013
WBI-DDI: Drug-Drug Interaction Extraction using Majority Voting.
Proceedings of the 7th International Workshop on Semantic Evaluation, 2013
WBI-NER: The impact of domain-specific features on the performance of identifying and classifying mentions of drugs.
Proceedings of the 7th International Workshop on Semantic Evaluation, 2013
CPU and cache efficient management of memory-resident databases.
Proceedings of the 29th IEEE International Conference on Data Engineering, 2013
OmixAnalyzer - A Web-Based System for Management and Analysis of High-Throughput Omics Data Sets.
Proceedings of the Data Integration in the Life Sciences - 9th International Conference, 2013
Experiences from Developing the Domain-Specific Entity Search Engine GeneView.
Proceedings of the Datenbanksysteme für Business, 2013
QGramProjector: Q-Gram Projection for Indexing Highly-Similar Strings.
Proceedings of the Advances in Databases and Information Systems, 2013
2012
GeneView: a comprehensive semantic search engine for PubMed.
Nucleic Acids Res., 2012
Improving data quality by source analysis.
ACM J. Data Inf. Qual., 2012
Data Management Challenges in Next Generation Sequencing.
Datenbank-Spektrum, 2012
ChemSpot: a hybrid system for chemical named entity recognition.
Bioinform., 2012
Tools for managing and analyzing microarray data.
Briefings Bioinform., 2012
Adaptive efficient compression of genomes.
Algorithms Mol. Biol., 2012
(Re)Use in Public Scientific Workflow Repositories.
Proceedings of the Scientific and Statistical Database Management, 2012
Regular Path Queries on Large Graphs.
Proceedings of the Scientific and Statistical Database Management, 2012
Efficient Similarity Search in Very Large String Sets.
Proceedings of the Scientific and Statistical Database Management, 2012
String Searching in Referentially Compressed Genomes.
Proceedings of the KDIR 2012 - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval, Barcelona, Spain, 4, 2012
InterOnto - Ranking Inter-Ontology Links.
Proceedings of the Data Integration in the Life Sciences - 8th International Conference, 2012
Discovering conditional inclusion dependencies.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012
2011
Search, adapt, and reuse: the future of scientific workflows.
SIGMOD Rec., 2011
Molecular Event Extraction from Link Grammar Parse Trees in the BioNLP'09 Shared Task.
Comput. Intell., 2011
BioCreative III interactive task: an overview.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
BMC Bioinform., 2011
Next generation data integration for Life Sciences.
Proceedings of the 27th International Conference on Data Engineering, 2011
Pipa: custom integration of protein interactions and pathways.
Proceedings of the 41. Jahrestagung der Gesellschaft für Informatik, 2011
Scalable Sequence Similarity Search and Join in Main Memory on Multi-cores.
Proceedings of the Euro-Par 2011: Parallel Processing Workshops - CCPI, CGWS, HeteroPar, HiBB, HPCVirt, HPPC, HPSS, MDGS, ProPer, Resilience, UCHPC, VHPC, Bordeaux, France, August 29, 2011
Not all links are equal: Exploiting Dependency Types for the Extraction of Protein-Protein Interactions from Text.
Proceedings of the 2011 Workshop on Biomedical Natural Language Processing, 2011
2010
Deep Web Integration with VisQI.
Proc. VLDB Endow., 2010
A Comprehensive Benchmark of Kernel Methods to Extract Protein-Protein Interactions from Literature.
PLoS Comput. Biol., 2010
A fast and effective dependency graph kernel for PPI relation extraction.
BMC Bioinform., 2010
Species identification for gene name normalization.
BMC Bioinform., 2010
Simple tricks for improving pattern-based information extraction from the biomedical literature.
J. Biomed. Semant., 2010
Phenoclustering: online mining of cross-species phenotypes.
Bioinform., 2010
Graph-based concept identification and disambiguation for enterprise search.
Proceedings of the 19th International Conference on World Wide Web, 2010
Prefix Tree Indexing for Similarity Search and Similarity Joins on Genomic Data.
Proceedings of the Scientific and Statistical Database Management, 2010
Selecting Materialized Views for RDF Data.
Proceedings of the Current Trends in Web Engineering, 2010
An urban health risk analysis for Berlin: exploration and integration of spatio-temporal information on the urban environment.
Proceedings of the Integration of Environmental Information in Europe: Proceedings of the 24th International Conference on Informatics for Environmental Protection, 2010
Estimating Result Size and Execution Times for Graph Queries.
Proceedings of the Local Proceedings of the Fourteenth East-European Conference on Advances in Databases and Information Systems, 2010
2009
Graph Management in the Life Sciences.
Proceedings of the Encyclopedia of Database Systems, 2009
Semantic Data Integration for Life Science Entities.
Proceedings of the Encyclopedia of Database Systems, 2009
A Hierarchical Approach to Model Web Query Interfaces for Web Source Integration.
Proc. VLDB Endow., 2009
A Machine Learning Approach to Foreign Key Discovery.
Proceedings of the 12th International Workshop on the Web and Databases, 2009
SOA-Based Integration of Text Mining Services.
Proceedings of the 2009 IEEE Congress on Services, Part I, 2009
Graph-Based Ontology Construction from Heterogenous Evidences.
Proceedings of the Semantic Web - ISWC 2009, 8th International Semantic Web Conference, 2009
Exploiting scientific workflows for large-scale gene expression data analysis.
Proceedings of the 24th International Symposium on Computer and Information Sciences, 2009
High-performance information extraction with AliBaba.
Proceedings of the EDBT 2009, 2009
Molecular event extraction from Link Grammar parse trees.
Proceedings of the BioNLP 2009 Workshop Companion Volume for Shared Task, BioNLP@HLT-NAACL 2009, 2009
Word Sense Disambiguation in Biomedical Applications.
Proceedings of the Information Retrieval in Biomedicine, 2009
2008
A Flexible Framework for Integrating Annotations from Different Tools and Tag Sets.
Trait. Autom. des Langues, 2008
Integrating protein-protein interactions and text mining for protein function prediction.
BMC Bioinform., 2008
Mining phenotypes for gene function prediction.
BMC Bioinform., 2008
Querying Distributed RDF Data Sources with SPARQL.
Proceedings of the Semantic Web: Research and Applications, 2008
Mining Phenotypes for Protein Function Prediction.
Proceedings of the Ontologies and Text Mining for Life Sciences: Current Status and Future Perspectives, 24.03., 2008
08131 Executive Summary -- Ontologies and Text Mining for Life Sciences : Current Status and Future Perspectives.
Proceedings of the Ontologies and Text Mining for Life Sciences: Current Status and Future Perspectives, 24.03., 2008
2007
Fast and practical indexing and querying of very large graphs.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2007
Efficiently Detecting Inclusion Dependencies.
Proceedings of the 23rd International Conference on Data Engineering, 2007
High-Precision Function Prediction using Conserved Interactions.
Proceedings of the German Conference on Bioinformatics, 2007
What's New? What's Certain? - Scoring Search Results in the Presence of Overlapping Data Sources.
Proceedings of the Data Integration in the Life Sciences, 4th International Workshop, 2007
Informationsintegration - Architekturen und Methoden zur Integration verteilter und heterogener Datenquellen.
dpunkt.verlag, 2007
2006
A structural keystone for drug design.
J. Integr. Bioinform., 2006
Adapters, shims, and glue - service interoperability for <i>in silico</i> experiments.
Bioinform., 2006
ALIBABA: PubMed as a graph.
Bioinform., 2006
Collecting a Large Corpus from all of Medline.
Proceedings of the Second International Symposium on Semantic Mining in Biomedicine, 2006
Efficiently Computing Inclusion Dependencies for Schema Discovery.
Proceedings of the 22nd International Conference on Data Engineering Workshops, 2006
Classification of Contradiction Patterns.
Proceedings of the Advances in Data Analysis, 2006
Implementing a Linguistic Query Language for Historic Texts.
Proceedings of the Current Trends in Database Technology - EDBT 2006, 2006
Describing differences between databases.
Proceedings of the 2006 ACM CIKM International Conference on Information and Knowledge Management, 2006
Informationsintegration: Architekturen und Methoden zur Integration verteilter und heterogener Datenquellen.
dpunkt, ISBN: 3-89864-400-6, 2006
2005
Erkennen und Bereinigen von Datenfehlern in naturwissenschaftlichen Daten.
Datenbank-Spektrum, 2005
Columba: an integrated database of proteins, structures, and annotations.
BMC Bioinform., 2005
Systematic feature evaluation for gene name recognition.
BMC Bioinform., 2005
GandrKB--ontological microarray annotation and visualization.
Bioinform., 2005
What makes a gene name? Named entity recognition in the biomedical literature.
Briefings Bioinform., 2005
Optimizing syntax patterns for discovering protein-protein interactions.
Proceedings of the 2005 ACM Symposium on Applied Computing (SAC), 2005
A query language for biological networks.
Proceedings of the ECCB/JBI'05 Proceedings, Fourth European Conference on Computational Biology/Sixth Meeting of the Spanish Bioinformatics Network (Jornadas de BioInformática), Palacio de Congresos, Madrid, Spain, September 28, 2005
Querying Ontologies in Relational Database Systems.
Proceedings of the Data Integration in the Life Sciences, Second InternationalWorkshop, 2005
Integrating and Warehousing Liver Gene Expression Data and Related Biomedical Resources in GEDAW.
Proceedings of the Data Integration in the Life Sciences, Second InternationalWorkshop, 2005
(Almost) Hands-Off Information Integration for the Life Sciences.
Proceedings of the Second Biennial Conference on Innovative Data Systems Research, 2005
2004
Completeness of integrated information sources.
Inf. Syst., 2004
Mining for Patterns in Contradictory Data.
Proceedings of the IQIS 2004, 2004
Columba: Multidimensional Data Integration of Protein Annotations.
Proceedings of the Data Integration in the Life Sciences, First International Workshop, 2004
Data Mining: The Next Generation.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Perspectives Workshop: Data Mining: The Next Generation, 11.07., 2004
Cooperative Transaction Processing between Clients and Servers.
Proceedings of the Advances in Databases and Information Systems, 2004
2003
Integration molekularbiologischer Daten.
Datenbank-Spektrum, 2003
2000
Query planning in mediator based information systems.
PhD thesis, 2000
Query Planning with Information Quality Bounds.
Proceedings of the Flexible Query Answering Systems, 2000
Strategies for the Conceptual Design of Federated Information Systems.
Proceedings of the Engineering Federated Information Systems, 2000
1999
IXDB, an X chromosome integrated database (update).
Nucleic Acids Res., 1999
EDITtoTrEMBL: a distributed approach to high-quality automated protein sequence annotation.
Bioinform., 1999
A proposal for a standard CORBA interface for genome maps.
Bioinform., 1999
Quality-driven Integration of Heterogenous Information Systems.
Proceedings of the VLDB'99, 1999
Globale Anfragebearbeitung mit verteilten und heterogenen Datenquellen.
Proceedings of the Informatik '99, 1999
Density Scores for Cooperative Query Answering.
Proceedings of the 4. Workshop Föderierte Datenbanken, 1999
Integration durch Standards: Erfahrungen mit CORBA in Life Science Research.
Proceedings of the 4. Workshop Föderierte Datenbanken, 1999
Constructing IDL Views on Relational Databases.
Proceedings of the Advanced Information Systems Engineering, 1999
Designing a Global Information Resource for Molecular Biology.
Proceedings of the Datenbanksysteme in Büro, 1999
1998
IXDB, an X chromosome integrated database.
Nucleic Acids Res., 1998
Issues in developing integrated genomic databases and application to the human X chromosome.
Bioinform., 1998
Combining Heterogeneous Data Sources through Query Correspondence Assertions.
Proceedings of the CIKM'98 First Workshop on Web Information and Data Management (WIDM'98), 1998
Design Issues of Database Access in a CORBA Environment.
Proceedings of the Integration heterogener Softwaresysteme, IHS'98, 1998
Query Mediation for Heterogeneous Data Sources.
Proceedings of the 3. Workshop "Föderierte Datenbanken", 1998
1997
Ansätze zur Föderation moleklargenetischer Datenbanken.
Datenbank Rundbr., 1997
IXDB, an integrated database for genomic data of the Human X Chromosome.
Proceedings of the German Conference on Bioinformatics, 1997