Paolo Missier

Orcid: 0000-0002-0978-2446

Affiliations:
  • Newcastle University, School of Computing Science, UK
  • University of Manchester, UK (former)


According to our database1, Paolo Missier authored at least 172 papers between 1995 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Experience: A Comparative Analysis of Multivariate Time-Series Generative Models: A Case Study on Human Activity Data.
ACM J. Data Inf. Qual., September, 2024

Supporting Better Insights of Data Science Pipelines with Fine-grained Provenance.
ACM Trans. Database Syst., June, 2024

Fair and Private Data Preprocessing through Microaggregation.
ACM Trans. Knowl. Discov. Data, April, 2024

Validity constraints for data analysis workflows.
Future Gener. Comput. Syst., 2024

From why-provenance to why+provenance: Towards addressing deep data explanations in Data-Centric AI.
Proceedings of the 32nd Symposium of Advanced Database Systems, 2024

Design and Development of a Provenance Capture Platform for Data Science.
Proceedings of the 40th International Conference on Data Engineering, ICDE 2024, 2024

2023
ConvBoost: Boosting ConvNets for Sensor-based Activity Recognition.
Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., 2023

Handling Overlapping Asymmetric Datasets - A Twice Penalized P-Spline Approach.
CoRR, 2023

On Training Strategies for LSTMs in Sensor-Based Human Activity Recognition.
Proceedings of the IEEE International Conference on Pervasive Computing and Communications Workshops and other Affiliated Events, 2023

Preprocessing Matters: Automated Pipeline Selection for Fair Classification.
Proceedings of the Modeling Decisions for Artificial Intelligence, 2023

Interpretable and robust hospital readmission predictions from Electronic Health Records.
Proceedings of the IEEE International Conference on Big Data, 2023

2022
DPDS: Assisting Data Science with Data Provenance.
Proc. VLDB Endow., 2022

Knowledge-Driven Data Ecosystems Toward Data Transparency.
ACM J. Data Inf. Qual., 2022

Curating a longitudinal research resource using linked primary care EHR data - a UK Biobank case study.
J. Am. Medical Informatics Assoc., 2022

Real-world data mining meets clinical practice: Research challenges and perspective.
Frontiers Big Data, 2022

Simple Yet Surprisingly Effective Training Strategies for LSTMs in Sensor-Based Human Activity Recognition.
CoRR, 2022

Benchmark time series data sets for PyTorch - the torchtime package.
CoRR, 2022

Data-Driven, AI-Based Clinical Practice: Experiences, Challenges, and Research Directions.
Proceedings of the 30th Italian Symposium on Advanced Database Systems, 2022

Tracking trajectories of multiple long-term conditions using dynamic patient-cluster associations.
Proceedings of the IEEE International Conference on Big Data, 2022

2021
A customisable pipeline for the semi-automated discovery of online activists and social campaigns on Twitter.
World Wide Web, 2021

Knowledge-driven Data Ecosystems Towards Data Transparency.
CoRR, 2021

Latency of Trading Transactions in Brokered IoT Data Marketplace in Ethereum.
Proceedings of the 2021 IEEE SmartWorld, 2021

Fine-grained Provenance for High-quality Data Science (Discussion Paper).
Proceedings of the 29th Italian Symposium on Advanced Database Systems, 2021

An HMM-ensemble approach to predict severity progression of ICU treatment for hospitalized COVID-19 patients.
Proceedings of the 20th IEEE International Conference on Machine Learning and Applications, 2021

Optimising Fairness Through Parametrised Data Sampling.
Proceedings of the 24th International Conference on Extending Database Technology, 2021

2020
Capturing and querying fine-grained provenance of preprocessing pipelines in data science.
Proc. VLDB Endow., 2020

Abstracting PROV provenance graphs: A validity-preserving approach.
Future Gener. Comput. Syst., 2020

Technologies for Trustworthy Machine Learning: A Survey in a Socio-Technical Context.
CoRR, 2020

Building Rule Hierarchies for Efficient Logical Rule Learning from Knowledge Graphs.
CoRR, 2020

Efficient Rule Learning with Template Saturation for Knowledge Graph Completion.
CoRR, 2020

Data-driven vs knowledge-driven inference of health outcomes in the ageing population: a case study.
Proceedings of the Workshops of the EDBT/ICDT 2020 Joint Conference, 2020

Predicting Respiratory Failure in Patients with COVID-19 pneumonia: a case study from Northern Italy.
Proceedings of the First International AAI4H, 2020

2019
Toward trusted open data and services.
Internet Technol. Lett., 2019

Why-Diff: Exploiting Provenance to Understand Outcome Differences From Non-Identical Reproduced Workflows.
IEEE Access, 2019

Communication in Emergency Management through Data Integration and Trust: an introduction to the CEM-DIT system.
Proceedings of the 16th International Conference on Information Systems for Crisis Response and Management, 2019

A Customisable Pipeline for Continuously Harvesting Socially-Minded Twitter Users.
Proceedings of the Web Engineering - 19th International Conference, 2019

Toward a Decentralized, Trust-Less Marketplace for Brokered IoT Data Trading Using Blockchain.
Proceedings of the IEEE International Conference on Blockchain, 2019

Efficient Re-Computation of Big Data Analytics Processes in the Presence of Changes: Computational Framework, Reference Architecture, and Applications.
Proceedings of the 2019 IEEE International Congress on Big Data, 2019

2018
Provenance Standards.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Report on the First International Workshop on Incremental Re-computation: Provenance and Beyond.
SIGMOD Rec., 2018

Editorial: Special Issue on Improving the Veracity and Value of Big Data.
ACM J. Data Inf. Qual., 2018

VazaDengue: An information system for preventing and combating mosquito-borne diseases with social networks.
Inf. Syst., 2018

Selective and Recurring Re-computation of Big Data Analytics Tasks: Insights from a Genomics Case Study.
Big Data Res., 2018

Design and Evaluation of a Genomics Variant Analysis Pipeline Using GATK Spark Tools.
Proceedings of the 26th Italian Symposium on Advanced Database Systems, 2018

Leveraging Blockchain to Enable Smart-Health Applications.
Proceedings of the 4th IEEE International Forum on Research and Technology for Society and Industry, 2018

Informing decision makers: facilitating communication and trust for decision makers during crises.
Proceedings of the 15th International Conference on Information Systems for Crisis Response and Management, 2018

Versioned-PROV: A PROV Extension to Support Mutable Data Entities.
Proceedings of the Provenance and Annotation of Data and Processes, 2018

Provenance Annotation and Analysis to Support Process Re-computation.
Proceedings of the Provenance and Annotation of Data and Processes, 2018

Analyzing Social Network Images with Deep Learning Models to Fight Zika Virus.
Proceedings of the Image Analysis and Recognition - 15th International Conference, 2018

Loom: Query-aware Partitioning of Online Graphs.
Proceedings of the 21st International Conference on Extending Database Technology, 2018

Exploiting Execution Provenance to Explain Difference Between Two Data-Intensive Computations.
Proceedings of the 14th IEEE International Conference on e-Science, 2018

2017
Revealing the Detailed Lineage of Script Outputs Using Hybrid Provenance.
Int. J. Digit. Curation, 2017

TAPER: query-aware, partition-enhancement for large, heterogenous graphs.
Distributed Parallel Databases, 2017

Mind my value: a decentralized infrastructure for fair and trusted IoT data trading.
Proceedings of the Seventh International Conference on the Internet of Things, 2017

Adaptive Incremental Learning for Statistical Relational Models Using Gradient-Based Boosting.
Proceedings of the Late Breaking Papers of the 27th International Conference on Inductive Logic Programming, 2017

Recruiting from the Network: Discovering Twitter Users Who Can Help Combat Zika Epidemics.
Proceedings of the Web Engineering - 17th International Conference, 2017

Preserving the Value of Large Scale Data Analytics over Time Through Selective Re-computation.
Proceedings of the Data Analytics - 31st British International Conference on Databases, 2017

Why-Diff: Explaining differences amongst similar workflow runs by exploiting scientific metadata.
Proceedings of the 2017 IEEE International Conference on Big Data (IEEE BigData 2017), 2017

2016
Data trajectories: tracking reuse of published data for transitive credit attribution.
Int. J. Digit. Curation, 2016

Scalable and efficient whole-exome data processing using workflows on the cloud.
Future Gener. Comput. Syst., 2016

The lifecycle of provenance metadata and its associated challenges and opportunities.
CoRR, 2016

Provenance and data differencing for workflow reproducibility analysis.
Concurr. Comput. Pract. Exp., 2016

The Data, They Are A-Changin'.
Proceedings of the 8th USENIX Workshop on the Theory and Practice of Provenance, 2016

Clustering provenance facilitating provenance exploration through data abstraction.
Proceedings of the Workshop on Human-In-the-Loop Data Analytics, 2016

Analyzing Provenance Across Heterogeneous Provenance Graphs.
Proceedings of the Provenance and Annotation of Data and Processes, 2016

DataONE: A Data Federation with Provenance Support.
Proceedings of the Provenance and Annotation of Data and Processes, 2016

Tracking Dengue Epidemics Using Twitter Content Classification and Topic Modelling.
Proceedings of the Current Trends in Web Engineering, 2016

Workload-aware Streaming Graph Partitioning.
Proceedings of the Workshops of the EDBT/ICDT 2016 Joint Conference, 2016

Facilitating reproducible research by investigating computational metadata.
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016

2015
Corrigendum to the Special Issue Editorial in JDIQ Volume 5, Issue 3.
ACM J. Data Inf. Qual., 2015

Editorial.
ACM J. Data Inf. Qual., 2015

Recent advances in Scalable Workflow Enactment Engines and Technologies.
Future Gener. Comput. Syst., 2015

Access control and view generation for provenance graphs.
Future Gener. Comput. Syst., 2015

YesWorkflow: A User-Oriented, Language-Independent Tool for Recovering Workflow Information from Scripts.
CoRR, 2015

SVI: A Simple Single-Nucleotide Human Variant Interpretation Tool for Clinical Use.
Proceedings of the Data Integration in the Life Sciences - 11th International Conference, 2015

Bootstrapping Personalised Human Activity Recognition Models Using Online Active Learning.
Proceedings of the 15th IEEE International Conference on Computer and Information Technology, 2015

2014
The PBase Scientific Workflow Provenance Repository.
Int. J. Digit. Curation, 2014

Tweet My Street: A Cross-Disciplinary Collaboration for the Analysis of Local Twitter Data.
Future Internet, 2014

Distilling structure in Taverna scientific workflows: a refactoring approach.
BMC Bioinform., 2014

UP & DOWN: Improving Provenance Precision by Combining Workflow- and Trace-Level Information.
Proceedings of the 6th Workshop on the Theory and Practice of Provenance, 2014

DistillFlow: removing redundancy in scientific workflows.
Proceedings of the Conference on Scientific and Statistical Database Management, 2014

A Platform for Analysing Stream and Historic Data with Efficient and Scalable Design Patterns.
Proceedings of the 2014 IEEE World Congress on Services, 2014

Scaling Whole Exome sequencing using workflows on the cloud.
Proceedings of the 22nd Italian Symposium on Advanced Database Systems, 2014

Measuring the impact of cognitive distractions on driving performance using time series analysis.
Proceedings of the 17th International IEEE Conference on Intelligent Transportation Systems, 2014

ProvAbs: Model, Policy, and Tooling for Abstracting PROV Graphs.
Proceedings of the Provenance and Annotation of Data and Processes, 2014

ProvGen: Generating Synthetic PROV Graphs with Predictable Structure.
Proceedings of the Provenance and Annotation of Data and Processes, 2014

Provenance Storage, Querying, and Visualization in PBase.
Proceedings of the Provenance and Annotation of Data and Processes, 2014

Provenance-Based Searching and Ranking for Scientific Workflows.
Proceedings of the Provenance and Annotation of Data and Processes, 2014

On strategies for budget-based online annotation in human activity recognition.
Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing, 2014

From Scripted HPC-Based NGS Pipelines to Workflows on the Cloud.
Proceedings of the 14th IEEE/ACM International Symposium on Cluster, 2014

2013
Report from the second workshop on scalable workflow enactment engines and technology (SWEET'13).
SIGMOD Rec., 2013

Preface.
Fundam. Informaticae, 2013

Why linked data is not enough for scientists.
Future Gener. Comput. Syst., 2013

D-PROV: Extending the PROV Provenance Model with Workflow Structure.
Proceedings of the 5th Workshop on the Theory and Practice of Provenance, 2013

Extracting PROV provenance traces from Wikipedia history pages.
Proceedings of the Joint 2013 EDBT/ICDT Conferences, 2013

The W3C PROV family of specifications for modelling provenance metadata.
Proceedings of the Joint 2013 EDBT/ICDT Conferences, 2013

Measuring Data Completeness for Microbial Genomics Database.
Proceedings of the Intelligent Information and Database Systems - 5th Asian Conference, 2013

Reference Architectures to Measure Data Completeness across Integrated Databases.
Proceedings of the Intelligent Information and Database Systems - 5th Asian Conference, 2013

2012
Report from the first workshop on scalable workflow enactment engines and technology (SWEET'12).
SIGMOD Rec., 2012

Golden Trail: Retrieving the Data History that Matters from a Comprehensive Provenance Repository.
Int. J. Digit. Curation, 2012

Predicting the Execution Time of Workflow Activities Based on Their Input Features.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Modelling Provenance Using Structured Occurrence Networks.
Proceedings of the Provenance and Annotation of Data and Processes, 2012

A PROV Encoding for Provenance Analysis Using Deductive Rules.
Proceedings of the Provenance and Annotation of Data and Processes, 2012

Detecting Duplicate Records in Scientific Workflow Results.
Proceedings of the Provenance and Annotation of Data and Processes, 2012


2011
eScience.
Proceedings of the Handbook of Semantic Web Technologies, 2011

Guest Editorial: Scientific Workflows, Provenance and Their Applications.
Int. J. Comput. Their Appl., 2011

Extending Semantic Provenance into the Web of Data.
IEEE Internet Comput., 2011

The Open Provenance Model core specification (v1.1).
Future Gener. Comput. Syst., 2011

Workflows to open provenance graphs, round-trip.
Future Gener. Comput. Syst., 2011

Simulating Taverna workflows using stochastic process algebras.
Concurr. Comput. Pract. Exp., 2011

Incremental Workflow Improvement Through Analysis of Its Data Provenance.
Proceedings of the 3rd Workshop on the Theory and Practice of Provenance, 2011

Achieving reproducibility by combining provenance with service and workflow versioning.
Proceedings of the WORKS'11, 2011

Towards the Preservation of Scientific Workflows.
Proceedings of the 8th International Conference on Digital Preservation, 2011

An experimental workflow development platform for historical document digitisation and analysis.
Proceedings of the 2011 Workshop on Historical Document Imaging and Processing, 2011

2010
A formal semantics for the Taverna 2 workflow model.
J. Comput. Syst. Sci., 2010

A comparison of using Taverna and BPEL in building scientific workflows: the case of caGrid.
Concurr. Comput. Pract. Exp., 2010

Taverna, Reloaded.
Proceedings of the Scientific and Statistical Database Management, 2010

Functional Units: Abstractions for Web Service Annotations.
Proceedings of the 6th World Congress on Services, 2010

Workflows for Information Integration in the Life Sciences.
Proceedings of the Search Computing, 2010

A Linked Data Approach to Sharing Workflows and Workflow Results.
Proceedings of the Leveraging Applications of Formal Methods, Verification, and Validation, 2010

<i>Janus</i>: From Workflows to Semantic Provenance and Linked Open Data.
Proceedings of the Provenance and Annotation of Data and Processes, 2010

Understanding Collaborative Studies through Interoperable Workflow Provenance.
Proceedings of the Provenance and Annotation of Data and Processes, 2010

Fine-grained and efficient lineage querying of collection-based workflow provenance.
Proceedings of the EDBT 2010, 2010


ERGOT: A Semantic-Based System for Service Discovery in Distributed Infrastructures.
Proceedings of the 10th IEEE/ACM International Conference on Cluster, 2010

2009
Incorporating Domain-Specific Information Quality Constraints into Database Queries.
ACM J. Data Inf. Qual., 2009

The data playground: An intuitive workflow specification environment.
Future Gener. Comput. Syst., 2009

Introduction.
Proceedings of the First International Workshop on the role of Semantic Web in Provenance Management (SWPM 2009), 2009

Semantically Annotated Provenance in the Life Science Grid.
Proceedings of the First International Workshop on the role of Semantic Web in Provenance Management (SWPM 2009), 2009

Data Quality support to on-the-fly data integration using Adaptive Query Processing.
Proceedings of the Seventeenth Italian Symposium on Advanced Database Systems, 2009

Combining DHTs and SONs for Semantic-Based Service Discovery.
Proceedings of the Ninth International Conference on Intelligent Systems Design and Applications, 2009

Time-completeness trade-offs in record linkage using adaptive query processing.
Proceedings of the EDBT 2009, 2009

Medical image processing workflow support on the EGEE grid with taverna.
Proceedings of the Twenty-Second IEEE International Symposium on Computer-Based Medical Systems, 2009

2008
Brokering infrastructure for minimum cost data procurement based on quality-quantity models.
Decis. Support Syst., 2008

An ontology-based approach to handling information quality in e-Science.
Concurr. Comput. Pract. Exp., 2008

Information quality in proteomics.
Briefings Bioinform., 2008

Model-Driven Component Generation for Families of Completeness.
Proceedings of the International Workshop on Quality in Databases and Management of Uncertain Data, 2008

Exploiting Provenance to Make Sense of Automated Decisions in Scientific Workflows.
Proceedings of the Provenance and Annotation of Data and Processes, 2008

Data Lineage Model for Taverna Workflows with Lightweight Annotation Requirements.
Proceedings of the Provenance and Annotation of Data and Processes, 2008

Building Scientific Workflow with Taverna and BPEL: A Comparative Study in caGrid.
Proceedings of the Service-Oriented Computing - ICSOC 2008 Workshops, 2008

2007
Requirements and Services for Metadata Management.
IEEE Internet Comput., 2007

Managing information quality in e-science: the qurator workbench.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2007

Metadata Management in S-OGSA.
Proceedings of the Computational Science - ICCS 2007, 7th International Conference, Beijing, China, May 27, 2007

Grid metadata management: Requirements and architecture.
Proceedings of the 8th IEEE/ACM International Conference on Grid Computing (GRID 2007), 2007

Taverna Workflows: Syntax and Semantics.
Proceedings of the Third International Conference on e-Science and Grid Computing, 2007

Accelerating Disease Gene Identification Through Integrated SNP Data Analysis.
Proceedings of the Data Integration in the Life Sciences, 4th International Workshop, 2007

2006
An overview of S-OGSA: A Reference Semantic Grid Architecture.
J. Web Semant., 2006

Quality Views: Capturing and Exploiting the User Perspective on Data Quality.
Proceedings of the 32nd International Conference on Very Large Data Bases, 2006

Managing Information Quality in e-Science Using Semantic Web Technology.
Proceedings of the Semantic Web: Research and Applications, 2006

Towards the Management of Information Quality in Proteomics.
Proceedings of the 19th IEEE International Symposium on Computer-Based Medical Systems (CBMS 2006), 2006

2005
Clustering Web pages based on their structure.
Data Knowl. Eng., 2005

Data Quality at a Glance.
Datenbank-Spektrum, 2005

Provider issues in quality-constrained data provisioning.
Proceedings of the IQIS 2005, 2005

Managing Information Quality in e-Science: A Case Study in Proteomics.
Proceedings of the Perspectives in Conceptual Modeling, 2005

Semantic Support for Meta-Scheduling in Grids.
Proceedings of the First Workshop on Knowledge and Data Management in GRIDs, 2005

Architectural Patterns for the Semantic Grid.
Proceedings of the First Workshop on Knowledge and Data Management in GRIDs, 2005

2004
An Automatic Data Grabber for Large Web Sites.
Proceedings of the (e)Proceedings of the Thirtieth International Conference on Very Large Data Bases, VLDB 2004, Toronto, Canada, August 31, 2004

Ontology-Based Question Answering in a Federation of University Sites: The MOSES Case Study.
Proceedings of the Natural Language Processing and Information Systems, 2004

QoS in Multichannel IS: The MAIS Approach.
Proceedings of the Engineering Advanced Web Applications: Proceedings of Workshops in connection with the 4th International Conference on Web Engineering (ICWE 2004), 2004

A formulation of the Data Quality Optimization Problem in Cooperative Information Systems.
Proceedings of the CAiSE'04 Workshops in connection with The 16th Conference on Advanced Information Systems Engineering, 2004

2003
Improving Data Quality in Practice: A Case Study in the Italian Public Administration.
Distributed Parallel Databases, 2003

Fine-grain web site structure discovery.
Proceedings of the Fifth ACM CIKM International Workshop on Web Information and Data Management (WIDM 2003), 2003

A model for Information Quality management in Cooperative Information Systems.
Proceedings of the Eleventh Italian Symposium on Advanced Database Systems, 2003

A Multidimensional Model for Information Quality in Cooperative Information Systems.
Proceedings of the Eighth International Conference on Information Quality (ICIQ 2003), 2003

The Service to Businesses Project: Improving Government-to-Business Relationships in Italy.
Proceedings of the Electronic Government, Second International Conference, 2003

2001
CitiTime: a system for rapid creation of portable next-generation telephony services.
Comput. Networks, 2001

2000
Java call control, coordination, and transactions.
IEEE Commun. Mag., 2000

Telcordia's Database Reconciliation and Data Quality Analysis Tool.
Proceedings of the VLDB 2000, 2000

A Knowledge-Based Decision Support Workbench for Advanced Ecommerce.
Proceedings of the 2000 Academia / Industry Working Conference on Research Challenges (AIWoRC 2000), 2000

1999
A Framework for Analyzing Virtual Enterprise Infrastructure.
Proceedings of the Ninth International Workshop on Research Issues on Data Engineering: Information Technology for Virtual Enterprises, 1999

1995
Extending a Multidatabase Manipulation Language to Resolve Schema and Data Conflicts.
Proceedings of the Database Applications Semantics, Proceedings of the Sixth IFIP TC-2 Working Conference on Data Semantics (DS-6), Stone Mountain, Atlanta, Georgia, USA, May 30, 1995


  Loading...