Daniel S. Katz

Orcid: 0000-0001-5934-7525

Affiliations:
  • University of Illinois Urbana-Champaign, School of Information Sciences, IL, USA
  • Argonne National Laboratory, IL, USA (2009-2021)
  • University of Chicago, Computation Institute (CI), IL, USA (2009-2016)
  • Louisiana State University, Department of Electrical and Computer Engineering , (2006-2013)
  • Northwestern University, Evanston, IL, USA (PhD)


According to our database1, Daniel S. Katz authored at least 209 papers between 1992 and 2024.

Collaborative distances:
  • Dijkstra number2 of three.
  • Erdős number3 of two.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
FAIR-USE4OS: Guidelines for creating impactful open-source software.
PLoS Comput. Biol., 2024

Special issue on software citation, indexing, and discoverability.
PeerJ Comput. Sci., 2024

Research Software Engineering: Bridging Knowledge Gaps (Dagstuhl Seminar 24161).
Dagstuhl Reports, 2024

Thoughts on Learning Human and Programming Languages.
Comput. Sci. Eng., 2024

Training Next Generation AI Users and Developers at NCSA.
CoRR, 2024

Cycling on the Freeway: The Perilous State of Open Source Neuroscience Software.
CoRR, 2024

FAIR-USE4OS: From open source to Open Source.
CoRR, 2024

Enabling Remote Management of FaaS Endpoints with Globus Compute Multi-User Endpoints.
Proceedings of the Practice and Experience in Advanced Research Computing 2024: Human Powered Computing, 2024

A benchmark suite and performance analysis of user-space provenance collectors.
Proceedings of the 2nd ACM Conference on Reproducibility and Replicability, 2024

2023
FAIR AI models in high energy physics.
Mach. Learn. Sci. Technol., December, 2023

Global Impact Modelling Software is Unsustainable (Dataset and analysis script).
Dataset, December, 2023





Workflow Registry Tester.
Dataset, June, 2023



Training Next-Generation Artificial Intelligence Users and Developers at NCSA.
Comput. Sci. Eng., 2023

Leveraging Large Language Models to Build and Execute Computational Workflows.
CoRR, 2023

Transitioning ECP Software Technology into a Foundation for Sustainable Research Software.
CoRR, 2023

An Open Community-Driven Model For Sustainable Research Software: Sustainable Research Software Institute.
CoRR, 2023

Wanted: standards for automatic reproducibility of computational experiments.
CoRR, 2023

The Changing Role of RSEs over the Lifetime of Parsl.
CoRR, 2023

Workflows Community Summit 2022: A Roadmap Revolution.
CoRR, 2023

Fine-grained Policy-driven I/O Sharing for Burst Buffers.
Proceedings of the International Conference for High Performance Computing, 2023


Research Software Engineering in 2030.
Proceedings of the 19th IEEE International Conference on e-Science, 2023

Automatic Reproduction of Workflows in the Snakemake Workflow Catalog and nf-core Registries.
Proceedings of the 2023 ACM Conference on Reproducibility and Replicability, 2023

2022



$f$funcX: Federated Function as a Service for Science.
IEEE Trans. Parallel Distributed Syst., 2022

A survey of the state of the practice for research software in the United States.
PeerJ Comput. Sci., 2022

Giving Research Software Engineers a Larger Stage Through the Better Scientific Software Fellowship.
Comput. Sci. Eng., 2022

Overcoming Challenges to Continuous Integration in HPC.
Comput. Sci. Eng., 2022

Research Software Engineers: Career Entry Points and Training Gaps.
Comput. Sci. Eng., 2022

Giving RSEs a Larger Stage through the Better Scientific Software Fellowship.
CoRR, 2022

FAIR for AI: An interdisciplinary, international, inclusive, and diverse community building perspective.
CoRR, 2022

funcX: Federated Function as a Service for Science.
CoRR, 2022

Extended Abstract: Productive Parallel Programming with Parsl.
CoRR, 2022

2021

URSSI Community Survey 2018 Raw Data.
Dataset, September, 2021




The Four Pillars of Research Software Engineering.
IEEE Softw., 2021

Taking a fresh look at FAIR for research software.
Patterns, 2021

Software citation.
Inf. Serv. Use, 2021

Software Must be Recognised as an Important Output of Scholarly Research.
Int. J. Digit. Curation, 2021

Understanding the multifaceted geospatial software ecosystem: a survey approach.
Int. J. Geogr. Inf. Sci., 2021

Software Training in HEP.
Comput. Softw. Big Sci., 2021

A FAIR and AI-ready Higgs Boson Decay Dataset.
CoRR, 2021

Workflows Community Summit: Advancing the State-of-the-art of Scientific Workflows Management Systems Research and Development.
CoRR, 2021

Workflows Community Summit: Bringing the Scientific Workflows Community Together.
CoRR, 2021

A Fresh Look at FAIR for Research Software.
CoRR, 2021


Addressing Research Software Sustainability via Institutes.
Proceedings of the 1st IEEE/ACM International Workshop on Body of Knowledge for Software Sustainability, 2021

Research Software Sustainability and Citation.
Proceedings of the 1st IEEE/ACM International Workshop on Body of Knowledge for Software Sustainability, 2021

Sustaining Research Software via Research Software Engineers and Professional Associations.
Proceedings of the 1st IEEE/ACM International Workshop on Body of Knowledge for Software Sustainability, 2021

Research Software Sustainability: Lessons Learned at NCSA.
Proceedings of the 54th Hawaii International Conference on System Sciences, 2021

Extreme Scale Survey Simulation with Python Workflows.
Proceedings of the 17th IEEE International Conference on eScience, 2021

Federated Function as a Service for eScience.
Proceedings of the 17th IEEE International Conference on eScience, 2021

Working Towards Understanding the Role of FAIR for Machine Learning.
Proceedings of the DaMaLOS, 2021

2020
The Research Software Alliance (ReSA) and the community landscape.
Dataset, March, 2020

Convergence of artificial intelligence and high performance computing on NSF-supported cyberinfrastructure.
J. Big Data, 2020

The challenges of theory-software translation.
F1000Research, 2020

Software and Data Citation.
Comput. Sci. Eng., 2020

Nine Best Practices for Research Software Registries and Repositories: A Concise Guide.
CoRR, 2020

Confluence of Artificial Intelligence and High Performance Computing for Accelerated, Scalable and Reproducible Gravitational Wave Detection.
CoRR, 2020

Software Sustainability & High Energy Physics.
CoRR, 2020

Toward Interoperable Cyberinfrastructure: Common Descriptions for Computational Resources and Applications.
Proceedings of the PEARC '20: Practice and Experience in Advanced Research Computing, 2020

Research software engineers community workshop.
Proceedings of the PEARC Companion '20: Practice and Experience in Advanced Research Computing, 2020


2019
Supporting High-Performance and High-Throughput Computing for Experimental Science.
Comput. Softw. Big Sci., December, 2019


Enforcing public data archiving policies in academic publishing: A study of ecology journals.
Big Data Soc., January, 2019

The global impact of science gateways, virtual research environments and virtual laboratories.
Future Gener. Comput. Syst., 2019

Introduction to Accelerating Scientific Discovery With Reusable Software.
Comput. Sci. Eng., 2019

Community Organizations: Changing the Culture in Which Research Software Is Developed and Sustained.
Comput. Sci. Eng., 2019

Enabling real-time multi-messenger astrophysics discoveries with deep learning.
CoRR, 2019

Theory-Software Translation: Research Challenges and Future Directions.
CoRR, 2019

Software Citation Implementation Challenges.
CoRR, 2019

Sustaining Research Software: an SC18 Panel.
CoRR, 2019

Deep Learning for Multi-Messenger Astrophysics: A Gateway for Discovery in the Big Data Era.
CoRR, 2019

Scalable Parallel Programming in Python with Parsl.
Proceedings of the Practice and Experience in Advanced Research Computing on Rise of the Machines (learning), 2019

Citation and Research Objects: Toward Active Research Objects.
Proceedings of Workshop on Research Objects (RO2019), 2019

Research software development & management in universities: case studies from Manchester's RSDS group, Illinois' NCSA, and Notre Dame's CRC.
Proceedings of the 14th International Workshop on Software Engineering for Science, 2019

Parsl: Pervasive Parallel Programming in Python.
Proceedings of the 28th International Symposium on High-Performance Parallel and Distributed Computing, 2019

Quantifying the Impact of Memory Errors in Deep Learning.
Proceedings of the 2019 IEEE International Conference on Cluster Computing, 2019

2018
research-software/resosuma-data: 0.4.1.
Dataset, October, 2018

research-software/resosuma-data: 0.4.0.
Dataset, September, 2018

research-software/resosuma-data: 0.3.0.
Dataset, July, 2018


Journal of Open Source Software (JOSS): design and first-year review.
PeerJ Comput. Sci., 2018

The principles of tomorrow's university.
F1000Research, 2018

Publish Your Software: Introducing the Journal of Open Source Software (JOSS).
Comput. Sci. Eng., 2018

Conceptualization of a US Research Software Sustainability Institute (URSSI).
Comput. Sci. Eng., 2018

The State of Sustainable Research Software: Results from the Workshop on Sustainable Software for Science: Practice and Experiences (WSSSPE5.1).
CoRR, 2018

BioWorkbench: A High-Performance Framework for Managing and Analyzing Bioinformatics Experiments.
CoRR, 2018

Parsl: Scalable Parallel Scripting in Python.
Proceedings of the 10th International Workshop on Science Gateways, 2018

Software Citation in Theory and Practice.
Proceedings of the Mathematical Software - ICMS 2018, 2018

Mapping the Research Software Sustainability Space.
Proceedings of the 14th IEEE International Conference on e-Science, 2018

Building a Sustainable Structure for Research Software Engineering Activities.
Proceedings of the 14th IEEE International Conference on e-Science, 2018

2017
Leading-edge research in cluster, cloud, and grid computing: Best papers from the IEEE/ACM CCGrid 2015 conference.
Future Gener. Comput. Syst., 2017

A multi-disciplinary perspective on emergent and future innovations in peer review.
F1000Research, 2017

Four simple recommendations to encourage best practices in research software.
F1000Research, 2017

Engineering Academic Software (Dagstuhl Perspectives Workshop 16252).
Dagstuhl Manifestos, 2017

Streaming supercomputing needs workflow-enabled programming-in-the-large.
CoRR, 2017

Report on the Fourth Workshop on Sustainable Software for Science: Practice and Experiences (WSSSPE4).
CoRR, 2017

Report on the first workshop on negative and null results in eScience.
Concurr. Comput. Pract. Exp., 2017

Introducing distributed dynamic data-intensive (D3) science: Understanding applications and infrastructure.
Concurr. Comput. Pract. Exp., 2017

A social content delivery network for e-Science.
Concurr. Comput. Pract. Exp., 2017

Evaluating Distributed Execution of Workloads.
Proceedings of the 13th IEEE International Conference on e-Science, 2017

Understanding Software in Research: Initial Results from Examining Nature and a Call for Collaboration.
Proceedings of the 13th IEEE International Conference on e-Science, 2017

BOSS-LDG: A Novel Computational Framework That Brings Together Blue Waters, Open Science Grid, Shifter and the LIGO Data Grid to Accelerate Gravitational Wave Discovery.
Proceedings of the 13th IEEE International Conference on e-Science, 2017

2016
Ten Simple Rules for Taking Advantage of Git and GitHub.
PLoS Comput. Biol., 2016

Software Citation Principles.
PeerJ Prepr., 2016

Software vs. data in the context of citation.
PeerJ Prepr., 2016

Application Skeleton: Generating Synthetic Applications for Infrastructure Research.
J. Open Source Softw., 2016

The Challenge and Promise of Software Citation for Credit, Identification, Discovery, and Reuse.
ACM J. Data Inf. Qual., 2016

eScience today and tomorrow - Part 2.
Future Gener. Comput. Syst., 2016

eScience today and tomorrow.
Future Gener. Comput. Syst., 2016

Application skeletons: Construction and use in eScience.
Future Gener. Comput. Syst., 2016

Analysis of Distributed Execution of Workloads.
CoRR, 2016

Report on the Third Workshop on Sustainable Software for Science: Practice and Experiences (WSSSPE3).
CoRR, 2016

Integrating Abstractions to Enhance the Execution of Distributed Applications.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

Evaluating Online Global Recovery with Fenix Using Application-Aware In-Memory Checkpointing Techniques.
Proceedings of the 45th International Conference on Parallel Processing Workshops, 2016

2015
The Case for Workflow-Aware Storage: An Opportunity Study.
J. Grid Comput., 2015

Looking at Software Sustainability and Productivity Challenges from NSF.
CoRR, 2015

Report on the Second Workshop on Sustainable Software for Science: Practice and Experiences (WSSSPE2).
CoRR, 2015

Interlanguage parallel scripting for distributed-memory scientific computing.
Proceedings of the 10th Workshop on Workflows in Support of Large-Scale Science, 2015

Porting Ordinary Applications to Blue Gene/Q Supercomputers.
Proceedings of the 11th IEEE International Conference on e-Science, 2015

Toward Interlanguage Parallel Scripting for Distributed-Memory Scientific Computing.
Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015

2014
Special issue on eScience infrastructure and applications.
Future Gener. Comput. Syst., 2014

Implementing Transitive Credit with JSON-LD.
CoRR, 2014

Summary of the First Workshop on Sustainable Software for Science: Practice and Experiences (WSSSPE1).
CoRR, 2014

Second Workshop on on Sustainable Software for Science: Practice and Experiences (WSSSPE2): Submission, Peer-Review and Sorting Process, and Results.
CoRR, 2014

Challenges in Selecting Software to be Reused.
CoRR, 2014

Standing Together for Reproducibility in Large-Scale Computing: Report on reproducibility@XSEDE.
CoRR, 2014

DA-TC: a novel application execution model in multicluster systems.
Clust. Comput., 2014

Exploring Automatic, Online Failure Recovery for Scientific Applications at Extreme Scales.
Proceedings of the International Conference for High Performance Computing, 2014

Evaluating storage systems for scientific data in the cloud.
Proceedings of the ScienceCloud'14, 2014

Design and evaluation of the gemtc framework for GPU-enabled many-task computing.
Proceedings of the 23rd International Symposium on High-Performance Parallel and Distributed Computing, 2014

Using Application Skeletons to Improve eScience Infrastructure.
Proceedings of the 10th IEEE International Conference on e-Science, 2014

On Replica Placement in a Social CDN for e-Science.
Proceedings of the 10th IEEE International Conference on e-Science, 2014

2013
JETS: Language and System Support for Many-Parallel-Task Workflows.
J. Grid Comput., 2013

Turbine: A Distributed-memory Dataflow Engine for High Performance Many-task Applications.
Fundam. Informaticae, 2013

Recent advances in e-Science.
Future Gener. Comput. Syst., 2013

Reusability in Science: From Initial User Engagement to Dissemination of Results.
CoRR, 2013

First Workshop on on Sustainable Software for Science: Practice and Experiences (WSSSPE): Submission and Peer-Review Process, and Results.
CoRR, 2013

Distributed computing practice for large-scale science and engineering applications.
Concurr. Comput. Pract. Exp., 2013

Parallelizing the execution of sequential scripts.
Proceedings of the International Conference for High Performance Computing, 2013

Swift/T: scalable data flow programming for many-task applications.
Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2013

MTC envelope: defining the capability of large scale computers in the context of parallel scripting applications.
Proceedings of the 22nd International Symposium on High-Performance Parallel and Distributed Computing, 2013

Constructing a Social Content Delivery Network for eScience.
Proceedings of the 9th IEEE International Conference on eScience, 2013

Swift/T: Large-Scale Application Composition via Distributed-Memory Dataflow Processing.
Proceedings of the 13th IEEE/ACM International Symposium on Cluster, 2013

2012
Grid Computing: The Next Decade -- Report and Summary
CoRR, 2012

Survey and Analysis of Production Distributed Computing Infrastructures
CoRR, 2012

Many-Task Computing and Blue Waters
CoRR, 2012

Turbine: a distributed-memory dataflow engine for extreme-scale many-task applications.
Proceedings of the 1st ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies, 2012

Design and analysis of data management in scalable parallel scripting.
Proceedings of the SC Conference on High Performance Computing Networking, 2012

A Social Content Delivery Network for Scientific Cooperation: Vision, Design, and Architecture.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Job and data clustering for aggregate use of multiple production cyberinfrastructures.
Proceedings of the DIDC'12, 2012

Topic 1: Support Tools and Environments.
Proceedings of the Euro-Par 2012 Parallel Processing - 18th International Conference, 2012

Pilot abstractions for compute, data, and network.
Proceedings of the 8th IEEE International Conference on E-Science, 2012

A Workflow-Aware Storage System: An Opportunity Study.
Proceedings of the 12th IEEE/ACM International Symposium on Cluster, 2012

2011
Swift: A language for distributed parallel scripting.
Parallel Comput., 2011

Many-Task Computing Tools for Multiscale Modeling
CoRR, 2011

Data-intensive CyberShake computations on an opportunistic cyberinfrastructure.
Proceedings of the 2011 TeraGrid Conference - Extreme Digital Discovery, 2011

AME: an anyscale many-task computing engine.
Proceedings of the WORKS'11, 2011

Panel: many-task computing meets exascales.
Proceedings of the 2011 ACM International Workshop on Many Task Computing on Grids and Supercomputers, 2011

Cyberinfrastructure Usage Modalities on the TeraGrid.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

2010
Special Section: Grid computing, high-performance and distributed applications.
Future Gener. Comput. Syst., 2010

Collaborative Astronomical Image Mosaics
CoRR, 2010

Special Issue: Grid Computing, High Performance and Distributed Application.
Concurr. Comput. Pract. Exp., 2010

Global-scale distributed I/O with ParaMEDIC.
Concurr. Comput. Pract. Exp., 2010

Scheduling many-task workloads on supercomputers: Dealing with trailing tasks.
Proceedings of the 3rd Workshop on Many-Task Computing on Grids and Supercomputers, 2010

Distributed Systems and Algorithms.
Proceedings of the Euro-Par 2010 - Parallel Processing, 16th International Euro-Par Conference, Ischia, Italy, August 31, 2010

2009
Montage: a grid portal and software toolkit for science-grade astronomical image mosaicking.
Int. J. Comput. Sci. Eng., 2009

Critical perspectives on large-scale distributed applications and production Grids.
Proceedings of the 2009 10th IEEE/ACM International Conference on Grid Computing, 2009

A Fresh Perspective on Developing and Executing DAG-Based Distributed Applications: A Case-Study of SAGA-Based Montage.
Proceedings of the Fifth International Conference on e-Science, 2009

An innovative application execution toolkit for multicluster grids.
Proceedings of the 2009 IEEE International Conference on Cluster Computing, August 31, 2009

2008
GADA 2008 PC Co-chairs' Message.
Proceedings of the On the Move to Meaningful Internet Systems: OTM 2008, 2008

Feature rich, enhanced grid portal for LONI.
Proceedings of the 15th ACM Mardi Gras conference: From lightweight mash-ups to lambda grids: Understanding the spectrum of distributed computing requirements, applications, tools, infrastructures, interoperability, and the incremental adoption of key capabilities, Baton Rouge, Louisiana, USA, January 29, 2008

Workflow task clustering for best effort systems with Pegasus.
Proceedings of the 15th ACM Mardi Gras conference: From lightweight mash-ups to lambda grids: Understanding the spectrum of distributed computing requirements, applications, tools, infrastructures, interoperability, and the incremental adoption of key capabilities, Baton Rouge, Louisiana, USA, January 29, 2008

Abstractions for Distributed Systems (DPA 2008).
Proceedings of the Euro-Par 2008 Workshops, 2008

2007
Optimizing workflow data footprint.
Sci. Program., 2007

GADA 2007 PC Co-chairs' Message.
Proceedings of the On the Move to Meaningful Internet Systems 2007: CoopIS, 2007

Generating Complex Astronomy Workflows.
Proceedings of the Workflows for e-Science, Scientific Workflows for Grids., 2007

2006
A Component Architecture for High-Performance Scientific Computing.
Int. J. High Perform. Comput. Appl., 2006

Data-Oriented Distributed Computing for Science: Reality and Possibilities.
Proceedings of the On the Move to Meaningful Internet Systems 2006: CoopIS, 2006

2005
Pegasus: A framework for mapping complex scientific workflows onto distributed systems.
Sci. Program., 2005

The Pegasus portal: web based grid computing.
Proceedings of the 2005 ACM Symposium on Applied Computing (SAC), 2005

A Comparison of Two Methods for Building Astronomical Image Mosaics on a Grid.
Proceedings of the 34th International Conference on Parallel Processing Workshops (ICPP 2005 Workshops), 2005

Message from the Chairs.
Proceedings of the 34th International Conference on Parallel Processing Workshops (ICPP 2005 Workshops), 2005

2004
Accessing and Visualizing Scientific Spatiotemporal Data.
Proceedings of the 16th International Conference on Scientific and Statistical Database Management (SSDBM 2004), 2004

Application-Level Fault Tolerance in the Orbital Thermal Imaging Spectrometer.
Proceedings of the 10th IEEE Pacific Rim International Symposium on Dependable Computing (PRDC 2004), 2004

2003
Tests and Tolerances for High-Performance Software-Implemented Fault Detection.
IEEE Trans. Computers, 2003

NASA Advances Robotic Space Exploration.
Computer, 2003

2001
High Performance Computing Systems for Autonomous Spaceborne Missions.
Int. J. High Perform. Comput. Appl., 2001

Embedded/Real-Time Systems.
Int. J. High Perform. Comput. Appl., 2001

Fault-Tolerant High-Performance Matrix Multiplication: Theory and Practice.
Proceedings of the 2001 International Conference on Dependable Systems and Networks (DSN 2001) (formerly: FTCS), 2001

2000
Software-Implemented Fault Detection for High-Performance Space Applications.
Proceedings of the 2000 International Conference on Dependable Systems and Networks (DSN 2000) (formerly FTCS-30 and DCCA-8), 2000

Demonstration of the Remote Exploration and Experimentation (REE) Fault-Tolerant Parallel-Processing Supercomputer for Spacecraft Onboard Scientific Data Processing.
Proceedings of the 2000 International Conference on Dependable Systems and Networks (DSN 2000) (formerly FTCS-30 and DCCA-8), 2000

Development of a Spaceborne Embedded Cluster.
Proceedings of the 2000 IEEE International Conference on Cluster Computing (CLUSTER 2000), November 28th, 2000

1999
Advances in Modeling the Generation of the Geomagnetic Field by the Use of Massively Parallel Computers and Profound Optimization.
Proceedings of the Ninth SIAM Conference on Parallel Processing for Scientific Computing, 1999

1997
Optimization of a Parallel Ocean General Circulation Model.
Proceedings of the ACM/IEEE Conference on Supercomputing, 1997

1992
Initial results for automated computational modeling of patient-specific electromagnetic hyperthermia.
IEEE Trans. Biomed. Eng., 1992


  Loading...