Bill Howe

Orcid: 0000-0001-8588-8472

Affiliations:
  • University of Washington, Information School, Seattle, WA, USA
  • Portland State University, USA (former)


According to our database1, Bill Howe authored at least 149 papers between 2002 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Epistemic Parity: Reproducibility as an Evaluation Metric for Differential Privacy.
SIGMOD Rec., March, 2024

Reliable, Routable, and Reproducible: Collection of Pedestrian Pathways at Statewide Scale.
CoRR, 2024

ML-EAT: A Multilevel Embedding Association Test for Interpretable and Transparent Social Science.
CoRR, 2024

Representation Bias of Adolescents in AI: A Bilingual, Bicultural Study.
CoRR, 2024

Dataset Scale and Societal Consistency Mediate Facial Impression Bias in Vision-Language AI.
CoRR, 2024

Towards Zero-Shot Annotation of the Built Environment with Vision-Language Models (Vision Paper).
CoRR, 2024

The Art of Refusal: A Survey of Abstention in Large Language Models.
CoRR, 2024

PathwayBench: Assessing Routability of Pedestrian Pathway Networks Inferred from Multi-City Imagery.
CoRR, 2024

Geospatial Imputation of Urban Mobility Data with Self-Supervised Learning.
Proceedings of the 57th Hawaii International Conference on System Sciences, 2024

Does a Fair Model Produce Fair Explanations? Relating Distributive and Procedural Fairness.
Proceedings of the 57th Hawaii International Conference on System Sciences, 2024

SARN: Structurally-Aware Recurrent Network for Spatio-Temporal Disaggregation.
Proceedings of the 32nd ACM International Conference on Advances in Geographic Information Systems, 2024

Towards Zero-Shot Annotation of the Built Environment with Vision-Language Models.
Proceedings of the 32nd ACM International Conference on Advances in Geographic Information Systems, 2024

Laboratory-Scale AI: Open-Weight Models are Competitive with ChatGPT Even in Low-Resource Settings.
Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency, 2024

Characterizing LLM Abstention Behavior in Science QA with Context Perturbations.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Label-Efficient Group Robustness via Out-of-Distribution Concept Curation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
InfoVisDial: An Informative Visual Dialogue Dataset by Bridging Large Multimodal and Language Models.
CoRR, 2023

Top-down Green-ups: Satellite Sensing and Deep Models to Predict Buffelgrass Phenology.
CoRR, 2023

Urban Spatiotemporal Data Synthesis via Neural Disaggregation.
CoRR, 2023

Adapting to Skew: Imputing Spatiotemporal Urban Data with 3D Partial Convolutions and Biased Masking.
CoRR, 2023

Contrastive Language-Vision AI Models Pretrained on Web-Scraped Multimodal Data Exhibit Sexual Objectification Bias.
Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency, 2023

AI is Reducing to Data Curation (Abstract).
Proceedings of the 15th Alberto Mendelzon International Workshop on Foundations of Data Management (AMW 2023), 2023

2022
Technical Perspective: Imperative or Functional Control Flow Handling: Why not the Best of Both Worlds?
SIGMOD Rec., 2022

The Many Facets of Data Equity.
ACM J. Data Inf. Qual., 2022

Epistemic Parity: Reproducibility as an Evaluation Metric for Differential Privacy.
CoRR, 2022

Responsible data management.
Commun. ACM, 2022

Technical perspective: Visualization search: from sketching to natural language.
Commun. ACM, 2022

Surj: Ontological Learning for Fast, Accurate, and Robust Hierarchical Multi-label Classification.
Proceedings of the Companion of The Web Conference 2022, Virtual Event / Lyon, France, April 25, 2022

Ontologue: Declarative Benchmark Construction for Ontological Multi-Label Classification.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
Technical Perspective: From Sketching to Natural Language: Expressive Visual Querying for Accelerating Insight.
SIGMOD Rec., 2021

COVID-19 Brings Data Equity Challenges to the Fore.
Digit. Gov. Res. Pract., 2021

EquiTensors: Learning Fair Integrations of Heterogeneous Urban Data.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

2020
Database Repair Meets Algorithmic Fairness.
SIGMOD Rec., 2020

Advice from SIGMOD/PODS 2020.
SIGMOD Rec., 2020

SPORES: Sum-Product Optimization via Relational Equality Saturation for Large Scale Linear Algebra.
Proc. VLDB Endow., 2020

Responsible Data Management.
Proc. VLDB Endow., 2020

JECL: Joint Embedding and Cluster Learning for Image-Text Pairs.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Fairness-Aware Demand Prediction for New Mobility.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Formalizing Visualization Design Knowledge as Constraints: Actionable and Extensible Models in Draco.
IEEE Trans. Vis. Comput. Graph., 2019

Fairness in Practice: A Survey on Equity in Urban Mobility.
IEEE Data Eng. Bull., 2019

Nutritional Labels for Data and Models.
IEEE Data Eng. Bull., 2019

Data Management for Causal Algorithmic Fairness.
IEEE Data Eng. Bull., 2019

Delineating Knowledge Domains in the Scientific Literature Using Visual Information.
CoRR, 2019

In Defense of Synthetic Data.
CoRR, 2019

Capuchin: Causal Database Repair for Algorithmic Fairness.
CoRR, 2019

MultiDEC: Multi-Modal Clustering of Image-Caption Pairs.
CoRR, 2019

GraviTIE: Exploratory Analysis of Large-Scale Heterogeneous Image Collections.
Proceedings of the World Wide Web Conference, 2019

Interventional Fairness: Causal Database Repair for Algorithmic Fairness.
Proceedings of the 2019 International Conference on Management of Data, 2019

Identifying the Central Figure of a Scientific Paper.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

FairST: Equitable Spatial and Temporal Demand Prediction for New Mobility Systems.
Proceedings of the 27th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, 2019

Beyond Open vs. Closed: Balancing Individual Privacy and Public Accountability in Data Sharing.
Proceedings of the Conference on Fairness, Accountability, and Transparency, 2019

MithraLabel: Flexible Dataset Nutritional Labels for Responsible Data Science.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Database-Agnostic Workload Management.
Proceedings of the 9th Biennial Conference on Innovative Data Systems Research, 2019

Falcon: Balancing Interactive Latency and Resolution Sensitivity for Scalable Linked Visualizations.
Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems, 2019

2018
Viziometrics: Analyzing Visual Information in the Scientific Literature.
IEEE Trans. Big Data, 2018

Panel: A Debate on Data and Algorithmic Ethics.
Proc. VLDB Endow., 2018

The principles of tomorrow's university.
F1000Research, 2018

Privacy-Preserving Synthetic Datasets Over Weakly Constrained Domains.
CoRR, 2018

Query2Vec: NLP Meets Databases for Generalized Workload Analytics.
CoRR, 2018

MobilityMirror: Bias-Adjusted Transportation Datasets.
Proceedings of the Big Social Data and Urban Computing, 2018

A Nutritional Label for Rankings.
Proceedings of the 2018 International Conference on Management of Data, 2018

Special Session: A Technical Research Agenda in Data Ethics and Responsible Data Management.
Proceedings of the 2018 International Conference on Management of Data, 2018

EZLearn: Exploiting Organic Supervision in Automated Data Annotation.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

2017
Scalable and Efficient Flow-Based Community Detection for Large-Scale Graph Analysis.
ACM Trans. Knowl. Discov. Data, 2017

Synthetic Data for Social Good.
CoRR, 2017

EZLearn: Exploiting Organic Supervision in Large-Scale Data Annotation.
CoRR, 2017

Deep Mapping of the Visual Literature.
Proceedings of the 26th International Conference on World Wide Web Companion, 2017

Fides: Towards a Platform for Responsible Data Science.
Proceedings of the 29th International Conference on Scientific and Statistical Database Management, 2017

DataSynthesizer: Privacy-Preserving Synthetic Datasets.
Proceedings of the 29th International Conference on Scientific and Statistical Database Management, 2017

LaraDB: A Minimalist Kernel for Linear and Relational Algebra Computation.
Proceedings of the 4th ACM SIGMOD Workshop on Algorithms and Systems for MapReduce and Beyond, 2017

Data Science Education: We're Missing the Boat, Again.
Proceedings of the 33rd IEEE International Conference on Data Engineering, 2017

PhyloParser: A Hybrid Algorithm for Extracting Phylogenies from Dendrograms.
Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, 2017

Profiling a GPU database implementation: a holistic view of GPU resource utilization on TPC-H queries.
Proceedings of the 13th International Workshop on Data Management on New Hardware, 2017

The Myria Big Data Management and Analytics System and Cloud Services.
Proceedings of the 8th Biennial Conference on Innovative Data Systems Research, 2017

Voyager 2: Augmenting Visual Analysis with Partial View Specifications.
Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, 2017

2016
Voyager: Exploratory Analysis via Faceted Browsing of Visualization Recommendations.
IEEE Trans. Vis. Comput. Graph., 2016

Lara: A Key-Value Algebra underlying Arrays and Relations.
CoRR, 2016

Scalable clustering algorithms for continuous environmental flow cytometry.
Bioinform., 2016

VizioMetrix: A Platform for Analyzing the Visual Information in Big Scholarly Data.
Proceedings of the 25th International Conference on World Wide Web, 2016

Towards a general-purpose query language for visualization recommendation.
Proceedings of the Workshop on Human-In-the-Loop Data Analytics, 2016

SQLShare: Results from a Multi-Year SQL-as-a-Service Experiment.
Proceedings of the 2016 International Conference on Management of Data, 2016

MusicDB: A Platform for Longitudinal Music Analytics.
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016

High variety cloud databases.
Proceedings of the 32nd IEEE International Conference on Data Engineering Workshops, 2016

From NoSQL Accumulo to NewSQL Graphulo: Design and utility of graph algorithms inside a BigTable database.
Proceedings of the 2016 IEEE High Performance Extreme Computing Conference, 2016

2015
The BigDAWG Polystore System.
SIGMOD Rec., 2015

A Demonstration of the BigDAWG Polystore System.
Proc. VLDB Endow., 2015

Query-Based Data Pricing.
J. ACM, 2015

Perfopticon: Visual Query Analysis for Distributed Databases.
Comput. Graph. Forum, 2015

Gaussian Mixture Models Use-Case: In-Memory Analysis with Myria.
Proceedings of the 3rd VLDB Workshop on In-Memory Data Mangement and Analytics, 2015

Towards automated prediction of relationships among scientific datasets.
Proceedings of the 27th International Conference on Scientific and Statistical Database Management, 2015

GossipMap: a distributed community detection algorithm for billion-edge directed graphs.
Proceedings of the International Conference for High Performance Computing, 2015

Detecting and Dismantling Composite Visualizations in the Scientific Literature.
Proceedings of the Pattern Recognition Applications and Methods, 2015

Dismantling Composite Visualizations in the Scientific Literature.
Proceedings of the ICPRAM 2015, 2015

Big Data Science Needs Big Data Middleware.
Proceedings of the Seventh Biennial Conference on Innovative Data Systems Research, 2015

Time-Varying Clusters in Large-Scale Flow Cytometry.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
The database group at the University of Washington.
SIGMOD Rec., 2014

Helping scientists reconnect their datasets.
Proceedings of the Conference on Scientific and Statistical Database Management, 2014

Should we all be teaching "intro to data science" instead of "intro to databases"?
Proceedings of the International Conference on Management of Data, 2014

Demonstration of the Myria big data management service.
Proceedings of the International Conference on Management of Data, 2014

2013
Hadoop's Adolescence.
Proc. VLDB Endow., 2013

Managing Skew in Hadoop.
IEEE Data Eng. Bull., 2013

Collaborative Science Workflows in SQL.
Comput. Sci. Eng., 2013

Compiled Plans for In-Memory Path-Counting Queries.
Proceedings of the 1st International Workshop on In Memory Data Management and Analytics, 2013

Real-time collaborative analysis with (almost) pure SQL: a case study in biogeochemical oceanography.
Proceedings of the Conference on Scientific and Statistical Database Management, 2013

Education and career paths for data scientists.
Proceedings of the Conference on Scientific and Statistical Database Management, 2013

The power of data use management in action.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

Toward practical query pricing with QueryMarket.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

Massive scale cyber traffic analysis: a driver for graph database research.
Proceedings of the First International Workshop on Graph Data Management Experiences and Systems, 2013

Scalable Flow-Based Community Detection for Large-Scale Network Analysis.
Proceedings of the 13th IEEE International Conference on Data Mining Workshops, 2013

Stop That Query! The Need for Managing Data Use.
Proceedings of the Sixth Biennial Conference on Innovative Data Systems Research, 2013

A Discussion on Pricing Relational Data.
Proceedings of the In Search of Elegance in the Theory and Practice of Computation, 2013

2012
The HaLoop approach to large-scale iterative data analysis.
VLDB J., 2012

SkewTune in Action: Mitigating Skew in MapReduce Applications.
Proc. VLDB Endow., 2012

QueryMarket Demonstration: Pricing for Online Data Markets.
Proc. VLDB Endow., 2012

Advancing Declarative Query in the Long Tail of Science.
IEEE Data Eng. Bull., 2012

Virtual Appliances, Cloud Computing, and Reproducible Research.
Comput. Sci. Eng., 2012

SkewTune: mitigating skew in mapreduce applications.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2012

VizDeck: self-organizing dashboards for visual analytics.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2012

Poster: Hadoop's Adolescence; A Comparative Workloads Analysis from Three Research Clusters.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Abstract: Hadoop's Adolescence; A Comparative Workloads Analysis from Three Research Clusters.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Optimizing Large-Scale Semi-Naïve Datalog Evaluation in Hadoop.
Proceedings of the Datalog in Academia and Industry - Second International Workshop, 2012

Designing good algorithms for MapReduce and beyond.
Proceedings of the ACM Symposium on Cloud Computing, SOCC '12, 2012

VizDeck: a card game metaphor for fast visual data exploration.
Proceedings of the CHI Conference on Human Factors in Computing Systems, 2012

2011
Data Markets in the Cloud: An Opportunity for the Database Community.
Proc. VLDB Endow., 2011

Database-as-a-Service for Long-Tail Science.
Proceedings of the Scientific and Statistical Database Management, 2011

Towards Efficient and Precise Queries over Ten Million Asteroid Trajectory Models.
Proceedings of the Scientific and Statistical Database Management, 2011

Automatic example queries for ad hoc databases.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011

Parallel visualization on large clusters using MapReduce.
Proceedings of the IEEE Symposium on Large Data Analysis and Visualization, 2011

2010
HaLoop: Efficient Iterative Data Processing on Large Clusters.
Proc. VLDB Endow., 2010

Astronomy in the Cloud: Using MapReduce for Image Coaddition
CoRR, 2010

Scalable Clustering Algorithm for N-Body Simulations in a Shared-Nothing Cluster.
Proceedings of the Scientific and Statistical Database Management, 2010

Client + Cloud: Evaluating Seamless Architectures for Visual Data Analytics in the Ocean Sciences.
Proceedings of the Scientific and Statistical Database Management, 2010

COVE: A Visual Environment for Multidisciplinary Ocean Science Collaboration.
Proceedings of the Sixth International Conference on e-Science, 2010

Skew-resistant parallel processing of feature-extracting scientific user-defined functions.
Proceedings of the 1st ACM Symposium on Cloud Computing, 2010

2009
Scientific Mashups: Runtime-Configurable Data Product Ensembles.
Proceedings of the Scientific and Statistical Database Management, 2009

Embracing Uncertainty in Large-Scale Computational Astrophysics.
Proceedings of the Third VLDB workshop on Management of Uncertain Data (MUD2009) in conjunction with VLDB 2009, 2009

Analyzing massive astrophysical datasets: Can Pig/Hadoop or a relational DBMS help?
Proceedings of the 2009 IEEE International Conference on Cluster Computing, August 31, 2009

2008
Scientific Exploration in the Era of Ocean Observatories.
Comput. Sci. Eng., 2008

Quarrying dataspaces: Schemaless profiling of unfamiliar information sources.
Proceedings of the 24th International Conference on Data Engineering Workshops, 2008

End-to-End eScience: Integrating Workflow, Query, Visualization, and Provenance at an Ocean Observatory.
Proceedings of the Fourth International Conference on e-Science, 2008

Scientific Mashups: Runtime-Configurable Data Product Ensembles.
Proceedings of the Fourth International Conference on e-Science, 2008

2007
Smoothing the ROI Curve for Scientific Data Management Applications.
Proceedings of the Third Biennial Conference on Innovative Data Systems Research, 2007

2006
Managing the Forecast Factory.
Proceedings of the 22nd International Conference on Data Engineering Workshops, 2006

2005
Algebraic manipulation of scientific datasets.
VLDB J., 2005

Retrofitting a Data Model to Existing Environmental Data.
Proceedings of the 17th International Conference on Scientific and Statistical Database Management, 2005

Querying and Visualizing Gridded Datasets for e-Science.
Proceedings of the 21st International Conference on Data Engineering, 2005

2004
Logical and Physical Data Independence for Native Scientific Data Repositories.
IEEE Data Eng. Bull., 2004

Emergent Semantics: Towards Self-Organizing Scientific Metadata..
Proceedings of the Semantics for Grid Databases, 2004

2003

2002
Representing, Exploiting, and Extracting Metadata using Metadata++.
Proceedings of the 2002 Annual National Conference on Digital Government Research, 2002


  Loading...