Aditya G. Parameswaran

Orcid: 0000-0002-4538-4752

Affiliations:
  • University of California, Berkeley, USA
  • University of Illinois, USA (former)


According to our database1, Aditya G. Parameswaran authored at least 148 papers between 2008 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Dealing with Acronyms, Abbreviations, and Typos in Real-World Entity Matching.
Proc. VLDB Endow., August, 2024

SPADE: Synthesizing Data Quality Assertions for Large Language Model Pipelines.
Proc. VLDB Endow., August, 2024

"We Have No Idea How Models will Behave in Production until Production": How Engineers Operationalize Machine Learning.
Proc. ACM Hum. Comput. Interact., 2024

DocETL: Agentic Query Rewriting and Evaluation for Complex Document Processing.
CoRR, 2024

NUDGE: Lightweight Non-Parametric Fine-Tuning of Embeddings for Retrieval.
CoRR, 2024

Flow with FlorDB: Incremental Context Maintenance for the Machine Learning Lifecycle.
CoRR, 2024

Towards Accurate and Efficient Document Analytics with Large Language Models.
CoRR, 2024

SPADE: Synthesizing Assertions for Large Language Model Pipelines.
CoRR, 2024

Who Validates the Validators? Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences.
Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology, 2024

Quilt: Custom UIs for Linking Unstructured Documents to Structured Datasets.
Proceedings of the Adjunct Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology, 2024

Building Reactive Large Language Model Pipelines with Motion.
Proceedings of the Companion of the 2024 International Conference on Management of Data, 2024

It Took Longer than I was Expecting: Why is Dataset Search Still so Hard?
Proceedings of the 2024 Workshop on Human-In-the-Loop Data Analytics, 2024

Inferring Visualization Intent from Conversation.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

Revisiting Prompt Engineering via Declarative Crowdsourcing.
Proceedings of the 14th Conference on Innovative Data Systems Research, 2024

2023
Transactional Panorama: A Conceptual Framework for User Perception in Analytical Visual Interfaces.
Proc. VLDB Endow., 2023

Visualizing Spreadsheet Formula Graphs Compactly.
Proc. VLDB Endow., 2023

Moving Fast With Broken Data.
CoRR, 2023

Efficient and Compact Spreadsheet Formula Graphs.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

Automatic and Precise Data Validation for Machine Learning.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

2022
Deconstructing Categorization in Visualization Recommendation: A Taxonomy and Comparative Study.
IEEE Trans. Vis. Comput. Graph., 2022

Towards Observability for Production Machine Learning Pipelines [Vision].
Proc. VLDB Endow., 2022

Bolt-on, Compact, and Rapid Program Slicing for Notebooks [Scalable Data Science].
Proc. VLDB Endow., 2022

Operationalizing Machine Learning: An Interview Study.
CoRR, 2022

Rethinking Streaming Machine Learning Evaluation.
CoRR, 2022

The Sky Above The Clouds.
CoRR, 2022

Leveraging Analysis History for Improved In Situ Visualization Recommendation.
Comput. Graph. Forum, 2022

Expressive querying for accelerating visual analytics.
Commun. ACM, 2022

Piloting Data Engineering at Berkeley.
Proceedings of the 1st ACM SIGMOD International Workshop on Data Systems Education: Bridging Education Practice with Education Research, 2022

Towards Observability for Machine Learning Pipelines.
Proceedings of the 12th Conference on Innovative Data Systems Research, 2022

2021
From Sketching to Natural Language: Expressive Visual Querying for Accelerating Insight.
SIGMOD Rec., 2021

NOAH: Interactive Spreadsheet Exploration with Dynamic Hierarchical Overviews.
Proc. VLDB Endow., 2021

Flexible Rule-Based Decomposition and Metadata Independence in Modin: A Parallel Dataframe System.
Proc. VLDB Endow., 2021

Fine-Grained Lineage for Safer Notebook Interactions.
Proc. VLDB Endow., 2021

Lux: Always-on Visualization Recommendations for Exploratory Dataframe Workflows.
Proc. VLDB Endow., 2021

Enhancing the Interactivity of Dataframe Queries by Leveraging Think Time.
IEEE Data Eng. Bull., 2021

Lux: Always-on Visualization Recommendations for Exploratory Data Science.
CoRR, 2021

Production Machine Learning Pipelines: Empirical Analysis and Optimization Opportunities.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

Rapid Approximate Aggregation with Distribution-Sensitive Interval Guarantees.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

Whither AutoML? Understanding the Role of Automation in Machine Learning Workflows.
Proceedings of the CHI '21: CHI Conference on Human Factors in Computing Systems, 2021

2020
$\varvec{\textsc {Orpheus}}$DB: bolt-on versioning for relational databases (extended version).
VLDB J., 2020

You can't always sketch what you want: Understanding Sensemaking in Visual Query Systems.
IEEE Trans. Vis. Comput. Graph., 2020

Towards Scalable Dataframe Systems.
Proc. VLDB Endow., 2020

Three Lessons from Accelerating Scientific Insight Discovery via Visual Querying.
Patterns, 2020

Uncovering Effective Explanations for Interactive Genomic Data Analysis.
Patterns, 2020

Demystifying a Dark Art: Understanding Real-World Machine Learning Model Development.
CoRR, 2020

Democratizing Data Science and Lessons Learned Along the Way.
Proceedings of the VLDB 2020 PhD Workshop co-located with the 46th International Conference on Very Large Databases (VLDB 2020), ONLINE, August 31, 2020

ShapeSearch: A Flexible and Efficient System for Shape-based Exploration of Trendlines.
Proceedings of the 2020 International Conference on Management of Data, 2020

Benchmarking Spreadsheet Systems.
Proceedings of the 2020 International Conference on Management of Data, 2020

2019
Interactive Data Exploration with Smart Drill-Down.
IEEE Trans. Knowl. Data Eng., 2019

Enabling Data Science for the Majority.
Proc. VLDB Endow., 2019

A Human-in-the-loop Perspective on AutoML: Milestones and the Road Ahead.
IEEE Data Eng. Bull., 2019

SCATTERSEARCH: Visual Querying of Scatterplot Visualizations.
CoRR, 2019

An Exploratory User Study of Visual Causality Analysis.
Comput. Graph. Forum, 2019

Anti-Freeze for Large and Complex Spreadsheets: Asynchronous Formula Computation.
Proceedings of the 2019 International Conference on Management of Data, 2019

Avoiding drill-down fallacies with <i>VisPilot</i>: assisted exploration of data subsets.
Proceedings of the 24th International Conference on Intelligent User Interfaces, 2019

Faster, Higher, Stronger: Redesigning Spreadsheets for Scale.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

CRUX: Adaptive Querying for Efficient Crowdsourced Data Extraction.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

2018
Helix: Holistic Optimization for Accelerating Iterative Machine Learning.
Proc. VLDB Endow., 2018

Helix: Accelerating Human-in-the-loop Machine Learning.
Proc. VLDB Endow., 2018

ShapeSearch: Flexible Pattern-based Querying of Trend Line Visualizations.
Proc. VLDB Endow., 2018

Adaptive Sampling for Rapidly Matching Histograms.
Proc. VLDB Endow., 2018

Automatic email response suggestion for support departments within a university.
PeerJ Prepr., 2018

The Case for a Visual Discovery Assistant: A Holistic Solution for Accelerating Visual Data Exploration.
IEEE Data Eng. Bull., 2018

How Developers Iterate on Machine Learning Workflows - A Survey of the Applied Machine Learning Literature.
CoRR, 2018

Low-Norm Graph Embedding.
CoRR, 2018

Towards a Theory of Data-Diff: Optimal Synthesis of Succinct Data Modification Scripts.
CoRR, 2018

DataDiff: User-Interpretable Data Transformation Summaries for Collaborative Data Analysis.
Proceedings of the 2018 International Conference on Management of Data, 2018

Accelerating Human-in-the-loop Machine Learning: Challenges and Opportunities.
Proceedings of the Second Workshop on Data Management for End-To-End Machine Learning, 2018

Optimally Leveraging Density and Locality for Exploratory Browsing and Sampling.
Proceedings of the Workshop on Human-In-the-Loop Data Analytics, 2018

Navigating the Data Lake with DATAMARAN: Automatically Extracting Structure from Log Datasets.
Proceedings of the 2018 International Conference on Management of Data, 2018

Towards a Holistic Integration of Spreadsheets with Databases: A Scalable Storage Engine for Presentational Data Management.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

Aggregating Crowdsourced Image Segmentations.
Proceedings of the HCOMP 2018 Works in Progress and Demonstration Papers Track of the sixth AAAI Conference on Human Computation and Crowdsourcing (HCOMP 2018), 2018

Holistic Crowd-Powered Sorting via AID: Optimizing for Accuracies, Inconsistencies, and Difficulties.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

Characterizing Scalability Issues in Spreadsheet Software using Online Forums.
Proceedings of the Extended Abstracts of the 2018 CHI Conference on Human Factors in Computing Systems, 2018

2017
I've Seen "Enough": Incrementally Improving Visualizations to Support Rapid Decision Making.
Proc. VLDB Endow., 2017

Understanding Workers, Developing Effective Tasks, and Enhancing Marketplace Dynamics: A Study of a Large Crowdsourcing Marketplace.
Proc. VLDB Endow., 2017

OrpheusDB: Bolt-on Versioning for Relational Databases.
Proc. VLDB Endow., 2017

Letter from the 2017 IEEE TCDE Early Career Award Winner.
IEEE Data Eng. Bull., 2017

Accelerating Scientific Data Exploration via Visual Query Systems.
CoRR, 2017

OrpheusDB: A Lightweight Approach to Relational Dataset Versioning.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017

SLiMFast: Guaranteed Results for Data Fusion and Source Reliability.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017

Fast-Forwarding to Desired Visualizations with Zenvisage.
Proceedings of the 8th Biennial Conference on Innovative Data Systems Research, 2017

On the Interpretability of Conditional Probability Estimates in the Agnostic Setting.
Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017

2016
Challenges in Data Crowdsourcing.
IEEE Trans. Knowl. Data Eng., 2016

Towards Visualization Recommendation Systems.
SIGMOD Rec., 2016

Effortless Data Exploration with zenvisage: An Expressive and Interactive Visual Analytics System.
Proc. VLDB Endow., 2016

Decibel: The Relational Dataset Branching System.
Proc. VLDB Endow., 2016

Optimizing Open-Ended Crowdsourcing: The Next Frontier in Crowdsourced Data Management.
IEEE Data Eng. Bull., 2016

zenvisage: Effortless Visual Data Exploration.
CoRR, 2016

Speedy Browsing and Sampling with NeedleTail.
CoRR, 2016

It's just a matter of perspective(s): Crowd-Powered Consensus Organization of Corpora.
CoRR, 2016

Towards Globally Optimal Crowdsourcing Quality Management: The Uniform Worker Setting.
Proceedings of the 2016 International Conference on Management of Data, 2016

Squish: Near-Optimal Compression for Archival of Relational Datasets.
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

FacetGist: Collective Extraction of Document Facets in Large Technical Corpora.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

2015
SEEDB: Efficient Data-Driven Visualization Recommendations to Support Visual Analytics.
Proc. VLDB Endow., 2015

Rapid Sampling for Visualizations with Ordering Guarantees.
Proc. VLDB Endow., 2015

Smart Drill-Down: A New Data Exploration Operator.
Proc. VLDB Endow., 2015

Principles of Dataset Versioning: Exploring the Recreation/Storage Tradeoff.
Proc. VLDB Endow., 2015

Collaborative Data Analytics with DataHub.
Proc. VLDB Endow., 2015

DATASPREAD: Unifying Databases and Spreadsheets.
Proc. VLDB Endow., 2015

Crowdsourced Data Management: Industry and Academic Perspectives.
Found. Trends Databases, 2015

Globally Optimal Crowdsourcing Quality Management: Full technical report.
CoRR, 2015

CrowdGather: Entity Extraction over Structured Domains.
CoRR, 2015

Exploiting Features for Data Source Quality Estimation.
CoRR, 2015

On the Uniform Convergence of Consistent Confidence Measures.
CoRR, 2015

GeoHashViz: interactive analytics for mapping spatiotemporal diffusion of Twitter hashtags.
Proceedings of the 2015 XSEDE Conference: Scientific Advancements Enabled by Enhanced Cyberinfrastructure, St. Louis, MO, USA, July 26, 2015

Towards a Unified Query Language for Provenance and Versioning.
Proceedings of the 7th USENIX Workshop on the Theory and Practice of Provenance, 2015

Exploiting Correlations for Expensive Predicate Evaluation.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

Debiasing Crowdsourced Batches.
Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

Comprehensive and reliable crowd assessment algorithms.
Proceedings of the 31st IEEE International Conference on Data Engineering, 2015

Surpassing Humans and Computers with JELLYBEAN: Crowd-Vision-Hybrid Counting Algorithms.
Proceedings of the Third AAAI Conference on Human Computation and Crowdsourcing, 2015

DataHub: Collaborative Data Science & Dataset Version Management at Scale.
Proceedings of the Seventh Biennial Conference on Innovative Data Systems Research, 2015

2014
SEEDB: Automatically Generating Query Visualizations.
Proc. VLDB Endow., 2014

Optimal Crowd-Powered Rating and Filtering Algorithms.
Proc. VLDB Endow., 2014

Finish Them!: Pricing Algorithms for Human Computation.
Proc. VLDB Endow., 2014

Smart Drill Down.
CoRR, 2014

Indexing Cost Sensitive Prediction.
CoRR, 2014

DataSift: a crowd-powered search toolkit.
Proceedings of the International Conference on Management of Data, 2014

Crowd-powered find algorithms.
Proceedings of the IEEE 30th International Conference on Data Engineering, Chicago, 2014

Optimal Worker Quality and Answer Estimates in Crowd-Powered Filtering and Rating.
Proceedings of the Seconf AAAI Conference on Human Computation and Crowdsourcing, 2014

2013
Human-powered data management.
PhD thesis, 2013

Active Sampling for Entity Matching with Guarantees.
ACM Trans. Knowl. Discov. Data, 2013

Evaluating the crowd with confidence.
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

DataSift: An Expressive and Accurate Crowd-Powered Search Toolkit.
Proceedings of the First AAAI Conference on Human Computation and Crowdsourcing, 2013

Efficient parsing-based search over structured data.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

2012
An overview of the deco system: data model and query language; query processing and optimization.
SIGMOD Rec., 2012

Deco: A System for Declarative Crowdsourcing.
Proc. VLDB Endow., 2012

An interview with Surajit Chaudhuri.
XRDS, 2012

Big data.
XRDS, 2012

CrowdScreen: algorithms for filtering data with humans.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2012

So who won?: dynamic max discovery with the crowd.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2012

Minimizing Uncertainty in Pipelines.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Active sampling for entity matching.
Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012

Fuzzy Joins Using MapReduce.
Proceedings of the IEEE 28th International Conference on Data Engineering (ICDE 2012), 2012

Deco: declarative crowdsourcing.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

2011
Recommendation systems with complex constraints: A course recommendation perspective.
ACM Trans. Inf. Syst., 2011

Human-assisted graph search: it's okay to ask questions.
Proc. VLDB Endow., 2011

Optimal Schemes for Robust Web Extraction.
Proc. VLDB Endow., 2011

Information seeking: convergence of search, recommendations, and advertising.
Commun. ACM, 2011

Answering Queries using Humans, Algorithms and Databases.
Proceedings of the Fifth Biennial Conference on Innovative Data Systems Research, 2011

2010
Towards The Web of Concepts: Extracting Concepts from Large Datasets.
Proc. VLDB Endow., 2010

Recsplorer: recommendation algorithms based on precedence mining.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010

Synthesizing view definitions from data.
Proceedings of the Database Theory, 2010

Evaluating, combining and generalizing recommendations with prerequisites.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

2009
Social sites research through CourseRank.
SIGMOD Rec., 2009

Recommendations with prerequisites.
Proceedings of the 2009 ACM Conference on Recommender Systems, 2009

Blogs as Predictors of Movie Success.
Proceedings of the Third International Conference on Weblogs and Social Media, 2009

2008
Robust construction of the three-dimensional flow complex.
Proceedings of the 24th ACM Symposium on Computational Geometry, 2008


  Loading...