David Dao

According to our database1, David Dao authored at least 20 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
OAM-TCD: A globally diverse dataset of high-resolution tree cover maps.
CoRR, 2024

Data Debugging with Shapley Importance over Machine Learning Pipelines.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Advancing Algorithms and Applications for Data Valuation in Machine Learning.
PhD thesis, 2023

GEO-Bench: Toward Foundation Models for Earth Monitoring.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022
Data Debugging with Shapley Importance over End-to-End Machine Learning Pipelines.
CoRR, 2022

ReforesTree: A Dataset for Estimating Tropical Forest Carbon Stock with Deep Learning and Aerial Imagery.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
RumbleML: program the lakehouse with JSONiq.
CoRR, 2021

Toward Foundation Models for Earth Monitoring: Proposal for a Climate Change Benchmark.
CoRR, 2021

Tackling the Overestimation of Forest Carbon with Deep Learning and Aerial Imagery.
CoRR, 2021

Challenges in KDD and ML for Sustainable Development.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Scalability vs. Utility: Do We Have To Sacrifice One for the Other in Data Importance Quantification?
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Ease.ML: A Lifecycle Management System for Machine Learning.
Proceedings of the 11th Conference on Innovative Data Systems Research, 2021

2020
TrueBranch: Metric Learning-based Verification of Forest Conservation Projects.
CoRR, 2020

2019
Efficient Task-Specific Data Valuation for Nearest Neighbor Algorithms.
Proc. VLDB Endow., 2019

Data Capsule: A New Paradigm for Automatic Compliance with Data Privacy Regulations.
Proceedings of the Heterogeneous Data Management, Polystores, and Analytics for Healthcare, 2019

Towards Efficient Data Valuation Based on the Shapley Value.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

2018
A Demonstration of Sterling: A Privacy-Preserving Data Marketplace.
Proc. VLDB Endow., 2018

DataBright: Towards a Global Exchange for Decentralized Data Ownership and Trusted Computation.
CoRR, 2018

Emotional prosody analysis on human voices.
Proceedings of the IEEE 8th Annual Computing and Communication Workshop and Conference, 2018

2016
CellProfiler Analyst: interactive data exploration, analysis and classification of large biological image sets.
Bioinform., 2016


  Loading...