2020
SparkFuzz: searching correctness regressions in modern query engines.
Proceedings of the 8th International Workshop on Testing Database Systems, 2020

2016
Apache Spark: a unified engine for big data processing.
Commun. ACM, 2016

2015
Scaling Spark in the Real World: Performance and Usability.
Proc. VLDB Endow., 2015

2013
Shark: SQL and rich analytics at scale.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

The Case for Tiny Tasks in Compute Clusters.
Proceedings of the 14th Workshop on Hot Topics in Operating Systems, 2013