Reynold Xin

According to our database1, Reynold Xin authored at least 36 papers between 2009 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.



In proceedings 
PhD thesis 


Online presence:



Adaptive and Robust Query Execution for Lakehouses At Scale.
Proc. VLDB Endow., August, 2024



It Takes a Community: The Open-source Challenge.
ACM Queue, 2021

Lakehouse: A New Generation of Open Platforms that Unify Data Warehousing and Advanced Analytics.
Proceedings of the 11th Conference on Innovative Data Systems Research, 2021

Delta Lake: High-Performance ACID Table Storage over Cloud Object Stores.
Proc. VLDB Endow., 2020

SparkFuzz: searching correctness regressions in modern query engines.
Proceedings of the 8th International Workshop on Testing Database Systems, 2020

Go with the Flow: Graphs, Streaming and Relational Computations over Distributed Dataflow.
PhD thesis, 2018

Structured Streaming: A Declarative API for Real-Time Applications in Apache Spark.
Proceedings of the 2018 International Conference on Management of Data, 2018

Spark and Scala (keynote).
Proceedings of the 8th ACM SIGPLAN International Symposium on Scala, 2017

MLlib: Machine Learning in Apache Spark.
J. Mach. Learn. Res., 2016

Apache Spark: a unified engine for big data processing.
Commun. ACM, 2016

SparkR: Scaling R Programs with Spark.
Proceedings of the 2016 International Conference on Management of Data, 2016

Introduction to Spark 2.0 for Database Researchers.
Proceedings of the 2016 International Conference on Management of Data, 2016

GraphFrames: an integrated API for mixing graph and relational queries.
Proceedings of the Fourth International Workshop on Graph Data Management Experiences and Systems, Redwood Shores, CA, USA, June 24, 2016

Scaling Spark in the Real World: Performance and Usability.
Proc. VLDB Endow., 2015

Spark SQL: Relational Data Processing in Spark.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

A Partitioning Framework for Aggressive Data Skipping.
Proc. VLDB Endow., 2014

GraphX: Unifying Data-Parallel and Graph-Parallel Analytics.
CoRR, 2014

Fine-grained partitioning for aggressive data skipping.
Proceedings of the International Conference on Management of Data, 2014

GraphX: Graph Processing in a Distributed Dataflow Framework.
Proceedings of the 11th USENIX Symposium on Operating Systems Design and Implementation, 2014

Publishing bibliographic data on the Semantic Web using BibBase.
Semantic Web, 2013

Shark: SQL and rich analytics at scale.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

GraphX: a resilient distributed graph system on Spark.
Proceedings of the First International Workshop on Graph Data Management Experiences and Systems, 2013

The Case for Tiny Tasks in Compute Clusters.
Proceedings of the 14th Workshop on Hot Topics in Operating Systems, 2013

Beating State-of-the-art By -10000%.
Proceedings of the Sixth Biennial Conference on Innovative Data Systems Research, 2013

The End of an Architectural Era for Analytical Databases.
Adv. Math. Commun., 2012

Improving Data Management Applications Using Microtask Platforms.
Adv. Math. Commun., 2012

Finding related tables.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2012

Shark: fast data analysis using coarse-grained distributed memory.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2012

CrowdDB: Query Processing with the VLDB Crowd.
Proc. VLDB Endow., 2011

CrowdDB: answering queries with crowdsourcing.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011

MEET DB2: Automated Database Migration Evaluation.
Proc. VLDB Endow., 2010

Publishing Bibliographic Data on the Semantic Web using BibBase.
Proceedings of the ISWC 2010 Posters & Demonstrations Track: Collected Abstracts, 2010

BibBase triplified.
Proceedings of the Proceedings the 6th International Conference on Semantic Systems, 2010

Linkage Query Writer.
Proc. VLDB Endow., 2009
