Shivaram Venkataraman

PhD thesis, 2017

Hemingway: Modeling Distributed Optimization Algorithms.

[BibT_eX]

[DOI]

Xinghao Pan

Zizheng Tai

Joseph Gonzalez

CoRR, 2017

Occupy the Cloud: Distributed Computing for the 99%.

[BibT_eX]

[DOI]

Eric Jonas

CoRR, 2017

Drizzle: Fast and Adaptable Stream Processing at Scale.

[BibT_eX]

[DOI]

Proceedings of the 26th Symposium on Operating Systems Principles, 2017

CherryPick: Adaptively Unearthing the Best Cloud Configurations for Big Data Analytics.

[BibT_eX]

[DOI]

Omid Alipourfard

Hongqiang Harry Liu

Jianshu Chen

Minlan Yu

Ming Zhang

Proceedings of the 14th USENIX Symposium on Networked Systems Design and Implementation, 2017

Breaking Locality Accelerates Block Gauss-Seidel.

[BibT_eX]

[DOI]

Stephen Tu

Proceedings of the 34th International Conference on Machine Learning, 2017

KeystoneML: Optimizing Pipelines for Large-Scale Advanced Analytics.

[BibT_eX]

[DOI]

Evan Randall Sparks

Tomer Kaftan

Proceedings of the 33rd IEEE International Conference on Data Engineering, 2017

Occupy the cloud: distributed computing for the 99%.

[BibT_eX]

[DOI]

Eric Jonas

Qifan Pu

Proceedings of the 2017 Symposium on Cloud Computing, SoCC 2017, Santa Clara, CA, USA, 2017

2016

MLlib: Machine Learning in Apache Spark.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2016

Large Scale Kernel Learning using Block Coordinate Descent.

[BibT_eX]

[DOI]

Stephen Tu

Rebecca Roelofs

CoRR, 2016

Apache Spark: a unified engine for big data processing.

[BibT_eX]

[DOI]

Commun. ACM, 2016

SparkR: Scaling R Programs with Spark.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Conference on Management of Data, 2016

Ernest: Efficient Performance Prediction for Large-Scale Advanced Analytics.

[BibT_eX]

[DOI]

Proceedings of the 13th USENIX Symposium on Networked Systems Design and Implementation, 2016

Matrix Computations and Optimization in Apache Spark.

[BibT_eX]

[DOI]

Evan Randall Sparks

Aaron Staple

Matei Zaharia

Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

2015

linalg: Matrix Computations in Apache Spark.

[BibT_eX]

[DOI]

Evan Randall Sparks

Alexander Ulanov

Matei Zaharia

CoRR, 2015

2014

Quantifying eventual consistency with PBS.

[BibT_eX]

[DOI]

Peter Bailis

Joseph M. Hellerstein

Commun. ACM, 2014

Record Placement Based on Data Skew Using Solid State Drives.

[BibT_eX]

[DOI]

Jun Suzuki

Sameer Agarwal

Proceedings of the Big Data Benchmarks, Performance Optimization, and Emerging Hardware, 2014

The Power of Choice in Data-Aware Cluster Scheduling.

[BibT_eX]

[DOI]

Aurojit Panda

Ganesh Ananthanarayanan

Proceedings of the 11th USENIX Symposium on Operating Systems Design and Implementation, 2014

2013

PBS at work: advancing data management with consistency metrics.

[BibT_eX]

[DOI]

Peter Bailis

Joseph M. Hellerstein

Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

The Case for Tiny Tasks in Compute Clusters.

[BibT_eX]

[DOI]

Kay Ousterhout

Aurojit Panda

Josh Rosen

Proceedings of the 14th Workshop on Hot Topics in Operating Systems, 2013

Presto: distributed machine learning and graph processing with sparse matrices.

[BibT_eX]

[DOI]

Proceedings of the Eighth Eurosys Conference 2013, 2013

2012

Probabilistically Bounded Staleness for Practical Partial Quorums.

[BibT_eX]

[DOI]

Peter Bailis

Joseph M. Hellerstein

Proc. VLDB Endow., 2012

Sweet Storage SLOs with Frosting.

[BibT_eX]

[DOI]

Andrew Wang

Sara Alspaugh

Randy H. Katz

Proceedings of the 4th USENIX Workshop on Hot Topics in Cloud Computing, 2012

Using R for Iterative and Incremental Processing.

[BibT_eX]

[DOI]

Indrajit Roy

Alvin AuYoung

Robert S. Schreiber

Proceedings of the 4th USENIX Workshop on Hot Topics in Cloud Computing, 2012

Cake: enabling high-level SLOs on shared storage systems.

[BibT_eX]

[DOI]

Andrew Wang

Sara Alspaugh

Randy H. Katz

Proceedings of the ACM Symposium on Cloud Computing, SOCC '12, 2012

2011

Characterizing Data Structures for Volatile Forensics.

[BibT_eX]

[DOI]

Ellick Chan

Proceedings of the 2011 IEEE Sixth International Workshop on Systematic Approaches to Digital Forensic Engineering, 2011

Consistent and Durable Data Structures for Non-Volatile Byte-Addressable Memory.

[BibT_eX]

[DOI]

Parthasarathy Ranganathan

Niraj Tolia

Roy H. Campbell

Proceedings of the 9th USENIX Conference on File and Storage Technologies, 2011

2010

Scaling eCGA model building via data-intensive computing.

[BibT_eX]

[DOI]

Abhishek Verma

Xavier Llorà

David E. Goldberg

Roy H. Campbell

Proceedings of the IEEE Congress on Evolutionary Computation, 2010

Forenscope: a framework for live forensics.

[BibT_eX]

[DOI]

Ellick Chan