Farzad Khorasani

According to our database¹, Farzad Khorasani authored at least 14 papers between 2014 and 2019.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2019

High Performance Multilevel Graph Partitioning on GPU.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on High Performance Computing & Simulation, 2019

CORF: Coalescing Operand Register File for GPUs.

[BibT_eX]

[DOI]

Hodjat Asghari Esfeden

Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, 2019

2018

In-Register Parameter Caching for Dynamic Neural Nets with Virtual Persistent Processor Specialization.

[BibT_eX]

[DOI]

Farzad Khorasani

Hodjat Asghari Esfeden

Nael B. Abu-Ghazaleh

Vivek Sarkar

Proceedings of the 51st Annual IEEE/ACM International Symposium on Microarchitecture, 2018

RegMutex: Inter-Warp GPU Register Time-Sharing.

[BibT_eX]

[DOI]

Farzad Khorasani

Hodjat Asghari Esfeden

Amin Farmahini Farahani

Nuwan Jayasena

Vivek Sarkar

Proceedings of the 45th ACM/IEEE Annual International Symposium on Computer Architecture, 2018

2017

Enabling Work-Efficiency for High Performance Vertex-Centric Graph Analytics on GPUs.

[BibT_eX]

[DOI]

Proceedings of the Seventh Workshop on Irregular Applications: Architectures and Algorithms, 2017

Dyna: toward a self-optimizing declarative language for machine learning applications.

[BibT_eX]

[DOI]

Tim Vieira

Matthew Francis-Landau

Nathaniel Wesley Filardo

Farzad Khorasani

Jason Eisner

Proceedings of the 1st ACM SIGPLAN International Workshop on Machine Learning and Programming Languages, 2017

2016

High Performance Vertex-Centric Graph Analytics on GPUs.

[BibT_eX]

[DOI]

Farzad Khorasani

PhD thesis, 2016

Eliminating Intra-Warp Load Imbalance in Irregular Nested Patterns via Collaborative Task Engagement.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

CuMAS: Data Transfer Aware Multi-Application Scheduling for Shared GPUs.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Conference on Supercomputing, 2016

2015

Efficient warp execution in presence of divergence with collaborative context collection.

[BibT_eX]

[DOI]

Farzad Khorasani

Rajiv Gupta

Laxmi N. Bhuyan

Proceedings of the 48th International Symposium on Microarchitecture, 2015

Scalable SIMD-Efficient Graph Processing on GPUs.

[BibT_eX]

[DOI]

Farzad Khorasani

Rajiv Gupta

Laxmi N. Bhuyan

Proceedings of the 2015 International Conference on Parallel Architectures and Compilation, 2015

Stadium Hashing: Scalable and Flexible Hashing on GPUs.

[BibT_eX]

[DOI]

Proceedings of the 2015 International Conference on Parallel Architectures and Compilation, 2015

2014

LightPlay: Efficient Replay with GPUs.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 2014

CuSha: vertex-centric graph processing on GPUs.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Symposium on High-Performance Parallel and Distributed Computing, 2014

Farzad Khorasani

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...