Farzad Khorasani

According to our database1, Farzad Khorasani authored at least 14 papers between 2014 and 2019.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2019
High Performance Multilevel Graph Partitioning on GPU.
Proceedings of the 17th International Conference on High Performance Computing & Simulation, 2019

CORF: Coalescing Operand Register File for GPUs.
Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, 2019

2018
In-Register Parameter Caching for Dynamic Neural Nets with Virtual Persistent Processor Specialization.
Proceedings of the 51st Annual IEEE/ACM International Symposium on Microarchitecture, 2018

RegMutex: Inter-Warp GPU Register Time-Sharing.
Proceedings of the 45th ACM/IEEE Annual International Symposium on Computer Architecture, 2018

2017
Enabling Work-Efficiency for High Performance Vertex-Centric Graph Analytics on GPUs.
Proceedings of the Seventh Workshop on Irregular Applications: Architectures and Algorithms, 2017

Dyna: toward a self-optimizing declarative language for machine learning applications.
Proceedings of the 1st ACM SIGPLAN International Workshop on Machine Learning and Programming Languages, 2017

2016
High Performance Vertex-Centric Graph Analytics on GPUs.
PhD thesis, 2016

Eliminating Intra-Warp Load Imbalance in Irregular Nested Patterns via Collaborative Task Engagement.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

CuMAS: Data Transfer Aware Multi-Application Scheduling for Shared GPUs.
Proceedings of the 2016 International Conference on Supercomputing, 2016

2015
Efficient warp execution in presence of divergence with collaborative context collection.
Proceedings of the 48th International Symposium on Microarchitecture, 2015

Scalable SIMD-Efficient Graph Processing on GPUs.
Proceedings of the 2015 International Conference on Parallel Architectures and Compilation, 2015

Stadium Hashing: Scalable and Flexible Hashing on GPUs.
Proceedings of the 2015 International Conference on Parallel Architectures and Compilation, 2015

2014
LightPlay: Efficient Replay with GPUs.
Proceedings of the Languages and Compilers for Parallel Computing, 2014

CuSha: vertex-centric graph processing on GPUs.
Proceedings of the 23rd International Symposium on High-Performance Parallel and Distributed Computing, 2014


  Loading...