Mayank Daga

According to our database1, Mayank Daga authored at least 15 papers between 2011 and 2019.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2019
MIOpen: An Open Source Library For Deep Learning Primitives.
CoRR, 2019

2016
On the Acceleration of Graph500: Characterizing PCIe Overheads with Multi-GPUs.
Proceedings of the High Performance Computing for Computational Science - VECPAR 2016, 2016

Implementing directed acyclic graphs with the heterogeneous system architecture.
Proceedings of the 9th Annual Workshop on General Purpose Processing using Graphics Processing Unit, 2016

clSPARSE: A Vendor-Optimized Open-Source Sparse BLAS Library.
Proceedings of the 4th International Workshop on OpenCL, 2016

Multiscale Approximation with Graphical Processing Units for Multiplicative Speedup in Molecular Dynamics.
Proceedings of the 7th ACM International Conference on Bioinformatics, 2016

2015
On the Performance, Energy, and Power of Data-Access Methods in Heterogeneous Computing Systems.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015

Exploring Parallel Programming Models for Heterogeneous Computing Systems.
Proceedings of the 2015 IEEE International Symposium on Workload Characterization, 2015

Structural Agnostic SpMV: Adapting CSR-Adaptive for Irregular Matrices.
Proceedings of the 22nd IEEE International Conference on High Performance Computing, 2015

2014
Efficient Sparse Matrix-Vector Multiplication on GPUs Using the CSR Storage Format.
Proceedings of the International Conference for High Performance Computing, 2014

Efficient breadth-first search on a heterogeneous processor.
Proceedings of the 2014 IEEE International Conference on Big Data (IEEE BigData 2014), 2014

2012
Multi-dimensional characterization of electrostatic surface potential computation on graphics processors.
BMC Bioinform., 2012

Exploiting Coarse-Grained Parallelism in B+ Tree Searches on an APU.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

2011
Architecture-Aware Mapping and Optimization on a 1600-Core GPU.
Proceedings of the 17th IEEE International Conference on Parallel and Distributed Systems, 2011

Towards accelerating molecular modeling via multi-scale approximation on a GPU.
Proceedings of the IEEE 1st International Conference on Computational Advances in Bio and Medical Sciences, 2011

Bounding the effect of partition camping in GPU kernels.
Proceedings of the 8th Conference on Computing Frontiers, 2011


  Loading...