Venmugil Elango

Orcid: 0000-0002-7031-9020

According to our database1, Venmugil Elango authored at least 12 papers between 2013 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Microscaling Data Formats for Deep Learning.
CoRR, 2023

Shared Microexponents: A Little Shifting Goes a Long Way.
CoRR, 2023


2021
Pase: Parallelization Strategies for Efficient DNN Training.
Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021

2018
Diesel: DSL for linear algebra and neural net computations on GPUs.
Proceedings of the 2nd ACM SIGPLAN International Workshop on Machine Learning and Programming Languages, 2018

2015
Distributed memory code generation for mixed Irregular/Regular computations.
Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2015

On Characterizing the Data Access Complexity of Programs.
Proceedings of the 42nd Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages, 2015

2014
On Using the Roofline Model with Lower Bounds on Data Movement.
ACM Trans. Archit. Code Optim., 2014

Spatial adaptive sampling in multiscale simulation.
Comput. Phys. Commun., 2014

On characterizing the data movement complexity of computational DAGs for parallel execution.
Proceedings of the 26th ACM Symposium on Parallelism in Algorithms and Architectures, 2014

2013
Beyond reuse distance analysis: Dynamic analysis for characterization of data locality potential.
ACM Trans. Archit. Code Optim., 2013

Accelerating Strassen-Winograd's matrix multiplication algorithm on GPUs.
Proceedings of the 20th Annual International Conference on High Performance Computing, 2013


  Loading...