Saurav Muralidharan

Orcid: 0000-0003-4024-3958

According to our database1, Saurav Muralidharan authored at least 20 papers between 2009 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation.
CoRR, 2024

MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models.
CoRR, 2024

LLM Pruning and Distillation in Practice: The Minitron Approach.
CoRR, 2024

Compact Language Models via Pruning and Knowledge Distillation.
CoRR, 2024

Flextron: Many-in-One Flexible Large Language Model.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
The Sparsity Roofline: Understanding the Hardware Limits of Sparse Neural Networks.
CoRR, 2023

Understanding the Effect of the Long Tail on Neural Network Compression.
CoRR, 2023

Uniform Sparsity in Deep Neural Networks.
Proceedings of the Sixth Conference on Machine Learning and Systems, 2023

HighLight: Efficient and Flexible DNN Acceleration with Hierarchical Structured Sparsity.
Proceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture, 2023

2022
Efficient Sparsely Activated Transformers.
CoRR, 2022

2020
A Programmable Approach to Neural Network Compression.
IEEE Micro, 2020

Reliable Model Compression via Label-Preservation-Aware Loss Functions.
CoRR, 2020

2019
A Programmable Approach to Model Compression.
CoRR, 2019

2016
Abstractions and Strategies for Adaptive Programming.
PhD thesis, 2016

Designing a Tunable Nested Data-Parallel Programming System.
ACM Trans. Archit. Code Optim., 2016

Architecture-Adaptive Code Variant Tuning.
Proceedings of the Twenty-First International Conference on Architectural Support for Programming Languages and Operating Systems, 2016

2015
A collection-oriented programming model for performance portability.
Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2015

2014
Nitro: A Framework for Adaptive Code Variant Tuning.
Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014

2013
Towards making autotuning mainstream.
Int. J. High Perform. Comput. Appl., 2013

2009
Galaxia: A Semi-decentralized System for Implementing Secure-Group P2P Networks.
Proceedings of the First International Conference on Networks and Communications, 2009


  Loading...