Saleh Ashkboos

Orcid: 0000-0001-6115-6779

According to our database1, Saleh Ashkboos authored at least 27 papers between 2017 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

2017
2018
2019
2020
2021
2022
2023
2024
2025
0
1
2
3
4
5
6
7
8
9
1
3
2
3
1
1
2
5
1
3
3
1
1

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
HALO: Hadamard-Assisted Lower-Precision Optimization for LLMs.
CoRR, January, 2025

2024
EfQAT: An Efficient Framework for Quantization-Aware Training.
CoRR, 2024

Computational Bottlenecks of Training Small-scale Large Language Models.
CoRR, 2024

QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs.
CoRR, 2024

Arrow Matrix Decomposition: A Novel Approach for Communication-Efficient Sparse Matrix Multiplication.
Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2024

QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

SliceGPT: Compress Large Language Models by Deleting Rows and Columns.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

QUIK: Towards End-to-end 4-Bit Inference on Generative Large Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023

Towards End-to-end 4-Bit Inference on Generative Large Language Models.
CoRR, 2023

STen: Productive and Efficient Sparsity in PyTorch.
CoRR, 2023

OPTQ: Accurate Quantization for Generative Pre-trained Transformers.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers.
CoRR, 2022

ENS-10: A Dataset For Post-Processing Ensemble Weather Forecast.
CoRR, 2022

The spatial computer: A model for energy-efficient parallel computation.
CoRR, 2022

ProbGraph: High-Performance and High-Accuracy Graph Mining with Probabilistic Set Representations.
Proceedings of the SC22: International Conference for High Performance Computing, 2022

ENS-10: A Dataset For Post-Processing Ensemble Weather Forecasts.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Motif Prediction with Graph Neural Networks.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

2021
Multi-way sparsest cut problem on trees with a control on the number of parts and outliers.
Discret. Appl. Math., 2021

Flare: flexible in-network allreduce.
Proceedings of the International Conference for High Performance Computing, 2021

New Bounds For Distributed Mean Estimation and Variance Reduction.
Proceedings of the 9th International Conference on Learning Representations, 2021

Degree-based Feature Is All You Need: Science4Cast Report.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

2020
Distributed Mean Estimation with Optimal Error Bounds.
CoRR, 2020

2019
SparCML: high-performance sparse communication for machine learning.
Proceedings of the International Conference for High Performance Computing, 2019

2017
An Efficient Parallel Data Clustering Algorithm Using Isoperimetric Number of Trees.
CoRR, 2017

Minimum Cuts of Distance-Regular Digraphs.
Electron. J. Comb., 2017


  Loading...