Saleh Ashkboos

Orcid: 0000-0001-6115-6779

According to our database¹, Saleh Ashkboos authored at least 27 papers between 2017 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

2017

2018

2019

2020

2021

2022

2023

2024

2025

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

HALO: Hadamard-Assisted Lower-Precision Optimization for LLMs.

[BibT_eX]

[DOI]

CoRR, January, 2025

2024

EfQAT: An Efficient Framework for Quantization-Aware Training.

[BibT_eX]

[DOI]

Saleh Ashkboos

Bram Verhoef

Torsten Hoefler

Evangelos Eleftheriou

Martino Dazzi

CoRR, 2024

Computational Bottlenecks of Training Small-scale Large Language Models.

[BibT_eX]

[DOI]

Saleh Ashkboos

Iman Mirzadeh

Keivan Alizadeh

Mohammad Hossein Sekhavat

Moin Nabi

Mehrdad Farajtabar

Fartash Faghri

CoRR, 2024

QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs.

[BibT_eX]

[DOI]

Saleh Ashkboos

Amirkeivan Mohtashami

CoRR, 2024

Arrow Matrix Decomposition: A Novel Approach for Communication-Efficient Sparse Matrix Multiplication.

[BibT_eX]

[DOI]

Lukas Gianinazzi

Alexandros Nikolaos Ziogas

Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2024

QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs.

[BibT_eX]

[DOI]

Saleh Ashkboos

Amirkeivan Mohtashami

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

SliceGPT: Compress Large Language Models by Deleting Rows and Columns.

[BibT_eX]

[DOI]

Saleh Ashkboos

Maximilian L. Croci

Marcelo Gennari Do Nascimento

Torsten Hoefler

James Hensman

Proceedings of the Twelfth International Conference on Learning Representations, 2024

QUIK: Towards End-to-end 4-Bit Inference on Generative Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023

Arrow Matrix Decompositions.

[BibT_eX]

[DOI]

Lukas Gianinazzi

Alexandros Nikolaos Ziogas

Dataset, April, 2023

Towards End-to-end 4-Bit Inference on Generative Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

STen: Productive and Efficient Sparsity in PyTorch.

[BibT_eX]

[DOI]

CoRR, 2023

OPTQ: Accurate Quantization for Generative Pre-trained Transformers.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022

GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers.

[BibT_eX]

[DOI]

CoRR, 2022

ENS-10: A Dataset For Post-Processing Ensemble Weather Forecast.

[BibT_eX]

[DOI]

CoRR, 2022

The spatial computer: A model for energy-efficient parallel computation.

[BibT_eX]

[DOI]

CoRR, 2022

ProbGraph: High-Performance and High-Accuracy Graph Mining with Probabilistic Set Representations.

[BibT_eX]

[DOI]

Raghavendra Kanakagiri

Proceedings of the SC22: International Conference for High Performance Computing, 2022

ENS-10: A Dataset For Post-Processing Ensemble Weather Forecasts.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Motif Prediction with Graph Neural Networks.

[BibT_eX]

[DOI]

Raghavendra Kanakagiri

Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

2021

Multi-way sparsest cut problem on trees with a control on the number of parts and outliers.

[BibT_eX]

[DOI]

Ramin Javadi

Saleh Ashkboos

Discret. Appl. Math., 2021

Flare: flexible in-network allreduce.

[BibT_eX]

[DOI]

Daniele De Sensi

Salvatore Di Girolamo

Saleh Ashkboos

Shigang Li

Torsten Hoefler

Proceedings of the International Conference for High Performance Computing, 2021

New Bounds For Distributed Mean Estimation and Variance Reduction.

[BibT_eX]

[DOI]

Peter Davies

Vijaykrishna Gurunanthan

Niusha Moshrefi

Saleh Ashkboos

Dan Alistarh

Proceedings of the 9th International Conference on Learning Representations, 2021

Degree-based Feature Is All You Need: Science4Cast Report.

[BibT_eX]

[DOI]

Milad Aghajohari

Mohammad Sadegh Akhondzadeh

Saleh Ashkboos

Kamran Chitsaz

Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

2020

Distributed Mean Estimation with Optimal Error Bounds.

[BibT_eX]

[DOI]

Dan Alistarh

Saleh Ashkboos

Peter Davies

CoRR, 2020

2019

SparCML: high-performance sparse communication for machine learning.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2019

2017

An Efficient Parallel Data Clustering Algorithm Using Isoperimetric Number of Trees.

[BibT_eX]

[DOI]

Ramin Javadi

Saleh Ashkboos

CoRR, 2017

Minimum Cuts of Distance-Regular Digraphs.

[BibT_eX]

[DOI]

Electron. J. Comb., 2017

Saleh Ashkboos

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...