Arash Bakhtiari
According to our database1,
Arash Bakhtiari
authored at least 9 papers
between 2016 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
On csauthors.net:
Bibliography
2024
CoRR, 2024
FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design.
CoRR, 2024
DeepSpeed-FastGen: High-throughput Text Generation for LLMs via MII and DeepSpeed-Inference.
CoRR, 2024
Quant-LLM: Accelerating the Serving of Large Language Models via FP6-Centric Algorithm-System Co-Design on Modern GPUs.
Proceedings of the 2024 USENIX Annual Technical Conference, 2024
2023
ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative Tasks.
CoRR, 2023
2020
Larq Compute Engine: Design, Benchmark, and Deploy State-of-the-Art Binarized Neural Networks.
CoRR, 2020
2017
High Order Adaptive Semi-Lagrangian/Volume-Integral Methods for Parallel Advection-Diffusion Simulations (Adaptive Semi-Lagrangian/Volumenintegral-Methode Hoher Ordnung für Parallele Advektions-Diffusions Simulationen)
PhD thesis, 2017
A Holistic Scalable Implementation Approach of the Lattice Boltzmann Method for CPU/GPU Heterogeneous Clusters.
Comput., 2017
2016
A parallel arbitrary-order accurate AMR algorithm for the scalar advection-diffusion equation.
Proceedings of the International Conference for High Performance Computing, 2016