Nawras Alnaasan

Orcid: 0000-0002-3638-4144

According to our database1, Nawras Alnaasan authored at least 12 papers between 2021 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
HINT: Designing Cache-Efficient MPI_Alltoall using Hybrid Memory Copy Ordering and Non-Temporal Instructions.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

Characterizing Communication in Distributed Parameter-Efficient Fine-Tuning for Large Language Models.
Proceedings of the IEEE Symposium on High-Performance Interconnects, 2024

Accelerating Large Language Model Training with Hybrid GPU-based Compression.
Proceedings of the 24th IEEE International Symposium on Cluster, 2024

2023
Performance Characterization of using Quantization for DNN Inference on Edge Devices: Extended Version.
CoRR, 2023

Performance Characterization of Using Quantization for DNN Inference on Edge Devices.
Proceedings of the 7th IEEE International Conference on Fog and Edge Computing, 2023

Designing In-network Computing Aware Reduction Collectives in MPI.
Proceedings of the IEEE Symposium on High-Performance Interconnects, 2023

Flover: A Temporal Fusion Framework for Efficient Autoregressive Model Parallel Inference.
Proceedings of the 30th IEEE International Conference on High Performance Computing, 2023

HARVEST: High-Performance Artificial Vision Framework for Expert Labeling using Semi-Supervised Training.
Proceedings of the IEEE International Conference on Big Data, 2023

2022
Optimizing Distributed DNN Training Using CPUs and BlueField-2 DPUs.
IEEE Micro, 2022

OMB-Py: Python Micro-Benchmarks for Evaluating Performance of MPI Libraries on HPC Systems.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2022

AccDP: Accelerated Data-Parallel Distributed DNN Training for Modern GPU-Based HPC Clusters.
Proceedings of the 29th IEEE International Conference on High Performance Computing, 2022

2021
Accelerating CPU-based Distributed DNN Training on Modern HPC Clusters using BlueField-2 DPUs.
Proceedings of the IEEE Symposium on High-Performance Interconnects, 2021


  Loading...