Murali Emani
Orcid: 0000-0002-6279-0007Affiliations:
- Argonne National Laboratory, IL, USA
According to our database1,
Murali Emani
authored at least 54 papers
between 2013 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
-
on dl.acm.org
On csauthors.net:
Bibliography
2024
Proc. ACM Meas. Anal. Comput. Syst., 2024
LLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI Accelerators.
CoRR, 2024
Centimani: Enabling Fast AI Accelerator Selection for DNN Training with a Novel Performance Predictor.
Proceedings of the 2024 USENIX Annual Technical Conference, 2024
MProt-DPO: Breaking the ExaFLOPS Barrier for Multimodal Protein Design Workflows with Direct Preference Optimization.
Proceedings of the International Conference for High Performance Computing, 2024
Toward a Holistic Performance Evaluation of Large Language Models Across Diverse AI Accelerators.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024
WActiGrad: Structured Pruning for Efficient Finetuning and Inference of Large Language Models on AI Accelerators.
Proceedings of the Euro-Par 2024: Parallel Processing, 2024
A Multi-Level, Multi-Scale Visual Analytics Approach to Assessment of Multifidelity HPC Systems.
Proceedings of the 24th IEEE International Symposium on Cluster, 2024
2023
J. Syst. Archit., November, 2023
Int. J. High Perform. Comput. Appl., November, 2023
DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies.
CoRR, 2023
CoRR, 2023
IEEE Access, 2023
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023
Transfer Learning Across Heterogeneous Features For Efficient Tensor Program Generation.
Proceedings of the 2nd International Workshop on Extreme Heterogeneity Solutions, 2023
Proceedings of the OpenMP: Advanced Task-Based, Device and Compiler Programming, 2023
Proceedings of the Euro-Par 2023: Parallel Processing - 29th International Conference on Parallel and Distributed Computing, Limassol, Cyprus, August 28, 2023
2022
Intelligent resolution: Integrating Cryo-EM with AI-driven multi-resolution simulations to observe the severe acute respiratory syndrome coronavirus-2 replication-transcription machinery in action.
Int. J. High Perform. Comput. Appl., 2022
FAIR for AI: An interdisciplinary, international, inclusive, and diverse community building perspective.
CoRR, 2022
IEEE Access, 2022
Making Machine Learning Datasets and Models FAIR for HPC: A Methodology and Case Study.
Proceedings of the Fourth International Conference on Transdisciplinary AI, 2022
Proceedings of the High Performance Computing. ISC High Performance 2022 International Workshops - Hamburg, Germany, May 29, 2022
Proceedings of the IEEE/ACM International Workshop on Performance Modeling, 2022
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2022
Proceedings of the IEEE/ACM International Workshop on HPC User Support Tools, 2022
Proceedings of the HPDC '22: The 31st International Symposium on High-Performance Parallel and Distributed Computing, Minneapolis, MN, USA, 27 June 2022, 2022
Finding Reusable Machine Learning Components to Build Programming Language Processing Pipelines.
Proceedings of the Software Architecture. ECSA 2022 Tracks and Workshops, 2022
Proceedings of the Sixth IEEE/ACM International Workshop on Software Correctness for HPC Applications, 2022
Towards neural architecture-aware exploration of compiler optimizations in a deep learning {graph} compiler.
Proceedings of the CF '22: 19th ACM International Conference on Computing Frontiers, Turin, Italy, May 17, 2022
Proceedings of the 22nd IEEE International Symposium on Cluster, 2022
2021
Accelerating Scientific Applications With SambaNova Reconfigurable Dataflow Architecture.
Comput. Sci. Eng., 2021
MLPerf HPC: A Holistic Benchmark Suite for Scientific Machine Learning on HPC Systems.
CoRR, 2021
Stream-AI-MD: streaming AI-driven adaptive molecular simulations for heterogeneous computing platforms.
Proceedings of the PASC '21: Platform for Advanced Scientific Computing Conference, 2021
Proceedings of the IEEE/ACM Workshop on Machine Learning in High Performance Computing Environments, 2021
HPC Ontology: Towards a Unified Ontology for Managing Training Datasets and AI Models for High-Performance Computing.
Proceedings of the IEEE/ACM Workshop on Machine Learning in High Performance Computing Environments, 2021
MLPerf™ HPC: A Holistic Benchmark Suite for Scientific Machine Learning on HPC Systems.
Proceedings of the IEEE/ACM Workshop on Machine Learning in High Performance Computing Environments, 2021
2020
EReinit: Scalable and efficient fault-tolerance for bulk-synchronous MPI applications.
Concurr. Comput. Pract. Exp., 2020
2019
Proceedings of the 2019 IEEE/ACM Workshop on Memory Centric High Performance Computing, 2019
Proceedings of the 3rd IEEE/ACM Industry/University Joint International Workshop on Data-center Automation, 2019
2018
Proceedings of the Workshop on Memory Centric High Performance Computing, 2018
Proceedings of the 2018 IEEE/ACM Performance Modeling, 2018
Proceedings of the 25th European MPI Users' Group Meeting, 2018
Proceedings of the 32nd International Conference on Supercomputing, 2018
2016
Proceedings of the Languages and Compilers for Parallel Computing, 2016
Integrating Algorithmic Parameters into Benchmarking and Design Space Exploration in 3D Scene Understanding.
Proceedings of the 2016 International Conference on Parallel Architectures and Compilation, 2016
2015
PhD thesis, 2015
Celebrating diversity: a mixture of experts approach for runtime mapping in dynamic environments.
Proceedings of the 36th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2015
2014
Change Detection Based Parallelism Mapping: Exploiting Offline Models and Online Adaptation.
Proceedings of the Languages and Compilers for Parallel Computing, 2014
2013
Proceedings of the 2013 IEEE/ACM International Symposium on Code Generation and Optimization, 2013