Raghu Prabhakar
Orcid: 0000-0003-0230-4377
According to our database1,
Raghu Prabhakar
authored at least 26 papers
between 2012 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Composition of Experts: A Modular Compound AI System Leveraging Large Language Models.
CoRR, 2024
Kernel Looping: Eliminating Synchronization Boundaries for Peak Inference Performance.
CoRR, 2024
SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts.
CoRR, 2024
SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts.
Proceedings of the 57th IEEE/ACM International Symposium on Microarchitecture, 2024
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2024
SambaNova SN40L RDU: Breaking the Barrier of Trillion+ Parameter Scale Gen AI Computing.
Proceedings of the 36th IEEE Hot Chips Symposium, 2024
2023
Proceedings of the IEEE Custom Integrated Circuits Conference, 2023
2022
Proceedings of the IEEE International Solid-State Circuits Conference, 2022
(CGRA4HPC) 2022 Invited Speaker: Pushing the Boundaries of HPC with the Integration of AI.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2022
2021
Proceedings of the MICRO '21: 54th Annual IEEE/ACM International Symposium on Microarchitecture, 2021
Proceedings of the IEEE Hot Chips 33 Symposium, 2021
2019
Proceedings of the Performance Evaluation and Benchmarking for the Era of Cloud(s), 2019
Proceedings of the 46th International Symposium on Computer Architecture, 2019
2018
PhD thesis, 2018
Proceedings of the 39th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2018
2017
Proceedings of the 44th Annual International Symposium on Computer Architecture, 2017
2016
Proceedings of the 43rd ACM/IEEE Annual International Symposium on Computer Architecture, 2016
Proceedings of the Twenty-First International Conference on Architectural Support for Programming Languages and Operating Systems, 2016
2012
Proceedings of the Languages and Compilers for Parallel Computing, 2012
Proceedings of the International Symposium on Physical Design, 2012
Proceedings of the International Symposium on Low Power Electronics and Design, 2012
CUDA-For-Clusters: A System for Efficient Execution of CUDA Kernels on Multi-core Clusters.
Proceedings of the Euro-Par 2012 Parallel Processing - 18th International Conference, 2012
Proceedings of the 17th Asia and South Pacific Design Automation Conference, 2012