Benjamin Klenk

Orcid: 0000-0001-7657-3049

According to our database1, Benjamin Klenk authored at least 13 papers between 2014 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Towards a Standardized Representation for Deep Learning Collective Algorithms.
Proceedings of the IEEE Symposium on High-Performance Interconnects, 2024

2022
A Case For Intra-rack Resource Disaggregation in HPC.
ACM Trans. Archit. Code Optim., 2022

2021
SiP-ML: high-bandwidth optical network interconnects for machine learning training.
Proceedings of the ACM SIGCOMM 2021 Conference, Virtual Event, USA, August 23-27, 2021., 2021

2020
Why Data Science and Machine Learning Need Silicon Photonics.
Proceedings of the Optical Fiber Communications Conference and Exhibition, 2020

An In-Network Architecture for Accelerating Shared-Memory Multiprocessor Collectives.
Proceedings of the 47th ACM/IEEE Annual International Symposium on Computer Architecture, 2020

2018
Communication architectures for scalable GPU-centric computing systems.
PhD thesis, 2018

2017
An Overview of MPI Characteristics of Exascale Proxy Applications.
Proceedings of the High Performance Computing - 32nd International Conference, 2017

Relaxations for High-Performance Message Passing on Massively Parallel SIMT Processors.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

2016
Analyzing GPU-controlled communication with dynamic parallelism in terms of performance and energy.
Parallel Comput., 2016

2015
Analyzing communication models for distributed thread-collaborative processors in terms of energy and time.
Proceedings of the 2015 IEEE International Symposium on Performance Analysis of Systems and Software, 2015

2014
Energy-efficient stencil computations on distributed GPUs using dynamic parallelism and GPU-controlled communication.
Proceedings of the 2nd International Workshop on Energy Efficient Supercomputing, 2014

Analyzing Put/Get APIs for Thread-Collaborative Processors.
Proceedings of the 43rd International Conference on Parallel Processing Workshops, 2014

Energy-Efficient Collective Reduce and Allreduce Operations on Distributed GPUs.
Proceedings of the 14th IEEE/ACM International Symposium on Cluster, 2014


  Loading...