We stand with Ukraine

We stand with Ukraine

Min Si

Orcid: 0000-0002-0208-096X

According to our database¹, Min Si authored at least 36 papers between 2012 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Accelerating Communication in Deep Learning Recommendation Model Training with Dual-Level Adaptive Lossy Compression.

[BibT_eX]

[DOI]

,

,

,

,

Ching-Hsiang Chu

,

,

,

,

,

,

,

Proceedings of the International Conference for High Performance Computing, 2024

2023

Special issue on new trends in high-performance computing: Software systems and applications.

[BibT_eX]

[DOI]

Sunita Chandrasekaran

,

,

,

Softw. Pract. Exp., 2023

Software-Hardware Co-design of Heterogeneous SmartNIC System for Recommendation Models Inference and Training.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Martin C. Herbordt

,

Proceedings of the 37th International Conference on Supercomputing, 2023

Accelerating MPI Collectives with Process-in-Process-based Multi-object Techniques.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the 32nd International Symposium on High-Performance Parallel and Distributed Computing, 2023

PiP-MColl: Process-in-Process-based Multi-object MPI Collectives.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Cluster Computing, 2023

2022

Guest Editorial.

[BibT_eX]

[DOI]

,

,

Antonio J. Peña

IEEE Trans. Parallel Distributed Syst., 2022

Special Issue on Hot Interconnects.

[BibT_eX]

[DOI]

,

,

IEEE Micro, 2022

Special Issue on Programming Models and Applications for Multicores and Manycores 2020.

[BibT_eX]

[DOI]

,

,

Concurr. Comput. Pract. Exp., 2022

Special issue on programming models and applications for multicores and manycores 2019-2020.

[BibT_eX]

[DOI]

,

,

Concurr. Comput. Pract. Exp., 2022

2021

Guest Editorial.

[BibT_eX]

[DOI]

,

,

IEEE Trans. Parallel Distributed Syst., 2021

Dynamic scaling for low-precision learning.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the PPoPP '21: 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2021

OpenSHMEM over MPI as a Performance Contender: Thorough Analysis and Optimizations.

[BibT_eX]

[DOI]

,

,

Jeff R. Hammond

,

Proceedings of the OpenSHMEM and Related Technologies. OpenSHMEM in the Era of Exascale and Smart Networks, 2021

A FACT-based Approach: Making Machine Learning Collective Autotuning Feasible on Exascale Systems.

[BibT_eX]

[DOI]

Michael Wilkins

,

,

,

Nikos Hardavellas

,

,

Proceedings of the Workshop on Exascale MPI, 2021

Daps: A Dynamic Asynchronous Progress Stealing Model for MPI Communication.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE International Conference on Cluster Computing, 2021

2020

Guest editorial: Special Issue on Applications and System Software for Hybrid Exascale Systems.

[BibT_eX]

[DOI]

Antonio J. Peña

,

Parallel Comput., 2020

CAB-MPI: exploring interprocess work-stealing towards balanced MPI communication.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the International Conference for High Performance Computing, 2020

Workshop 8: AsHES Accelerators and Hybrid Exascale Systems.

[BibT_eX]

[DOI]

,

,

Simon Garcia De Gonzalo

Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020

2019

Scalable Deep Learning via I/O Analysis and Optimization.

[BibT_eX]

[DOI]

,

,

,

ACM Trans. Parallel Comput., 2019

Parallel programming models and systems software for high-end computing (P2S2 2018).

[BibT_eX]

[DOI]

,

,

Parallel Comput., 2019

International workshop on programming models and applications for multicores and manycores (PMAM 2018).

[BibT_eX]

[DOI]

,

,

Parallel Comput., 2019

Software combining to mitigate multithreaded MPI contention.

[BibT_eX]

[DOI]

Abdelhalim Amer

,

,

Michael Blocksome

,

,

Michael Chuvelev

,

,

,

,

Jeff R. Hammond

,

Shintaro Iwasaki

,

Kenneth J. Raffenetti

,

Mikhail Shiryaev

,

,

,

Sagar Thapaliya

,

Proceedings of the ACM International Conference on Supercomputing, 2019

2018

Dynamic Adaptable Asynchronous Progress Model for MPI RMA Multiphase Applications.

[BibT_eX]

[DOI]

,

Antonio J. Peña

,

Jeff R. Hammond

,

,

Masamichi Takagi

,

Yutaka Ishikawa

IEEE Trans. Parallel Distributed Syst., 2018

Introduction to AsHES 2018.

[BibT_eX]

[DOI]

Sunita Chandrasekaran

,

Antonio J. Peña

,

Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018

Process-in-process: techniques for practical address-space sharing.

[BibT_eX]

[DOI]

,

,

,

Masamichi Takagi

,

,

,

Yutaka Ishikawa

Proceedings of the 27th International Symposium on High-Performance Parallel and Distributed Computing, 2018

2017

Why is MPI so slow?: analyzing the fundamental limits in implementing MPI-3.1.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2017

Parallel I/O Optimizations for Scalable Deep Learning.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 23rd IEEE International Conference on Parallel and Distributed Systems, 2017

Process-Based Asynchronous Progress Model for MPI Point-to-Point Communication.

[BibT_eX]

[DOI]

,

Proceedings of the 19th IEEE International Conference on High Performance Computing and Communications; 15th IEEE International Conference on Smart City; 3rd IEEE International Conference on Data Science and Systems, 2017

Towards Scalable Deep Learning via I/O Analysis and Optimization.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 19th IEEE International Conference on High Performance Computing and Communications; 15th IEEE International Conference on Smart City; 3rd IEEE International Conference on Data Science and Systems, 2017

2015

Casper: An Asynchronous Progress Model for MPI RMA on Many-Core Architectures.

[BibT_eX]

[DOI]

,

Antonio J. Peña

,

Jeff R. Hammond

,

,

Masamichi Takagi

,

Yutaka Ishikawa

Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015

Scaling NWChem with Efficient and Portable Asynchronous Communication in MPI RMA.

[BibT_eX]

[DOI]

,

Antonio J. Peña

,

Jeff R. Hammond

,

,

Yutaka Ishikawa

Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015

Techniques for Enabling Highly Efficient Message Passing on Many-Core Architectures.

[BibT_eX]

[DOI]

,

,

Yutaka Ishikawa

Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015

2014

MT-MPI: multithreaded MPI for many-core environments.

[BibT_eX]

[DOI]

,

Antonio J. Peña

,

,

Masamichi Takagi

,

Yutaka Ishikawa

Proceedings of the 2014 International Conference on Supercomputing, 2014

2013

Direct MPI Library for Intel Xeon Phi Co-Processors.

[BibT_eX]

[DOI]

,

Yutaka Ishikawa

,

Masamichi Takagi

Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

2012

Poster: An MPI Library implementing Direct Communication for Many-Core Based Accelerators.

[BibT_eX]

[DOI]

,

Yutaka Ishikawa

Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Abstract: An MPI Library implementing Direct Communication for Many-Core Based Accelerators.

[BibT_eX]

[DOI]

,

Yutaka Ishikawa

Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Design of Direct Communication Facility for Many-Core Based Accelerators.

[BibT_eX]

[DOI]

,

Yutaka Ishikawa

Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012

Loading...