Min Si

Orcid: 0000-0002-0208-096X

According to our database1, Min Si authored at least 36 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Accelerating Communication in Deep Learning Recommendation Model Training with Dual-Level Adaptive Lossy Compression.
CoRR, 2024

2023
Special issue on new trends in high-performance computing: Software systems and applications.
Softw. Pract. Exp., 2023

Software-Hardware Co-design of Heterogeneous SmartNIC System for Recommendation Models Inference and Training.
Proceedings of the 37th International Conference on Supercomputing, 2023

Accelerating MPI Collectives with Process-in-Process-based Multi-object Techniques.
Proceedings of the 32nd International Symposium on High-Performance Parallel and Distributed Computing, 2023

PiP-MColl: Process-in-Process-based Multi-object MPI Collectives.
Proceedings of the IEEE International Conference on Cluster Computing, 2023

2022
Guest Editorial.
IEEE Trans. Parallel Distributed Syst., 2022

Special Issue on Hot Interconnects.
IEEE Micro, 2022

Special Issue on Programming Models and Applications for Multicores and Manycores 2020.
Concurr. Comput. Pract. Exp., 2022

Special issue on programming models and applications for multicores and manycores 2019-2020.
Concurr. Comput. Pract. Exp., 2022

2021
Guest Editorial.
IEEE Trans. Parallel Distributed Syst., 2021

Dynamic scaling for low-precision learning.
Proceedings of the PPoPP '21: 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2021

OpenSHMEM over MPI as a Performance Contender: Thorough Analysis and Optimizations.
Proceedings of the OpenSHMEM and Related Technologies. OpenSHMEM in the Era of Exascale and Smart Networks, 2021

A FACT-based Approach: Making Machine Learning Collective Autotuning Feasible on Exascale Systems.
Proceedings of the Workshop on Exascale MPI, 2021

Daps: A Dynamic Asynchronous Progress Stealing Model for MPI Communication.
Proceedings of the IEEE International Conference on Cluster Computing, 2021

2020
Guest editorial: Special Issue on Applications and System Software for Hybrid Exascale Systems.
Parallel Comput., 2020

CAB-MPI: exploring interprocess work-stealing towards balanced MPI communication.
Proceedings of the International Conference for High Performance Computing, 2020

Workshop 8: AsHES Accelerators and Hybrid Exascale Systems.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020

2019
Scalable Deep Learning via I/O Analysis and Optimization.
ACM Trans. Parallel Comput., 2019

Parallel programming models and systems software for high-end computing (P2S2 2018).
Parallel Comput., 2019

International workshop on programming models and applications for multicores and manycores (PMAM 2018).
Parallel Comput., 2019

Software combining to mitigate multithreaded MPI contention.
Proceedings of the ACM International Conference on Supercomputing, 2019

2018
Dynamic Adaptable Asynchronous Progress Model for MPI RMA Multiphase Applications.
IEEE Trans. Parallel Distributed Syst., 2018

Introduction to AsHES 2018.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018

Process-in-process: techniques for practical address-space sharing.
Proceedings of the 27th International Symposium on High-Performance Parallel and Distributed Computing, 2018

2017

Parallel I/O Optimizations for Scalable Deep Learning.
Proceedings of the 23rd IEEE International Conference on Parallel and Distributed Systems, 2017

Process-Based Asynchronous Progress Model for MPI Point-to-Point Communication.
Proceedings of the 19th IEEE International Conference on High Performance Computing and Communications; 15th IEEE International Conference on Smart City; 3rd IEEE International Conference on Data Science and Systems, 2017

Towards Scalable Deep Learning via I/O Analysis and Optimization.
Proceedings of the 19th IEEE International Conference on High Performance Computing and Communications; 15th IEEE International Conference on Smart City; 3rd IEEE International Conference on Data Science and Systems, 2017

2015
Casper: An Asynchronous Progress Model for MPI RMA on Many-Core Architectures.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015

Scaling NWChem with Efficient and Portable Asynchronous Communication in MPI RMA.
Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015

Techniques for Enabling Highly Efficient Message Passing on Many-Core Architectures.
Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015

2014
MT-MPI: multithreaded MPI for many-core environments.
Proceedings of the 2014 International Conference on Supercomputing, 2014

2013
Direct MPI Library for Intel Xeon Phi Co-Processors.
Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

2012
Poster: An MPI Library implementing Direct Communication for Many-Core Based Accelerators.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Abstract: An MPI Library implementing Direct Communication for Many-Core Based Accelerators.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Design of Direct Communication Facility for Many-Core Based Accelerators.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012


  Loading...