José Monsalve Diaz

Orcid: 0000-0001-6875-1685

Affiliations:
  • University of Delaware, USA


According to our database1, José Monsalve Diaz authored at least 28 papers between 2012 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
ComPile: A Large IR Dataset from Production Sources.
CoRR, 2023

On Memory Codelets: Prefetching, Recoding, Moving and Streaming Data.
CoRR, 2023

Implementation of Dataflow Software Pipelining for Codelet Model.
Proceedings of the 2023 ACM/SPEC International Conference on Performance Engineering, 2023

DEMAC: A Platform for Education in High-performance Computing, Bridging the Gap Between Users and Hardware.
Proceedings of the Workshop on Computer Architecture Education, 2023

Memory Transfer Decomposition: Exploring Smart Data Movement Through Architecture-Aware Strategies.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

A gem5 Implementation of the Sequential Codelet Model: Reducing Overhead and Expanding the Software Memory Interface.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

Towards Fault Tolerance and Resilience in the Sequential Codelet Model.
Proceedings of the High Performance Computing - 10th Latin American Conference, 2023

2022
Chiplets and the Codelet Model.
CoRR, 2022

The SuperCodelet architecture.
Proceedings of the ExHET@PPoPP 2022: Proceedings of the 1st International Workshop on Extreme Heterogeneity Solutions, 2022

Automatic Asynchronous Execution of Synchronously Offloaded OpenMP Target Regions.
Proceedings of the Eighth IEEE/ACM Workshop on the LLVM Compiler Infrastructure in HPC, 2022

Co-Designing an OpenMP GPU Runtime and Optimizations for Near-Zero Overhead Execution.
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022

Efficient Execution of OpenMP on GPUs.
Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2022

2021
swFLOW: A large-scale distributed framework for deep learning on Sunway TaihuLight supercomputer.
Inf. Sci., 2021

2020
DEMAC: A Modular Platform for HW-SW Co-Design.
Proceedings of the Fourth IEEE/ACM Annual Workshop on Emerging Parallel and Distributed Runtime Systems and Middleware, 2020

CODIR: Towards an MLIR Codelet Model Dialect.
Proceedings of the Fourth IEEE/ACM Annual Workshop on Emerging Parallel and Distributed Runtime Systems and Middleware, 2020

2019
Analysis of OpenMP 4.5 Offloading in Implementations: Correctness and Overhead.
Parallel Comput., 2019

The TRegion Interface and Compiler Optimizations for OpenMP Target Regions.
Proceedings of the OpenMP: Conquering the Full Hardware Spectrum, 2019

swFLOW: A Dataflow Deep Learning Framework on Sunway TaihuLight Supercomputer.
Proceedings of the 21st IEEE International Conference on High Performance Computing and Communications; 17th IEEE International Conference on Smart City; 5th IEEE International Conference on Data Science and Systems, 2019

Toward A High-Performance Emulation Platformfor Brain-Inspired Intelligent SystemsExploring Dataflow-Based Execution Model and Beyond.
Proceedings of the 43rd IEEE Annual Computer Software and Applications Conference, 2019

2018
OpenMP 4.5 Validation and Verification Suite for Device Offload.
Proceedings of the Evolving OpenMP for Evolving Architectures, 2018

Evaluating Support for OpenMP Offload Features.
Proceedings of the 47th International Conference on Parallel Processing, 2018

2016
Resource Management for Running HPC Applications in Container Clouds.
Proceedings of the High Performance Computing - 31st International Conference, 2016

Energy Avoiding Matrix Multiply.
Proceedings of the Languages and Compilers for Parallel Computing, 2016

The Importance of Efficient Fine-Grain Synchronization for Many-Core Systems.
Proceedings of the Languages and Compilers for Parallel Computing, 2016

2015
Improving MPSoC reliability through adapting runtime task schedule based on time-correlated fault behavior.
Proceedings of the 2015 Design, Automation & Test in Europe Conference & Exhibition, 2015

Dynamic CPU Resource Allocation in Containerized Cloud Environments.
Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015

2014
Integration Scheme for Modular Snake Robot Software Components.
Proceedings of the Modelling and Simulation for Autonomous Systems, 2014

2012
Simulation and control integrated framework for modular snake robots locomotion research.
Proceedings of the IEEE/SICE International Symposium on System Integration, 2012


  Loading...