Jia Deng

Int. J. Model. Identif. Control., 2024

Unleashing CPU Potential for Executing GPU Programs Through Compiler/Runtime Optimizations.

[DOI]

Ruobing Han

Hyesoon Kim

Proceedings of the 57th IEEE/ACM International Symposium on Microarchitecture, 2024

Enabling Fine-Grained Incremental Builds by Making Compiler Stateful.

[DOI]

Ruobing Han

Hyesoon Kim

Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2024

2023

Concrete Type Inference for Code Optimization using Machine Learning with SMT Solving.

[DOI]

Proc. ACM Program. Lang., October, 2023

HIPLZ: Enabling performance portability for exascale systems.

[DOI]

Concurr. Comput. Pract. Exp., 2023

2022

Study on the law of the structure parameters influence on thermal deformation of magnetic poles of magnetic-liquid double suspension bearing.

[DOI]

Int. J. Comput. Appl. Technol., 2022

2021

Attention Mechanism-Based CNN-LSTM Model for Wind Turbine Fault Prediction Using SSN Ontology Annotation.

[DOI]

Wirel. Commun. Mob. Comput., 2021

Study on temperature rise and thermal deformation of rotor caused by eddy current loss of magnetic-liquid double suspension bearing.

[DOI]

Int. J. Model. Identif. Control., 2021

Identifying Behavior Dispatchers for Malware Analysis.

[DOI]

Proceedings of the ASIA CCS '21: ACM Asia Conference on Computer and Communications Security, 2021

2020

Advanced Graph-Based Deep Learning for Probabilistic Type Inference.

[DOI]

Fangke Ye

CoRR, 2020

OmpMemOpt: Optimized Memory Movement for Heterogeneous Computing.

[DOI]

Prithayan Barua

Proceedings of the Euro-Par 2020: Parallel Processing, 2020

2018

Compile-Time Library Call Detection Using CAASCADE and XALT.

[DOI]

Proceedings of the High Performance Computing, 2018

Detecting MPI usage anomalies via partial program symbolic execution.

[DOI]

Fangke Ye

Proceedings of the International Conference for High Performance Computing, 2018

Parallel sparse flow-sensitive points-to analysis.

[DOI]

Michael G. Burke

Proceedings of the 27th International Conference on Compiler Construction, 2018

2015

Finding Tizen security bugs through whole-system static analysis.

[DOI]

CoRR, 2015

LLVM-based communication optimizations for PGAS programs.

[DOI]

Proceedings of the Second Workshop on the LLVM Compiler Infrastructure in HPC, 2015

Parallelizing a discrete event simulation application using the Habanero-Java multicore library.

[DOI]

Wei-Cheng Xiao

Proceedings of the Sixth International Workshop on Programming Models and Applications for Multicores and Manycores, 2015

A Composable Deadlock-Free Approach to Object-Based Isolation.

[DOI]

Shams Imam

Proceedings of the Euro-Par 2015: Parallel Processing, 2015

2014

Inter-iteration Scalar Replacement Using Array SSA Form.

[DOI]

Proceedings of the Compiler Construction - 23rd International Conference, 2014

2013

A Transformation Framework for Optimizing Task-Parallel Programs.

[DOI]

ACM Trans. Program. Lang. Syst., 2013

A decoupled non-SSA global register allocation using bipartite liveness graphs.

[DOI]

ACM Trans. Archit. Code Optim., 2013

Accelerating Habanero-Java programs with OpenCL generation.

[DOI]

Proceedings of the 2013 International Conference on Principles and Practices of Programming on the Java Platform: Virtual Machines, 2013

Isolation for nested task parallelism.

[DOI]

Proceedings of the 2013 ACM SIGPLAN International Conference on Object Oriented Programming Systems Languages & Applications, 2013

Speculative Execution of Parallel Programs with Precise Exception Semantics on GPUs.

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 2013

Compiler-Driven Data Layout Transformation for Heterogeneous Platforms.

[DOI]

Proceedings of the Euro-Par 2013: Parallel Processing Workshops, 2013

Interprocedural strength reduction of critical sections in explicitly-parallel programs.

[DOI]

Proceedings of the 22nd International Conference on Parallel Architectures and Compilation Techniques, 2013

2012

Efficient data race detection for async-finish parallelism.

[DOI]

Formal Methods Syst. Des., 2012

Scalable and precise dynamic datarace detection for structured parallelism.

[DOI]

Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation, 2012

Finish Accumulators: An Efficient Reduction Construct for Dynamic Task Parallelism.

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 2012

Practical Permissions for Race-Free Parallelism.

[DOI]

Proceedings of the ECOOP 2012 - Object-Oriented Programming, 2012

2011

Permission Regions for Race-Free Parallelism.

[DOI]

Proceedings of the Runtime Verification - Second International Conference, 2011

Habanero-Java: the new adventures of old X10.

[DOI]

Proceedings of the 9th International Conference on Principles and Practice of Programming in Java, 2011

Intermediate language extensions for parallelism.

[DOI]

Proceedings of the SPLASH'11 Workshops, 2011

Delegated isolation.

[DOI]

Proceedings of the 26th Annual ACM SIGPLAN Conference on Object-Oriented Programming, 2011

The design and implementation of the habanero-java parallel programming language.

[DOI]

Proceedings of the Companion to the 26th Annual ACM SIGPLAN Conference on Object-Oriented Programming, 2011

Communication Optimizations for Distributed-Memory X10 Programs.

[DOI]

Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

2010

Efficient Selection of Vector Instructions Using Dynamic Programming.

[DOI]

Proceedings of the 43rd Annual IEEE/ACM International Symposium on Microarchitecture, 2010

SLAW: A scalable locality-aware adaptive work-stealing scheduler.

[DOI]

Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Reducing task creation and termination overhead in explicitly parallel programs.

[DOI]

Proceedings of the 19th International Conference on Parallel Architectures and Compilation Techniques, 2010

Automatic vector instruction selection for dynamic compilation.

[DOI]