An Evaluation of Vectorization and Cache Reuse Tradeoffs on Modern CPUs.
Proceedings of the 9th International Workshop on Programming Models and Applications for Multicores and Manycores, 2018
CUDAAdvisor: LLVM-based runtime profiling for modern GPUs.
Proceedings of the 2018 International Symposium on Code Generation and Optimization, 2018
Energy optimization for mobile video streaming via an aggregate model.
Multim. Tools Appl., 2017
Determining driver phone use leveraging smartphone sensors.
Multim. Tools Appl., 2016
Characterizing emerging heterogeneous memory.
Proceedings of the 2016 ACM SIGPLAN International Symposium on Memory Management, Santa Barbara, CA, USA, June 14, 2016
Automating performance bottleneck detection using search-based application profiling.
Proceedings of the 2015 International Symposium on Software Testing and Analysis, 2015