Think Before You Attribute: Improving the Performance of LLMs Attribution Systems.
CoRR, May, 2025
Tadashi: Enabling AI-Based Automated Code Generation With Guaranteed Correctness.
CoRR, 2024
On the Scalability of Computing Genomic Diversity Using SparkLeBLAST: A Feasibility Study.
Proceedings of the IEEE High Performance Extreme Computing Conference, 2024
Evaluation of Vectorization Methods on Arm SVE Using the Exo Language.
Proceedings of the IEEE International Conference on Cluster Computing, 2024
At the Locus of Performance: Quantifying the Effects of Copious 3D-Stacked Cache on HPC Workloads.
,
,
,
,
,
,
,
,
,
,
,
ACM Trans. Archit. Code Optim., December, 2023
At the Locus of Performance: A Case Study in Enhancing CPUs with Copious 3D-Stacked Cache.
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2022
Why Globally Re-shuffle? Revisiting Data Shuffling in Large Scale Deep Learning.
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022
Matrix Engines for High Performance Computing: A Paragon of Performance or Grasping at Straws?
,
,
,
,
,
,
,
,
,
,
Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021
Diamond matrix powers kernels.
Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region, 2020
Sieving in primality testing and factorization
PhD thesis, 2015
Cache optimized linear sieve
CoRR, 2011