Aparna Chandramowlishwaran
Orcid: 0000-0003-0840-4192
According to our database1,
Aparna Chandramowlishwaran
authored at least 44 papers
between 2008 and 2024.
Collaborative distances:
Collaborative distances:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
on orcid.org
on dl.acm.org
On csauthors.net:
BERN-NN-IBF: Enhancing Neural Network Bound Propagation Through Implicit Bernstein Form and Optimized Tensor Operations.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., November, 2024
Breaking Boundaries: Distributed Domain Decomposition with Scalable Physics-Informed Neural PDE Solvers.
Proceedings of the International Conference for High Performance Computing, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the 52nd International Conference on Parallel Processing, 2023
Proceedings of the SC22: International Conference for High Performance Computing, 2022
IEEE Trans. Parallel Distributed Syst., 2021
Proc. ACM Meas. Anal. Comput. Syst., 2021
J. Comput. Sci., 2021
Int. J. High Perform. Comput. Appl., 2021
Train Once and Use Forever: Solving Boundary Value Problems in Unseen Domains with Pre-trained Deep Learning Models.
CoRR, 2021
SURFNet: Super-Resolution of Turbulent Flows with Transfer Learning using Small Datasets.
Proceedings of the 30th International Conference on Parallel Architectures and Compilation Techniques, 2021
SIGMETRICS Perform. Evaluation Rev., 2020
Brief Announcement: On the Limits of Parallelizing Convolutional Neural Networks on GPUs.
CoRR, 2020
Proceedings of the SPAA '20: 32nd ACM Symposium on Parallelism in Algorithms and Architectures, 2020
Proceedings of the International Conference for High Performance Computing, 2020
Proceedings of the ICS '20: 2020 International Conference on Supercomputing, 2020
Proceedings of the ICS '20: 2020 International Conference on Supercomputing, 2020
Proc. ACM Meas. Anal. Comput. Syst., 2019
Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium, 2019
Proceedings of the ACM International Conference on Supercomputing, 2019
Proceedings of the 48th International Conference on Parallel Processing, 2019
Towards Portable Online Prediction of Network Utilization Using MPI-Level Monitoring.
Proceedings of the Euro-Par 2019: Parallel Processing, 2019
Roofline Guided Design and Analysis of a Multi-stencil CFD Solver for Multicore Performance.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018
Proceedings of the 24th IEEE International Conference on Parallel and Distributed Systems, 2018
Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating Systems, 2018
Proceedings of the Euro-Par 2017: Parallel Processing - 23rd International Conference on Parallel and Distributed Computing, Santiago de Compostela, Spain, August 28, 2017
cudaCR: An In-Kernel Application-Level Checkpoint/Restart Scheme for CUDA-Enabled GPUs.
Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017
Proceedings of the 23rd IEEE International Conference on High Performance Computing, 2016
A CPU: GPU Hybrid Implementation and Model-Driven Scheduling of the Fast Multipole Method.
Proceedings of the Seventh Workshop on General Purpose Processing Using GPUs, 2014
Commun. ACM, 2012
Brief announcement: towards a communication optimal fast multipole method and its implications at exascale.
Proceedings of the 24th ACM Symposium on Parallelism in Algorithms and Architectures, 2012
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012
Proceedings of the 3rd USENIX Workshop on Hot Topics in Parallelism, 2011
Petascale Direct Numerical Simulation of Blood Flow on 200K Cores and Heterogeneous Architectures.
Proceedings of the Conference on High Performance Computing Networking, 2010
Diagnosis, Tuning, and Redesign for Multicore Performance: A Case Study of the Fast Multipole Method.
Proceedings of the Conference on High Performance Computing Networking, 2010
Applying the concurrent collections programming model to asynchronous parallel dense linear algebra.
Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2010
Optimizing and tuning the fast multipole method for state-of-the-art multicore architectures.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010
Performance evaluation of concurrent collections on high-performance multicore computing systems.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010
Declarative aspects of memory management in the concurrent collections parallel programming model.
Proceedings of the POPL 2009 Workshop on Declarative Aspects of Multicore Programming, 2009
On the Design of Fast Pseudo-Random Number Generators for the Cell Broadband Engine and an Application to Risk Analysis.
Proceedings of the 2008 International Conference on Parallel Processing, 2008