Anne C. Elster

Orcid: 0000-0002-7087-6193

According to our database1, Anne C. Elster authored at least 66 papers between 1988 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Scheduling Languages: A Past, Present, and Future Taxonomy.
CoRR, 2024

Message from the HCW 2024 General Co-Chairs.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

HCW 2024 Preface and Committee List.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

2023
A Distributed Integrated Feature Selection Scheme for Column Subset Selection.
IEEE Trans. Knowl. Data Eng., March, 2023


Towards a Benchmarking Suite for Kernel Tuners.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

Message from the HCW 2023 Technical Program Committee Co-Chairs.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

(De/Re)-Compositions Expressed Systematically via MDH-Based Schedules.
Proceedings of the 32nd ACM SIGPLAN International Conference on Compiler Construction, 2023

2022
Quantum Computing and High-Performance Computing: Compilation Stack Similarities.
Comput. Sci. Eng., 2022

Nvidia Hopper GPU and Grace CPU Highlights.
Comput. Sci. Eng., 2022

Analyzing Search Techniques for Autotuning Image-based GPU Kernels: The Impact of Sample Sizes.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2022

GPU-FAST-PROCLUS: A Fast GPU-parallelized Approach to Projected Clustering.
Proceedings of the 25th International Conference on Extending Database Technology, 2022

2021
FPGAs in the Cloud.
Comput. Sci. Eng., 2021

Accelerating Scientific Applications With SambaNova Reconfigurable Dataflow Architecture.
Comput. Sci. Eng., 2021

The European Factor: From ARM to Atos.
Comput. Sci. Eng., 2021

LS-CAT: A Large-Scale CUDA AutoTuning Dataset.
CoRR, 2021

Autotuning Benchmarking Techniques: A Roofline Model Case Study.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2021

2020
High-Level Synthesis-Based Approach for Accelerating Scientific Codes on FPGAs.
Comput. Sci. Eng., 2020

2019

Using Heterogeneous Graph Nodes (HGNs) to Minimize Overall Graph Execution Time in Heterogeneous Distributed Systems Modeling.
Proceedings of the 27th Euromicro International Conference on Parallel, 2019

2018
Energy monitoring as an essential building block towards sustainable ultrascale systems.
Sustain. Comput. Informatics Syst., 2018

ImageCL: Language and source-to-source compiler for performance portability, load balancing, and scalability prediction on heterogeneous systems.
Concurr. Comput. Pract. Exp., 2018

Quasi Spin Images.
Proceedings of the 31st Norsk Informatikkonferanse, 2018

Container-Based Virtualization for Heterogeneous HPC Clouds: Insights from the EU H2020 CloudLightning Project.
Proceedings of the 2018 IEEE International Conference on Cloud Engineering, 2018

2017
Machine learning-based auto-tuning for enhanced performance portability of OpenCL applications.
Concurr. Comput. Pract. Exp., 2017

GPU Power Modeling of HPC Applications for the Simulation of Heterogeneous Clouds.
Proceedings of the Parallel Processing and Applied Mathematics, 2017

2016
Characterizing numascale clusters with GPUs: MPI-based and GPU interconnect benchmarks.
Proceedings of the International Conference on High Performance Computing & Simulation, 2016

ImageCL: An image processing language for performance portability on heterogeneous systems.
Proceedings of the International Conference on High Performance Computing & Simulation, 2016


2015
Medical image segmentation on GPUs - A comprehensive review.
Medical Image Anal., 2015

Real-time gradient vector flow on GPUs using OpenCL.
J. Real Time Image Process., 2015

Noise Modeler: An Interactive Editor and Library for Procedural Terrains via Continuous Generation and Compilation of GPU Shaders.
Proceedings of the Entertainment Computing - ICEC 2015 - 14th International Conference, 2015

Machine Learning Based Auto-Tuning for Enhanced OpenCL Performance Portability.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015

2014
Summary of the First Workshop on Sustainable Software for Science: Practice and Experiences (WSSSPE1).
CoRR, 2014

GPU accelerated segmentation and centerline extraction of tubular structures from medical images.
Int. J. Comput. Assist. Radiol. Surg., 2014

Register Caching for Stencil Computations on GPUs.
Proceedings of the 16th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing, 2014

2013
Software for Science: Some Personal Reflections.
CoRR, 2013

GPU-Accelerated Visualization of Scattered Point Data.
IEEE Access, 2013

Organizing Visual Data in Structured Layout by Maximizing Similarity-Proximity Correlation.
Proceedings of the Advances in Visual Computing - 9th International Symposium, 2013

2012
Population Dynamics P Systems on CUDA.
Proceedings of the Computational Methods in Systems Biology, 2012

2011
Optimized Barriers for Heterogeneous Systems Using MPI.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

Bandwidth Reduction through Multithreaded Compression of Seismic Images.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

2010
Automatic Run-time Parallelization and Transformation of I/O.
Proceedings of the Conference on High Performance Computing Networking, 2010

Run-Time Analysis and Instrumentation for Communication Overlap Potential.
Proceedings of the Recent Advances in the Message Passing Interface, 2010

Fast GPU-Based Fluid Simulations Using SPH.
Proceedings of the Applied Parallel and Scientific Computing, 2010

Performance modeling of heterogeneous systems.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Real-Time Ray Tracing Using Nvidia OptiX.
Proceedings of the 31st Annual Conference of the European Association for Computer Graphics, 2010

2009
Modelling Multi-GPU Systems.
Proceedings of the Parallel Computing: From Multicores and GPU's to Petascale, 2009

Throughput Computing on Future GPUs.
Proceedings of the Parallel Computing: From Multicores and GPU's to Petascale, 2009

Parallel Computing with GPUs.
Proceedings of the Parallel Computing: From Multicores and GPU's to Petascale, 2009

Porous Rock Simulations and Lattice Boltzmann on GPUs.
Proceedings of the Parallel Computing: From Multicores and GPU's to Petascale, 2009

Linear optimization on modern GPUs.
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

A super-efficient adaptable bit-reversal algorithm for multithreaded architectures.
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

2008
Latency Impact on Spin-Lock Algorithms for Modern Shared Memory Multiprocessors.
Scalable Comput. Pract. Exp., 2008

2006
Parallel Methods for Real-Time Visualization of Snow.
Proceedings of the Applied Parallel Computing. State of the Art in Scientific Computing, 2006

Automatic and Transparent Optimizations of an Application's MPI Communication.
Proceedings of the Applied Parallel Computing. State of the Art in Scientific Computing, 2006

A Load Balancing Strategy for Computations on Large, Read-Only Data Sets.
Proceedings of the Applied Parallel Computing. State of the Art in Scientific Computing, 2006

HPC Environments - Visualization and Parallelization Tools: Minisymposium Abstract.
Proceedings of the Applied Parallel Computing. State of the Art in Scientific Computing, 2006

2004
Online Task Scheduling on Heterogeneous Clusters: An Experimental Study.
Proceedings of the Applied Parallel Computing, 2004

2003
Real-time Visualization of Smoke through Parallelization.
Proceedings of the Parallel Computing: Software Technology, 2003

Cluster Computing as a Teaching Tool.
Proceedings of the Parallel Computing: Software Technology, 2003

2002
Fast MPI Broadcasts through Reliable Multicasting.
Proceedings of the Applied Parallel Computing Advanced Scientific Computing, 2002

High-Performance Computing: Past, Present, and Future.
Proceedings of the Applied Parallel Computing Advanced Scientific Computing, 2002

1989
Fault-Tolerant Matrix Operations on Hypercube Multiprocessors.
Proceedings of the International Conference on Parallel Processing, 1989

Fast bit-reversal algorithms.
Proceedings of the IEEE International Conference on Acoustics, 1989

1988
Block-matrix operations using orthogonal trees.
Proceedings of the Third Conference on Hypercube Concurrent Computers and Applications, 1988


  Loading...