Michael Garland

Orcid: 0000-0001-6093-7602

According to our database1, Michael Garland authored at least 97 papers between 1996 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Exploring Data Layout for Sparse Tensor Times Dense Matrix on GPUs.
ACM Trans. Archit. Code Optim., March, 2024

Automatic Tracing in Task-Based Runtime Systems.
CoRR, 2024

Composing Distributed Computations Through Task and Kernel Fusion.
CoRR, 2024

2023
Accelerated Encrypted Execution of General-Purpose Applications.
IACR Cryptol. ePrint Arch., 2023

CODAG: Characterizing and Optimizing Decompression Algorithms for GPUs.
CoRR, 2023

ArctyrEX : Accelerated Encrypted Execution of General-Purpose Applications.
CoRR, 2023

Understanding the Effect of the Long Tail on Neural Network Compression.
CoRR, 2023

Legate Sparse: Distributed Sparse Computing in Python.
Proceedings of the International Conference for High Performance Computing, 2023

Stream-K: Work-Centric Parallel Decomposition for Dense Matrix-Matrix Multiplication on the GPU.
Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2023

Visibility Algorithms for Dynamic Dependence Analysis and Distributed Coherence.
Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2023

GPU-Initiated On-Demand High-Throughput Storage Access in the BaM System Architecture.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

Graphene: An IR for Optimized Tensor Computations on GPUs.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

2022
GPU-Initiated On-Demand High-Throughput Storage Access in the BaM System Architecture.
Dataset, October, 2022

Efficient Sparsely Activated Transformers.
CoRR, 2022

BaM: A Case for Enabling Fine-grain High Throughput GPU-Orchestrated Access to Storage.
CoRR, 2022

2021
Supercomputing in Python With Legate.
Comput. Sci. Eng., 2021

Scaling implicit parallelism via dynamic control replication.
Proceedings of the PPoPP '21: 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2021

2020
A Programmable Approach to Neural Network Compression.
IEEE Micro, 2020

Reliable Model Compression via Label-Preservation-Aware Loss Functions.
CoRR, 2020

2019
A Programmable Approach to Model Compression.
CoRR, 2019

GPU-Accelerated Atari Emulation for Reinforcement Learning.
CoRR, 2019

Legate NumPy: accelerated and distributed array computing.
Proceedings of the International Conference for High Performance Computing, 2019

Throughput-oriented GPU memory allocation.
Proceedings of the 24th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2019

2018
A Block-Oriented, Parallel and Collective Approach to Sparse Indefinite Preconditioning on GPUs.
Proceedings of the 8th IEEE/ACM Workshop on Irregular Applications: Architectures and Algorithms, 2018

Dynamic tracing: memoization of task graphs for dynamic task-based runtimes.
Proceedings of the International Conference for High Performance Computing, 2018

2017
AdaBatch: Adaptive Batch Sizes for Training Deep Neural Networks.
CoRR, 2017

Parallel Depth-First Search for Directed Acyclic Graphs.
Proceedings of the Seventh Workshop on Irregular Applications: Architectures and Algorithms, 2017

2016
Designing a Tunable Nested Data-Parallel Programming System.
ACM Trans. Archit. Code Optim., 2016

Merge-based parallel sparse matrix-vector multiplication.
Proceedings of the International Conference for High Performance Computing, 2016

Merge-based sparse matrix-vector multiplication (SpMV) using the CSR storage format.
Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2016

Architecture-Adaptive Code Variant Tuning.
Proceedings of the Twenty-First International Conference on Architectural Support for Programming Languages and Operating Systems, 2016

2015
High-Performance and Scalable GPU Graph Traversal.
ACM Trans. Parallel Comput., 2015

A collection-oriented programming model for performance portability.
Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2015

Optimizing Sparse Matrix Operations on GPUs Using Merge Path.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015

Parallel Methods for Verifying the Consistency of Weakly-Ordered Architectures.
Proceedings of the 2015 International Conference on Parallel Architectures and Compilation, 2015

2014
A decomposition for in-place matrix transposition.
Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2014

Nitro: A Framework for Adaptive Code Variant Tuning.
Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014

Work-Efficient Parallel GPU Methods for Single-Source Shortest Paths.
Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014

Red Fox: An Execution Environment for Relational Query Processing on GPUs.
Proceedings of the 12th Annual IEEE/ACM International Symposium on Code Generation and Optimization, 2014

2013
Guest Editors' Introduction: Special Section on the ACM SIGGRAPH Symposium on Interactive 3D Graphics and Games (I3D 2012).
IEEE Trans. Vis. Comput. Graph., 2013

2012
Designing a unified programming model for heterogeneous machines.
Proceedings of the SC Conference on High Performance Computing Networking, 2012

Scalable GPU graph traversal.
Proceedings of the 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2012

Scalable Manycore Computing with CUDA.
Fundamentals of Multicore Software Development, 2012

2011
NVIDIA GPU.
Proceedings of the Encyclopedia of Parallel Computing, 2011

GPUs and the Future of Parallel Computing.
IEEE Micro, 2011

Social Network Clustering and Visualization using Hierarchical Edge Bundles.
Comput. Graph. Forum, 2011

Copperhead: compiling an embedded data parallel language.
Proceedings of the 16th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2011

2010
Understanding throughput-oriented architectures.
Commun. ACM, 2010

Parallel computing with CUDA.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Sparse Matrix-Vector Multiplication on Multicore and Accelerators.
Proceedings of the Scientific Computing with Multicore and Accelerators., 2010

Efficient Parallel Scan Algorithms for Manycore GPUs.
Proceedings of the Scientific Computing with Multicore and Accelerators., 2010

2009
Solving Computational Problems with GPU Computing.
Comput. Sci. Eng., 2009

Fast BVH Construction on GPUs.
Comput. Graph. Forum, 2009

MLS-based scalar fields over triangle meshes and their application in mesh processing.
Proceedings of the 2009 Symposium on Interactive 3D Graphics, 2009

Implementing sparse matrix-vector multiplication on throughput-oriented processors.
Proceedings of the ACM/IEEE Conference on High Performance Computing, 2009

Designing efficient sorting algorithms for manycore GPUs.
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

2008
On the Visualization of Social and other Scale-Free Networks.
IEEE Trans. Vis. Comput. Graph., 2008

Free-form motion processing.
ACM Trans. Graph., 2008

Scalable Parallel Programming with CUDA.
ACM Queue, 2008

Parallel Computing Experiences with CUDA.
IEEE Micro, 2008

Rapid Multipole Graph Drawing on the GPU.
Proceedings of the Graph Drawing, 16th International Symposium, 2008

Sparse matrix computations on manycore GPU's.
Proceedings of the 45th Design Automation Conference, 2008

2007
Iterative Methods for Improving Mesh Parameterizations.
Proceedings of the 2007 International Conference on Shape Modeling and Applications (SMI 2007), 2007

2006
Interactive Point-Based Rendering of Higher-Order Tetrahedral Data.
IEEE Trans. Vis. Comput. Graph., 2006

Visual Exploration of Complex Time-Varying Graphs.
IEEE Trans. Vis. Comput. Graph., 2006

Editing arbitrarily deforming surface animations.
ACM Trans. Graph., 2006

Spectral surface quadrangulation.
ACM Trans. Graph., 2006

Interactive Multiresolution Editing and Display of Large Terrains.
Comput. Graph. Forum, 2006

2005
A Multiresolution Representation for Massive Meshes.
IEEE Trans. Vis. Comput. Graph., 2005

Sketching mesh deformations.
ACM Trans. Graph., 2005

Quadric-based simplification in any dimension.
ACM Trans. Graph., 2005

Harmonic functions for quadrilateral remeshing of arbitrary manifolds.
Comput. Aided Geom. Des., 2005

Curvature Maps for Local Shape Comparison.
Proceedings of the 2005 International Conference on Shape Modeling and Applications (SMI 2005), 2005

Surfacing by numbers.
Proceedings of the International Conference on Computer Graphics and Interactive Techniques, 2005

Progressive multiresolution meshes for deforming surfaces.
Proceedings of the 2005 ACM SIGGRAPH/Eurographics Symposium on Computer Animation, 2005

Interactive material replacement in photographs.
Proceedings of the Graphics Interface 2005 Conference, 2005

2004
Jump map-based interactive texture synthesis.
ACM Trans. Graph., 2004

Fair morse functions for extracting the topological structure of a surface mesh.
ACM Trans. Graph., 2004

Pixel-Exact Rendering of Spacetime Finite Element Solutions.
Proceedings of the 15th IEEE Visualization Conference, 2004

Similarity-based surface modelling using geodesic fans.
Proceedings of the International Conference on Computer Graphics and Interactive Techniques, 2004

Mesh Modelling with Curve Analogies.
Proceedings of the 12th Pacific Conference on Computer Graphics and Applications, 2004

Mining scale-free networks using geodesic clustering.
Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2004

Spacetime meshing with adaptive refinement and coarsening.
Proceedings of the 20th ACM Symposium on Computational Geometry, 2004

2003
User-guided simplification.
Proceedings of the 2003 Symposium on Interactive 3D Graphics, 2003

Interactive Texture Synthesis on Surfaces using Jump Maps.
Proceedings of the 14th Eurographics Workshop on Rendering Techniques, 2003

2002
Permission grids: practical, error-bounded simplification.
ACM Trans. Graph., 2002

A Multiphase Approach to Efficient Surface Simplification.
Proceedings of the 13th IEEE Visualization Conference, 2002

Towards Real-Time Texture Synthesis with the Jump Map.
Proceedings of the 13th Eurographics Workshop on Rendering Techniques, 2002

2001
Efficient Adaptive Simplification of Massive Meshes.
Proceedings of the 12th IEEE Visualization Conference, 2001

Hierarchical face clustering on polygonal surfaces.
Proceedings of the 2001 Symposium on Interactive 3D Graphics, 2001

1999
Quadric-Based Polygonal Surface Simplification.
PhD thesis, 1999

Optimal triangulation and quadric-based surface simplification.
Comput. Geom., 1999

Face Cluster Radiosity.
Proceedings of the Rendering Techniques '99, 1999

Multiresolution Modeling: Survey and Future Opportunities.
Proceedings of the 20th Annual Conference of the European Association for Computer Graphics, 1999

1998
Simplifying surfaces with color and texture using quadric error metrics.
Proceedings of the 9th IEEE Visualization Conference, 1998

1997
Surface simplification using quadric error metrics.
Proceedings of the 24th Annual Conference on Computer Graphics and Interactive Techniques, 1997

1996
Fast and flexible polygonization of height fields.
Proceedings of the ACM SIGGRAPH 96 Visual Proceedings: The art and interdisciplinary programs of SIGGRAPH 1996, 1996


  Loading...