Proceedings of the ASPLOS '22: 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Lausanne, Switzerland, 28 February 2022, 2022

GPUPool: A Holistic Approach to Fine-Grained GPU Sharing in the Cloud.

[BibT_eX]

[DOI]

Xiaodan Serina Tan

Pavel Golikov

Nandita Vijaykumar

Gennady Pekhimenko

Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2022

VoxelCache: Accelerating Online Mapping in Robotics and 3D Reconstruction Tasks.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2022

2021

DAMOV: A New Methodology and Benchmark Suite for Evaluating Data Movement Bottlenecks.

[BibT_eX]

[DOI]

IEEE Access, 2021

QUAC-TRNG: High-Throughput True Random Number Generation Using Quadruple Row Activation in Commodity DRAM Chips.

[BibT_eX]

[DOI]

Ataberk Olgun

Minesh Patel

Abdullah Giray Yaglikçi

Proceedings of the 48th ACM/IEEE Annual International Symposium on Computer Architecture, 2021

SynCron: Efficient Synchronization Support for Near-Data-Processing Architectures.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2021

2020

Rethinking Divide and Conquer - Towards Holistic Interfaces of the Computing Stack.

[BibT_eX]

[DOI]

Schahram Dustdar

Onur Mutlu

Nandita Vijaykumar

IEEE Internet Comput., 2020

Echo: Compiler-based GPU Memory Footprint Reduction for LSTM RNN Training.

[BibT_eX]

[DOI]

Bojian Zheng

Nandita Vijaykumar

Gennady Pekhimenko

Proceedings of the 47th ACM/IEEE Annual International Symposium on Computer Architecture, 2020

SysScale: Exploiting Multi-domain Dynamic Voltage and Frequency Scaling for Energy Efficient Mobile Processors.

[BibT_eX]

[DOI]

Jawad Haj-Yahya

Mohammed Alser

Jeremie S. Kim

Abdullah Giray Yaglikçi

Nandita Vijaykumar

Efraim Rotem

Onur Mutlu

Proceedings of the 47th ACM/IEEE Annual International Symposium on Computer Architecture, 2020

2019

Enhancing Programmability, Portability, and Performance with Rich Cross-Layer Abstractions.

[BibT_eX]

[DOI]

Nandita Vijaykumar

CoRR, 2019

SMASH: Co-designing Software Compression and Hardware-Accelerated Indexing for Efficient Sparse Matrix Operations.

[BibT_eX]

[DOI]

Konstantinos Kanellopoulos

Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture, 2019

CROW: a low-cost substrate for improving DRAM performance, energy efficiency, and reliability.

[BibT_eX]

[DOI]

Hasan Hassan

Minesh Patel

Jeremie S. Kim

Abdullah Giray Yaglikçi

Proceedings of the 46th International Symposium on Computer Architecture, 2019

2018

EcoRNN: Fused LSTM RNN Implementation with Data Layout Optimization.

[BibT_eX]

[DOI]

CoRR, 2018

Exploiting Row-Level Temporal Locality in DRAM to Reduce the Memory Access Latency.

[BibT_eX]

[DOI]

CoRR, 2018

SoftMC: Practical DRAM Characterization Using an FPGA-Based Infrastructure.

[BibT_eX]

[DOI]

CoRR, 2018

Decoupling GPU Programming Models from Resource Management for Enhanced Programming Ease, Portability, and Performance.

[BibT_eX]

[DOI]

CoRR, 2018

Zorua: Enhancing Programming Ease, Portability, and Performance in GPUs by Decoupling Programming Models from Resource Management.

[BibT_eX]

[DOI]

CoRR, 2018

A Case for Richer Cross-Layer Abstractions: Bridging the Semantic Gap with Expressive Memory.

[BibT_eX]

[DOI]

Proceedings of the 45th ACM/IEEE Annual International Symposium on Computer Architecture, 2018

The Locality Descriptor: A Holistic Cross-Layer Abstraction to Express Data Locality In GPUs.

[BibT_eX]

[DOI]

Proceedings of the 45th ACM/IEEE Annual International Symposium on Computer Architecture, 2018

2017

Gaia: Geo-Distributed Machine Learning Approaching LAN Speeds.

[BibT_eX]

[DOI]

Proceedings of the 14th USENIX Symposium on Networked Systems Design and Implementation, 2017

SoftMC: A Flexible and Practical Open-Source Infrastructure for Enabling Experimental DRAM Studies.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Symposium on High Performance Computer Architecture, 2017

2016

A Framework for Accelerating Bottlenecks in GPU Execution with Assist Warps.

[BibT_eX]

[DOI]

Rachata Ausavarungnirun

CoRR, 2016

Zorua: A holistic approach to resource virtualization in GPUs.

[BibT_eX]

[DOI]

Proceedings of the 49th Annual IEEE/ACM International Symposium on Microarchitecture, 2016

Transparent Offloading and Mapping (TOM): Enabling Programmer-Transparent Near-Data Processing in GPU Systems.

[BibT_eX]

[DOI]

Proceedings of the 43rd ACM/IEEE Annual International Symposium on Computer Architecture, 2016

Accelerating pointer chasing in 3D-stacked memory: Challenges, mechanisms, evaluation.

[BibT_eX]

[DOI]

Proceedings of the 34th IEEE International Conference on Computer Design, 2016

A case for toggle-aware compression for GPU systems.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Symposium on High Performance Computer Architecture, 2016

ChargeCache: Reducing DRAM latency by exploiting row access locality.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Symposium on High Performance Computer Architecture, 2016

2015

A case for core-assisted bottleneck acceleration in GPUs: enabling flexible data compression with assist warps.

[BibT_eX]

[DOI]

Rachata Ausavarungnirun

Proceedings of the 42nd Annual International Symposium on Computer Architecture, 2015

Nandita Vijaykumar

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...