Guoyang Chen

According to our database1, Guoyang Chen authored at least 26 papers between 2014 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
A Classical Architecture For Digital Quantum Computers.
CoRR, 2023

GIM: Versatile GNN Acceleration with Reconfigurable Processing-in-Memory.
Proceedings of the 41st IEEE International Conference on Computer Design, 2023

2022
Towards Execution-Efficient LSTMs via Hardware-Guided Grow-and-Prune Paradigm.
IEEE Trans. Emerg. Top. Comput., 2022

2021
EGEMM-TC: accelerating scientific computing on tensor cores with extended precision.
Proceedings of the PPoPP '21: 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2021

Enabling energy-efficient DNN training on hybrid GPU-FPGA accelerators.
Proceedings of the ICS '21: 2021 International Conference on Supercomputing, 2021

Simple Augmentation Goes a Long Way: ADRL for DNN Quantization.
Proceedings of the 9th International Conference on Learning Representations, 2021

PIM-DL: Boosting DNN Inference on Digital Processing In-Memory Architectures via Data Layout Optimizations.
Proceedings of the 30th International Conference on Parallel Architectures and Compilation Techniques, 2021

2020
iPIM: Programmable In-Memory Image Processing Accelerator Using Near-Bank Architecture.
Proceedings of the 47th ACM/IEEE Annual International Symposium on Computer Architecture, 2020

Regularized Training and Tight Certification for Randomized Smoothed Classifier with Provable Robustness.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Sionnx: Automatic Unit Test Generator for ONNX Conformance.
CoRR, 2019

Hardware-Guided Symbiotic Training for Compact, Accurate, yet Execution-Efficient LSTM.
CoRR, 2019

Parallel Training via Computation Graph Transformation.
Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019

2018
Footprint modeling of cache associativity and granularity.
Proceedings of the International Symposium on Memory Systems, 2018

2017
Optimizing Data Placement on GPU Memory: A Portable Approach.
IEEE Trans. Computers, 2017

EffiSha: A Software Framework for Enabling Effficient Preemptive Scheduling of GPU.
Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2017

Efficient support of position independence on non-volatile memory.
Proceedings of the 50th Annual IEEE/ACM International Symposium on Microarchitecture, 2017

Sweet KNN: An Efficient KNN on GPU through Reconciliation between Redundancy Removal and Regularity.
Proceedings of the 33rd IEEE International Conference on Data Engineering, 2017

2016
Data-centric combinatorial optimization of parallel code.
Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2016

Coherence-Free Multiview: Enabling Reference-Discerning Data Placement on GPU.
Proceedings of the 2016 International Conference on Supercomputing, 2016

Towards Ontology-Based Program Analysis.
Proceedings of the 30th European Conference on Object-Oriented Programming, 2016

OpenCL-based erasure coding on heterogeneous architectures.
Proceedings of the 27th IEEE International Conference on Application-specific Systems, 2016

2015
Enabling Portable Optimizations of Data Placement on GPU.
IEEE Micro, 2015

Free launch: optimizing GPU dynamic kernel launches through thread reuse.
Proceedings of the 48th International Symposium on Microarchitecture, 2015

Enabling and Exploiting Flexible Task Assignment on GPU through SM-Centric Program Transformations.
Proceedings of the 29th ACM on International Conference on Supercomputing, 2015

2014
PORPLE: An Extensible Optimizer for Portable Data Placement on GPU.
Proceedings of the 47th Annual IEEE/ACM International Symposium on Microarchitecture, 2014

SM-centric transformation: circumventing hardware restrictions for flexible GPU scheduling.
Proceedings of the International Conference on Parallel Architectures and Compilation, 2014


  Loading...