Sheng-Chun Kao

Orcid: 0000-0001-7928-9027

According to our database1, Sheng-Chun Kao authored at least 21 papers between 2018 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
NonGEMM Bench: Understanding the Performance Horizon of the Latest ML Workloads with NonGEMM Workloads.
CoRR, 2024

Progressive Gradient Flow for Robust N: M Sparsity Training in Transformers.
CoRR, 2024

2023
JaxPruner: A concise library for sparsity research.
CoRR, 2023

FLAT: An Optimized Dataflow for Mitigating Attention Bottlenecks.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

2022
Domain-aware Genetic Algorithms for Hardware and Mapping Optimization for Efficient DNN Acceleration.
PhD thesis, 2022

A Formalism of DNN Accelerator Flexibility.
Proc. ACM Meas. Anal. Comput. Syst., 2022

Training Recipe for N: M Structured Sparsity with Decaying Pruning Mask.
CoRR, 2022

DNNFuser: Generative Pre-Trained Transformer as a Generalized Mapper for Layer Fusion in DNN Accelerators.
CoRR, 2022

Demystifying Map Space Exploration for NPUs.
Proceedings of the IEEE International Symposium on Workload Characterization, 2022

MAGMA: An Optimization Framework for Mapping Multiple DNNs on Multiple Accelerator Cores.
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2022

DiGamma: Domain-aware Genetic Algorithm for HW-Mapping Co-optimization for DNN Accelerators.
Proceedings of the 2022 Design, Automation & Test in Europe Conference & Exhibition, 2022

2021
ATTACC the Quadratic Bottleneck of Attention Layers.
CoRR, 2021

Domain-specific Genetic Algorithm for Multi-tenant DNNAccelerator Scheduling.
CoRR, 2021

E3: A HW/SW Co-design Neuroevolution Platform for Autonomous Learning in Edge Device.
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2021

Extending Sparse Tensor Accelerators to Support Multiple Compression Formats.
Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021

2020
Conditional Neural Architecture Search.
CoRR, 2020

Generative Design of Hardware-aware DNNs.
CoRR, 2020

ConfuciuX: Autonomous Hardware Resource Assignment for DNN Accelerators using Reinforcement Learning.
Proceedings of the 53rd Annual IEEE/ACM International Symposium on Microarchitecture, 2020

GAMMA: Automating the HW Mapping of DNN Models on Accelerators via Genetic Algorithm.
Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2020

2019
Reinforcement learning based interconnection routing for adaptive traffic optimization.
Proceedings of the 13th IEEE/ACM International Symposium on Networks-on-Chip, 2019

2018
Dynamically Updatable Ternary Segmented Aging Bloom Filter for OpenFlow-Compliant Low-Power Packet Processing.
IEEE/ACM Trans. Netw., 2018


  Loading...