Hezi Zhang

Gian Giacomo Guerreschi

Proceedings of the ISCA '22: The 49th Annual International Symposium on Computer Architecture, New York, New York, USA, June 18, 2022

EQC: ensembled quantum computing for variational quantum algorithms.

[DOI]

Samuel Alexander Stein

Proceedings of the ISCA '22: The 49th Annual International Symposium on Computer Architecture, New York, New York, USA, June 18, 2022

INSPIRE: in-storage private information retrieval via protocol and architecture co-design.

[DOI]

Proceedings of the ISCA '22: The 49th Annual International Symposium on Computer Architecture, New York, New York, USA, June 18, 2022

Shfl-BW: accelerating deep neural network inference with tensor-core aware weight pruning.

[DOI]

Proceedings of the DAC '22: 59th ACM/IEEE Design Automation Conference, San Francisco, California, USA, July 10, 2022

Heuristic adaptability to input dynamics for SpMM on CPUs.

[DOI]

Proceedings of the DAC '22: 59th ACM/IEEE Design Automation Conference, San Francisco, California, USA, July 10, 2022

DOTA: detect and omit weak attentions for scalable transformer acceleration.

[DOI]

Proceedings of the ASPLOS '22: 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Lausanne, Switzerland, 28 February 2022, 2022

Paulihedral: a generalized block-wise compiler optimization framework for Quantum simulation kernels.

[DOI]

Proceedings of the ASPLOS '22: 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Lausanne, Switzerland, 28 February 2022, 2022

2021

Effective and Efficient Batch Normalization Using a Few Uncorrelated Data for Statistics Estimation.

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2021

Reuse-centric k-means configuration.

[DOI]

Inf. Syst., 2021

ZEN: Efficient Zero-Knowledge Proofs for Neural Networks.

[DOI]

IACR Cryptol. ePrint Arch., 2021

Attacking Point Cloud Segmentation with Color-only Perturbation.

[DOI]

CoRR, 2021

TC-GNN: Accelerating Sparse Graph Neural Network Computation Via Dense Tensor Core on GPUs.

[DOI]

CoRR, 2021

Towards Efficient Ansatz Architecture for Variational Quantum Algorithms.

[DOI]

CoRR, 2021

Mapping Surface Code to Superconducting Quantum Processors.

[DOI]

Anbang Wu

Hezi Zhang

Gian Giacomo Guerreschi

CoRR, 2021

QECV: Quantum Error Correction Verification.

[DOI]

Anbang Wu

Hezi Zhang

Gian Giacomo Guerreschi

CoRR, 2021

Mitigating Noise-Induced Gradient Vanishing in Variational Quantum Algorithm Training.

[DOI]

CoRR, 2021

QGTC: Accelerating Quantized GNN via GPU Tensor Core.

[DOI]

CoRR, 2021

Transformer Acceleration with Dynamic Sparse Attention.

[DOI]

CoRR, 2021

Efficient Sparse Matrix Kernels based on Adaptive Workload-Balancing and Parallel-Reduction.

[DOI]

CoRR, 2021

MPU: Towards Bandwidth-abundant SIMT Processor via Near-bank Computing.

[DOI]

CoRR, 2021

Palleon: A Runtime System for Efficient Video Processing toward Dynamic Class Skew.

[DOI]

Proceedings of the 2021 USENIX Annual Technical Conference, 2021

APNN-TC: accelerating arbitrary precision neural networks on ampere GPU tensor cores.

[DOI]

Proceedings of the International Conference for High Performance Computing, 2021

Efficient tensor core-based GPU kernels for structured sparsity under reduced precision.

[DOI]

Proceedings of the International Conference for High Performance Computing, 2021

EGEMM-TC: accelerating scientific computing on tensor cores with extended precision.

[DOI]

Proceedings of the PPoPP '21: 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2021

GNNAdvisor: An Adaptive and Efficient Runtime System for GNN Acceleration on GPUs.

[DOI]

Proceedings of the 15th USENIX Symposium on Operating Systems Design and Implementation, 2021

On the Co-Design of Quantum Software and Hardware.

[DOI]

Proceedings of the NANOCOM '21: The Eighth Annual ACM International Conference on Nanoscale Computing and Communication, Virtual Event, Italy, September 7, 2021

ENMC: Extreme Near-Memory Classification via Approximate Screening.

[DOI]

Proceedings of the MICRO '21: 54th Annual IEEE/ACM International Symposium on Microarchitecture, 2021

Improving Streaming Graph Processing Performance using Input Knowledge.

[DOI]

Proceedings of the MICRO '21: 54th Annual IEEE/ACM International Symposium on Microarchitecture, 2021

DSXplore: Optimizing Convolutional Neural Networks via Sliding-Channel Convolutions.

[DOI]

Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021

Overcoming the Memory Hierarchy Inefficiencies in Graph Processing Applications.

[DOI]

Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2021

Saga: Sparse Adversarial Attack on EEG-Based Brain Computer Interface.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

An Efficient Quantitative Approach for Optimizing Convolutional Neural Networks.

[DOI]

Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

TiAcc: Triangle-inequality based Hardware Accelerator for K-means on FPGAs.

[DOI]

Proceedings of the 21st IEEE/ACM International Symposium on Cluster, 2021

UAG: Uncertainty-aware Attention Graph Neural Network for Defending Adversarial Attacks.

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Projection-based runtime assertions for testing and debugging Quantum programs.

[DOI]

Proc. ACM Program. Lang., 2020

Rethinking the performance comparison between SNNS and ANNS.

[DOI]

Neural Networks, 2020

Tianjic: A Unified and Scalable Chip Bridging Spike-Based and Continuous Neural Computation.

[DOI]

IEEE J. Solid State Circuits, 2020

A novel ensemble pruning approach based on information exchange glowworm swarm optimization and complementarity measure.

[DOI]

J. Intell. Fuzzy Syst., 2020

Rubik: A Hierarchical Architecture for Efficient Graph Learning.

[DOI]

CoRR, 2020

Uncertainty-aware Attention Graph Neural Network for Defending Adversarial Attacks.

[DOI]

CoRR, 2020

Scalable Adversarial Attack on Graph Neural Networks with Alternating Direction Method of Multipliers.

[DOI]

CoRR, 2020

Optimizing Convolutional Neural Network Architecture via Information Field.

[DOI]

CoRR, 2020

GNNAdvisor: An Efficient Runtime System for GNN Acceleration on GPUs.

[DOI]

CoRR, 2020

Domain-adversarial multi-task framework for novel therapeutic property prediction of compounds.

[DOI]

Bioinform., 2020

A Close Look at Multi-tenant Parallel CNN Inference for Autonomous Driving.

[DOI]

Proceedings of the Network and Parallel Computing, 2020

DUET: Boosting Deep Neural Network Efficiency on Dual-Module Architecture.

[DOI]

Proceedings of the 53rd Annual IEEE/ACM International Symposium on Microarchitecture, 2020

iPIM: Programmable In-Memory Image Processing Accelerator Using Near-Bank Architecture.

[DOI]

Proceedings of the 47th ACM/IEEE Annual International Symposium on Computer Architecture, 2020

SGQuant: Squeezing the Last Bit on Graph Neural Networks with Specialized Quantization.

[DOI]

Proceedings of the 32nd IEEE International Conference on Tools with Artificial Intelligence, 2020

Boosting Deep Neural Network Efficiency with Dual-Module Inference.

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

Eliminating Redundant Computation in Noisy Quantum Computing Simulation.

[DOI]

Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020

Towards Efficient Superconducting Quantum Processor Architecture Design.

[DOI]

Proceedings of the ASPLOS '20: Architectural Support for Programming Languages and Operating Systems, 2020

DeepSniffer: A DNN Model Extraction Framework Based on Learning Architectural Hints.

[DOI]

Proceedings of the ASPLOS '20: Architectural Support for Programming Languages and Operating Systems, 2020

Weighted-Sampling Audio Adversarial Example Attack.

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

DASM: Data-Streaming-Based Computing in Nonvolatile Memory Architecture for Embedded System.

[DOI]

IEEE Trans. Very Large Scale Integr. Syst., 2019

Poq: Projection-based Runtime Assertions for Debugging on a Quantum Computer.

[DOI]

CoRR, 2019

AccD: A Compiler-based Framework for Accelerating Distance-related Algorithms on CPU-FPGA Platforms.

[DOI]

CoRR, 2019

SANQ: A Simulation Framework for Architecting Noisy Intermediate-Scale Quantum Computing System.

[DOI]

CoRR, 2019

Neural Network Model Extraction Attacks in Edge Devices by Hearing Architectural Hints.

[DOI]

CoRR, 2019

Adversarial attack on Speech-to-Text Recognition Models.

[DOI]

Xiaolei Liu

Kun Wan

CoRR, 2019

Reconciling Feature-Reuse and Overfitting in DenseNet with Specialized Dropout.

[DOI]

Proceedings of the 31st IEEE International Conference on Tools with Artificial Intelligence, 2019

Dynamic Sparse Graph for Efficient Deep Learning.

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

KPynq: A Work-Efficient Triangle-Inequality Based K-Means on FPGA.

[DOI]

Proceedings of the 27th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2019

Tackling the Qubit Mapping Problem for NISQ-Era Quantum Devices.

[DOI]

Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, 2019

2018

Penetrating the Fog: the Path to Efficient CNN Models.

[DOI]

CoRR, 2018

Domain-Adversarial Multi-Task Framework for Novel Therapeutic Property Prediction of Compounds.

[DOI]

CoRR, 2018

Reconciling Feature-Reuse and Overfitting in DenseNet with Specialized Dropout.

[DOI]

CoRR, 2018

In-memory multiplication engine with SOT-MRAM based stochastic computing.

[DOI]

CoRR, 2018

SECS: Efficient Deep Stream Processing via Class Skew Dichotomy.

[DOI]

CoRR, 2018

Challenges Towards Deploying Data Intensive Scientific Applications on Extreme Heterogeneity Supercomputers.

[DOI]

CoRR, 2018

Reuse-Centric K-Means Configuration.

[DOI]

Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

2017

GLORE: generalized loop redundancy elimination upon LER-notation.

[DOI]

Xipeng Shen

Proc. ACM Program. Lang., 2017

Generalizations of the theory and deployment of triangular inequality for compiler-based strength reduction.

[DOI]

Proceedings of the 38th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2017

Sweet KNN: An Efficient KNN on GPU through Reconciliation between Redundancy Removal and Regularity.

[DOI]

Guoyang Chen

Xipeng Shen

Proceedings of the 33rd IEEE International Conference on Data Engineering, 2017

2015

TOP: A Framework for Enabling Algorithmic Optimizations for Distance-Related Problems.

[DOI]

Proc. VLDB Endow., 2015

Autotuning algorithmic choice for input sensitivity.

[DOI]