Long Zheng

Orcid: 0000-0001-7903-2061

Affiliations:

Huazhong University of Science and Technology, School of Computer Science and Technology, Wuhan, China (PhD 2016)

According to our database¹, Long Zheng authored at least 74 papers between 2014 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

2014

2016

2018

2020

2022

2024

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

ARCHER: a ReRAM-based accelerator for compressed recommendation systems.

[BibT_eX]

[DOI]

Frontiers Comput. Sci., October, 2024

L-FNNG: Accelerating Large-Scale KNN Graph Construction on CPU-FPGA Heterogeneous Platform.

[BibT_eX]

[DOI]

ACM Trans. Reconfigurable Technol. Syst., September, 2024

CPSAA: Accelerating Sparse Attention Using Crossbar-Based Processing-In-Memory Architecture.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., June, 2024

PhGraph: A High-Performance ReRAM-Based Accelerator for Hypergraph Applications.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., May, 2024

An Efficient GCNs Accelerator Using 3D-Stacked Processing-in-Memory Architectures.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., May, 2024

A heterogeneous 3-D stacked PIM accelerator for GCN-based recommender systems.

[BibT_eX]

[DOI]

CCF Trans. High Perform. Comput., April, 2024

Minimal Context-Switching Data Race Detection with Dataflow Tracking.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., March, 2024

Towards High-Performance Graph Processing: From a Hardware/Software Co-Design Perspective.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., March, 2024

A Scalable, Efficient, and Robust Dynamic Memory Management Library for HLS-based FPGAs.

[BibT_eX]

[DOI]

Proceedings of the 57th IEEE/ACM International Symposium on Microarchitecture, 2024

Enabling Efficient Large Recommendation Model Training with Near CXL Memory Processing.

[BibT_eX]

[DOI]

Proceedings of the 51st ACM/IEEE Annual International Symposium on Computer Architecture, 2024

High-Performance and Resource-Efficient Dynamic Memory Management in High-Level Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 61st ACM/IEEE Design Automation Conference, 2024

SpaHet: A Software/Hardware Co-design for Accelerating Heterogeneous-Sparsity based Sparse Matrix Multiplication.

[BibT_eX]

[DOI]

Proceedings of the 61st ACM/IEEE Design Automation Conference, 2024

Towards Redundancy-Free Recommendation Model Training via Reusable-aware Near-Memory Processing.

[BibT_eX]

[DOI]

Proceedings of the 61st ACM/IEEE Design Automation Conference, 2024

2023

Accelerating Loop-Oriented RTL Simulation With Code Instrumentation.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., December, 2023

Accelerating Graph Convolutional Networks Through a PIM-Accelerated Approach.

[BibT_eX]

[DOI]

IEEE Trans. Computers, September, 2023

PDAS: Improving network pruning based on Progressive Differentiable Architecture Search for DNNs.

[BibT_eX]

[DOI]

Future Gener. Comput. Syst., 2023

Cyclosa: Redundancy-Free Graph Pattern Mining via Set Dataflow.

[BibT_eX]

[DOI]

Proceedings of the 2023 USENIX Annual Technical Conference, 2023

Accelerating Personalized Recommendation with Cross-level Near-Memory Processing.

[BibT_eX]

[DOI]

Proceedings of the 50th Annual International Symposium on Computer Architecture, 2023

MetaNMP: Leveraging Cartesian-Like Product to Accelerate HGNNs with Near-Memory Processing.

[BibT_eX]

[DOI]

Proceedings of the 50th Annual International Symposium on Computer Architecture, 2023

GraphMetaP: Efficient MetaPath Generation for Dynamic Heterogeneous Graph Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

AFaVS: Accurate Yet Fast Version Switching for Graph Processing Systems.

[BibT_eX]

[DOI]

Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

SMOG: Accelerating Subgraph Matching on GPUs.

[BibT_eX]

[DOI]

Proceedings of the IEEE High Performance Extreme Computing Conference, 2023

FNNG: A High-Performance FPGA-based Accelerator for K-Nearest Neighbor Graph Construction.

[BibT_eX]

[DOI]

Proceedings of the 2023 ACM/SIGDA International Symposium on Field Programmable Gate Arrays, 2023

MeG<sup>2</sup>: In-Memory Acceleration for Genome Graphs Analysis.

[BibT_eX]

[DOI]

Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023

2022

A Flexible Yet Efficient DNN Pruning Approach for Crossbar-Based Processing-in-Memory Architectures.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022

ReaDy: A ReRAM-Based Processing-in-Memory Accelerator for Dynamic Graph Convolutional Networks.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022

An Effective 2-Dimension Graph Partitioning for Work Stealing Assisted Graph Processing on Multi-FPGAs.

[BibT_eX]

[DOI]

IEEE Trans. Big Data, 2022

ReCSA: a dedicated sort accelerator using ReRAM-based content addressable memory.

[BibT_eX]

[DOI]

Frontiers Comput. Sci., 2022

GraphFly: Efficient Asynchronous Streaming Graphs Processing via Dependency-Flow.

[BibT_eX]

[DOI]

Proceedings of the SC22: International Conference for High Performance Computing, 2022

A Data-Centric Accelerator for High-Performance Hypergraph Processing.

[BibT_eX]

[DOI]

Proceedings of the 55th IEEE/ACM International Symposium on Microarchitecture, 2022

A General Offloading Approach for Near-DRAM Processing-In-Memory Architectures.

[BibT_eX]

[DOI]

Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022

An Efficient Graph Accelerator with Distributed On-Chip Memory Hierarchy.

[BibT_eX]

[DOI]

Proceedings of the Algorithms and Architectures for Parallel Processing, 2022

Towards Fast GPU-based Sparse DNN Inference: A Hybrid Compute Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE High Performance Extreme Computing Conference, 2022

Accelerating Sparse Deep Neural Network Inference Using GPU Tensor Cores.

[BibT_eX]

[DOI]

Proceedings of the IEEE High Performance Extreme Computing Conference, 2022

ScalaGraph: A Scalable Accelerator for Massively Parallel Graph Processing.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2022

Hardware-Accelerated Hypergraph Processing with Chain-Driven Scheduling.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2022

Accelerating Graph Convolutional Networks Using Crossbar-based Processing-In-Memory Architectures.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2022

ReSMA: accelerating approximate string matching using ReRAM-based content addressable memory.

[BibT_eX]

[DOI]

Proceedings of the DAC '22: 59th ACM/IEEE Design Automation Conference, San Francisco, California, USA, July 10, 2022

2021

Efficient Graph Processing with Invalid Update Filtration.

[BibT_eX]

[DOI]

IEEE Trans. Big Data, 2021

FDGLib: A Communication Library for Efficient Large-Scale Graph Processing in FPGA-Accelerated Data Centers.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., 2021

Editorial for the special issue on high performance distributed computing.

[BibT_eX]

[DOI]

CCF Trans. High Perform. Comput., 2021

Fast Sparse Deep Neural Network Inference with Flexible SpMM Optimization Space Exploration.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE High Performance Extreme Computing Conference, 2021

Productive High-Performance k-Truss Decomposition on GPU Using Linear Algebra.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE High Performance Extreme Computing Conference, 2021

GraSU: A Fast Graph Update Library for FPGA-based Dynamic Graph Processing.

[BibT_eX]

[DOI]

Proceedings of the FPGA '21: The 2021 ACM/SIGDA International Symposium on Field Programmable Gate Arrays, Virtual Event, USA, February 28, 2021

SumPA: Efficient Pattern-Centric Graph Mining with Pattern Abstraction.

[BibT_eX]

[DOI]

Proceedings of the 30th International Conference on Parallel Architectures and Compilation Techniques, 2021

2020

ReSQM: Accelerating Database Operations Using ReRAM-Based Content Addressable Memory.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2020

A Conflict-free Scheduler for High-performance Graph Processing on Multi-pipeline FPGAs.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2020

Efficient FPGA-based graph processing with hybrid pull-push computational model.

[BibT_eX]

[DOI]

Frontiers Comput. Sci., 2020

Dynamic cluster strategy for hierarchical rollback-recovery protocols in MPI HPC applications.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2020

Effective runtime scheduling for high-performance graph processing on heterogeneous dataflow architecture.

[BibT_eX]

[DOI]

CCF Trans. High Perform. Comput., 2020

ReGra: Accelerating Graph Traversal Applications Using ReRAM With Lower Communication Cost.

[BibT_eX]

[DOI]

IEEE Access, 2020

Scaph: Scalable GPU-Accelerated Graph Processing with Value-Driven Differential Scheduling.

[BibT_eX]

[DOI]

Proceedings of the 2020 USENIX Annual Technical Conference, 2020

A Locality-Aware Energy-Efficient Accelerator for Graph Mining Applications.

[BibT_eX]

[DOI]

Proceedings of the 53rd Annual IEEE/ACM International Symposium on Microarchitecture, 2020

A Heterogeneous PIM Hardware-Software Co-Design for Energy-Efficient Graph Processing.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020

Spara: An Energy-Efficient ReRAM-Based Accelerator for Sparse Graph Analytics Applications.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020

2019

Efficient Time-Evolving Stream Processing at Scale.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2019

Supporting Superpages and Lightweight Page Migration in Hybrid Memory Systems.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2019

A Survey on Graph Processing Accelerators: Challenges and Opportunities.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., 2019

Fast Triangle Counting on GPU.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE High Performance Extreme Computing Conference, 2019

RAGra: Leveraging Monolithic 3D ReRAM for Massively-Parallel Graph Processing.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2019

2018

Scalable Data Race Detection for Lock-Intensive Programs with Pending Period Representation.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2018

DigHR: precise dynamic detection of hidden races with weak causal relation analysis.

[BibT_eX]

[DOI]

J. Supercomput., 2018

Efficient and Scalable Graph Parallel Processing With Symbolic Execution.

[BibT_eX]

[DOI]

Long Zheng

Xiaofei Liao

Hai Jin

ACM Trans. Archit. Code Optim., 2018

Scalable concurrency debugging with distributed graph processing.

[BibT_eX]

[DOI]

Proceedings of the 2018 International Symposium on Code Generation and Optimization, 2018

An efficient graph accelerator with parallel data conflict management.

[BibT_eX]

[DOI]

Proceedings of the 27th International Conference on Parallel Architectures and Compilation Techniques, 2018

Towards concurrency race debugging: an integrated approach for constraint solving and dynamic slicing.

[BibT_eX]

[DOI]

Proceedings of the 27th International Conference on Parallel Architectures and Compilation Techniques, 2018

2017

Exploiting the Parallelism Between Conflicting Critical Sections with Partial Reversion.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2017

Hardware/software cooperative caching for hybrid DRAM/NVM memory architectures.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Supercomputing, 2017

Towards Dataflow-Based Graph Accelerator.

[BibT_eX]

[DOI]

Proceedings of the 37th IEEE International Conference on Distributed Computing Systems, 2017

2016

A Performance Debugging Framework for Unnecessary Lock Contentions with Record/Replay Techniques.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2016

Automatic Security Bug Classification: A Compile-Time Approach.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Conference on Parallel and Distributed Systems, 2016

2015

Understanding and identifying latent data races cross-thread interleaving.

[BibT_eX]

[DOI]

Frontiers Comput. Sci., 2015

On performance debugging of unnecessary lock contentions on multicore processors: a replay-based approach.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual IEEE/ACM International Symposium on Code Generation and Optimization, 2015

2014

esDMT: Efficient and scalable deterministic multithreading through memory isolation.

[BibT_eX]

[DOI]

Proceedings of the 20th IEEE International Conference on Parallel and Distributed Systems, 2014

Long Zheng

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...