Feng Zhang

Orcid: 0000-0003-1983-7321

Affiliations:
  • Renmin University of China, Key Laboratory of Data Engineering and Knowledge Engineering, Beijing, China
  • Tsinghua University, Department of Computer Science and Technology, Beijing, China (former)


According to our database1, Feng Zhang authored at least 80 papers between 2000 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
GPU-based butterfly counting.
VLDB J., September, 2024

Data-Aware Adaptive Compression for Stream Processing.
IEEE Trans. Knowl. Data Eng., September, 2024

Enabling Adaptive Sampling for Intra-Window Join: Simultaneously Optimizing Quantity and Quality.
Proc. ACM Manag. Data, September, 2024

HocoPG: A Database System with Homomorphic Compression for Text Processing.
Proc. VLDB Endow., August, 2024

TDSQL: Tencent Distributed Database System.
Proc. VLDB Endow., August, 2024

Graph-Centric Performance Analysis for Large-Scale Parallel Applications.
IEEE Trans. Parallel Distributed Syst., July, 2024

Enabling Window-Based Monotonic Graph Analytics with Reusable Transitional Results for Pattern-Consistent Queries.
Proc. VLDB Endow., July, 2024

G-Learned Index: Enabling Efficient Learned Index on GPU.
IEEE Trans. Parallel Distributed Syst., June, 2024

Compressed Data Direct Computing for Databases.
IEEE Trans. Knowl. Data Eng., May, 2024

Improving Graph Compression for Efficient Resource-Constrained Graph Analytics.
Proc. VLDB Endow., May, 2024

Compressed data direct computing for Chinese dataset on DCU.
CCF Trans. High Perform. Comput., April, 2024

Binary-Stochasticity-Enabled Highly Efficient Neuromorphic Deep Learning Achieves Better-than-Software Accuracy.
Adv. Intell. Syst., January, 2024

Autonomous driving system: A comprehensive survey.
Expert Syst. Appl., 2024

F-TADOC: FPGA-Based Text Analytics Directly on Compression with HLS.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

Enabling Efficient NVM-Based Text Analytics without Decompression.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

2023
Optimizing DNNs With Partially Equivalent Transformations and Automated Corrections.
IEEE Trans. Computers, December, 2023

Homomorphic Compression: Making Text Processing on Compression Unlimited.
Proc. ACM Manag. Data, December, 2023

Enabling Efficient Random Access to Hierarchically Compressed Text Data on Diverse GPU Platforms.
IEEE Trans. Parallel Distributed Syst., October, 2023

Expanding the Edge: Enabling Efficient Winograd CNN Inference With Deep Reuse on Edge Device.
IEEE Trans. Knowl. Data Eng., October, 2023

BladeDISC: Optimizing Dynamic Shape Machine Learning Workloads via Compiler Approach.
Proc. ACM Manag. Data, September, 2023

CompressGraph: Efficient Parallel Graph Analytics with Rule-Based Compression.
Proc. ACM Manag. Data, 2023

CompressStreamDB: Fine-Grained Adaptive Stream Processing without Decompression.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

EdgeNN: Efficient Neural Network Inference for CPU-GPU Integrated Edge Devices.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

Efficient Anomaly Detection in Property Graphs.
Proceedings of the Database Systems for Advanced Applications, 2023

RECom: A Compiler Approach to Accelerating Recommendation Model Inference with Massive Embedding Columns.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

Space-Efficient TREC for Enabling Deep Learning on Microcontrollers.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

2022
YuenyeungSpTRSV.
Dataset, May, 2022

Payment behavior prediction on shared parking lots with TR-GCN.
VLDB J., 2022

POCLib: A High-Performance Framework for Enabling Near Orthogonal Processing on Compression.
IEEE Trans. Parallel Distributed Syst., 2022

Detecting Performance Variance for Parallel Applications Without Source Code.
IEEE Trans. Parallel Distributed Syst., 2022

Leveraging Code Snippets to Detect Variations in the Performance of HPC Systems.
IEEE Trans. Parallel Distributed Syst., 2022

Exploring Data Analytics Without Decompression on Embedded GPU Systems.
IEEE Trans. Parallel Distributed Syst., 2022

G-SLIDE: A GPU-Based Sub-Linear Deep Learning Engine via LSH Sparsification.
IEEE Trans. Parallel Distributed Syst., 2022

Exploring Query Processing on CPU-GPU Integrated Edge Device.
IEEE Trans. Parallel Distributed Syst., 2022

Periodic Weather-Aware LSTM With Event Mechanism for Parking Behavior Prediction.
IEEE Trans. Knowl. Data Eng., 2022

Efficient Load-Balanced Butterfly Counting on GPU.
Proc. VLDB Endow., 2022

An Adaptive Elastic Multi-model Big Data Analysis and Information Extraction System.
Data Sci. Eng., 2022

DREW: Efficient Winograd CNN Inference with Deep Reuse.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

An In-depth Analysis of Subflow Degradation for Multi-path TCP on High Speed Rails.
Proceedings of the 23rd IEEE International Symposium on a World of Wireless, 2022

CompressDB: Enabling Efficient Compressed Data Direct Processing for Various Databases.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

Optimizing Random Access to Hierarchically-Compressed Data on GPU.
Proceedings of the SC22: International Conference for High Performance Computing, 2022

Taming the Big Data Monster: Managing Petabytes of Data with Multi-Model Databases.
Proceedings of the 2022 IEEE 34th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), 2022

TREC: Transient Redundancy Elimination-based Convolution.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
TADOC: Text analytics directly on compression.
VLDB J., 2021

Fine-Grained Multi-Query Stream Processing on Integrated Architectures.
IEEE Trans. Parallel Distributed Syst., 2021

iMLBench: A Machine Learning Benchmark Suite for CPU-GPU Integrated Architectures.
IEEE Trans. Parallel Distributed Syst., 2021

YuenyeungSpTRSV: A Thread-Level and Warp-Level Fusion Synchronization-Free Sparse Triangular Solve.
IEEE Trans. Parallel Distributed Syst., 2021

An Efficient Parallel Secure Machine Learning Framework on GPUs.
IEEE Trans. Parallel Distributed Syst., 2021

DTransE: Distributed Translating Embedding for Knowledge Graph.
IEEE Trans. Parallel Distributed Syst., 2021

Automatic Irregularity-Aware Fine-Grained Workload Partitioning on Integrated Architectures.
IEEE Trans. Knowl. Data Eng., 2021

Preface.
J. Comput. Sci. Technol., 2021

Exploring deep reuse in winograd CNN inference.
Proceedings of the PPoPP '21: 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2021

G-TADOC: Enabling Efficient GPU-Based Text Analytics without Decompression.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

FineQuery: Fine-Grained Query Processing on CPU-GPU Integrated Architectures.
Proceedings of the IEEE International Conference on Cluster Computing, 2021

2020
Modeling Analysis and Cost-Performance Ratio Optimization of Virtual Machine Scheduling in Cloud Computing.
IEEE Trans. Parallel Distributed Syst., 2020

FineStream: Fine-Grained Window-Based Stream Processing on CPU-GPU Integrated Architectures.
Proceedings of the 2020 USENIX Annual Technical Conference, 2020

Payment Behavior Prediction and Statistical Analysis for Shared Parking Lots.
Proceedings of the Network and Parallel Computing, 2020

PewLSTM: Periodic LSTM with Weather-Aware Gating Mechanism for Parking Behavior Prediction.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

CapelliniSpTRSV: A Thread-Level Synchronization-Free Sparse Triangular Solve on GPUs.
Proceedings of the ICPP 2020: 49th International Conference on Parallel Processing, 2020

ParSecureML: An Efficient Parallel Secure Machine Learning Framework on GPUs.
Proceedings of the ICPP 2020: 49th International Conference on Parallel Processing, 2020

Towards Concurrent Stateful Stream Processing on Multicore Processors.
Proceedings of the 36th IEEE International Conference on Data Engineering, 2020

Enabling Efficient Random Access to Hierarchically-Compressed Data.
Proceedings of the 36th IEEE International Conference on Data Engineering, 2020

Exploration of TransE in a Distributed Environment.
Proceedings of the 40th IEEE International Conference on Distributed Computing Systems, 2020

2019
Hardware-Conscious Stream Processing: A Survey.
SIGMOD Rec., 2019

Guest Editorial: Special Issue on Network and Parallel Computing for Emerging Architectures and Applications.
Int. J. Parallel Program., 2019

Scaling Stream Processing with Transactional State Management on Multicores.
CoRR, 2019

Performance evaluation and analysis of sparse matrix and graph kernels on heterogeneous processors.
CCF Trans. High Perform. Comput., 2019

Statistical Analysis and Prediction of Parking Behavior.
Proceedings of the Network and Parallel Computing, 2019

Distributed Join Algorithms on Multi-CPU Clusters with GPUDirect RDMA.
Proceedings of the 48th International Conference on Parallel Processing, 2019

Parallel Hybrid Join Algorithm on GPU.
Proceedings of the 21st IEEE International Conference on High Performance Computing and Communications; 17th IEEE International Conference on Smart City; 5th IEEE International Conference on Data Science and Systems, 2019

2018
An adaptive breadth-first search algorithm on integrated architectures.
J. Supercomput., 2018

Efficient Document Analytics on Compressed Data: Method, Challenges, Algorithms, Insights.
Proc. VLDB Endow., 2018

Zwift: A Programming Framework for High Performance Text Analytics on Compressed Data.
Proceedings of the 32nd International Conference on Supercomputing, 2018

2017
Understanding Co-Running Behaviors on Integrated CPU/GPU Architectures.
IEEE Trans. Parallel Distributed Syst., 2017

FinePar: irregularity-aware fine-grained workload partitioning on integrated architectures.
Proceedings of the 2017 International Symposium on Code Generation and Optimization, 2017

2016
Characterizing and optimizing TPC-C workloads on large-scale systems using SSD arrays.
Sci. China Inf. Sci., 2016

2015
To Co-run, or Not to Co-run: A Performance Study on Integrated Architectures.
Proceedings of the 23rd IEEE International Symposium on Modeling, 2015

2002
A New Statistical Approach to Personal Name Extraction.
Proceedings of the Machine Learning, 2002

2001
Web mining for Web image retrieval.
J. Assoc. Inf. Sci. Technol., 2001

2000
Chinese Pinyin Input Method For Mobile Phone.
Proceedings of the 2000 International Symposium on Chinese Spoken Language Processing, 2000


  Loading...