Pengfei Chen

Orcid: 0000-0003-0972-6900

Affiliations:
  • Sun Yat-sen University, School of Data Science and Computer, Guangzhou, China
  • Xi'an Jiaotong University, Department of Computer Science, China (PhD 2016)


According to our database1, Pengfei Chen authored at least 76 papers between 2013 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
DeepCAT<sup>+</sup>: A Low-Cost and Transferrable Online Configuration Auto-Tuning Approach for Big Data Frameworks.
IEEE Trans. Parallel Distributed Syst., November, 2024

HyperTuner: a cross-layer multi-objective hyperparameter auto-tuning framework for data analytic services.
J. Supercomput., August, 2024

Network shortcut in data plane of service mesh with eBPF.
J. Netw. Comput. Appl., February, 2024

Graph neural network based robust anomaly detection at service level in SDN driven microservice system.
Comput. Networks, February, 2024

MicroFI: Non-Intrusive and Prioritized Request-Level Fault Injection for Microservice Applications.
IEEE Trans. Dependable Secur. Comput., 2024

ChangeRCA: Finding Root Causes from Software Changes in Large Online Systems.
Proc. ACM Softw. Eng., 2024

TraStrainer: Adaptive Sampling for Distributed Traces with System Runtime State.
Proc. ACM Softw. Eng., 2024

A Survey on Failure Analysis and Fault Injection in AI Systems.
CoRR, 2024

FaaSRCA: Full Lifecycle Root Cause Analysis for Serverless Applications.
Proceedings of the 35th IEEE International Symposium on Software Reliability Engineering, 2024

A Bayesian LSTM Based Active Anomaly Detection Service for Large Online Systems.
Proceedings of the 15th Asia-Pacific Symposium on Internetware, 2024

CTuner: Automatic NoSQL Database Tuning with Causal Reinforcement Learning.
Proceedings of the 15th Asia-Pacific Symposium on Internetware, 2024

LogShrink: Effective Log Compression by Leveraging Commonality and Variability of Log Data.
Proceedings of the 46th IEEE/ACM International Conference on Software Engineering, 2024

DashChef: A Metric Recommendation Service for Online Systems Using Graph Learning.
Proceedings of the Engineering of Complex Computer Systems - 28th International Conference, 2024

Real-Time Intrusion Detection and Prevention with Neural Network in Kernel Using eBPF.
Proceedings of the 54th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, 2024

2023
TraceRank: Abnormal service localization with dis-aggregated end-to-end tracing data in cloud native systems.
J. Softw. Evol. Process., October, 2023

ATConf: auto-tuning high dimensional configuration parameters for big data processing frameworks.
Clust. Comput., October, 2023

TurBO: A cost-efficient configuration-based auto-tuning approach for cluster-based big data frameworks.
J. Parallel Distributed Comput., July, 2023

A Spatiotemporal Deep Learning Approach for Unsupervised Anomaly Detection in Cloud Systems.
IEEE Trans. Neural Networks Learn. Syst., April, 2023

FaaSDeliver: Cost-Efficient and QoS-Aware Function Delivery in Computing Continuum.
IEEE Trans. Serv. Comput., 2023

SwissLog: Robust Anomaly Detection and Localization for Interleaved Unstructured Logs.
IEEE Trans. Dependable Secur. Comput., 2023

Nahida: In-Band Distributed Tracing with eBPF.
CoRR, 2023

Nezha: Interpretable Fine-Grained Root Causes Analysis for Microservices on Multi-modal Observability Data.
Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2023

DiagConfig: Configuration Diagnosis of Performance Violations in Configurable Software Systems.
Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2023

EFTuner: A Bi-Objective Configuration Parameter Auto-Tuning Method Towards Energy-Efficient Big Data Processing.
Proceedings of the 14th Asia-Pacific Symposium on Internetware, 2023

LogReducer: Identify and Reduce Log Hotspots in Kernel on the Fly.
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering, 2023

DeepPower: Deep Reinforcement Learning based Power Management for Latency Critical Applications in Multi-core Systems.
Proceedings of the 52nd International Conference on Parallel Processing, 2023

MARS: Fault Localization in Programmable Networking Systems with Low-cost In-Band Network Telemetry.
Proceedings of the 52nd International Conference on Parallel Processing, 2023

2022
Microscaler: Cost-Effective Scaling for Microservice Applications in the Cloud With an Online Learning Approach.
IEEE Trans. Cloud Comput., 2022

JointConf: Jointly autotuning configuration parameters for modularized graph databases.
J. Softw. Evol. Process., 2022

A Semi-Supervised VAE Based Active Anomaly Detection Framework in Multivariate Time Series for Online Systems.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

A Transferable Time Series Forecasting Service Using Deep Transformer Model for Online Systems.
Proceedings of the 37th IEEE/ACM International Conference on Automated Software Engineering, 2022

Graph based Incident Extraction and Diagnosis in Large-Scale Online Systems.
Proceedings of the 37th IEEE/ACM International Conference on Automated Software Engineering, 2022

Going through the Life Cycle of Faults in Clouds: Guidelines on Fault Handling.
Proceedings of the IEEE 33rd International Symposium on Software Reliability Engineering, 2022

Share or Not Share? Towards the Practicability of Deep Models for Unsupervised Anomaly Detection in Modern Online Systems.
Proceedings of the IEEE 33rd International Symposium on Software Reliability Engineering, 2022

TS-InvarNet: Anomaly Detection and Localization based on Tempo-spatial KPI Invariants in Distributed Services.
Proceedings of the IEEE International Conference on Web Services, 2022

MicroSketch: Lightweight and Adaptive Sketch Based Performance Issue Detection and Localization in Microservice Systems.
Proceedings of the Service-Oriented Computing - 20th International Conference, 2022

DeepCAT: A Cost-Efficient Online Configuration Auto-Tuning Approach for Big Data Frameworks.
Proceedings of the 51st International Conference on Parallel Processing, 2022

Active-MTSAD: Multivariate Time Series Anomaly Detection With Active Learning.
Proceedings of the 52nd Annual IEEE/IFIP International Conference on Dependable Systems and Networks, 2022

2021
Function delivery network: Extending serverless computing for heterogeneous platforms.
Softw. Pract. Exp., 2021

MicroRank: End-to-End Latency Issue Localization with Extended Spectrum Analysis in Microservice Environments.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Sieve: Attention-based Sampling of End-to-End Trace Data in Distributed Microservice Systems.
Proceedings of the 2021 IEEE International Conference on Web Services, 2021

Poster: Function Delivery Network: Extending Serverless to Heterogeneous Computing.
Proceedings of the 41st IEEE International Conference on Distributed Computing Systems, 2021

T-Rank: A Lightweight Spectrum based Fault Localization Approach for Microservice Systems.
Proceedings of the 21st IEEE/ACM International Symposium on Cluster, 2021

2020
Hdconfigor: Automatically Tuning High Dimensional Configuration Parameters for Log Search Engines.
IEEE Access, 2020

A Framework of Virtual War Room and Matrix Sketch-Based Streaming Anomaly Detection for Microservice Systems.
IEEE Access, 2020

AutoMAP: Diagnose Your Microservice-based Web Applications Automatically.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

SwissLog: Robust and Unified Deep Learning Based Log Anomaly Detection for Diverse Faults.
Proceedings of the 31st IEEE International Symposium on Software Reliability Engineering, 2020

A Learning-based Dynamic Load Balancing Approach for Microservice Systems in Multi-cloud Environment.
Proceedings of the 26th IEEE International Conference on Parallel and Distributed Systems, 2020

ConfAdvisor: An Automatic Configuration Tuning Framework for NoSQL Database Benchmarking with a Black-box Approach.
Proceedings of the Benchmarking, Measuring, and Optimizing, 2020

2019
CauseInfer: Automated End-to-End Performance Diagnosis with Hierarchical Causality Graph in Cloud Environment.
IEEE Trans. Serv. Comput., 2019

Microscaler: Automatic Scaling for Microservices with an Online Learning Approach.
Proceedings of the 2019 IEEE International Conference on Web Services, 2019

Nebula: A Blockchain Based Decentralized Sharing Computing Platform.
Proceedings of the Blockchain and Trustworthy Systems - First International Conference, 2019

2018
ARF-Predictor: Effective Prediction of Aging-Related Failure Using Entropy.
IEEE Trans. Dependable Secur. Comput., 2018

Opportunities and Challenges Towards Cognitive IT Service Management in Real World.
Proceedings of the IEEE Symposium on Service-Oriented System Engineering, 2018

Microscope: Pinpoint Performance Issues with Causal Graphs in Micro-service Environments.
Proceedings of the Service-Oriented Computing - 16th International Conference, 2018

On Anomaly Detection and Root Cause Analysis of Microservice Systems.
Proceedings of the Service-Oriented Computing - ICSOC 2018 Workshops, 2018

CloudRanger: Root Cause Identification for Cloud Native Systems.
Proceedings of the 18th IEEE/ACM International Symposium on Cluster, 2018

2017
InvarNet-X: A Black-Box Invariant-Based Approach to Diagnosing Big Data Systems.
IEEE Trans. Emerg. Top. Comput., 2017

A cost-effective strategy for Cloud system maintenance.
Comput. Electr. Eng., 2017

An Approach for Anomaly Diagnosis Based on Hybrid Graph Model with Logs for Distributed Services.
Proceedings of the 2017 IEEE International Conference on Web Services, 2017

LogDC: Problem Diagnosis for Declartively-Deployed Cloud Applications with Log.
Proceedings of the 14th IEEE International Conference on e-Business Engineering, 2017

DriftInsight: Detecting Anomalous Behaviors in Large-Scale Cloud Platform.
Proceedings of the 2017 IEEE 10th International Conference on Cloud Computing (CLOUD), 2017

LogSed: Anomaly Diagnosis through Mining Time-Weighted Control Flow Graph in Logs.
Proceedings of the 2017 IEEE 10th International Conference on Cloud Computing (CLOUD), 2017

2016
A Heuristic Time Sharing Policy for Backup Resources in Cloud System.
KSII Trans. Internet Inf. Syst., 2016

TaskInsight: A Fine-Grained Performance Anomaly Detection and Problem Locating System.
Proceedings of the 9th IEEE International Conference on Cloud Computing, 2016

Optimizing Backup Resources in the Cloud.
Proceedings of the 9th IEEE International Conference on Cloud Computing, 2016

2015
Making Availability as a Service in the Clouds.
CoRR, 2015

CHAOS: Accurate and Realtime Detection of Aging-Oriented Failure Using Entropy.
CoRR, 2015

A Novel Approach to Improving Resource Utilization for IaaS.
Proceedings of the 12th Web Information System and Application Conference, 2015

2014
An Automatic Framework for Detecting and Characterizing Performance Degradation of Software Systems.
IEEE Trans. Reliab., 2014

Bio-inspired Mechanism and Model Exploration of Software Aging.
CoRR, 2014

InvarNet-X: A Comprehensive Invariant Based Approach for Performance Diagnosis in Big Data Platform.
Proceedings of the Big Data Benchmarks, Performance Optimization, and Emerging Hardware, 2014

CauseInfer: Automatic and distributed performance diagnosis with hierarchical causality graph in large distributed systems.
Proceedings of the 2014 IEEE Conference on Computer Communications, 2014

Multiplexing of Backup VMs Based on Greedy Policy.
Proceedings of the 11th Web Information System and Application Conference, 2014

2013
Multi-scale Entropy: One Metric of Software Aging.
Proceedings of the Seventh IEEE International Symposium on Service-Oriented System Engineering, 2013

An ensemble MIC-based approach for performance diagnosis in big data platform.
Proceedings of the 2013 IEEE International Conference on Big Data (IEEE BigData 2013), 2013


  Loading...