Dan Pei
Orcid: 0000-0002-5113-838XAffiliations:
- Tsinghua University, Beijing, China
- AT&T Inc (former)
According to our database1,
Dan Pei
authored at least 202 papers
between 1999 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
Privacy-preserving MTS anomaly detection for network devices through federated learning.
Inf. Sci., 2025
2024
Proc. VLDB Endow., August, 2024
Diagnosing Performance Issues for Large-Scale Microservice Systems With Heterogeneous Graph.
IEEE Trans. Serv. Comput., 2024
J. Netw. Comput. Appl., 2024
A Scenario-Oriented Benchmark for Assessing AIOps Algorithms in Microservice Management.
CoRR, 2024
CoRR, 2024
CoRR, 2024
Supervised Fine-Tuning for Unsupervised KPI Anomaly Detection for Mobile Web Systems.
Proceedings of the ACM on Web Conference 2024, 2024
Revisiting VAE for Unsupervised Time Series Anomaly Detection: A Frequency Perspective.
Proceedings of the ACM on Web Conference 2024, 2024
Illuminating the Gray Zone: Non-intrusive Gray Failure Localization in Server Operating Systems.
Proceedings of the Companion Proceedings of the 32nd ACM International Conference on the Foundations of Software Engineering, 2024
Proceedings of the Companion Proceedings of the 32nd ACM International Conference on the Foundations of Software Engineering, 2024
Proceedings of the Companion Proceedings of the 32nd ACM International Conference on the Foundations of Software Engineering, 2024
Chain-of-Event: Interpretable Root Cause Analysis for Microservices through Automatically Learning Weighted Event Causal Graph.
Proceedings of the Companion Proceedings of the 32nd ACM International Conference on the Foundations of Software Engineering, 2024
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024
Microservice Root Cause Analysis With Limited Observability Through Intervention Recognition in the Latent Space.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024
Guardian of the Resiliency: Detecting Erroneous Software Changes Before They Make Your Microservice System Less Fault-Resilient.
Proceedings of the 32nd IEEE/ACM International Symposium on Quality of Service, 2024
Enhanced Fine-Tuning of Lightweight Domain-Specific Q&A Model Based on Large Language Models.
Proceedings of the 35th IEEE International Symposium on Software Reliability Engineering, 2024
Auto-PIP: Real-time Identification of Critical Performance Inflection Points in Software Stress Testing.
Proceedings of the 35th IEEE International Symposium on Software Reliability Engineering, 2024
LabelEase: A Semi-Automatic Tool for Efficient and Accurate Trace Labeling in Microservices.
Proceedings of the 35th IEEE International Symposium on Software Reliability Engineering, 2024
Proceedings of the 35th IEEE International Symposium on Software Reliability Engineering, 2024
Multivariate Time Series Anomaly Detection based on Pre-trained Models with Dual-Attention Mechanism.
Proceedings of the 35th IEEE International Symposium on Software Reliability Engineering, 2024
TimeSeriesBench: An Industrial-Grade Benchmark for Time Series Anomaly Detection Models.
Proceedings of the 35th IEEE International Symposium on Software Reliability Engineering, 2024
Proceedings of the 35th IEEE International Symposium on Software Reliability Engineering, 2024
Causality Enhanced Graph Representation Learning for Alert-Based Root Cause Analysis.
Proceedings of the 24th IEEE International Symposium on Cluster, 2024
2023
IEEE Trans. Computers, October, 2023
IEEE Trans. Netw. Serv. Manag., September, 2023
Generic and robust root cause localization for multi-dimensional data in online service systems.
J. Syst. Softw., September, 2023
J. Syst. Softw., September, 2023
IEEE Trans. Serv. Comput., 2023
Robust Anomaly Clue Localization of Multi-Dimensional Derived Measure for Online Video Services.
IEEE Trans. Serv. Comput., 2023
IEEE Trans. Serv. Comput., 2023
Lindorm TSDB: A Cloud-native Time-series Database for Large-scale Monitoring Systems.
Proc. VLDB Endow., 2023
CoRR, 2023
CMDiagnostor: An Ambiguity-Aware Root Cause Localization Approach Based on Call Metric Data.
Proceedings of the ACM Web Conference 2023, 2023
Proceedings of the ACM Web Conference 2023, 2023
From Point-wise to Group-wise: A Fast and Accurate Microservice Trace Anomaly Detection Approach.
Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2023
Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2023
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023
Proceedings of the 34th IEEE International Symposium on Software Reliability Engineering, 2023
Proceedings of the 34th IEEE International Symposium on Software Reliability Engineering, 2023
Proceedings of the 34th IEEE International Symposium on Software Reliability Engineering, 2023
Efficient Multivariate Time Series Anomaly Detection Through Transfer Learning for Large-Scale Web Services.
Proceedings of the IEEE International Conference on Web Services, 2023
2022
Detecting Outlier Machine Instances Through Gaussian Mixture Variational Autoencoder With One Dimensional CNN.
IEEE Trans. Computers, 2022
Online malicious domain name detection with partial labels for large-scale dependable systems.
J. Syst. Softw., 2022
Efficient KPI Anomaly Detection Through Transfer Learning for Large-Scale Web Services.
IEEE J. Sel. Areas Commun., 2022
Situation-Aware Multivariate Time Series Anomaly Detection Through Active Learning and Contrast VAE-Based Models in Large Distributed Systems.
IEEE J. Sel. Areas Commun., 2022
Actionable and Interpretable Fault Localization for Recurring Failures in Online Service Systems.
CoRR, 2022
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022
Actionable and interpretable fault localization for recurring failures in online service systems.
Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2022
Causal Inference-Based Root Cause Analysis for Online Service Systems with Intervention Recognition.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022
Identifying Erroneous Software Changes through Self-Supervised Contrastive Learning on Time Series Data.
Proceedings of the IEEE 33rd International Symposium on Software Reliability Engineering, 2022
Proceedings of the IEEE 33rd International Symposium on Software Reliability Engineering, 2022
Proceedings of the Database and Expert Systems Applications, 2022
Generic and Robust Performance Diagnosis via Causal Inference for OLTP Database Systems.
Proceedings of the 22nd IEEE International Symposium on Cluster, 2022
2021
IEEE Trans. Netw. Serv. Manag., 2021
CoRR, 2021
DockerMock: Pre-Build Detection of Dockerfile Faults through Mocking Instruction Execution.
CoRR, 2021
Proceedings of the 2021 USENIX Annual Technical Conference, 2021
An empirical investigation of practical log anomaly detection for online service systems.
Proceedings of the ESEC/FSE '21: 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2021
Identifying bad software changes via multimodal anomaly detection for online service systems.
Proceedings of the ESEC/FSE '21: 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2021
Multivariate Time Series Anomaly Detection and Interpretation using Hierarchical Inter-Metric and Temporal Embedding.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021
Proceedings of the 29th IEEE/ACM International Symposium on Quality of Service, 2021
Proceedings of the 32nd IEEE International Symposium on Software Reliability Engineering, 2021
Proceedings of the 32nd IEEE International Symposium on Software Reliability Engineering, 2021
CTF: Anomaly Detection in High-Dimensional Time Series with Coarse-to-Fine Model Transfer.
Proceedings of the 40th IEEE Conference on Computer Communications, 2021
2020
Proc. VLDB Endow., 2020
IEEE Access, 2020
Proceedings of the ESEC/FSE '20: 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2020
A Practical Machine Learning-Based Framework to Detect DNS Covert Communication in Enterprises.
Proceedings of the Security and Privacy in Communication Networks, 2020
Proceedings of the 28th IEEE/ACM International Symposium on Quality of Service, 2020
Unsupervised Detection of Microservice Trace Anomalies through Service-Level Deep Bayesian Networks.
Proceedings of the 31st IEEE International Symposium on Software Reliability Engineering, 2020
LogTransfer: Cross-System Log Anomaly Detection for Software Systems with Transfer Learning.
Proceedings of the 31st IEEE International Symposium on Software Reliability Engineering, 2020
Proceedings of the 39th IEEE International Performance Computing and Communications Conference, 2020
Proceedings of the 39th IEEE Conference on Computer Communications, 2020
ZeroWall: Detecting Zero-Day Web Attacks through Encoder-Decoder Recurrent Neural Networks.
Proceedings of the 39th IEEE Conference on Computer Communications, 2020
Unsupervised Clustering through Gaussian Mixture Variational AutoEncoder with Non-Reparameterized Variational Inference and Std Annealing.
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020
Proceedings of the ICSE-SEIP 2020: 42nd International Conference on Software Engineering, Software Engineering in Practice, Seoul, South Korea, 27 June, 2020
Proceedings of the Neural Information Processing - 27th International Conference, 2020
Proceedings of the Neural Information Processing - 27th International Conference, 2020
Proceedings of the 29th International Conference on Computer Communications and Networks, 2020
Proceedings of the 29th International Conference on Computer Communications and Networks, 2020
Proceedings of the 29th International Conference on Computer Communications and Networks, 2020
2019
IEEE Trans. Netw. Serv. Manag., 2019
Walking Without Friends: Publishing Anonymized Trajectory Dataset Without Leaking Social Relationships.
IEEE Trans. Netw. Serv. Manag., 2019
Dynamic TCP Initial Windows and Congestion Control Schemes Through Reinforcement Learning.
IEEE J. Sel. Areas Commun., 2019
On the Necessity and Effectiveness of Learning the Prior of Variational Auto-Encoder.
CoRR, 2019
Proceedings of the 13th ACM Conference on Recommender Systems, 2019
Robust Anomaly Detection for Multivariate Time Series through Stochastic Recurrent Neural Network.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019
Proceedings of the International Symposium on Quality of Service, 2019
FluxRank: A Widely-Deployable Framework to Automatically Localizing Root Cause Machines for Software Service Failure Mitigation.
Proceedings of the 30th IEEE International Symposium on Software Reliability Engineering, 2019
Proceedings of the 30th IEEE International Symposium on Software Reliability Engineering, 2019
Proceedings of the 2019 IEEE Conference on Computer Communications, 2019
Proceedings of the 2019 IEEE Conference on Computer Communications, 2019
LogAnomaly: Unsupervised Detection of Sequential and Quantitative Anomalies in Unstructured Logs.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019
Causal Analysis of the Unsatisfying Experience in Realtime Mobile Multiplayer Games in the Wild.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019
Collaborative learning between cloud and end devices: an empirical study on location prediction.
Proceedings of the 4th ACM/IEEE Symposium on Edge Computing, 2019
2018
IEEE Trans. Serv. Comput., 2018
IEEE/ACM Trans. Netw., 2018
Proc. ACM Meas. Anal. Comput. Syst., 2018
无线网络用户的Wi-Fi指纹匿名化研究 (Study on Wi-Fi Fingerprint Anonymization for Users in Wireless Networks).
计算机科学, 2018
IEEE Access, 2018
Unsupervised Anomaly Detection via Variational Auto-Encoder for Seasonal KPIs in Web Applications.
Proceedings of the 2018 World Wide Web Conference on World Wide Web, 2018
Reducing Web Latency Through Dynamically Setting TCP Initial Window with Reinforcement Learning.
Proceedings of the 26th IEEE/ACM International Symposium on Quality of Service, 2018
Proceedings of the 26th IEEE/ACM International Symposium on Quality of Service, 2018
Proceedings of the 26th IEEE/ACM International Symposium on Quality of Service, 2018
Proceedings of the 29th IEEE International Symposium on Software Reliability Engineering, 2018
The Frame Latency of Personalized Livestreaming Can Be Significantly Slowed Down by WiFi.
Proceedings of the 37th IEEE International Performance Computing and Communications Conference, 2018
Robust and Unsupervised KPI Anomaly Detection Based on Conditional Variational Autoencoder.
Proceedings of the 37th IEEE International Performance Computing and Communications Conference, 2018
Rapid Deployment of Anomaly Detection Models for Large Number of Emerging KPI Streams.
Proceedings of the 37th IEEE International Performance Computing and Communications Conference, 2018
Continuous delivery of personalized assessment and feedback in agile software engineering projects.
Proceedings of the 40th International Conference on Software Engineering: Software Engineering Education and Training, 2018
Proceedings of the 2018 IEEE International Conference on Communications, 2018
The DevOps Lab Platform for Managing Diversified Projects in Educating Agile Software Engineering.
Proceedings of the IEEE Frontiers in Education Conference, 2018
2017
𝔽<sup>2</sup> Tree: Rapid Failure Recovery for Routing in Production Data Center Networks.
IEEE/ACM Trans. Netw., 2017
IEEE Trans. Inf. Forensics Secur., 2017
Frontiers Comput. Sci., 2017
Syslog processing for switch failure diagnosis and prediction in datacenter networks.
Proceedings of the 25th IEEE/ACM International Symposium on Quality of Service, 2017
Proceedings of the 25th IEEE/ACM International Symposium on Quality of Service, 2017
Proceedings of the 25th IEEE/ACM International Symposium on Quality of Service, 2017
Proceedings of the 36th IEEE International Performance Computing and Communications Conference, 2017
Proceedings of the 2017 IEEE Conference on Computer Communications, 2017
Proceedings of the 26th International Conference on Computer Communication and Networks, 2017
Proceedings of the Adjunct Proceedings of the 2017 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2017 ACM International Symposium on Wearable Computers, 2017
2016
J. Commun. Networks, 2016
Fast and Cautious: Leveraging Multi-path Diversity for Transport Loss Recovery in Data Centers.
Proceedings of the 2016 USENIX Annual Technical Conference, 2016
Proceedings of the 2016 IFIP Networking Conference, 2016
Proceedings of the 3rd International on Workshop on Physical Analytics, 2016
Proceedings of the 14th Annual International Conference on Mobile Systems, 2016
Understanding the Impact of AP Density on WiFi Performance Through Real-World Deployment.
Proceedings of the IEEE International Symposium on Local and Metropolitan Area Networks, 2016
Proceedings of the 24th IEEE/ACM International Symposium on Quality of Service, 2016
M<sup>3</sup>: Practical and reliable multi-layer video multicast over multi-rate Wi-Fi network.
Proceedings of the 24th IEEE/ACM International Symposium on Quality of Service, 2016
Proceedings of the 35th IEEE International Performance Computing and Communications Conference, 2016
Proceedings of the 35th Annual IEEE International Conference on Computer Communications, 2016
Proceedings of the 35th Annual IEEE International Conference on Computer Communications, 2016
Proceedings of the 25th International Conference on Computer Communication and Networks, 2016
Proceedings of the 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing, 2016
2015
Alleviating flow interference in data center networks through fine-grained switch queue management.
Comput. Networks, 2015
Proceedings of the 34th IEEE International Performance Computing and Communications Conference, 2015
Proceedings of the 34th IEEE International Performance Computing and Communications Conference, 2015
Proceedings of the 2015 IEEE Conference on Computer Communications, 2015
Opprentice: Towards Practical and Automatic Anomaly Detection Through Machine Learning.
Proceedings of the 2015 ACM Internet Measurement Conference, 2015
Proceedings of the 44th International Conference on Parallel Processing, 2015
Rewiring 2 Links Is Enough: Accelerating Failure Recovery in Production Data Center Networks.
Proceedings of the 35th IEEE International Conference on Distributed Computing Systems, 2015
Proceedings of the 24th International Conference on Computer Communication and Networks, 2015
Rapid and robust impact assessment of software changes in large internet-based services.
Proceedings of the 11th ACM Conference on Emerging Networking Experiments and Technologies, 2015
2014
Where the Sidewalk Ends: Extending the Internet AS Graph Using Traceroutes from P2P Users.
IEEE Trans. Computers, 2014
Multi-AS cooperative incoming traffic engineering in a transit-edge separate internet.
Comput. Networks, 2014
Proceedings of the Network and Parallel Computing, 2014
Proceedings of the 2014 IFIP Networking Conference, Trondheim, 2014
Proceedings of the IEEE 39th Conference on Local Computer Networks, 2014
Proceedings of the 23rd International Conference on Computer Communication and Networks, 2014
Proceedings of the 10th International Conference on Network and Service Management, 2014
2012
G-RCA: a generic root cause analysis platform for service quality management in large IP networks.
IEEE/ACM Trans. Netw., 2012
Proceedings of the IEEE INFOCOM 2012, Orlando, FL, USA, March 25-30, 2012, 2012
Proceedings of the IEEE INFOCOM 2012, Orlando, FL, USA, March 25-30, 2012, 2012
2011
Proceedings of the SIGMETRICS 2011, 2011
Proceedings of the INFOCOM 2011. 30th IEEE International Conference on Computer Communications, 2011
Proceedings of the Algorithms - ESA 2011, 2011
2010
IEEE/ACM Trans. Netw., 2010
Proceedings of the 10th ACM SIGCOMM Internet Measurement Conference, 2010
Proceedings of the 18th annual IEEE International Conference on Network Protocols, 2010
2009
Proceedings of the 18th International Conference on World Wide Web, 2009
Proceedings of the 18th USENIX Security Symposium, 2009
Proceedings of the Passive and Active Network Measurement, 10th International Conference, 2009
Towards Efficient Large-Scale VPN Monitoring and Diagnosis under Operational Constraints.
Proceedings of the INFOCOM 2009. 28th IEEE International Conference on Computer Communications, 2009
Darkstar: Using exploratory data mining to raise the bar on network reliability and performance.
Proceedings of the 7th International Workshop on Design of Reliable Communication Networks, 2009
2008
In search of the elusive ground truth: the internet's as-level connectivity structure.
Proceedings of the 2008 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, 2008
Proceedings of the 2008 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, 2008
Proceedings of the ACM SIGCOMM 2008 Conference on Applications, 2008
2007
Proceedings of the 2007 USENIX Annual Technical Conference, 2007
Proceedings of the ACM SIGCOMM 2007 Conference on Applications, 2007
Proceedings of the INFOCOM 2007. 26th IEEE International Conference on Computer Communications, 2007
2006
Comput. Networks, 2006
Proceedings of the 15th USENIX Security Symposium, Vancouver, BC, Canada, July 31, 2006
Proceedings of the 6th ACM SIGCOMM Internet Measurement Conference, 2006
Proceedings of the 6th ACM SIGCOMM Internet Measurement Conference, 2006
2005
Comput. Networks, 2005
Proceedings of the 25th International Conference on Distributed Computing Systems (ICDCS 2005), 2005
Proceedings of the 14th International Conference On Computer Communications and Networks, 2005
2004
Proceedings of the 24th International Conference on Distributed Computing Systems (ICDCS 2004), 2004
2003
IEEE Trans. Parallel Distributed Syst., 2003
Proceedings of the LANC '03 IFIP / ACM Latin American Networking Conference 2003, 2003
Proceedings of the 23rd International Conference on Distributed Computing Systems (ICDCS 2003), 2003
Proceedings of the Global Telecommunications Conference, 2003
Proceedings of the Self-Managing Distributed Systems, 2003
Proceedings of the 2003 International Conference on Dependable Systems and Networks (DSN 2003), 2003
Proceedings of the 3rd DARPA Information Survivability Conference and Exposition (DISCEX-III 2003), 2003
2002
Proceedings of the Proceedings IEEE INFOCOM 2002, 2002
Proceedings of the 2nd ACM SIGCOMM Internet Measurement Workshop, 2002
Proceedings of the 2002 International Conference on Dependable Systems and Networks (DSN 2002), 2002
2001
Proceedings of the 1st ACM SIGCOMM Internet Measurement Workshop, 2001
1999
ACM SIGOPS Oper. Syst. Rev., 1999