2025
KVCache Cache in the Wild: Characterizing and Optimizing KVCache Cache at a Large Cloud Provider.
CoRR, June, 2025
OSS-UAgent: An Agent-based Usability Evaluation Framework for Open Source Software.
CoRR, May, 2025
SRDiffusion: Accelerate Video Diffusion Inference via Sketching-Rendering Cooperation.
,
,
,
,
,
,
,
,
,
,
CoRR, May, 2025
ServeGen: Workload Characterization and Generation of Large Language Model Serving in Production.
CoRR, May, 2025
A Graph-native Optimization Framework for Complex Graph Queries.
CoRR, March, 2025
Wan: Open and Advanced Large-Scale Video Generative Models.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, March, 2025
A Survey of Distributed Graph Algorithms on Massive Graphs.
ACM Comput. Surv., February, 2025
Static Batching of Irregular Workloads on GPUs: Framework and Application to Efficient MoE Model Inference.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, January, 2025
Qwen2.5-1M Technical Report.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, January, 2025
A Modular Graph-Native Query Optimization Framework.
Proceedings of the Companion of the 2025 International Conference on Management of Data, 2025
Helios: Efficient Distributed Dynamic Graph Sampling for Online GNN Inference.
,
,
,
,
,
,
,
,
,
,
Proceedings of the 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2025
Moko: Marrying Python with Big Data Systems.
Proceedings of the Twentieth European Conference on Computer Systems, 2025
AsymKV: Enabling 1-Bit Quantization of KV Cache with Layer-Wise Asymmetric Quantization Configurations.
Proceedings of the 31st International Conference on Computational Linguistics, 2025
2024
A Roadmap to Graph Analytics.
SIGMOD Rec., December, 2024
LSMGraph: A High-Performance Dynamic Graph Storage System with Multi-Level CSR.
,
,
,
,
,
,
,
,
,
,
,
Proc. ACM Manag. Data, December, 2024
Towards a Converged Relational-Graph Optimization Framework.
Proc. ACM Manag. Data, December, 2024
Towards Efficient Graph Processing in Geo-Distributed Data Centers.
IEEE Trans. Parallel Distributed Syst., November, 2024
GraphAr: An Efficient Storage Scheme for Graph Data in Data Lakes.
Proc. VLDB Endow., November, 2024
GastCoCo: Graph Storage and Coroutine-Based Prefetch Co-Design for Dynamic Graph Processing.
Proc. VLDB Endow., September, 2024
Ingress: an automated incremental graph processing system.
,
,
,
,
,
,
,
,
,
,
VLDB J., May, 2024
Linking Entities across Relations and Graphs.
ACM Trans. Database Syst., March, 2024
XGNN: Boosting Multi-GPU GNN Training via Global GNN Memory Store.
Proc. VLDB Endow., January, 2024
Exact Acceleration of Subgraph Graph Neural Networks by Eliminating Computation Redundancy.
CoRR, 2024
A Graph-Native Query Optimization Framework.
CoRR, 2024
Unicron: Economizing Self-Healing LLM Training at Scale.
CoRR, 2024
GraphScope Flex: LEGO-like Graph Computing Stack.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Companion of the 2024 International Conference on Management of Data, 2024
The Future of Graph Analytics.
Proceedings of the Companion of the 2024 International Conference on Management of Data, 2024
2023
RAGraph: A Region-Aware Framework for Geo-Distributed Graph Processing.
Proc. VLDB Endow., November, 2023
Application-driven graph partitioning.
VLDB J., January, 2023
Vineyard: Optimizing Data Sharing in Data-Intensive Analytics.
Proc. ACM Manag. Data, 2023
GraphScope Flex: LEGO-like Graph Computing Stack.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
Enhancing Data Lakes with GraphAr: Efficient Graph Data Management with a Specialized Storage Scheme.
CoRR, 2023
LON-GNN: Spectral GNNs with Learnable Orthonormal Basis.
CoRR, 2023
Legion: Automatically Pushing the Envelope of Multi-GPU System for Billion-Scale GNN Training.
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2023 USENIX Annual Technical Conference, 2023
Bridging the Gap between Relational OLTP and Graph-based OLAP.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2023 USENIX Annual Technical Conference, 2023
GLogS: Interactive Graph Pattern Matching Query At Large Scale.
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2023 USENIX Annual Technical Conference, 2023
The Linked Data Benchmark Council (LDBC): Driving Competition and Collaboration in the Graph Data Management Space.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Performance Evaluation and Benchmarking: 15th TPC Technology Conference, 2023
Layph: Making Change Propagation Constraint in Incremental Graph Processing by Layering Graph.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023
Efficient Multi-GPU Graph Processing with Remote Work Stealing.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023
Flash: A Framework for Programming Distributed Graph Processing Algorithms.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023
2022
Banyan: A Scoped Dataflow Engine for Graph Query Service.
Proc. VLDB Endow., 2022
ABC: Attributed Bipartite Co-clustering.
Proc. VLDB Endow., 2022
Asymmetrical cross-modal influence on neural encoding of auditory and visual features in natural scenes.
NeuroImage, 2022
DMCS : Density Modularity based Community Search.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022
Linking Entities across Relations and Graphs.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022
GNNLab: a factored system for sample-based GNN training over GPUs.
Proceedings of the EuroSys '22: Seventeenth European Conference on Computer Systems, Rennes, France, April 5, 2022
2021
GraphScope: A One-Stop Large Graph Processing System.
,
,
,
,
,
,
,
,
,
,
Proc. VLDB Endow., 2021
Automating Incremental Graph Processing with Flexible Memoization.
Proc. VLDB Endow., 2021
GraphScope: A Unified Engine For Big Graph Processing.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proc. VLDB Endow., 2021
Incrementalizing Graph Algorithms.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021
FlexGraph: a flexible and efficient distributed framework for GNN training.
Proceedings of the EuroSys '21: Sixteenth European Conference on Computer Systems, 2021
2020
Adaptive Asynchronous Parallelization of Graph Algorithms.
ACM Trans. Database Syst., 2020
Application Driven Graph Partitioning.
Proceedings of the 2020 International Conference on Management of Data, 2020
"I can't name it, but I can perceive it" Conceptual and Operational Design of "Tactile Accuracy" Assisting Tactile Image Cognition.
Proceedings of the ASSETS '20: The 22nd International ACM SIGACCESS Conference on Computers and Accessibility, 2020
2018
Parallelizing Sequential Graph Computations.
ACM Trans. Database Syst., 2018
From Think Parallel to Think Sequential.
SIGMOD Rec., 2018
Joint Spatio-Temporal Domain for Adaptive Kronecker Compressive Sensing.
J. Inf. Hiding Multim. Signal Process., 2018
Adaptive Asynchronous Parallelization of Graph Algorithms.
Proceedings of the 2018 International Conference on Management of Data, 2018
The Effect of Static and Dynamic Gesture Presentation on the Recognition of Two Manipulation Gestures.
Proceedings of the Human-Computer Interaction. Interaction Technologies, 2018
2017
GRAPE: Parallelizing Sequential Graph Computations.
Proc. VLDB Endow., 2017
GRAPE: Conducting Parallel Graph Computations without Developing Parallel Algorithms.
IEEE Data Eng. Bull., 2017
Parallelizing Sequential Graph Computations.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017
2014
Incremental Detection of Inconsistencies in Distributed Data.
IEEE Trans. Knowl. Data Eng., 2014
Bounded Conjunctive Queries.
Proc. VLDB Endow., 2014
The IUPHAR/BPS Guide to PHARMACOLOGY: an expert-driven knowledgebase of drug targets and their ligands.
,
,
,
,
,
,
,
,
,
,
,
,
,
Nucleic Acids Res., 2014
Interaction between Record Matching and Data Repairing.
ACM J. Data Inf. Qual., 2014
Conflict resolution with data currency and consistency.
ACM J. Data Inf. Qual., 2014
2013
Improving data quality : data consistency, deduplication, currency and accuracy.
PhD thesis, 2013
Determining the relative accuracy of attributes.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013
Inferring data currency and consistency for conflict resolution.
Proceedings of the 29th IEEE International Conference on Data Engineering, 2013
Data Quality Problems beyond Consistency and Deduplication.
Proceedings of the In Search of Elegance in the Theory and Practice of Computation, 2013
2011
CerFix: A System for Cleaning Data with Certain Fixes.
Proc. VLDB Endow., 2011
Interaction between record matching and data repairing.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011
2010
Towards Certain Fixes with Editing Rules and Master Data.
Proc. VLDB Endow., 2010
2009
PKUNEI - A Knowledge-Based Approach for Chinese Product Named Entity Semantic Identification.
Proceedings of the Computer Processing of Oriental Languages. Language Technology for the Knowledge-based Economy, 2009
A Novel Method of Automobiles' Chinese Nickname Recognition.
Proceedings of the Computer Processing of Oriental Languages. Language Technology for the Knowledge-based Economy, 2009