Yongwei Wu

Orcid: 0000-0002-6651-7032

Affiliations:
  • Tsinghua University, Department of Computer Science and Technology, Tsinghua National Laboratory for Information Science and Technology, Beijing, China


According to our database1, Yongwei Wu authored at least 161 papers between 2003 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
xMeta: SSD-HDD-hybrid Optimization for Metadata Maintenance of Cloud-scale Object Storage.
ACM Trans. Archit. Code Optim., June, 2024

Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving.
CoRR, 2024

Efficient and Economic Large Language Model Inference with Attention Offloading.
CoRR, 2024

Scaling Up Memory Disaggregated Applications with SMART.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023
Spectrophotometer Design Using Single-Grating, Single-Sensor, Double-Beam Spectroscope.
IEEE Trans. Instrum. Meas., 2023

Explore Data Placement Algorithm for Balanced Recovery Load Distribution.
Proceedings of the 2023 USENIX Annual Technical Conference, 2023

Partial Failure Resilient Memory Management System for (CXL-based) Distributed Shared Memory.
Proceedings of the 29th Symposium on Operating Systems Principles, 2023

Falcon: Fast OLTP Engine for Persistent Cache and Non-Volatile Memory.
Proceedings of the 29th Symposium on Operating Systems Principles, 2023

Multi-Objective Optimization for Floating Point Mix-Precision Tuning.
Proceedings of the IEEE/ACM International Symposium on Low Power Electronics and Design, 2023

TEA: A General-Purpose Temporal Graph Random Walk Engine.
Proceedings of the Eighteenth European Conference on Computer Systems, 2023

NosWalker: A Decoupled Architecture for Out-of-Core Random Walk Processing.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

Achieving Sub-second Pairwise Query over Evolving Graphs.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

2022
A Survey of Storage Systems in the RDMA Era.
IEEE Trans. Parallel Distributed Syst., 2022

SeqDLM: A Sequencer-Based Distributed Lock Manager for Efficient Shared File Access in a Parallel File System.
Proceedings of the SC22: International Conference for High Performance Computing, 2022

TeGraph: A Novel General-Purpose Temporal Graph Computing Engine.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Improving Information Literacy of Engineering Doctorate Based on Team Role Model.
Proceedings of the Computer Science and Education - 17th International Conference, 2022

libcrpm: improving the checkpoint performance of NVM.
Proceedings of the DAC '22: 59th ACM/IEEE Design Automation Conference, San Francisco, California, USA, July 10, 2022

T-GCN: A Sampling Based Streaming Graph Neural Network System with Hybrid Architecture.
Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2022

2021
3-D Partitioning for Large-Scale Graph Processing.
IEEE Trans. Computers, 2021

Mixer: Efficiently Understanding and Retrieving Visual Content at Web-Scale.
Proc. VLDB Endow., 2021

Random Walks on Huge Graphs at Cache Efficiency.
Proceedings of the SOSP '21: ACM SIGOPS 28th Symposium on Operating Systems Principles, 2021

Geometric Partitioning: Explore the Boundary of Optimal Erasure Code Repair.
Proceedings of the SOSP '21: ACM SIGOPS 28th Symposium on Operating Systems Principles, 2021

ROART: Range-query Optimized Persistent ART.
Proceedings of the 19th USENIX Conference on File and Storage Technologies, 2021

Thinking More about RDMA Memory Semantics.
Proceedings of the IEEE International Conference on Cluster Computing, 2021

2020
AsymNVM: An Efficient Framework for Implementing Persistent Data Structures on Asymmetric NVM Architecture.
Proceedings of the ASPLOS '20: Architectural Support for Programming Languages and Operating Systems, 2020

2019
RF-RPC: Remote Fetching RPC Paradigm for RDMA-Enabled Network.
IEEE Trans. Parallel Distributed Syst., 2019

Clip: A Disk I/O Focused Parallel Out-of-Core Graph Processing System.
IEEE Trans. Parallel Distributed Syst., 2019

Building Scalable NVM-based B+tree with HTM.
Proceedings of the 48th International Conference on Parallel Processing, 2019

X-RDMA: Effective RDMA Middleware in Large-scale Production Environments.
Proceedings of the 2019 IEEE International Conference on Cluster Computing, 2019

2018
DudeTx: Durable Transactions Made Decoupled.
ACM Trans. Storage, 2018

Principal Component Analysis Based Filtering for Scalable, High Precision k-NN Search.
IEEE Trans. Computers, 2018

Accelerating MapReduce on Commodity Clusters: An SSD-Empowered Approach.
IEEE Trans. Big Data, 2018

An Efficient Framework for Implementing Persist Data Structures on Remote NVM.
CoRR, 2018

Enabling Edge Intelligence for Activity Recognition in Smart Homes.
Proceedings of the 15th IEEE International Conference on Mobile Ad Hoc and Sensor Systems, 2018

ReGraph: A Graph Processing Framework that Alternately Shrinks and Repartitions the Graph.
Proceedings of the 32nd International Conference on Supercomputing, 2018

GraphP: Reducing Communication for PIM-Based Graph Processing with Efficient Data Partition.
Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2018

Low latency RNN inference with cellular batching.
Proceedings of the Thirteenth EuroSys Conference, 2018

Wonderland: A Novel Abstraction-Based Out-Of-Core Graph Processing System.
Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating Systems, 2018

2017
High Performance Graph Processing with Locality Oriented Design.
IEEE Trans. Computers, 2017

Evolution of Cloud Operating System: From Technology to Ecosystem.
J. Comput. Sci. Technol., 2017

Squeezing out All the Value of Loaded Data: An Out-of-core Graph Processing System with Reduced Disk I/O.
Proceedings of the 2017 USENIX Annual Technical Conference, 2017

RFP: When RPC is Faster than Server-Bypass with RDMA.
Proceedings of the Twelfth European Conference on Computer Systems, 2017

HybridFS - A High Performance and Balanced File System Framework with Multiple Distributed File Systems.
Proceedings of the 41st IEEE Annual Computer Software and Applications Conference, 2017

DudeTM: Building Durable Transactions with Decoupling for Persistent Memory.
Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems, 2017

2016
A Lightweight System for Detecting and Tolerating Concurrency Bugs.
IEEE Trans. Software Eng., 2016

Cloud Performance Modeling with Benchmark Evaluation of Elastic Scaling Strategies.
IEEE Trans. Parallel Distributed Syst., 2016

Heads-Join: Efficient Earth Mover's Distance Similarity Joins on Hadoop.
IEEE Trans. Parallel Distributed Syst., 2016

Top-k Spatio-Textual Similarity Join.
IEEE Trans. Knowl. Data Eng., 2016

Systematic Data Placement Optimization in Multi-Cloud Storage for Complex Requirements.
IEEE Trans. Computers, 2016

<i>MARS</i>: Mobile Application Relaunching Speed-Up through Flash-Aware Page Swapping.
IEEE Trans. Computers, 2016

Measuring and Optimizing Distributed Array Programs.
Proc. VLDB Endow., 2016

Exploring the Hidden Dimension in Graph Processing.
Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation, 2016

PCAF: Scalable, High Precision k-NN Search Using Principal Component Analysis Based Filtering.
Proceedings of the 45th International Conference on Parallel Processing, 2016

PROAR: A Weak Consistency Model for Ceph.
Proceedings of the 22nd IEEE International Conference on Parallel and Distributed Systems, 2016

I/O-Conscious and Prediction-Enabled Virtual Machines Scheduling.
Proceedings of the 2016 IEEE International Conference on Computer and Information Technology, 2016

2015
Cloud Storage over Multiple Data Centers.
Proceedings of the Handbook on Data Centers, 2015

Response Time Based Optimal Web Service Selection.
IEEE Trans. Parallel Distributed Syst., 2015

Sliding Mode Congestion Control for Data Center Ethernet Networks.
IEEE Trans. Computers, 2015

RFP: A Remote Fetching Paradigm for RDMA-Accelerated Systems.
CoRR, 2015

Associative Big Data Sharing in Community Clouds: The MeePo Approach.
IEEE Cloud Comput., 2015

Fixing, preventing, and recovering from concurrency bugs.
Sci. China Inf. Sci., 2015

A Customized Schema Design Framework for Multi-tenant Database.
Proceedings of the Web-Age Information Management - 16th International Conference, 2015

Region-aware Top-k Similarity Search.
Proceedings of the Web-Age Information Management - 16th International Conference, 2015

A Sampling-Based Framework for Crowdsourced Select Query with Multiple Predicates.
Proceedings of the Web-Age Information Management - 16th International Conference, 2015

Memory-Centric Data Storage for Mobile Systems.
Proceedings of the 2015 USENIX Annual Technical Conference, 2015

ThyNVM: enabling software-transparent crash consistency in persistent memory systems.
Proceedings of the 48th International Symposium on Microarchitecture, 2015

Flexible Desktop Application Management and Its Influence on Green Computing.
Proceedings of the 44th International Conference on Parallel Processing Workshops, 2015

Flex: Flexible and Energy Efficient Scheduling for Big Data Storage.
Proceedings of the 44th International Conference on Parallel Processing Workshops, 2015

What Is Wrong with the Transmission? A Comprehensive Study on Message Passing Related Bugs.
Proceedings of the 44th International Conference on Parallel Processing, 2015

Parallel Training GBRT Based on KMeans Histogram Approximation for Big Data.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2015

Bidding for Highly Available Services with Low Price in Spot Instance Market.
Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing, 2015

2014
Liquid: A Scalable Deduplication File System for Virtual Machine Images.
IEEE Trans. Parallel Distributed Syst., 2014

Guarantee Strict Fairness and UtilizePrediction Better in Parallel Job Scheduling.
IEEE Trans. Parallel Distributed Syst., 2014

Modeling of Distributed File Systems for Practical Performance Analysis.
IEEE Trans. Parallel Distributed Syst., 2014

NO2: Speeding up Parallel Processing of Massive Compute-Intensive Tasks.
IEEE Trans. Computers, 2014

Quatrain: Accelerating Data Aggregation between Multiple Layers.
IEEE Trans. Computers, 2014

Analysis of Backward Congestion Notification with Delay for Enhanced Ethernet Networks.
IEEE Trans. Computers, 2014

Granary: A sharing oriented distributed storage system.
Future Gener. Comput. Syst., 2014

AI: a lightweight system for tolerating concurrency bugs.
Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering, (FSE-22), Hong Kong, China, November 16, 2014

SepStore: Data Storage Accelerator for Distributed File Systems by Separating Small Files from Large Files.
Proceedings of the Internet of Vehicles - Technologies and Services, 2014

When paxos meets erasure code: reduce network and storage cost in state machine replication.
Proceedings of the 23rd International Symposium on High-Performance Parallel and Distributed Computing, 2014

2013
Attribute-Aware Data Aggregation Using Potential-Based Dynamic Routing in Wireless Sensor Networks.
IEEE Trans. Parallel Distributed Syst., 2013

TopCluster: A hybrid cluster model to support dynamic deployment in Grid.
J. Comput. Syst. Sci., 2013

A survey on reliability in distributed systems.
J. Comput. Syst. Sci., 2013

Human Dynamics Revealed through Log Analytics in a Cloud Computing Environment.
Proceedings of the Web-Age Information Management - 14th International Conference, 2013

Probabilistic QoS Analysis of Web Services.
Proceedings of the Network and Parallel Computing - 10th IFIP International Conference, 2013

2012
Job failures in high performance computing systems: A large-scale empirical study.
Comput. Math. Appl., 2012

Task optimization based on CPU pipeline technique in a multicore system.
Comput. Math. Appl., 2012

Droplet: A Distributed Solution of Data Deduplication.
Proceedings of the 13th ACM/IEEE International Conference on Grid Computing, 2012

µLibCloud: Providing High Available and Uniform Accessing to Multiple Cloud Storages.
Proceedings of the 13th ACM/IEEE International Conference on Grid Computing, 2012

Improving the Effective IO Throughput by Adaptive Read-Ahead Strategy for Private Cloud Storage Service.
Proceedings of the Seventh ChinaGrid Annual Conference, ChinaGrid 2012, Beijing, 2012

Improving the System Capacity by Client Cooperation in Distributed File Service.
Proceedings of the Seventh ChinaGrid Annual Conference, ChinaGrid 2012, Beijing, 2012

EasyDeploy: Automatic Application Deployment in Virtual Clusters.
Proceedings of the Seventh ChinaGrid Annual Conference, ChinaGrid 2012, Beijing, 2012

2011
Automatically constructing trusted cluster computing environment.
J. Supercomput., 2011

Optimization of sub-query processing in distributed data integration systems.
J. Netw. Comput. Appl., 2011

An Intelligent Capacity Planning Model for Cloud Market.
J. Internet Serv. Inf. Secur., 2011

Metadata changes in large file systems: a metadata querying perspective.
Comput. Syst. Sci. Eng., 2011

Optimizing write operation on replica in data grid.
Sci. China Inf. Sci., 2011

Location-Aware MapReduce in Virtual Cloud.
Proceedings of the International Conference on Parallel Processing, 2011

Making Service Granularity Right: An Assistant Approach Based on Business Process Analysis.
Proceedings of the Sixth Chinagrid Annual Conference, ChinaGrid 2011, Dalian, Liaoning, 2011

2010
Adaptive Workload Prediction of Grid Performance in Confidence Windows.
IEEE Trans. Parallel Distributed Syst., 2010

An adaptive task-level fault-tolerant approach to Grid.
J. Supercomput., 2010

Distributed bandwidth allocation based on alternating evolution algorithm.
J. Parallel Distributed Comput., 2010

VDB-MR: MapReduce-based distributed data integration using virtual database.
Future Gener. Comput. Syst., 2010

Improving grid performance by dynamically deploying applications.
Concurr. Comput. Pract. Exp., 2010

Service-oriented execution model supporting data sharing and adaptive query processing.
Clust. Comput., 2010

A Knowledge-based Continuous Double Auction Model for Cloud Market.
Proceedings of the Sixth International Conference on Semantics Knowledge and Grid, 2010

DABGPM: A Double Auction Bayesian Game-Based Pricing Model in Cloud Market.
Proceedings of the Network and Parallel Computing, IFIP International Conference, 2010

GridTDK: A Grid Transaction Development Kit.
Proceedings of the 13th International Conference on Network-Based Information Systems, 2010

PV-EASY: a strict fairness guaranteed and prediction enabled scheduler in parallel job scheduling.
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, 2010

Task Partition Comparison between Multi-core System and GPU.
Proceedings of the Fifth Annual ChinaGrid Conference, ChinaGrid 2010, Guangzhou, 2010

Enabling Cloud Storage to Support Traditional Applications.
Proceedings of the Fifth Annual ChinaGrid Conference, ChinaGrid 2010, Guangzhou, 2010

2009
ALR-MIN: A Replacement Strategy to Reduce Overhead during Dynamic Deployment of Applications in Grid.
Proceedings of the International Conference on Scalable Computing and Communications / Eighth International Conference on Embedded Computing, 2009

Campus Cloud for Data Storage and Sharing.
Proceedings of the Eighth International Conference on Grid and Cooperative Computing, 2009

Optimization of Data Retrievals in Processing Data Integration Queries.
Proceedings of the Fourth International Conference on Frontier of Computer Science and Technology, 2009

CampusWare: An Easy-to-Use, Efficient and Portable Grid Middleware for Compute-Intensive Applications.
Proceedings of the Fourth ChinaGrid Annual Conference, ChinaGrid 2009, Yantai, Shandong, 2009

2008
Grid-Enabled Workflow Management System Based On BPEL.
Int. J. High Perform. Comput. Appl., 2008

End-to-End Congestion Control for High Speed Networks Based on Population Ecology Models.
Proceedings of the 28th IEEE International Conference on Distributed Computing Systems (ICDCS 2008), 2008

VDM: Virtual Database Management for Distributed Databases and File Systems.
Proceedings of the Seventh International Conference on Grid and Cooperative Computing, 2008

ZettaDS: A Light-weight Distributed Storage System for Cluster.
Proceedings of the Third ChinaGrid Annual Conference, ChinaGrid 2008, Dunhuang, Gansu, 2008

Optimizing Communications in Processing Data Integration Queries.
Proceedings of the Third ChinaGrid Annual Conference, ChinaGrid 2008, Dunhuang, Gansu, 2008

Impact of Clustered Demands on Performance of Replication Strategies in Data Grid Systems.
Proceedings of the Third ChinaGrid Annual Conference, ChinaGrid 2008, Dunhuang, Gansu, 2008

Adaptive Hybrid Model for Long Term Load Prediction in Computational Grid.
Proceedings of the 8th IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2008), 2008

2007
Grid middleware in China.
Int. J. Web Grid Serv., 2007

Parallel programming over ChinaGrid.
Int. J. Web Grid Serv., 2007

An analytical model for performance evaluation in a computational grid.
Proceedings of the CHINA HPC 2007, 2007

Load prediction using hybrid model for computational grid.
Proceedings of the 8th IEEE/ACM International Conference on Grid Computing (GRID 2007), 2007

Dynamic Data Replication based on Local Optimization Principle in Data Grid.
Proceedings of the Grid and Cooperative Computing, 2007

Adapting to Application Workflow in Processing Data Integration Queries.
Proceedings of the Grid and Cooperative Computing, 2007

A Component Based Interoperability Solution over Existing Grid Middleware.
Proceedings of the Grid and Cooperative Computing, 2007

Component Based Legacy Program Executing over Grid.
Proceedings of the Grid and Cooperative Computing, 2007

2006
Overlapping Communication and Computation in MPI by Multithreading.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications & Conference on Real-Time Computing Systems and Applications, 2006

Scheduling divisible loads in the dynamic heterogeneous grid environment.
Proceedings of the 1st International Conference on Scalable Information Systems, 2006

Intelligent Decision Making for Agreement-based Grid Resource Management.
Proceedings of the Interdisciplinary and Multidisciplinary Research in Computer Science, 2006

On Interoperability: The Execution Management Perspective Based on ChinaGrid Support Platform*.
Proceedings of the Grid and Cooperative Computing Workshops, 2006

Execution Management in ChinaGrid Supporting Platform.
Proceedings of the Grid and Cooperative Computing, 2006

Grid Enabled Data Integration Framework for Bioinformatics Research.
Proceedings of the Grid and Cooperative Computing Workshops, 2006

General Running Service: An Execution Framework for Executing Legacy Program on Grid.
Proceedings of the Grid and Cooperative Computing Workshops, 2006

Hierarchical Replica Location Service Based on Hybrid Overlay Platform.
Proceedings of the Grid and Cooperative Computing Workshops, 2006

Grid Programming Environment over ChinaGrid Support Platform.
Proceedings of the Grid and Cooperative Computing Workshops, 2006

UGE4B: An Universal Grid Environment for Bioinformatics Research.
Proceedings of the Grid and Cooperative Computing Workshops, 2006

BSM: A scheduling algorithm for dynamic jobs based on economics theory.
Proceedings of the Grid and Cooperative Computing, 2006

Analysis of the Bioinformatics Grid Technique Applications in China.
Proceedings of the Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2006), 2006

DPGS: A Distributed Programmable Grid System.
Proceedings of the Advanced Web and Network Technologies, and Applications, 2006

2005
CGSV: An Adaptable Stream-Integrated Grid Monitoring System.
Proceedings of the Network and Parallel Computing, IFIP International Conference, 2005

FleMA: A Flexible Measurement Architecture for ChinaGrid.
Proceedings of the Parallel and Distributed Processing and Applications, 2005

Introduction to ChinaGrid Support Platform.
Proceedings of the Parallel and Distributed Processing and Applications, 2005

Parallel Algorithm and Implementation for Realtime Dynamic Simulation of Power System.
Proceedings of the 34th International Conference on Parallel Processing (ICPP 2005), 2005

Grid Developing Environment in CGSP System.
Proceedings of the Advanced Parallel Processing Technologies, 6th International Workshop, 2005

CGSP: An Extensible and Reconfigurable Grid Framework.
Proceedings of the Advanced Parallel Processing Technologies, 6th International Workshop, 2005

2004
Grid Computing in China.
J. Grid Comput., 2004

Lookup-Ring: Building Efficient Lookups for High Dynamic Peer-to-Peer Overlays.
Proceedings of the Network and Parallel Computing, IFIP International Conference, 2004

Paramecium: Assembling Raw Nodes into Composite Cells.
Proceedings of the Network and Parallel Computing, IFIP International Conference, 2004

Efficiently Rationing Resources for Grid and P2P Computing.
Proceedings of the Network and Parallel Computing, IFIP International Conference, 2004

DisCAS: A Distributed-Parallel Computer Algebra System.
Proceedings of the Computational Science, 2004

An Accounting and QoS Model for Grid Computing.
Proceedings of the Grid and Cooperative Computing, 2004

Efficient Search Using Adaptive Metadata Spreading in Peer-to-Peer Networks.
Proceedings of the Grid and Cooperative Computing, 2004

A Fine-grained Parallel Programming Model for Grid Computing.
Proceedings of the 2004 IEEE International Conference on Services Computing (SCC 2004), 2004

2003
Grid Computing Pool and Its Framework.
Proceedings of the 32nd International Conference on Parallel Processing Workshops (ICPP 2003 Workshops), 2003

Coarse-Grained Distributed Parallel Programming Interface for Grid Computing.
Proceedings of the Grid and Cooperative Computing, Second International Workshop, 2003

Improving Availability of P2P Storage Systems.
Proceedings of the Advanced Parallel Programming Technologies, 5th International Workshop, 2003


  Loading...