Haibo Chen

Orcid: 0000-0002-9720-0361

Affiliations:
  • Shanghai Jiao Tong University, Institute of Parallel and Distributed Systems,China
  • Fudan University, Parallel Processing Institute, China (former)


According to our database1, Haibo Chen authored at least 248 papers between 2006 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Locality-Preserving Graph Traversal With Split Live Migration.
IEEE Trans. Parallel Distributed Syst., October, 2024

Ad Hoc Transactions through the Looking Glass: An Empirical Study of Application-Level Transactions in Web Applications.
ACM Trans. Database Syst., March, 2024

WeBridge: Synthesizing Stored Procedures for Large-Scale Real-World Web Applications.
Proc. ACM Manag. Data, February, 2024

Toward an SGX-Friendly Java Runtime.
IEEE Trans. Computers, January, 2024

AsyncFS: Metadata Updates Made Asynchronous for Distributed Filesystems with In-Network Coordination.
CoRR, 2024

PowerInfer-2: Fast Large Language Model Inference on a Smartphone.
CoRR, 2024

Turbo Sparse: Achieving LLM SOTA Performance with Minimal Activated Parameters.
CoRR, 2024

PARALLELGPUOS: A Concurrent OS-level GPU Checkpoint and Restore System using Validated Speculation.
CoRR, 2024

Characterizing the Dilemma of Performance and Index Size in Billion-Scale Vector Search and Breaking It with Second-Tier Memory.
CoRR, 2024

Jiagu: Optimizing Serverless Computing Resource Utilization with Harmonized Efficiency and Practicability.
CoRR, 2024

Characterizing Network Requirements for GPU API Remoting in AI Applications.
CoRR, 2024

Harmonizing Efficiency and Practicability: Optimizing Resource Utilization in Serverless Computing with Jiagu.
Proceedings of the 2024 USENIX Annual Technical Conference, 2024

CPC: Flexible, Secure, and Efficient CVM Maintenance with Confidential Procedure Calls.
Proceedings of the 2024 USENIX Annual Technical Conference, 2024

Brief Announcement: Work Stealing through Partial Asynchronous Delegation.
Proceedings of the 36th ACM Symposium on Parallelism in Algorithms and Architectures, 2024

PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU.
Proceedings of the ACM SIGOPS 30th Symposium on Operating Systems Principles, 2024

Using Dynamically Layered Definite Releases for Verifying the RefFS File System.
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024

Fast and Scalable In-network Lock Management Using Lock Fission.
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024

Microkernel Goes General: Performance and Compatibility in the HongMeng Production Microkernel.
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024

sNPU: Trusted Execution Environments on Integrated NPUs.
Proceedings of the 51st ACM/IEEE Annual International Symposium on Computer Architecture, 2024

Optimizing File Systems on Heterogeneous Memory by Integrating DRAM Cache with Virtual Memory Management.
Proceedings of the 22nd USENIX Conference on File and Storage Technologies, 2024

Serialization/Deserialization-free State Transfer in Serverless Workflows.
Proceedings of the Nineteenth European Conference on Computer Systems, 2024

Characterization and Reclamation of Frozen Garbage in Managed FaaS Workloads.
Proceedings of the Nineteenth European Conference on Computer Systems, 2024

Jade: A High-throughput Concurrent Copying Garbage Collector.
Proceedings of the Nineteenth European Conference on Computer Systems, 2024

sIOPMP: Scalable and Efficient I/O Protection for TEEs.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

Seraph: A Performance-Cost Aware Tuner for Training Reinforcement Learning Model on Serverless Computing.
Proceedings of the 15th ACM SIGOPS Asia-Pacific Workshop on Systems, 2024

2023
Proving Query Equivalence Using Linear Integer Arithmetic.
Proc. ACM Manag. Data, December, 2023

Flock: Towards Multitasking Virtual Machines for Function-as-a-Service.
IEEE Trans. Computers, November, 2023

Understanding and Mitigating Twin Function Misuses in Operating System Kernel.
IEEE Trans. Computers, August, 2023

Hawkeye: Eliminating Kernel Address Leakage in Normal Data Flows.
IEEE Trans. Dependable Secur. Comput., 2023

Ad Hoc Transactions: What They Are and Why We Should Care.
SIGMOD Rec., 2023

Fine-Grained Re-Execution for Efficient Batched Commit of Distributed Transactions.
Proc. VLDB Endow., 2023

Software Engineering for OpenHarmony: A Research Roadmap.
CoRR, 2023

Transactional Indexes on (RDMA or CXL-based) Disaggregated Memory with Repairable Transaction.
CoRR, 2023

Bridging the Gap between Relational OLTP and Graph-based OLAP.
Proceedings of the 2023 USENIX Annual Technical Conference, 2023

Bifrost: Analysis and Optimization of Network I/O Tax in Confidential Virtual Machines.
Proceedings of the 2023 USENIX Annual Technical Conference, 2023

TreeSLS: A Whole-system Persistent Microkernel with Tree-structured State Checkpoint on NVM.
Proceedings of the 29th Symposium on Operating Systems Principles, 2023

UGACHE: A Unified GPU Cache for Embedding-based Deep Learning.
Proceedings of the 29th Symposium on Operating Systems Principles, 2023

No Provisioned Concurrency: Fast RDMA-codesigned Remote Fork for Serverless Computing.
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023

Characterizing Off-path SmartNIC for Accelerating Distributed Systems.
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023

BWoS: Formally Verified Block-based Work Stealing for Parallel Processing.
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023

Encrypted Databases Made Secure Yet Maintainable.
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023

Automated Verification of Idempotence for Stateful Serverless Applications.
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023

Security and Performance in the Delegated User-level Virtualization.
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023

Accelerating Extra Dimensional Page Walks for Confidential Computing.
Proceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture, 2023

ISA-Grid: Architecture of Fine-grained Privilege Control for Instructions and Registers.
Proceedings of the 50th Annual International Symposium on Computer Architecture, 2023

Database Deadlock Diagnosis for Large-Scale ORM-Based Web Applications.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

Efficient Distributed Secure Memory with Migratable Merkle Tree.
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2023

The Gap Between Serverless Research and Real-world Systems.
Proceedings of the 2023 ACM Symposium on Cloud Computing, SoCC 2023, 2023

CPS: A Cooperative Para-virtualized Scheduling Framework for Manycore Machines.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

AtoMig: Automatically Migrating Millions Lines of Code from TSO to WMM.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

BeeHive: Sub-second Elasticity for Web Services with Semi-FaaS Execution.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

Microsecond-scale Datacenter Computing with Modern Networking.
Proceedings of the ACM Turing Award Celebration Conference - China 2023, 2023

Heterogeneous Serverless Computing.
Proceedings of the ACM Turing Award Celebration Conference - China 2023, 2023

2022
Wukong+G: Fast and Concurrent RDF Query Processing Using RDMA-Assisted GPU Graph Exploration.
IEEE Trans. Parallel Distributed Syst., 2022

DrTM+B: Replication-Driven Live Reconfiguration for Fast and General Distributed Transaction Processing.
IEEE Trans. Parallel Distributed Syst., 2022

The Concurrent Learned Indexes for Multicore Data Storage.
ACM Trans. Storage, 2022

Colony: A Privileged Trusted Execution Environment With Extensibility.
IEEE Trans. Computers, 2022

General and Fast Inter-Process Communication via Bypassing Privileged Software.
IEEE Trans. Computers, 2022

Gating PatternPyramid for diversified image style transfer.
J. Electronic Imaging, 2022

Unified Enclave Abstraction and Secure Enclave Migration on Heterogeneous Security Architectures.
J. Comput. Sci. Technol., 2022

Reducing Transaction Processing Latency in Hardware Transactional Memory-based Database with Non-volatile Memory.
Int. J. Softw. Informatics, 2022

Memory Optimization System for SGXv2 Trusted Execution Environment.
Int. J. Softw. Informatics, 2022

A Comprehensive Study on Off-path SmartNIC.
CoRR, 2022

Booting 10K Serverless Functions within One Second via RDMA-based Remote Fork.
CoRR, 2022

DuVisor: a User-level Hypervisor Through Delegated Virtualization.
CoRR, 2022

In pursuit of a trusted computing base: the journey through virtualization (keynote).
Proceedings of the VEE '22: 18th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments, 2022

A Hardware-Software Co-design for Efficient Intra-Enclave Isolation.
Proceedings of the 31st USENIX Security Symposium, 2022

KRCORE: A Microsecond-scale RDMA Control Plane for Elastic Computing.
Proceedings of the 2022 USENIX Annual Technical Conference, 2022

BBQ: A Block-based Bounded Queue for Exchanging Data and Profiling.
Proceedings of the 2022 USENIX Annual Technical Conference, 2022

Zero-Change Object Transmission for Distributed Big Data Analytics.
Proceedings of the 2022 USENIX Annual Technical Conference, 2022

EPK: Scalable and Efficient Memory Protection Keys.
Proceedings of the 2022 USENIX Annual Technical Conference, 2022

WeTune: Automatic Discovery and Verification of Query Rewrite Rules.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

Ad Hoc Transactions in Web Applications: The Good, the Bad, and the Ugly.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

Asymmetry-aware scalable locking.
Proceedings of the PPoPP '22: 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Seoul, Republic of Korea, April 2, 2022

Microsecond-scale Preemption for Concurrent GPU-accelerated DNN Inferences.
Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation, 2022

MT^2: Memory Bandwidth Regulation on Hybrid NVM/DRAM Platforms.
Proceedings of the 20th USENIX Conference on File and Storage Technologies, 2022

HTMFS: Strong Consistency Comes for Free with Hardware Transactional Memory in Persistent Memory File Systems.
Proceedings of the 20th USENIX Conference on File and Storage Technologies, 2022

Affine Loop Invariant Generation via Matrix Algebra.
Proceedings of the Computer Aided Verification - 34th International Conference, 2022

Serverless computing on heterogeneous computers.
Proceedings of the ASPLOS '22: 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Lausanne, Switzerland, 28 February 2022, 2022

2021
XStore: Fast RDMA-Based Ordered Key-Value Store Using Remote Learned Cache.
ACM Trans. Storage, 2021

Boosting Inter-process Communication with Architectural Support.
ACM Trans. Comput. Syst., 2021

Enclavisor: A Hardware-Software Co-Design for Enclaves on Untrusted Cloud.
IEEE Trans. Computers, 2021

Wukong: A Distributed Framework for Fast and Concurrent Graph Querying.
ACM SIGOPS Oper. Syst. Rev., 2021

Revisiting Persistent Indexing Structures on Intel Optane DC Persistent Memory.
J. Comput. Sci. Technol., 2021

Accelerator Virtualization Framework Based on Inter-VM Exitless Communication.
Int. J. Softw. Informatics, 2021

VSync: Push-Button Verification and Optimization for Synchronization Primitives on Weak Memory Models (Technical Report).
CoRR, 2021

TZ-Container: protecting container from untrusted OS with ARM TrustZone.
Sci. China Inf. Sci., 2021

Characterizing and Optimizing Remote Persistent Memory with RDMA and NVM.
Proceedings of the 2021 USENIX Annual Technical Conference, 2021

TwinVisor: Hardware-isolated Confidential Virtual Machines for ARM.
Proceedings of the SOSP '21: ACM SIGOPS 28th Symposium on Operating Systems Principles, 2021

CLoF: A Compositional Lock Framework for Multi-level NUMA Systems.
Proceedings of the SOSP '21: ACM SIGOPS 28th Symposium on Operating Systems Principles, 2021

Polyjuice: High-Performance Transactions via Learned Concurrency Control.
Proceedings of the 15th USENIX Symposium on Operating Systems Design and Implementation, 2021

Retrofitting High Availability Mechanism to Tame Hybrid Transaction/Analytical Processing.
Proceedings of the 15th USENIX Symposium on Operating Systems Design and Implementation, 2021

Bringing Decentralized Search to Decentralized Services.
Proceedings of the 15th USENIX Symposium on Operating Systems Design and Implementation, 2021

Scalable Memory Protection in the PENGLAI Enclave.
Proceedings of the 15th USENIX Symposium on Operating Systems Design and Implementation, 2021

Unifying Timestamp with Transaction Ordering for MVCC with Decentralized Scalar Timestamp.
Proceedings of the 18th USENIX Symposium on Networked Systems Design and Implementation, 2021

Confidential Serverless Made Efficient with Plug-In Enclaves.
Proceedings of the 48th ACM/IEEE Annual International Symposium on Computer Architecture, 2021

Bridging the performance gap for copy-based garbage collectors atop non-volatile memory.
Proceedings of the EuroSys '21: Sixteenth European Conference on Computer Systems, 2021

Third-Eye: Practical and Context-Aware Inference of Causal Relationship Violations in Commodity Kernels.
Proceedings of the Detection of Intrusions and Malware, and Vulnerability Assessment, 2021

VSync: push-button verification and optimization for synchronization primitives on weak memory models.
Proceedings of the ASPLOS '21: 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2021

2020
CPU Elasticity to Mitigate Cross-VM Runtime Monitoring.
IEEE Trans. Dependable Secur. Comput., 2020

Secure and Efficient Control Data Isolation with Register-Based Data Cloaking.
IEEE Trans. Computers, 2020

Optimistic Transaction Processing in Deterministic Database.
J. Comput. Sci. Technol., 2020

GCPersist: an efficient GC-assisted lazy persistency framework for resilient Java applications on NVM.
Proceedings of the VEE '20: 16th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments, 2020

(Mostly) Exitless VM Protection from Untrusted Hypervisor through Disaggregated Nested Virtualization.
Proceedings of the 29th USENIX Security Symposium, 2020

Platinum: A CPU-Efficient Concurrent Garbage Collector for Tail-Reduction of Interactive Services.
Proceedings of the 2020 USENIX Annual Technical Conference, 2020

Harmonizing Performance and Isolation in Microkernels with Efficient Intra-kernel Isolation and Communication.
Proceedings of the 2020 USENIX Annual Technical Conference, 2020

XIndex: a scalable learned index for multicore data storage.
Proceedings of the PPoPP '20: 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2020

No barrier in the road: a comprehensive study and optimization of ARM barriers.
Proceedings of the PPoPP '20: 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2020

Fast RDMA-based Ordered Key-Value Store using Remote Learned Cache.
Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation, 2020

Characterizing serverless platforms with serverlessbench.
Proceedings of the SoCC '20: ACM Symposium on Cloud Computing, 2020

Catalyzer: Sub-millisecond Startup for Serverless Computing with Initialization-less Booting.
Proceedings of the ASPLOS '20: Architectural Support for Programming Languages and Operating Systems, 2020

On the precision of precise event based sampling.
Proceedings of the APSys '20: 11th ACM SIGOPS Asia-Pacific Workshop on Systems, 2020

SIndex: a scalable learned index for string keys.
Proceedings of the APSys '20: 11th ACM SIGOPS Asia-Pacific Workshop on Systems, 2020

2019
Learn-as-you-go with Megh: Efficient Live Migration of Virtual Machines.
IEEE Trans. Parallel Distributed Syst., 2019

Scaling out NUMA-Aware Applications with RDMA-Based Distributed Shared Memory.
J. Comput. Sci. Technol., 2019

DTrace: fine-grained and efficient data integrity checking with hardware instruction tracing.
Cybersecur., 2019

Learned Indexes for Dynamic Workloads.
CoRR, 2019

Taming Distrust in the Decentralized Internet with PIXIU.
CoRR, 2019

TEEv: virtualizing trusted execution environments on mobile platforms.
Proceedings of the 15th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments, 2019

ScissorGC: scalable and efficient compaction for Java full garbage collection.
Proceedings of the 15th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments, 2019

Pragh: Locality-preserving Graph Traversal with Split Live Migration.
Proceedings of the 2019 USENIX Annual Technical Conference, 2019

Pisces: A Scalable and Efficient Persistent Transactional Memory.
Proceedings of the 2019 USENIX Annual Technical Conference, 2019

EROFS: A Compression-friendly Readonly File System for Resource-scarce Devices.
Proceedings of the 2019 USENIX Annual Technical Conference, 2019

Using concurrent relational logic with helpers for verifying the AtomFS file system.
Proceedings of the 27th ACM Symposium on Operating Systems Principles, 2019

Performance and protection in the ZoFS user-space NVM file system.
Proceedings of the 27th ACM Symposium on Operating Systems Principles, 2019

On the Parallels between Paxos and Raft, and how to Port Optimizations.
Proceedings of the 2019 ACM Symposium on Principles of Distributed Computing, 2019

XPC: architectural support for secure and efficient cross process call.
Proceedings of the 46th International Symposium on Computer Architecture, 2019

SkyBridge: Fast and Secure Inter-Process Communication for Microkernels.
Proceedings of the Fourteenth EuroSys Conference 2019, Dresden, Germany, March 25-28, 2019, 2019

2018
Eunomia: Scaling Concurrent Index Structures Under Contention Using HTM.
IEEE Trans. Parallel Distributed Syst., 2018

Replication-Based Fault-Tolerance for Large-Scale Graph Processing.
IEEE Trans. Parallel Distributed Syst., 2018

PowerLyra: Differentiated Graph Computation and Partitioning on Skewed Graphs.
ACM Trans. Parallel Comput., 2018

Research on ARM TrustZone.
GetMobile Mob. Comput. Commun., 2018

ShadowEth: Private Smart Contract on Public Blockchain.
J. Comput. Sci. Technol., 2018

SplitPass: A Mutually Distrusting Two-Party Password Manager.
J. Comput. Sci. Technol., 2018

Asymmetric virtual machine replication for low latency and high available service.
Sci. China Inf. Sci., 2018

Cloud bursting for the world's largest consumer market.
Commun. ACM, 2018

Fast and Concurrent RDF Queries using RDMA-assisted GPU Graph Exploration.
Proceedings of the 2018 USENIX Annual Technical Conference, 2018

EPTI: Efficient Defence against Meltdown Attack for Unpatched VMs.
Proceedings of the 2018 USENIX Annual Technical Conference, 2018

Deconstructing RDMA-enabled Distributed Transactions: Hybrid is Better!
Proceedings of the 13th USENIX Symposium on Operating Systems Design and Implementation, 2018

VButton: Practical Attestation of User-driven Operations in Mobile Apps.
Proceedings of the 16th Annual International Conference on Mobile Systems, 2018

Comprehensive VM Protection Against Untrusted Hypervisor Through Retrofitted AMD Memory Encryption.
Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2018

Leveraging Hardware Transactional Memory for Cache Side-Channel Defenses.
Proceedings of the 2018 on Asia Conference on Computer and Communications Security, 2018

Espresso: Brewing Java For More Non-Volatility with Non-volatile Memory.
Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating Systems, 2018

Analysis and Optimizations of Java Full Garbage Collection.
Proceedings of the 9th Asia-Pacific Workshop on Systems, 2018

2017
VarCatcher: A Framework for Tackling Performance Variability of Parallel Workloads on Multi-Core.
IEEE Trans. Parallel Distributed Syst., 2017

Prophet: A Parallel Instruction-Oriented Many-Core Simulator.
IEEE Trans. Parallel Distributed Syst., 2017

Scalable Adaptive NUMA-Aware Lock.
IEEE Trans. Parallel Distributed Syst., 2017

Fence-Free Synchronization with Dynamically Serialized Synchronization Variables.
IEEE Trans. Parallel Distributed Syst., 2017

Efficient and Available In-Memory KV-Store with Hybrid Erasure Coding and Replication.
ACM Trans. Storage, 2017

Fast In-Memory Transaction Processing Using RDMA and HTM.
ACM Trans. Comput. Syst., 2017

Secure Outsourcing of Virtual Appliance.
IEEE Trans. Cloud Comput., 2017

A software cybernetics approach to self-tuning performance of on-line transaction processing systems.
J. Syst. Softw., 2017

Characterizing and optimizing Java-based HPC applications on Intel many-core architecture.
Sci. China Inf. Sci., 2017

vTZ: Virtualizing ARM TrustZone.
Proceedings of the 26th USENIX Security Symposium, 2017

Replication-driven Live Reconfiguration for Fast Distributed Transaction Processing.
Proceedings of the 2017 USENIX Annual Technical Conference, 2017

Soft Updates Made Simple and Fast on Non-volatile Memory.
Proceedings of the 2017 USENIX Annual Technical Conference, 2017

Sub-millisecond Stateful Stream Querying over Fast-evolving Linked Data.
Proceedings of the 26th Symposium on Operating Systems Principles, 2017

POSTER: Recovering Performance for Vector-based Machine Learning on Managed Runtime.
Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2017

Eunomia: Scaling Concurrent Search Trees under Contention Using HTM.
Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2017

Deconstructing Xen.
Proceedings of the 24th Annual Network and Distributed System Security Symposium, 2017

Transparent and Efficient CFI Enforcement with Intel Processor Trace.
Proceedings of the 2017 IEEE International Symposium on High Performance Computer Architecture, 2017

Secure Live Migration of SGX Enclaves on Untrusted Cloud.
Proceedings of the 47th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, 2017

Extracting More Intra-transaction Parallelism with Work Stealing for OLTP Workloads.
Proceedings of the 8th Asia-Pacific Workshop on Systems, Mumbai, India, September 2, 2017, 2017

2016
A Loosely-Coupled Full-System Multicore Simulation Framework.
IEEE Trans. Parallel Distributed Syst., 2016

Performance Analysis of Multimedia Retrieval Workloads Running on Multicores.
IEEE Trans. Parallel Distributed Syst., 2016

IMR: High-Performance Low-Cost Multi-Ring NoCs.
IEEE Trans. Parallel Distributed Syst., 2016

Fast Consensus Using Bounded Staleness for Scalable Read-Mostly Synchronization.
IEEE Trans. Parallel Distributed Syst., 2016

Deep Learning at Scale and at Ease.
ACM Trans. Multim. Comput. Commun. Appl., 2016

Hardware Support for Concurrent Detection of Multiple Concurrency Bugs on Fused CPU-GPU Architectures.
IEEE Trans. Computers, 2016

Parallelizing image feature extraction algorithms on multi-core platforms.
J. Parallel Distributed Comput., 2016

Deep Learning At Scale and At Ease.
CoRR, 2016

Petri net based test case generation for evolved specification.
Sci. China Inf. Sci., 2016

Performance Analysis and Optimization of Full Garbage Collection in Memory-hungry Environments.
Proceedings of the 12th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments, 2016

Scaling Multicore Databases via Constrained Parallel Execution.
Proceedings of the 2016 International Conference on Management of Data, 2016

Scalable adaptive NUMA-aware lock: combining local locking and remote locking for efficient concurrency.
Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2016

Fast and Concurrent RDF Queries with RDMA-Based Distributed Graph Exploration.
Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation, 2016

Efficient and Available In-memory KV-Store with Hybrid Erasure Coding and Replication.
Proceedings of the 14th USENIX Conference on File and Storage Technologies, 2016

Mitigating Sync Amplification for Copy-on-write Virtual Disk.
Proceedings of the 14th USENIX Conference on File and Storage Technologies, 2016

Fast and general distributed transactions using RDMA and HTM.
Proceedings of the Eleventh European Conference on Computer Systems, 2016

A Case for Virtualizing Persistent Memory.
Proceedings of the Seventh ACM Symposium on Cloud Computing, 2016

Rethinking Benchmarking for NVM-based File Systems.
Proceedings of the 7th ACM SIGOPS Asia-Pacific Workshop on Systems, 2016

2015
Multilevel Phase Analysis.
ACM Trans. Embed. Comput. Syst., 2015

Bipartite-Oriented Distributed Graph Partitioning for Big Learning.
J. Comput. Sci. Technol., 2015

Persistent Transactional Memory.
IEEE Comput. Archit. Lett., 2015

You Shouldn't Collect My Secrets: Thwarting Sensitive Keystroke Leakage in Mobile IME Apps.
Proceedings of the 24th USENIX Security Symposium, 2015

Fast in-memory transaction processing using RDMA and HTM.
Proceedings of the 25th Symposium on Operating Systems Principles, 2015

NUMA-aware graph-structured analytics.
Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2015

SYNC or ASYNC: time to fuse for distributed graph-parallel computation.
Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2015

Poster: TVisor - A Practical and Lightweight Mobile Red-Green Dual-OS Architecture.
Proceedings of the 13th Annual International Conference on Mobile Systems, 2015

AdAttester: Secure Online Mobile Advertisement Attestation Using TrustZone.
Proceedings of the 13th Annual International Conference on Mobile Systems, 2015

Reducing world switches in virtualized environment with flexible cross-world calls.
Proceedings of the 42nd Annual International Symposium on Computer Architecture, 2015

Open JDK Meets Xeon Phi: A Comprehensive Study of Java HPC on Intel Many-Core Architecture.
Proceedings of the 44th International Conference on Parallel Processing Workshops, 2015

<i>TinMan</i>: eliminating confidential mobile data exposure with security oriented offloading.
Proceedings of the Tenth European Conference on Computer Systems, 2015

PowerLyra: differentiated graph computation and partitioning on skewed graphs.
Proceedings of the Tenth European Conference on Computer Systems, 2015

Thwarting Memory Disclosure with Efficient Hypervisor-enforced Intra-domain Isolation.
Proceedings of the 22nd ACM SIGSAC Conference on Computer and Communications Security, 2015

2014
X10-FT: Transparent fault tolerance for APGAS language and runtime.
Parallel Comput., 2014

Architecting Flash-based Solid-State Drive for High-performance I/O Virtualization.
IEEE Comput. Archit. Lett., 2014

Scalable Read-mostly Synchronization Using Passive Reader-Writer Locks.
Proceedings of the 2014 USENIX Annual Technical Conference, 2014

Hydra: Efficient Detection of Multiple Concurrency Bugs on Fused CPU-GPU Architecture.
Proceedings of the 43rd International Conference on Parallel Processing, 2014

Computation and communication efficient graph processing with distributed immutable view.
Proceedings of the 23rd International Symposium on High-Performance Parallel and Distributed Computing, 2014

Concurrent and consistent virtual machine introspection with hardware transactional memory.
Proceedings of the 20th IEEE International Symposium on High Performance Computer Architecture, 2014

Greedy map generalization by iterative point removal.
Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, 2014

Using restricted transactional memory to build a scalable in-memory database.
Proceedings of the Ninth Eurosys Conference 2014, 2014

Replication-Based Fault-Tolerance for Large-Scale Graph Processing.
Proceedings of the 44th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, 2014

2013
Tiled-MapReduce: Efficient and Flexible MapReduce Processing on Multicore with Tiling.
ACM Trans. Archit. Code Optim., 2013

Security-Preserving Live Migration of Virtual Machines in the Cloud.
J. Netw. Syst. Manag., 2013

Parallelizing live migration of virtual machines.
Proceedings of the ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments (co-located with ASPLOS 2013), 2013

X10-FT: transparent fault tolerance for APGAS language and runtime.
Proceedings of the 2013 PPOPP International Workshop on Programming Models and Applications for Multicores and Manycores, 2013

Scalable deterministic replay in a parallel full-system emulator.
Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2013

X10-PM: Transparent and Efficient Place Migration for APGAS Language.
Proceedings of the 10th IEEE International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing, 2013

Architecture support for guest-transparent VM protection from untrusted hypervisor and physical attacks.
Proceedings of the 19th IEEE International Symposium on High Performance Computer Architecture, 2013

Point-polygon topological relationship query using hierarchical indices.
Proceedings of the 21st SIGSPATIAL International Conference on Advances in Geographic Information Systems, 2013

Multi-level phase analysis for sampling simulation.
Proceedings of the Design, Automation and Test in Europe, 2013

Opportunities and pitfalls of multi-core scaling using hardware transaction memory.
Proceedings of the Asia-Pacific Workshop on Systems, 2013

Schedule processes, not VCPUs.
Proceedings of the Asia-Pacific Workshop on Systems, 2013

2012
Mercury: Combining Performance with Dependability Using Self-Virtualization.
J. Comput. Sci. Technol., 2012

Revisiting Software Zero-Copy for Web-caching Applications with Twin Memory Allocation.
Proceedings of the 2012 USENIX Annual Technical Conference, 2012

A case for secure and scalable hypervisor using safe language.
Proceedings of the 2012 PPOPP International Workshop on Programming Models and Applications for Multicores and Manycores, 2012

Improving dynamic prediction accuracy through multi-level phase analysis.
Proceedings of the SIGPLAN/SIGBED Conference on Languages, 2012

Adaptive Pipeline Parallelism for Image Feature Extraction Algorithms.
Proceedings of the 41st International Conference on Parallel Processing, 2012

Defending against VM rollback attack.
Proceedings of the IEEE/IFIP International Conference on Dependable Systems and Networks Workshops, 2012

CFIMon: Detecting violation of control flow integrity using performance counters.
Proceedings of the IEEE/IFIP International Conference on Dependable Systems and Networks, 2012

TinyChecker: Transparent protection of VMs against hypervisor failures with nested virtualization.
Proceedings of the IEEE/IFIP International Conference on Dependable Systems and Networks Workshops, 2012

Transformer: a functional-driven cycle-accurate multicore simulator.
Proceedings of the 49th Annual Design Automation Conference 2012, 2012

A GPU-based high-throughput image retrieval algorithm.
Proceedings of the 5th Annual Workshop on General Purpose Processing with Graphics Processing Units, 2012

SSMalloc: a low-latency, locality-conscious memory allocator with stable performance scalability.
Proceedings of the Asia-Pacific Workshop on Systems, 2012

2011
Dynamic Software Updating Using a Relaxed Consistency Model.
IEEE Trans. Software Eng., 2011

ORDER: Object centRic DEterministic Replay for Java.
Proceedings of the 2011 USENIX Annual Technical Conference, 2011

CloudVisor: retrofitting protection of virtual machines in multi-tenant cloud with nested virtualization.
Proceedings of the 23rd ACM Symposium on Operating Systems Principles 2011, 2011

COREMU: a scalable and portable parallel full-system emulator.
Proceedings of the 16th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2011

A comprehensive analysis and parallelization of an image retrieval algorithm.
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2011

A case for scaling applications to many-core with OS clustering.
Proceedings of the European Conference on Computer Systems, 2011

Limiting cache-based side-channel in multi-tenant cloud using dynamic page coloring.
Proceedings of the IEEE/IFIP International Conference on Dependable Systems and Networks Workshops (DSN-W 2011), 2011

Security breaches as PMU deviation: detecting and identifying security attacks using performance counters.
Proceedings of the APSys '11 Asia Pacific Workshop on Systems, 2011

Evaluating the Performance and Scalability of MapReduce Applications on X10.
Proceedings of the Advanced Parallel Processing Technologies - 9th International Symposium, 2011

A Hierarchical Approach to Maximizing MapReduce Efficiency.
Proceedings of the 2011 International Conference on Parallel Architectures and Compilation Techniques, 2011

2010
Optimizing crash dump in virtualized environments.
Proceedings of the 6th International Conference on Virtual Execution Environments, 2010

Why software hangs and what can be done with it.
Proceedings of the 2010 IEEE/IFIP International Conference on Dependable Systems and Networks, 2010

Tiled-MapReduce: optimizing resource usages of data-parallel applications on multicore with tiling.
Proceedings of the 19th International Conference on Parallel Architectures and Compilation Techniques, 2010

2009
Control flow obfuscation with information flow tracking.
Proceedings of the 42st Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-42 2009), 2009

Evaluating SPLASH-2 Applications Using MapReduce.
Proceedings of the Advanced Parallel Processing Technologies, 8th International Symposium, 2009

2008
TPM-Performance Sensible Key Management Protocols for Service Provisioning in Cloud Computing.
Proceedings of the Security Protocols XVI, 2008

Corey: An Operating System for Many Cores.
Proceedings of the 8th USENIX Symposium on Operating Systems Design and Implementation, 2008

From Speculation to Security: Practical and Efficient Information Flow Tracking Using Speculative Hardware.
Proceedings of the 35th International Symposium on Computer Architecture (ISCA 2008), 2008

2007
Daonity - Grid security from two levels of virtualization.
Inf. Secur. Tech. Rep., 2007

Daonity: Protocol Solutions to Grid Security Using Hardware Strengthened Software Environment.
Proceedings of the Security Protocols, 2007

POLUS: A POwerful Live Updating System.
Proceedings of the 29th International Conference on Software Engineering (ICSE 2007), 2007

2006
Live updating operating systems using virtualization.
Proceedings of the 2nd International Conference on Virtual Execution Environments, 2006


  Loading...