Xubin He

Orcid: 0000-0002-5071-2861

Affiliations:
  • Temple University, Department of Computer and Information Sciences, Philadelphia, PA, USA
  • Virginia Commonwealth University, Department of Electrical and Computer Engineering, Richmond, VA, USA
  • Tennessee Technological University, Department of Electrical and Computer Engineering, Cookeville, TN, USA
  • University of Rhode Island, Kingston, RI, USA (PhD 2002)


According to our database1, Xubin He authored at least 154 papers between 2000 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
AMOSL: Adaptive Modality-wise Structure Learning in Multi-view Graph Neural Networks For Enhanced Unified Representation.
CoRR, 2024

Exploit both SMART Attributes and NAND Flash Wear Characteristics to Effectively Forecast SSD-based Storage Failures in Clusters.
Proceedings of the 2024 USENIX Annual Technical Conference, 2024

CauchyGCN: Preserving Local Smoothness in Graph Convolutional Networks via a Cauchy-Based Message-Passing Scheme and Clustering Analysis.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2024, 2024

2023
zPerf: A Statistical Gray-Box Approach to Performance Modeling and Extrapolation for Scientific Lossy Compression.
IEEE Trans. Computers, September, 2023

Exploring Memory Access Similarity to Improve Irregular Application Performance for Distributed Hybrid Memory Systems.
IEEE Trans. Parallel Distributed Syst., March, 2023

High-Ratio Lossy Compression: Exploring the Autoencoder to Compress Scientific Data.
IEEE Trans. Big Data, February, 2023

Boosting the Performance of Degraded Reads in RS-coded Distributed Storage Systems.
CoRR, 2023

Improving Progressive Retrieval for HPC Scientific Data using Deep Neural Network.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

GearDB: A GC-free Key-Value Store on HM-SMR Drives with Gear Compaction.
Proceedings of the ACM Turing Award Celebration Conference - China 2023, 2023

2022
Data Representation Aware of Damage to Extend the Lifetime of MLC NAND Flash Memory.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022

Locality-based transfer learning on compression autoencoder for efficient scientific data lossy compression.
J. Netw. Comput. Appl., 2022

Alias-Chain: Improving Blockchain Scalability via Exploring Content Locality among Transactions.
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022

2021
Design and Evaluation of a Risk-Aware Failure Identification Scheme for Improved RAS in Erasure-Coded Data Centers.
IEEE Trans. Parallel Distributed Syst., 2021

Reducing the Training Overhead of the HPC Compression Autoencoder via Dataset Proportioning.
Proceedings of the IEEE International Conference on Networking, Architecture and Storage, 2021

2020
Minority Disk Failure Prediction Based on Transfer Learning in Large Data Centers of Heterogeneous Disk Systems.
IEEE Trans. Parallel Distributed Syst., 2020

Compression Ratio Modeling and Estimation across Error Bounds for Lossy Compression.
IEEE Trans. Parallel Distributed Syst., 2020

Understanding and analysis of B+ trees on NVM towards consistency and efficiency.
CCF Trans. High Perform. Comput., 2020

Editorial for the special issue on storage system and technology.
CCF Trans. High Perform. Comput., 2020

MatrixKV: Reducing Write Stalls and Write Amplification in LSM-tree Based KV Stores with Matrix Container in NVM.
Proceedings of the 2020 USENIX Annual Technical Conference, 2020

AZ-Recovery: An Efficient Crossing-AZ Recovery Scheme for Erasure Coded Cloud Storage Systems.
Proceedings of the International Symposium on Reliable Distributed Systems, 2020

EC-Fusion: An Efficient Hybrid Erasure Coding Framework to Improve Both Application and Recovery Performance in Cloud Storage Systems.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020

StragglerHelper: Alleviating Straggling in Computing Clusters via Sharing Memory Access Patterns.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020

A Rack-Aware Pipeline Repair Scheme for Erasure-Coded Distributed Storage Systems.
Proceedings of the ICPP 2020: 49th International Conference on Parallel Processing, 2020

2019
Improving Cache Performance for Large-Scale Photo Stores via Heuristic Prefetching Scheme.
IEEE Trans. Parallel Distributed Syst., 2019

SEALDB: An Efficient LSM-tree Based KV Store on SMR Drives with Sets and Dynamic Bands.
IEEE Trans. Parallel Distributed Syst., 2019

Can I/O Variability Be Reduced on QoS-Less HPC Storage Systems?
IEEE Trans. Computers, 2019

SCORE: A Novel Scheme to Efficiently Cache Overlong ECCs in NAND Flash Memory.
ACM Trans. Archit. Code Optim., 2019

An optimal checkpointing model with online OCI adjustment for stream processing applications.
Concurr. Comput. Pract. Exp., 2019

Exploring Transfer Learning to Reduce Training Overhead of HPC Data in Machine Learning.
Proceedings of the 2019 IEEE International Conference on Networking, 2019

AZ-Code: An Efficient Availability Zone Level Erasure Code to Provide High Fault Tolerance in Cloud Storage Systems.
Proceedings of the 35th Symposium on Mass Storage Systems and Technologies, 2019

Optimizing the Parity Check Matrix for Efficient Decoding of RS-Based Cloud Storage Systems.
Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium, 2019

Transfer Learning based Failure Prediction for Minority Disks in Large Data Centers of Heterogeneous Disk Systems.
Proceedings of the 48th International Conference on Parallel Processing, 2019

GearDB: A GC-free Key-Value Store on HM-SMR Drives with Gear Compaction.
Proceedings of the 17th USENIX Conference on File and Storage Technologies, 2019

2018
Alleviating Memory Refresh Overhead via Data Compression for High Performance and Energy Efficiency.
IEEE Trans. Parallel Distributed Syst., 2018

Early Identification of Critical Blocks: Making Replicated Distributed Storage Systems Reliable Against Node Failures.
IEEE Trans. Parallel Distributed Syst., 2018

RAFI: Risk-Aware Failure Identification to Improve the RAS in Erasure-coded Data Centers.
Proceedings of the 2018 USENIX Annual Technical Conference, 2018

Exploiting Minipage-Level Mapping to Improve Write Efficiency of NAND Flash.
Proceedings of the 2018 IEEE International Conference on Networking, 2018

Reference-Counter Aware Deduplication in Erasure-Coded Distributed Storage System.
Proceedings of the 2018 IEEE International Conference on Networking, 2018

Chameleon: An Adaptive Wear Balancer for Flash Clusters.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

A Set-Aware Key-Value Store on Shingled Magnetic Recording Drives with Dynamic Band.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

Understanding and Modeling Lossy Compression Schemes on HPC Scientific Data.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

A Cost-effective and Energy-efficient Architecture for Die-stacked DRAM/NVM Memory Systems.
Proceedings of the 37th IEEE International Performance Computing and Communications Conference, 2018

Demystifying Cache Policies for Photo Stores at Scale: A Tencent Case Study.
Proceedings of the 32nd International Conference on Supercomputing, 2018

OSPADA: One-Shot Programming Aware Data Allocation Policy to Improve 3D NAND Flash Read Performance.
Proceedings of the 36th IEEE International Conference on Computer Design, 2018

Exploring the Optimal Platform Configuration for Power-Constrained HPC Workflows.
Proceedings of the 27th International Conference on Computer Communication and Networks, 2018

DREAM: Data Representation Aware of Damage to Extend the Lifetime of MLC NAND Flash Memory.
Proceedings of the 10th USENIX Workshop on Hot Topics in Storage and File Systems, 2018

Envisioning an Information Assurance and Performance Infrastructure for the Internet of Things.
Proceedings of the 4th IEEE International Conference on Collaboration and Internet Computing, 2018

2017
Understanding and Alleviating the Impact of the Flash Address Translation on Solid State Devices.
ACM Trans. Storage, 2017

Building Efficient Key-Value Stores via a Lightweight Compaction Tree.
ACM Trans. Storage, 2017

A Program Interference Error Aware LDPC Scheme for Improving NAND Flash Decoding Performance.
ACM Trans. Embed. Comput. Syst., 2017

Resemblance and mergence based indexing for high performance data deduplication.
J. Syst. Softw., 2017

IOTune: A G-states Driver for Elastic Performance of Block Storage.
CoRR, 2017

Toward Managing HPC Burst Buffers Effectively: Draining Strategy to Regulate Bursty I/O Behavior.
Proceedings of the 25th IEEE International Symposium on Modeling, 2017

SELF: A High Performance and Bandwidth Efficient Approach to Exploiting Die-Stacked DRAM as Part of Memory.
Proceedings of the 25th IEEE International Symposium on Modeling, 2017

StoreRush: An Application-Level Approach to Harvesting Idle Storage in a Best Effort Environment.
Proceedings of the International Conference on Computational Science, 2017

Effective Running of End-to-End HPC Workflows on Emerging Heterogeneous Architectures.
Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017

2016
Reducing Fragmentation for In-line Deduplication Backup Storage via Exploiting Backup History and Cache Knowledge.
IEEE Trans. Parallel Distributed Syst., 2016

H-Scale: A Fast Approach to Scale Disk Arrays via Hybrid Stripe Deployment.
ACM Trans. Storage, 2016

A Credit-Based Load-Balance-Aware CTA Scheduling Optimization Scheme in GPGPU.
Int. J. Parallel Program., 2016

Achieving High Reliability via Expediting the Repair of Critical Blocks in Replicated Storage Systems.
Proceedings of the 35th IEEE Symposium on Reliable Distributed Systems, 2016

CAR: A Compression-Aware Refresh Approach to Improve Memory Performance and Energy Efficiency.
Proceedings of the 2016 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Science, 2016

REAL: A retention error aware LDPC decoding scheme to improve NAND flash read performance.
Proceedings of the 32nd Symposium on Mass Storage Systems and Technologies, 2016

Improve Restore Speed in Deduplication Systems Using Segregated Cache.
Proceedings of the 24th IEEE International Symposium on Modeling, 2016

LAMS: A latency-aware memory scheduling policy for modern DRAM systems.
Proceedings of the 35th IEEE International Performance Computing and Communications Conference, 2016

Successor: Proactive cache warm-up of destination hosts in virtual machine migration contexts.
Proceedings of the 35th Annual IEEE International Conference on Computer Communications, 2016

RMD: A Resemblance and Mergence Based Approach for High Performance Deduplication.
Proceedings of the 45th International Conference on Parallel Processing, 2016

CoARC: Co-operative, Aggressive Recovery and Caching for Failures in Erasure Coded Hadoop.
Proceedings of the 45th International Conference on Parallel Processing, 2016

ROP: Alleviating Refresh Overheads via Reviving the Memory System in Frozen Cycles.
Proceedings of the 45th International Conference on Parallel Processing, 2016

Power-Capping Aware Checkpointing: On the Interplay Among Power-Capping, Temperature, Reliability, Performance, and Energy.
Proceedings of the 46th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, 2016

2015
De-Frag: an efficient scheme to improve deduplication performance via reducing data placement de-linearization.
Clust. Comput., 2015

FINGER: A novel erasure coding scheme using fine granularity blocks to improve Hadoop write and update performance.
Proceedings of the 10th IEEE International Conference on Networking, 2015

Alleviating DRAM Refresh Overhead Via Inter-rank Piggyback Caching.
Proceedings of the 23rd IEEE International Symposium on Modeling, 2015

A Stall-Aware Warp Scheduling for Dynamically Optimizing Thread-level Parallelism in GPGPUs.
Proceedings of the 29th ACM on International Conference on Supercomputing, 2015

Code 5-6: An Efficient MDS Array Coding Scheme to Accelerate Online RAID Level Migration.
Proceedings of the 44th International Conference on Parallel Processing, 2015

PPM: A Partitioned and Parallel Matrix Algorithm to Accelerate Encoding/Decoding Process of Asymmetric Parity Erasure Codes.
Proceedings of the 44th International Conference on Parallel Processing, 2015

Design Tradeoffs for Data Deduplication Performance in Backup Workloads.
Proceedings of the 13th USENIX Conference on File and Storage Technologies, 2015

An efficient page-level FTL to optimize address translation in flash memory.
Proceedings of the Tenth European Conference on Computer Systems, 2015

BPS: A Balanced Partial Stripe Write Scheme to Improve the Write Performance of RAID-6.
Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015

2014
Hint-K: An Efficient Multilevel Cache Using K-Step Hints.
IEEE Trans. Parallel Distributed Syst., 2014

Reducing SSD access latency via NAND flash program and erase suspension.
J. Syst. Archit., 2014

DMVL: An I/O bandwidth dynamic allocation method for virtual networks.
J. Netw. Comput. Appl., 2014

FlexECC: Partially Relaxing ECC of MLC SSD for Better Cache Performance.
Proceedings of the 2014 USENIX Annual Technical Conference, 2014

Accelerating Restore and Garbage Collection in Deduplication-based Backup Systems via Exploiting Historical Information.
Proceedings of the 2014 USENIX Annual Technical Conference, 2014

Exploiting Decoding Computational Locality to Improve the I/O Performance of an XOR-Coded Storage Cluster under Concurrent Failures.
Proceedings of the 33rd IEEE International Symposium on Reliable Distributed Systems, 2014

Alleviating I/O interference via caching and rate-controlled prefetching without degrading migration performance.
Proceedings of the 9th Parallel Data Storage Workshop, 2014

Clique Migration: Affinity Grouping of Virtual Machines for Inter-cloud Live Migration.
Proceedings of the 9th IEEE International Conference on Networking, 2014

A hybrid erasure-coded ECC scheme to improve performance and reliability of solid state drives.
Proceedings of the IEEE 33rd International Performance Computing and Communications Conference, 2014

An aggressive worn-out flash block management scheme to alleviate SSD performance degradation.
Proceedings of the Ninth Eurosys Conference 2014, 2014

APR: A Novel Parallel Repacking Algorithm for Efficient GPGPU Parallel Code Transformation.
Proceedings of the Seventh Workshop on General Purpose Processing Using GPUs, 2014

2013
An Efficient Penalty-Aware Cache to Improve the Performance of Parity-Based Disk Arrays under Faulty Conditions.
IEEE Trans. Parallel Distributed Syst., 2013

Exploiting workload dynamics to improve SSD read latency via differentiated error correction codes.
ACM Trans. Design Autom. Electr. Syst., 2013

Optimisation schemes to improve hybrid co-scheduling for concurrent virtual machines.
Int. J. Parallel Emergent Distributed Syst., 2013

D-PALD: A Dynamic Power-Aware Load Dispatcher with Response Time Percentile Guarantee in Heterogeneous Clusters.
Proceedings of the IEEE Eighth International Conference on Networking, 2013

A novel I/O scheduler for SSD with improved performance and lifetime.
Proceedings of the IEEE 29th Symposium on Mass Storage Systems and Technologies, 2013

A Comprehensive Analysis of XOR-Based Erasure Codes Tolerating 3 or More Concurrent Failures.
Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

A Flexible Framework to Enhance RAID-6 Scalability via Exploiting the Similarities among MDS Codes.
Proceedings of the 42nd International Conference on Parallel Processing, 2013

2012
An adaptive write buffer management scheme for flash-based SSDs.
ACM Trans. Storage, 2012

Improving Cloud Survivability through Dependency based Virtual Machine Placement.
Proceedings of the SECRYPT 2012, 2012

Distributed Virtual Diskless Checkpointing: A Highly Fault Tolerant Scheme for Virtualized Clusters.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012

GSR: A Global Stripe-Based Redistribution Approach to Accelerate RAID-5 Scaling.
Proceedings of the 41st International Conference on Parallel Processing, 2012

Reducing SSD read latency via NAND flash program and erase suspension.
Proceedings of the 10th USENIX conference on File and Storage Technologies, 2012

Delta-FTL: improving SSD lifetime via exploiting content locality.
Proceedings of the European Conference on Computer Systems, 2012

SDM: A Stripe-Based Data Migration Scheme to Improve the Scalability of RAID-6.
Proceedings of the 2012 IEEE International Conference on Cluster Computing, 2012

2011
Design and Evaluation of an Online Anomaly Detector for Distributed Storage Systems.
J. Softw., 2011

Victim Disk First: An Asymmetric Cache to Boost the Performance of Disk Arrays under Faulty Conditions.
Proceedings of the 2011 USENIX Annual Technical Conference, 2011

Hybrid Co-scheduling Optimizations for Concurrent Applications in Virtualized Environments.
Proceedings of the Sixth International Conference on Networking, Architecture, and Storage, 2011

H-Code: A Hybrid MDS Array Code to Optimize Partial Stripe Writes in RAID-6.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

HDP code: A Horizontal-Diagonal Parity Code to Optimize I/O load balancing in RAID-6.
Proceedings of the 2011 IEEE/IFIP International Conference on Dependable Systems and Networks, 2011

2010
A Dynamic Performance-Based Flow Control Method for High-Speed Data Transfer.
IEEE Trans. Parallel Distributed Syst., 2010

An Online Performance Anomaly Detector in Cluster File Systems.
Proceedings of the Third International Symposium on Parallel Architectures, 2010

Characterizing the Dependability of Distributed Storage Systems Using a Two-Layer Hidden Markov Model-Based Approach.
Proceedings of the Fifth International Conference on Networking, Architecture, and Storage, 2010

BPAC: An adaptive write buffer management scheme for flash-based Solid State Drives.
Proceedings of the IEEE 26th Symposium on Mass Storage Systems and Technologies, 2010

DiffECC: Improving SSD Read Performance Using Differentiated Error Correction Coding Schemes.
Proceedings of the MASCOTS 2010, 2010

An adaptive I/O load distribution scheme for distributed systems.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Reproducing non-deterministic bugs with lightweight recording in production environments.
Proceedings of the 29th International Performance Computing and Communications Conference, 2010

Hint-K: An Efficient Multi-level Cache Using K-Step Hints.
Proceedings of the 39th International Conference on Parallel Processing, 2010

Code-M: A non-MDS erasure code scheme to support fast recovery from up to two-disk failures in storage systems.
Proceedings of the 2010 IEEE/IFIP International Conference on Dependable Systems and Networks, 2010

2009
Symmetric active/active metadata service for high availability parallel file systems.
J. Parallel Distributed Comput., 2009

An efficient design for fast memory registration in RDMA.
J. Netw. Comput. Appl., 2009

Hotspot Prediction and cache in distributed stream-processing storage systems.
Proceedings of the 28th International Performance Computing and Communications Conference, 2009

uStream: A User-Level Stream Protocol over Infiniband.
Proceedings of the 15th IEEE International Conference on Parallel and Distributed Systems, 2009

An Extensible I/O Performance Analysis Framework for Distributed Environments.
Proceedings of the Euro-Par 2009 Parallel Processing, 2009

Implementing WebGIS on Hadoop: A case study of improving small file I/O performance on HDFS.
Proceedings of the 2009 IEEE International Conference on Cluster Computing, August 31, 2009

2008
Failure Prediction Models for Proactive Fault Tolerance Within Storage Environments.
Proceedings of the 16th International Symposium on Modeling, 2008

Performance adaptive UDP for high-speed bulk data transfer over dedicated links.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

An Adaptive Cache Management Using Dual LRU Stacks to Improve Buffer Cache Performance.
Proceedings of the 2008 IEEE International Performance, 2008

Evaluation of Data Dissemination Methods in Wireless Sensor Networks.
Proceedings of the 2008 International Conference on Wireless Networks, 2008

Tolerating Temporal Correlated Failures from Cyclic Dependency in High Performance Computing Systems.
Proceedings of the 14th International Conference on Parallel and Distributed Systems, 2008

An Attribute-Based Dynamic Data Organization in Mass Storage Systems.
Proceedings of the Seventh International Conference on Grid and Cooperative Computing, 2008

Symmetric Active/Active High Availability for High-Performance Computing System Services: Accomplishments and Limitations.
Proceedings of the 8th IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2008), 2008

Symmetric Active/Active Replication for Dependent Services.
Proceedings of the The Third International Conference on Availability, 2008

2007
Evaluation of iPVFS: A High Performance Parallel File System over iSCSI for Cluster Computing.
Int. J. Comput. Their Appl., 2007

A unified multiple-level cache for high performance storage systems.
Int. J. High Perform. Comput. Netw., 2007

An SRP Target Mode to Improve Read Performance of SRP-Based IB-SANs.
Proceedings of the Parallel and Distributed Processing and Applications, 2007

A Fast Delivery Protocol for Total Order Broadcasting.
Proceedings of the 16th International Conference on Computer Communications and Networks, 2007

Transparent Symmetric Active/Active Replication for Service-Level High Availability.
Proceedings of the Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2007), 2007

On Programming Models for Service-Level High Availability.
Proceedings of the The Second International Conference on Availability, 2007

2006
Symmetric Active/Active High Availability for High-Performance Computing System Services.
J. Comput., 2006

Book Review: Scalable and Secure Internet Services and Architecture.
Int. J. High Perform. Comput. Netw., 2006

Active/Active Replication for Highly Available HPC System Services.
Proceedings of the The First International Conference on Availability, 2006

2005
SPEK: A Storage Performance Evaluation Kernel Module for Block-Level Storage Systems under Faulty Conditions.
IEEE Trans. Dependable Secur. Comput., 2005

A Unified Multiple-Level Cache for High Performance Storage Systems.
Proceedings of the 13th International Symposium on Modeling, 2005

Design and Evaluation of a High Performance Parallel File System.
Proceedings of the 30th Annual IEEE Conference on Local Computer Networks (LCN 2005), 2005

Efficient file sharing strategy in DHT based P2P systems.
Proceedings of the 24th IEEE International Performance Computing and Communications Conference, 2005

2004
STICS: SCSI-to-IP cache for storage area networks.
J. Parallel Distributed Comput., 2004

Online Remote Data Backup for iSCSI-Based Storage Systems.
Proceedings of the International Conference on Internet Computing, 2004

2003
Performance evaluation of distributed iSCSI RAID.
Proceedings of the International Workshop on Storage Network Architecture and Parallel I/Os, 2003

SPEK: A Storage Performance Evaluation Kernel Module for Block Level Storage Systems.
Proceedings of the 11th International Workshop on Modeling, 2003

A unified, low-overhead framework to support continuous profiling and optimization.
Proceedings of the 22nd IEEE International Performance Computing and Communications Conference, 2003

Performability Evaluation of Networked Storage Systems Using N-SPEK.
Proceedings of the 3rd IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2003), 2003

2002
Implementation and Performance Evaluation of RAPID-Chache under Linux.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2002

A Caching Strategy to Improve iSCSI Performance.
Proceedings of the 27th Annual IEEE Conference on Local Computer Networks (LCN 2002), 2002

Introducing SCSI-to-IP Cache for Storage Area Networks.
Proceedings of the 31st International Conference on Parallel Processing (ICPP 2002), 2002

2000
Performance Evaluation of Distributed Web Server Architectures under E-Commerce Workloads.
Proceedings of the International Conference on Internet Computing, 2000


  Loading...