2025
Continuous-Time Object Segmentation Using High Temporal Resolution Event Camera.
IEEE Trans. Pattern Anal. Mach. Intell., February, 2025
2024
FreePrune: An Automatic Pruning Framework Across Various Granularities Based on Training-Free Evaluation.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., November, 2024
LightFS: A Lightweight Host-CSD Coordinated File System Optimizing for Heavy Small File Accesses.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., November, 2024
Optimizing the Performance of Consistency-Aware Deduplication Using Persistent Memory.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., June, 2024
A Fast Location-Aware Repair Strategy for Mobile Grouped Storage Clusters.
IEEE Internet Things J., June, 2024
BGS: Accelerate GNN training on multiple GPUs.
J. Syst. Archit., 2024
CEIU: Consistent and Efficient Incremental Update mechanism for mobile systems on flash storage.
J. Syst. Archit., 2024
FinerDedup: Sifting Fingerprints for Efficient Data Deduplication on Mobile Devices.
Proceedings of the 61st ACM/IEEE Design Automation Conference, 2024
Finding Visual Saliency in Continuous Spike Stream.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
FSR: A host-storage collaborative mechanism for data path optimization of NDP operations.
J. Syst. Archit., October, 2023
V-WAFA: An Endurance Variation Aware Fine-Grained Allocator for Persistent Memory.
IEEE Trans. Computers, April, 2023
FedMDS: An Efficient Model Discrepancy-Aware Semi-Asynchronous Clustered Federated Learning Framework.
IEEE Trans. Parallel Distributed Syst., March, 2023
LFPR: A Lazy Fast Predictive Repair Strategy for Mobile Distributed Erasure Coded Cluster.
IEEE Internet Things J., 2023
Optimizing the Incremental Update Mechanism by Inlaying File Indexes on Flash Storage.
Proceedings of the 12th Non-Volatile Memory Systems and Applications Symposium, 2023
RadarSSD: A Computational Storage for Radar Signal Processing.
Proceedings of the 52nd International Conference on Parallel Processing, 2023
Data-Quality-Driven Federated Learning for Optimizing Communication Costs.
Proceedings of the 29th IEEE International Conference on Parallel and Distributed Systems, 2023
Re-compact: Structured Pruning and SpMM Kernel Co-design for Accelerating DNNs on GPUs.
Proceedings of the 41st IEEE International Conference on Computer Design, 2023
An Efficient Scheduling Algorithm for Multi-mode Tasks on Near-Data Processing SSDs.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2023
HBP: Hierarchically Balanced Pruning and Accelerator Co-Design for Efficient DNN Inference.
Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023
Optimizing the Performance of NDP Operations by Retrieving File Semantics in Storage.
Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023
2022
Flexible Clustered Federated Learning for Client-Level Data Distribution Shift.
IEEE Trans. Parallel Distributed Syst., 2022
SENTunnel: Fast Path for Sensor Data Access on Automotive Embedded Systems.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022
Self-Adapting Channel Allocation for Multiple Tenants Sharing SSD Devices.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022
ChordMap: Automated Mapping of Streaming Applications Onto CGRA.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022
Horae: A Hybrid I/O Request Scheduling Technique for Near-Data Processing-Based SSD.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022
eRDAC: Efficient and Reliable Remote Direct Access and Control for Embedded Systems.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022
FRL: Fast and Reconfigurable Accelerator for Distributed Sound Source Localization.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022
ELOFS: An Extensible Low-Overhead Flash File System for Resource-Scarce Embedded Devices.
IEEE Trans. Computers, 2022
Federated learning with workload-aware client scheduling in heterogeneous systems.
Neural Networks, 2022
Efficient persistent memory file systems using virtual superpages with multi-level allocator.
J. Syst. Archit., 2022
Towards highly-concurrent leaderless state machine replication for distributed systems.
J. Syst. Archit., 2022
CoDiscard: A revenue model based cross-layer cooperative discarding mechanism for flash memory devices.
J. Syst. Archit., 2022
CADedup: High-performance Consistency-aware Deduplication Based on Persistent Memory.
Proceedings of the IEEE 40th International Conference on Computer Design, 2022
VEA: An FPGA-Based Voxel Encoding Accelerator for 3D Object Detection with LiDAR.
Proceedings of the IEEE 40th International Conference on Computer Design, 2022
3DS: An Efficient DPDK-based Data Distribution Service for Distributed Real-time Applications.
Proceedings of the 24th IEEE Int Conf on High Performance Computing & Communications; 8th Int Conf on Data Science & Systems; 20th Int Conf on Smart City; 8th Int Conf on Dependability in Sensor, 2022
Weak Network Oriented Mobile Distributed Storage: A Hybrid Fault-Tolerance Scheme Based on Potential Replicas.
Proceedings of the 24th IEEE Int Conf on High Performance Computing & Communications; 8th Int Conf on Data Science & Systems; 20th Int Conf on Smart City; 8th Int Conf on Dependability in Sensor, 2022
CacheSifter: Sifting Cache Files for Boosted Mobile Performance and Lifetime.
Proceedings of the 20th USENIX Conference on File and Storage Technologies, 2022
Optimizing CoW-based File Systems on Open-Channel SSDs with Persistent Memory.
Proceedings of the 2022 Design, Automation & Test in Europe Conference & Exhibition, 2022
GATLB: A Granularity-Aware TLB to Support Multi-Granularity Pages in Hybrid Memory System.
Proceedings of the 2022 Design, Automation & Test in Europe Conference & Exhibition, 2022
SAPredictor: a simple and accurate self-adaptive predictor for hierarchical hybrid memory system.
Proceedings of the DAC '22: 59th ACM/IEEE Design Automation Conference, San Francisco, California, USA, July 10, 2022
Lazy repair with temporary redundancy(LRTR): reducing repair network traffic in erasure-coded storage.
Proceedings of the CF '22: 19th ACM International Conference on Computing Frontiers, Turin, Italy, May 17, 2022
2021
Improving the Performance of Deduplication-Based Storage Cache via Content-Driven Cache Management Methods.
IEEE Trans. Parallel Distributed Syst., 2021
Self-Balancing Federated Learning With Global Imbalanced Data in Mobile Systems.
IEEE Trans. Parallel Distributed Syst., 2021
On the Design of Minimal-Cost Pipeline Systems Satisfying Hard/Soft Real-Time Constraints.
IEEE Trans. Emerg. Top. Comput., 2021
Bridging Mismatched Granularity Between Embedded File Systems and Flash Memory.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2021
Contour: A Process Variation Aware Wear-Leveling Mechanism for Inodes of Persistent Memory File Systems.
IEEE Trans. Computers, 2021
MobileRE: A replicas prioritized hybrid fault tolerance strategy for mobile distributed system.
J. Syst. Archit., 2021
A machine learning assisted data placement mechanism for hybrid storage systems.
J. Syst. Archit., 2021
LPE: Locality-Based Dead Prediction in Exclusive TLB for Large Coverage.
J. Circuits Syst. Comput., 2021
Flexible Clustered Federated Learning for Client-Level Data Distribution Shift.
CoRR, 2021
FedGroup: Efficient Federated Learning via Decomposed Similarity-Based Clustering.
Proceedings of the 2021 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom), New York City, NY, USA, September 30, 2021
CSAFL: A Clustered Semi-Asynchronous Federated Learning Framework.
Proceedings of the International Joint Conference on Neural Networks, 2021
FedSAE: A Novel Self-Adaptive Federated Learning Framework in Heterogeneous Systems.
Proceedings of the International Joint Conference on Neural Networks, 2021
Forseti: An Efficient Basic-block-level Sensitivity Analysis Framework Towards Multi-bit Faults.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2021
DFShards: effective construction of MRCs online for non-stack algorithms.
Proceedings of the CF '21: Computing Frontiers Conference, 2021
AIR Cache: A Variable-Size Block Cache Based on Fine-Grained Management Method.
Proceedings of the Web and Big Data - 5th International Joint Conference, 2021
2020
APMigration: Improving Performance of Hybrid Memory Performance via An Adaptive Page Migration Method.
IEEE Trans. Parallel Distributed Syst., 2020
Downsizing Without Downgrading: Approximated Dynamic Time Warping on Nonvolatile Memories.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2020
Separable Binary Convolutional Neural Network on Embedded Systems.
IEEE Trans. Computers, 2020
Optimizing synchronization mechanism for block-based file systems using persistent memory.
Future Gener. Comput. Syst., 2020
FedGroup: Ternary Cosine Similarity-based Clustered Federated Learning Framework toward High Accuracy in Heterogeneous Data.
CoRR, 2020
HydraFS: an efficient NUMA-aware in-memory file system.
Clust. Comput., 2020
SSDKeeper: Self-Adapting Channel Allocation to Improve the Performance of SSD Devices.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020
Themis: Malicious Wear Detection and Defense for Persistent Memory File Systems.
Proceedings of the 26th IEEE International Conference on Parallel and Distributed Systems, 2020
WMAlloc: A Wear-Leveling-Aware Multi-Grained Allocator for Persistent Memory File Systems.
Proceedings of the 26th IEEE International Conference on Parallel and Distributed Systems, 2020
Unified-TP: A Unified TLB and Page Table Cache Structure for Efficient Address Translation.
Proceedings of the 38th IEEE International Conference on Computer Design, 2020
MobileRE: A Hybrid Fault Tolerance Strategy Combining Erasure Codes and Replicas for Mobile Distributed Cluster.
Proceedings of the 22nd IEEE International Conference on High Performance Computing and Communications; 18th IEEE International Conference on Smart City; 6th IEEE International Conference on Data Science and Systems, 2020
Differentiating Cache Files for Fine-grain Management to Improve Mobile Performance and Lifetime.
Proceedings of the 12th USENIX Workshop on Hot Topics in Storage and File Systems, 2020
Optimizing Performance of Persistent Memory File Systems using Virtual Superpages.
Proceedings of the 2020 Design, Automation & Test in Europe Conference & Exhibition, 2020
LOFFS: A Low-Overhead File System for Large Flash Memory on Embedded Devices.
Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020
Efficient Multi-Grained Wear Leveling for Inodes of Persistent Memory File Systems.
Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020
2019
On the Design of Time-Constrained and Buffer-Optimal Self-Timed Pipelines.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2019
HiNextApp: A context-aware and adaptive framework for app prediction in mobile systems.
Sustain. Comput. Informatics Syst., 2019
FitCNN: A cloud-assisted and low-cost framework for updating CNNs on IoT devices.
Future Gener. Comput. Syst., 2019
CDAC: Content-Driven Deduplication-Aware Storage Cache.
Proceedings of the 35th Symposium on Mass Storage Systems and Technologies, 2019
Power-Aware Virtual Machine Placement for Mobile Edge Computing.
Proceedings of the 2019 International Conference on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, 2019
Optimizing the Data Transmission Scheme for Edge-Based Automatic Driving.
Proceedings of the 15th IEEE International Conference on Embedded Software and Systems, 2019
Archivist: A Machine Learning Assisted Data Placement Mechanism for Hybrid Storage Systems.
Proceedings of the 37th IEEE International Conference on Computer Design, 2019
Astraea: Self-Balancing Federated Learning for Improving Classification Accuracy of Mobile Deep Learning Applications.
Proceedings of the 37th IEEE International Conference on Computer Design, 2019
Reducing Write Amplification for Inodes of Journaling File System using Persistent Memory.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2019
UIMigrate: Adaptive Data Migration for Hybrid Non-Volatile Memory Systems.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2019
Tumbler: Energy Efficient Task Scheduling for Dual-Channel Solar-Powered Sensor Nodes.
Proceedings of the 56th Annual Design Automation Conference 2019, 2019
A Wear-Leveling-Aware Fine-Grained Allocator for Non-Volatile Memory.
Proceedings of the 56th Annual Design Automation Conference 2019, 2019
2018
Heterogeneous FPGA-Based Cost-Optimal Design for Timing-Constrained CNNs.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2018
Towards the Design of Efficient and Consistent Index Structure with Minimal Write Activities for Non-Volatile Memory.
IEEE Trans. Computers, 2018
带磨损均衡的小粒度非易失性内存管理机制 (In-page Wear-leveling Memory Management Based on Non-volatile Memory).
计算机科学, 2018
UMFS: An efficient user-space file system for non-volatile memory.
J. Syst. Archit., 2018
Synthesizing distributed pipelining systems with timing constraints via optimal functional unit assignment and communication selection.
J. Comput. Sci., 2018
DWARM: A wear-aware memory management scheme for in-memory file systems.
Future Gener. Comput. Syst., 2018
An Efficient File System for Hybrid In-Memory NVM and Block Devices.
Proceedings of the IEEE 7th Non-Volatile Memory Systems and Applications Symposium, 2018
Puppet: Energy Efficient Task Mapping For Storage-Less and Converter-Less Solar-Powered Non-Volatile Sensor Nodes.
Proceedings of the 36th IEEE International Conference on Computer Design, 2018
On the Design of Reliable Heterogeneous Systems via Checkpoint Placement and Core Assignment.
Proceedings of the 2018 on Great Lakes Symposium on VLSI, 2018
Efficient wear leveling for inodes of file systems on persistent memories.
Proceedings of the 2018 Design, Automation & Test in Europe Conference & Exhibition, 2018
2017
Optimal Functional-Unit Assignment for Heterogeneous Systems Under Timing Constraint.
IEEE Trans. Parallel Distributed Syst., 2017
面向内存文件系统的数据一致性更新机制研究 (Research on Data Consistency for In-memory File Systems).
计算机科学, 2017
Refinery swap: An efficient swap mechanism for hybrid DRAM-NVM systems.
Future Gener. Comput. Syst., 2017
BOSS: An Efficient Data Distribution Strategy for Object Storage Systems With Hybrid Devices.
IEEE Access, 2017
Towards the design of optimal range assignment for elevator groups under fluctuant traffic loads.
Proceedings of the 23rd IEEE International Conference on Embedded and Real-Time Computing Systems and Applications, 2017
UDORN: A design framework of persistent in-memory key-value database for NVM.
Proceedings of the IEEE 6th Non-Volatile Memory Systems and Applications Symposium, 2017
Optimal functional unit assignment and voltage selection for pipelined MPSoC with guaranteed probability on time performance.
Proceedings of the 18th ACM SIGPLAN/SIGBED Conference on Languages, 2017
2016
Properties of Self-Timed Ring Architectures for Deadlock-Free and Consistent Configuration Reaching Maximum Throughput.
J. Signal Process. Syst., 2016
Efficient Data Placement for Improving Data Access Performance on Domain-Wall Memory.
IEEE Trans. Very Large Scale Integr. Syst., 2016
A New Design of In-Memory File System Based on File Virtual Address Framework.
IEEE Trans. Computers, 2016
连接操作在SIMFS和EXT4上的性能比较 (Performance Comparison of Join Operations on SIMFS and EXT4).
计算机科学, 2016
A unified framework for designing high performance in-memory and hybrid memory file systems.
J. Syst. Archit., 2016
Performance Optimization for In-Memory File Systems on NUMA Machines.
Proceedings of the 17th International Conference on Parallel and Distributed Computing, 2016
The design and implementation of an efficient user-space in-memory file system.
Proceedings of the 5th Non-Volatile Memory Systems and Applications Symposium, 2016
Optimizing Data Placement of MapReduce on Ceph-Based Framework under Load-Balancing Constraint.
Proceedings of the 22nd IEEE International Conference on Parallel and Distributed Systems, 2016
Optimal Functional Assignment and Communication Selection under Timing Constraint for Self-Timed Pipelines.
Proceedings of the 13th International Conference on Embedded Software and Systems, 2016
The Design and Implementation of an Efficient Data Consistency Mechanism for In-Memory File Systems.
Proceedings of the 13th International Conference on Embedded Software and Systems, 2016
The design of an efficient swap mechanism for hybrid DRAM-NVM systems.
Proceedings of the 2016 International Conference on Embedded Software, 2016
Optimal functional-unit assignment and buffer placement for probabilistic pipelines.
Proceedings of the Eleventh IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis, 2016
The Design and Implementation of a High-Performance Hybrid Memory File System.
Proceedings of the International Conference on Advanced Cloud and Big Data, 2016
2015
Designing an efficient persistent in-memory file system.
Proceedings of the IEEE Non-Volatile Memory System and Applications Symposium, 2015
Prevent Deadlock and Remove Blocking for Self-Timed Systems.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2015
On the Design of High-Performance and Energy-Efficient Probabilistic Self-Timed Systems.
Proceedings of the 17th IEEE International Conference on High Performance Computing and Communications, 2015
Optimizing data placement for reducing shift operations on domain wall memories.
Proceedings of the 52nd Annual Design Automation Conference, 2015
2013
Effective file data-block placement for different types of page cache on hybrid main memory architectures.
Des. Autom. Embed. Syst., 2013