2024
Deep Learning Workload Scheduling in GPU Datacenters: A Survey.
ACM Comput. Surv., June, 2024
Hardware-Software Collaborative Tiered-Memory Management Framework for Virtualization.
ACM Trans. Comput. Syst., May, 2024
A Novel Extensible Simulation Framework for CXL-Enabled Systems.
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
InstInfer: In-Storage Attention Offloading for Cost-Effective Long-Context LLM Inference.
CoRR, 2024
Building a High-Performance Graph Storage on Top of Tree-Structured Key-Value Stores.
Big Data Min. Anal., 2024
Taming Hot Bloat Under Virtualization with HUGESCOPE.
Proceedings of the 2024 USENIX Annual Technical Conference, 2024
Fuzzy Information Entropy and Region Biased Matrix Factorization for Web Service QoS Prediction.
Proceedings of the 36th International Conference on Software Engineering and Knowledge Engineering, 2024
RAHN: A Reputation Based Hourglass Network for Web Service QoS Prediction (S).
Proceedings of the 36th International Conference on Software Engineering and Knowledge Engineering, 2024
Characterization of Large Language Model Development in the Datacenter.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024
HyLoReF: A Reputation Based QoS Prediction Framework using Hybrid Location Information.
Proceedings of the IEEE International Conference on Web Services, 2024
EKRM: Efficient Key-Value Retrieval Method to Reduce Data Lookup Overhead for Redis.
Proceedings of the Euro-Par 2024: Parallel Processing, 2024
2023
FHPM: Fine-grained Huge Page Management For Virtualization.
CoRR, 2023
FLORIA: A Fast and Featherlight Approach for Predicting Cache Performance.
Proceedings of the 37th International Conference on Supercomputing, 2023
vTMM: Tiered Memory Management for Virtual Machines.
Proceedings of the Eighteenth European Conference on Computer Systems, 2023
2022
Astraea: A Fair Deep Learning Scheduler for Multi-Tenant GPU Clusters.
IEEE Trans. Parallel Distributed Syst., 2022
Accelerating Address Translation for Virtualization by Leveraging Hardware Mode.
IEEE Trans. Computers, 2022
HMM-V: Heterogeneous Memory Management for Virtualization.
CoRR, 2022
Deep Learning Workload Scheduling in GPU Datacenters: Taxonomy, Challenges and Vision.
CoRR, 2022
Graph Neural Networks Based Memory Inefficiency Detection Using Selective Sampling.
Proceedings of the SC22: International Conference for High Performance Computing, 2022
Exploring GNN based program embedding technologies for binary related tasks.
Proceedings of the 30th IEEE/ACM International Conference on Program Comprehension, 2022
Tear Up the Bubble Boom: Lessons Learned From a Deep Learning Research and Development Cluster.
Proceedings of the IEEE 40th International Conference on Computer Design, 2022
A Question-Oriented Propagation Network for News Reading Comprehension.
Proceedings of the IEEE International Conference on Acoustics, 2022
M3: A Multi-View Fusion and Multi-Decoding Network for Multi-Document Reading Comprehension.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Original Content Is All You Need! an Empirical Study on Leveraging Answer Summary for WikiHowQA Answer Selection Task.
Proceedings of the 29th International Conference on Computational Linguistics, 2022
2021
Penalty- and Locality-aware Memory Allocation in Redis Using Enhanced AET.
ACM Trans. Storage, 2021
Swift shadow paging (SSP): no write-protection but following TLB flushing.
Proceedings of the VEE '21: 17th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments, 2021
An Edge-Fencing Strategy for Optimizing SSSP Computations on Large-Scale Graphs.
Proceedings of the ICPP 2021: 50th International Conference on Parallel Processing, Lemont, IL, USA, August 9, 2021
GRAPHSPY: Fused Program Semantic Embedding through Graph Neural Networks for Memory Efficiency.
Proceedings of the 58th ACM/IEEE Design Automation Conference, 2021
2020
Huge Page Friendly Virtualized Memory Management.
J. Comput. Sci. Technol., 2020
GRAPHSPY: Fused Program Semantic-Level Embedding via Graph Neural Networks for Dead Store Detection.
CoRR, 2020
2019
Lightweight and Accurate Memory Allocation in Key-Value Cache.
Int. J. Parallel Program., 2019
EMBA: Efficient Memory Bandwidth Allocation to Improve Performance on Intel Commodity Processor.
Proceedings of the 48th International Conference on Parallel Processing, 2019
pRedis: Penalty and Locality Aware Memory Allocation in Redis.
Proceedings of the ACM Symposium on Cloud Computing, SoCC 2019, 2019
2018
Fast Miss Ratio Curve Modeling for Storage Cache.
ACM Trans. Storage, 2018
An empirical study on selectiviey of retweeting behaviors under multiple exposures in social networks.
J. Comput. Sci., 2018
HUB: hugepage ballooning in kernel-based virtual machines.
Proceedings of the International Symposium on Memory Systems, 2018
A Neural Network Model for Cache and Memory Prediction of Neural Networks.
Proceedings of the IEEE International Conference on Parallel & Distributed Processing with Applications, 2018
Working Set Size Estimation with Hugepages in Virtualization.
Proceedings of the IEEE International Conference on Parallel & Distributed Processing with Applications, 2018
Get Out of the Valley: Power-Efficient Address Mapping for GPUs.
Proceedings of the 45th ACM/IEEE Annual International Symposium on Computer Architecture, 2018
DCAPS: dynamic cache allocation with partial sharing.
Proceedings of the Thirteenth EuroSys Conference, 2018
PACE: Penalty Aware Cache Modeling with Enhanced AET.
Proceedings of the 9th Asia-Pacific Workshop on Systems, 2018
2017
Optimal Symbiosis and Fair Scheduling in Shared Cache.
IEEE Trans. Parallel Distributed Syst., 2017
Optimizing Locality-Aware Memory Management of Key-Value Caches.
IEEE Trans. Computers, 2017
Evaluating the impacts of hugepage on virtual machines.
Sci. China Inf. Sci., 2017
BACM: Barrier-Aware Cache Management for Irregular Memory-Intensive GPGPU Workloads.
Proceedings of the 2017 IEEE International Conference on Computer Design, 2017
POSTER: BACM: Barrier-Aware Cache Management for Irregular Memory-Intensive GPGPU Workloads.
Proceedings of the 26th International Conference on Parallel Architectures and Compilation Techniques, 2017
2016
Dynamic Memory Balancing for Virtualization.
ACM Trans. Archit. Code Optim., 2016
A survey of cloud resource management for complex engineering applications.
Frontiers Comput. Sci., 2016
Kinetic Modeling of Data Eviction in Cache.
Proceedings of the 2016 USENIX Annual Technical Conference, 2016
Barrier-Aware Warp Scheduling for Throughput Processors.
Proceedings of the 2016 International Conference on Supercomputing, 2016
2015
LAMA: Optimized Locality-aware Memory Allocation for Key-value Cache.
Proceedings of the 2015 USENIX Annual Technical Conference, 2015
Optimal Cache Partition-Sharing.
Proceedings of the 44th International Conference on Parallel Processing, 2015
Optimal Footprint Symbiosis in Shared Cache.
Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015
Improving TLB Performance by Increasing Hugepage Ratio.
Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015
2014
Performance Metrics and Models for Shared Cache.
J. Comput. Sci. Technol., 2014
Optimizing GPU Virtualization with Address Mapping and Delayed Submission.
Proceedings of the 2014 IEEE International Conference on High Performance Computing and Communications, 2014
2013
Revisiting memory management on virtualized environments.
ACM Trans. Archit. Code Optim., 2013
pVEE: A Personalized Virtualized Experimentation Environment for Education Based on Virtual Machines.
Proceedings of the Pervasive Computing and the Networked World, 2013
Failure Recovery: When the Cure Is Worse Than the Disease.
Proceedings of the 14th Workshop on Hot Topics in Operating Systems, 2013
Who decides migration? A migration lock mechanism for virtual machines.
Proceedings of the 9th International Conference on Network and Service Management, 2013
Towards Eliminating Memory Virtualization Overhead.
Proceedings of the Advanced Parallel Processing Technologies, 2013
2012
Dynamic cache partitioning based on hot page migration.
Frontiers Comput. Sci., 2012
A Dynamic Cache Partitioning Mechanism under Virtualization Environment.
Proceedings of the 11th IEEE International Conference on Trust, 2012
Design model execution engine based on web services for distributed geography modeling environment.
Proceedings of the 2012 IEEE International Geoscience and Remote Sensing Symposium, 2012
A model contract and model integration language for integrating geography models in distributed environment.
Proceedings of the 2012 IEEE International Geoscience and Remote Sensing Symposium, 2012
Optimizing Interactive Performance for Desktop-Virtualization Environment.
Proceedings of the Pervasive Computing and the Networked World, 2012
Live Migrating the Virtual Machine Directly Accessing a Physical NIC.
Proceedings of the 2012 IEEE Asia-Pacific Services Computing Conference, 2012
2011
Locating Unregistered Toponym for Web Map Services.
Int. J. Comput. Process. Orient. Lang., 2011
A Rule-Based Pretreatment Mechanism for Online Mobile Map Data.
Proceedings of the 74th IEEE Vehicular Technology Conference, 2011
Selective hardware/software memory virtualization.
Proceedings of the 7th International Conference on Virtual Execution Environments, 2011
Low Cost Working Set Size Tracking.
Proceedings of the 2011 USENIX Annual Technical Conference, 2011
Sharing and reusing geography models via model execution engine.
Proceedings of the 2011 IEEE International Geoscience and Remote Sensing Symposium, 2011
Managing and integrating geography models in distributed environment.
Proceedings of the 2011 IEEE International Geoscience and Remote Sensing Symposium, 2011
Model semantic network for massive spatial information.
Proceedings of the 2011 IEEE International Geoscience and Remote Sensing Symposium, 2011
2010
Dynamic memory paravirtualization transparent to guest OS.
Sci. China Inf. Sci., 2010
DMM: A dynamic memory mapping model for virtual machines.
Sci. China Inf. Sci., 2010
Byte-Map: A Novel Mobile Map Format Using Two-Byte Coordinates.
Proceedings of the 72nd IEEE Vehicular Technology Conference, 2010
LBS-p: A LBS Platform Supporting Online Map Services.
Proceedings of the 72nd IEEE Vehicular Technology Conference, 2010
Evaluating and Optimizing I/O Virtualization in Kernel-based Virtual Machine (KVM).
Proceedings of the Network and Parallel Computing, IFIP International Conference, 2010
Web Service encapsulation of fortran-based geographical model.
Proceedings of the IEEE International Geoscience & Remote Sensing Symposium, 2010
The design and implementation of GIS applications based on SOA.
Proceedings of the IEEE International Geoscience & Remote Sensing Symposium, 2010
A Survey on I/O Virtualization and Optimization.
Proceedings of the Fifth Annual ChinaGrid Conference, ChinaGrid 2010, Guangzhou, 2010
Detecting and Analyzing VM-exits.
Proceedings of the 10th IEEE International Conference on Computer and Information Technology, 2010
An Innovative Course about Network Storage and System Virtualization Technologies in PKU.
Proceedings of the 10th IEEE International Conference on Computer and Information Technology, 2010
2009
Dynamic memory balancing for virtual machines.
ACM SIGOPS Oper. Syst. Rev., 2009
Fast Booting Many Similar Virtual Machines.
Proceedings of the Systems and Virtualization Management. Standards and the Cloud, 2009
A Refined Mobile Map Format and Its Application.
Proceedings of the Advances in Spatial and Temporal Databases, 2009
A Simple Cache Partitioning Approach in a Virtualized Environment.
Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications, 2009
REMOCA: Hypervisor Remote Disk Cache.
Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications, 2009
Fast Live Cloning of Virtual Machine Based on Xen.
Proceedings of the 11th IEEE International Conference on High Performance Computing and Communications, 2009
2008
Programming grid: a computer-aided education system for programming courses based on online judge.
Proceedings of the First ACM Summit on Computing Education in China, 2008
ChinaV: Building Virtualized Computing System.
Proceedings of the 10th IEEE International Conference on High Performance Computing and Communications, 2008
Live and incremental whole-system migration of virtual machines using block-bitmap.
Proceedings of the 2008 IEEE International Conference on Cluster Computing, 29 September, 2008
A Rule-Based Event Handling Model.
Proceedings of the 3rd IEEE Asia-Pacific Services Computing Conference, 2008
2007
A three-phase algorithm for whole-system live migration of virtual machines.
Proceedings of the CHINA HPC 2007, 2007
2005
An Agent Approach to Spatial Information Grid Architecture Design.
Comput. Artif. Intell., 2005
A Hierarchical Component-based WebGIS and Its Key Technologies.
Comput. Artif. Intell., 2005
The Semantic Annotation of Emergency Event Cases.
Proceedings of the 2005 International Conference on Semantics, 2005
The Study and Application of Crime Emergency Ontology Event Model.
Proceedings of the Knowledge-Based Intelligent Information and Engineering Systems, 2005
XML Approach to Communication Design of WebGIS.
Proceedings of the Web Engineering, 5th International Conference, 2005
Ontological Model of Event for Integration of Inter-organization Applications.
Proceedings of the Computational Science and Its Applications, 2005
Spatial Data Channel in a Mobile Navigation System.
Proceedings of the Computational Science and Its Applications, 2005
PK+ Tree: An Improved Spatial Index Structure of PK Tree.
Proceedings of the Computational Science, 2005
Design Hierarchical Component-Based WebGIS.
Proceedings of the Computational Science, 2005
2004
Component-Based WebGIS and Its Spatial Cache Framework.
Proceedings of the Advances in Web-Age Information Management: 5th International Conference, 2004
Agent-based Spatial Information Collaboration and Parallel Mechanisms.
Proceedings of the 4th IEEE International Workshop on Source Code Analysis and Manipulation (SCAM 2004), 2004
GML Based Ubiquitous WebGIS.
Proceedings of the 4th IEEE International Workshop on Source Code Analysis and Manipulation (SCAM 2004), 2004
A Component-Based WebGIS Geo-Union.
Proceedings of the Web Engineering - 4th International Conference, 2004
SOM: A Novel Model for Defining Topological Line-Region Relations.
Proceedings of the Computational Science and Its Applications, 2004
Load Analysis and Load Control in Geo-agents.
Proceedings of the Computational Science, 2004
A Cache Mechanism for Component-Based WebGIS.
Proceedings of the Computational Science, 2004
A Metadata Framework for Distributed Geo-spatial Databases in Grid Environment.
Proceedings of the Grid and Cooperative Computing, 2004
Mapping Business Workflows onto Network Services Environments.
Proceedings of the Grid and Cooperative Computing, 2004
Design Open Sharing Framework for Spatial Information in Semantic Web.
Proceedings of the Grid and Cooperative Computing, 2004
QoS Analysis on Web Service Based Spatial Integration.
Proceedings of the Grid and Cooperative Computing, 2004
Spatial Application Integrating Infrastructure.
Proceedings of the Electronic Government: Third International Conference, 2004
Web Service and Geographical Information Integration.
Proceedings of the 28th International Computer Software and Applications Conference (COMPSAC 2004), 2004
2003
Spatial semantic network and agent-based framework for spatial information interoperation.
Proceedings of the 2003 IEEE International Conference on Information Reuse and Integration, 2003
Extension of spatial metadata for navigating distributed spatial data.
Proceedings of the 2003 IEEE International Geoscience and Remote Sensing Symposium, 2003
Extension of spatial metadata and agent-based spatial Data navigation mechanism.
Proceedings of the ACM-GIS 2003, 2003
Spatial Information Grid - An Agent Framework.
Proceedings of the Grid and Cooperative Computing, Second International Workshop, 2003
2002
Design and Implementation of Map Visualization Objects in Component-based WebGIS.
Proceedings of the 3rd International Conference on Web Information Systems Engineering Workshops, 2002