Yingwei Luo

Orcid: 0000-0002-7903-0717

According to our database1, Yingwei Luo authored at least 119 papers between 2002 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Deep Learning Workload Scheduling in GPU Datacenters: A Survey.
ACM Comput. Surv., June, 2024

Hardware-Software Collaborative Tiered-Memory Management Framework for Virtualization.
ACM Trans. Comput. Syst., May, 2024

InstInfer: In-Storage Attention Offloading for Cost-Effective Long-Context LLM Inference.
CoRR, 2024

Building a High-Performance Graph Storage on Top of Tree-Structured Key-Value Stores.
Big Data Min. Anal., 2024

Taming Hot Bloat Under Virtualization with HUGESCOPE.
Proceedings of the 2024 USENIX Annual Technical Conference, 2024

Characterization of Large Language Model Development in the Datacenter.
Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024

HyLoReF: A Reputation Based QoS Prediction Framework using Hybrid Location Information.
Proceedings of the IEEE International Conference on Web Services, 2024

EKRM: Efficient Key-Value Retrieval Method to Reduce Data Lookup Overhead for Redis.
Proceedings of the Euro-Par 2024: Parallel Processing, 2024

2023
FHPM: Fine-grained Huge Page Management For Virtualization.
CoRR, 2023

FLORIA: A Fast and Featherlight Approach for Predicting Cache Performance.
Proceedings of the 37th International Conference on Supercomputing, 2023

vTMM: Tiered Memory Management for Virtual Machines.
Proceedings of the Eighteenth European Conference on Computer Systems, 2023

2022
Astraea: A Fair Deep Learning Scheduler for Multi-Tenant GPU Clusters.
IEEE Trans. Parallel Distributed Syst., 2022

Accelerating Address Translation for Virtualization by Leveraging Hardware Mode.
IEEE Trans. Computers, 2022

HMM-V: Heterogeneous Memory Management for Virtualization.
CoRR, 2022

Deep Learning Workload Scheduling in GPU Datacenters: Taxonomy, Challenges and Vision.
CoRR, 2022

Graph Neural Networks Based Memory Inefficiency Detection Using Selective Sampling.
Proceedings of the SC22: International Conference for High Performance Computing, 2022

Exploring GNN based program embedding technologies for binary related tasks.
Proceedings of the 30th IEEE/ACM International Conference on Program Comprehension, 2022

Tear Up the Bubble Boom: Lessons Learned From a Deep Learning Research and Development Cluster.
Proceedings of the IEEE 40th International Conference on Computer Design, 2022

A Question-Oriented Propagation Network for News Reading Comprehension.
Proceedings of the IEEE International Conference on Acoustics, 2022

M3: A Multi-View Fusion and Multi-Decoding Network for Multi-Document Reading Comprehension.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Original Content Is All You Need! an Empirical Study on Leveraging Answer Summary for WikiHowQA Answer Selection Task.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

2021
Penalty- and Locality-aware Memory Allocation in Redis Using Enhanced AET.
ACM Trans. Storage, 2021

Swift shadow paging (SSP): no write-protection but following TLB flushing.
Proceedings of the VEE '21: 17th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments, 2021

An Edge-Fencing Strategy for Optimizing SSSP Computations on Large-Scale Graphs.
Proceedings of the ICPP 2021: 50th International Conference on Parallel Processing, Lemont, IL, USA, August 9, 2021

GRAPHSPY: Fused Program Semantic Embedding through Graph Neural Networks for Memory Efficiency.
Proceedings of the 58th ACM/IEEE Design Automation Conference, 2021

2020
Huge Page Friendly Virtualized Memory Management.
J. Comput. Sci. Technol., 2020

GRAPHSPY: Fused Program Semantic-Level Embedding via Graph Neural Networks for Dead Store Detection.
CoRR, 2020

2019
Lightweight and Accurate Memory Allocation in Key-Value Cache.
Int. J. Parallel Program., 2019

EMBA: Efficient Memory Bandwidth Allocation to Improve Performance on Intel Commodity Processor.
Proceedings of the 48th International Conference on Parallel Processing, 2019

pRedis: Penalty and Locality Aware Memory Allocation in Redis.
Proceedings of the ACM Symposium on Cloud Computing, SoCC 2019, 2019

2018
Fast Miss Ratio Curve Modeling for Storage Cache.
ACM Trans. Storage, 2018

An empirical study on selectiviey of retweeting behaviors under multiple exposures in social networks.
J. Comput. Sci., 2018

HUB: hugepage ballooning in kernel-based virtual machines.
Proceedings of the International Symposium on Memory Systems, 2018

A Neural Network Model for Cache and Memory Prediction of Neural Networks.
Proceedings of the IEEE International Conference on Parallel & Distributed Processing with Applications, 2018

Working Set Size Estimation with Hugepages in Virtualization.
Proceedings of the IEEE International Conference on Parallel & Distributed Processing with Applications, 2018

Get Out of the Valley: Power-Efficient Address Mapping for GPUs.
Proceedings of the 45th ACM/IEEE Annual International Symposium on Computer Architecture, 2018

DCAPS: dynamic cache allocation with partial sharing.
Proceedings of the Thirteenth EuroSys Conference, 2018

PACE: Penalty Aware Cache Modeling with Enhanced AET.
Proceedings of the 9th Asia-Pacific Workshop on Systems, 2018

2017
Optimal Symbiosis and Fair Scheduling in Shared Cache.
IEEE Trans. Parallel Distributed Syst., 2017

Optimizing Locality-Aware Memory Management of Key-Value Caches.
IEEE Trans. Computers, 2017

Evaluating the impacts of hugepage on virtual machines.
Sci. China Inf. Sci., 2017

BACM: Barrier-Aware Cache Management for Irregular Memory-Intensive GPGPU Workloads.
Proceedings of the 2017 IEEE International Conference on Computer Design, 2017

POSTER: BACM: Barrier-Aware Cache Management for Irregular Memory-Intensive GPGPU Workloads.
Proceedings of the 26th International Conference on Parallel Architectures and Compilation Techniques, 2017

2016
Dynamic Memory Balancing for Virtualization.
ACM Trans. Archit. Code Optim., 2016

A survey of cloud resource management for complex engineering applications.
Frontiers Comput. Sci., 2016

Kinetic Modeling of Data Eviction in Cache.
Proceedings of the 2016 USENIX Annual Technical Conference, 2016

Barrier-Aware Warp Scheduling for Throughput Processors.
Proceedings of the 2016 International Conference on Supercomputing, 2016

2015
LAMA: Optimized Locality-aware Memory Allocation for Key-value Cache.
Proceedings of the 2015 USENIX Annual Technical Conference, 2015

Optimal Cache Partition-Sharing.
Proceedings of the 44th International Conference on Parallel Processing, 2015

Optimal Footprint Symbiosis in Shared Cache.
Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015

Improving TLB Performance by Increasing Hugepage Ratio.
Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015

2014
Performance Metrics and Models for Shared Cache.
J. Comput. Sci. Technol., 2014

Optimizing GPU Virtualization with Address Mapping and Delayed Submission.
Proceedings of the 2014 IEEE International Conference on High Performance Computing and Communications, 2014

2013
Revisiting memory management on virtualized environments.
ACM Trans. Archit. Code Optim., 2013

pVEE: A Personalized Virtualized Experimentation Environment for Education Based on Virtual Machines.
Proceedings of the Pervasive Computing and the Networked World, 2013

Failure Recovery: When the Cure Is Worse Than the Disease.
Proceedings of the 14th Workshop on Hot Topics in Operating Systems, 2013

Who decides migration? A migration lock mechanism for virtual machines.
Proceedings of the 9th International Conference on Network and Service Management, 2013

Towards Eliminating Memory Virtualization Overhead.
Proceedings of the Advanced Parallel Processing Technologies, 2013

2012
Dynamic cache partitioning based on hot page migration.
Frontiers Comput. Sci., 2012

A Dynamic Cache Partitioning Mechanism under Virtualization Environment.
Proceedings of the 11th IEEE International Conference on Trust, 2012

Design model execution engine based on web services for distributed geography modeling environment.
Proceedings of the 2012 IEEE International Geoscience and Remote Sensing Symposium, 2012

A model contract and model integration language for integrating geography models in distributed environment.
Proceedings of the 2012 IEEE International Geoscience and Remote Sensing Symposium, 2012

Optimizing Interactive Performance for Desktop-Virtualization Environment.
Proceedings of the Pervasive Computing and the Networked World, 2012

Live Migrating the Virtual Machine Directly Accessing a Physical NIC.
Proceedings of the 2012 IEEE Asia-Pacific Services Computing Conference, 2012

2011
Locating Unregistered Toponym for Web Map Services.
Int. J. Comput. Process. Orient. Lang., 2011

A Rule-Based Pretreatment Mechanism for Online Mobile Map Data.
Proceedings of the 74th IEEE Vehicular Technology Conference, 2011

Selective hardware/software memory virtualization.
Proceedings of the 7th International Conference on Virtual Execution Environments, 2011

Low Cost Working Set Size Tracking.
Proceedings of the 2011 USENIX Annual Technical Conference, 2011

Sharing and reusing geography models via model execution engine.
Proceedings of the 2011 IEEE International Geoscience and Remote Sensing Symposium, 2011

Managing and integrating geography models in distributed environment.
Proceedings of the 2011 IEEE International Geoscience and Remote Sensing Symposium, 2011

Model semantic network for massive spatial information.
Proceedings of the 2011 IEEE International Geoscience and Remote Sensing Symposium, 2011

2010
Dynamic memory paravirtualization transparent to guest OS.
Sci. China Inf. Sci., 2010

DMM: A dynamic memory mapping model for virtual machines.
Sci. China Inf. Sci., 2010

Byte-Map: A Novel Mobile Map Format Using Two-Byte Coordinates.
Proceedings of the 72nd IEEE Vehicular Technology Conference, 2010

LBS-p: A LBS Platform Supporting Online Map Services.
Proceedings of the 72nd IEEE Vehicular Technology Conference, 2010

Evaluating and Optimizing I/O Virtualization in Kernel-based Virtual Machine (KVM).
Proceedings of the Network and Parallel Computing, IFIP International Conference, 2010

Web Service encapsulation of fortran-based geographical model.
Proceedings of the IEEE International Geoscience & Remote Sensing Symposium, 2010

The design and implementation of GIS applications based on SOA.
Proceedings of the IEEE International Geoscience & Remote Sensing Symposium, 2010

A Survey on I/O Virtualization and Optimization.
Proceedings of the Fifth Annual ChinaGrid Conference, ChinaGrid 2010, Guangzhou, 2010

Detecting and Analyzing VM-exits.
Proceedings of the 10th IEEE International Conference on Computer and Information Technology, 2010

An Innovative Course about Network Storage and System Virtualization Technologies in PKU.
Proceedings of the 10th IEEE International Conference on Computer and Information Technology, 2010

2009
Dynamic memory balancing for virtual machines.
ACM SIGOPS Oper. Syst. Rev., 2009

Fast Booting Many Similar Virtual Machines.
Proceedings of the Systems and Virtualization Management. Standards and the Cloud, 2009

A Refined Mobile Map Format and Its Application.
Proceedings of the Advances in Spatial and Temporal Databases, 2009

A Simple Cache Partitioning Approach in a Virtualized Environment.
Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications, 2009

REMOCA: Hypervisor Remote Disk Cache.
Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications, 2009

Fast Live Cloning of Virtual Machine Based on Xen.
Proceedings of the 11th IEEE International Conference on High Performance Computing and Communications, 2009

2008
Programming grid: a computer-aided education system for programming courses based on online judge.
Proceedings of the First ACM Summit on Computing Education in China, 2008

ChinaV: Building Virtualized Computing System.
Proceedings of the 10th IEEE International Conference on High Performance Computing and Communications, 2008

Live and incremental whole-system migration of virtual machines using block-bitmap.
Proceedings of the 2008 IEEE International Conference on Cluster Computing, 29 September, 2008

A Rule-Based Event Handling Model.
Proceedings of the 3rd IEEE Asia-Pacific Services Computing Conference, 2008

2007
A three-phase algorithm for whole-system live migration of virtual machines.
Proceedings of the CHINA HPC 2007, 2007

2005
An Agent Approach to Spatial Information Grid Architecture Design.
Comput. Artif. Intell., 2005

A Hierarchical Component-based WebGIS and Its Key Technologies.
Comput. Artif. Intell., 2005

The Semantic Annotation of Emergency Event Cases.
Proceedings of the 2005 International Conference on Semantics, 2005

The Study and Application of Crime Emergency Ontology Event Model.
Proceedings of the Knowledge-Based Intelligent Information and Engineering Systems, 2005

XML Approach to Communication Design of WebGIS.
Proceedings of the Web Engineering, 5th International Conference, 2005

Ontological Model of Event for Integration of Inter-organization Applications.
Proceedings of the Computational Science and Its Applications, 2005

Spatial Data Channel in a Mobile Navigation System.
Proceedings of the Computational Science and Its Applications, 2005

PK+ Tree: An Improved Spatial Index Structure of PK Tree.
Proceedings of the Computational Science, 2005

Design Hierarchical Component-Based WebGIS.
Proceedings of the Computational Science, 2005

2004
Component-Based WebGIS and Its Spatial Cache Framework.
Proceedings of the Advances in Web-Age Information Management: 5th International Conference, 2004

Agent-based Spatial Information Collaboration and Parallel Mechanisms.
Proceedings of the 4th IEEE International Workshop on Source Code Analysis and Manipulation (SCAM 2004), 2004

GML Based Ubiquitous WebGIS.
Proceedings of the 4th IEEE International Workshop on Source Code Analysis and Manipulation (SCAM 2004), 2004

A Component-Based WebGIS Geo-Union.
Proceedings of the Web Engineering - 4th International Conference, 2004

SOM: A Novel Model for Defining Topological Line-Region Relations.
Proceedings of the Computational Science and Its Applications, 2004

Load Analysis and Load Control in Geo-agents.
Proceedings of the Computational Science, 2004

A Cache Mechanism for Component-Based WebGIS.
Proceedings of the Computational Science, 2004

A Metadata Framework for Distributed Geo-spatial Databases in Grid Environment.
Proceedings of the Grid and Cooperative Computing, 2004

Mapping Business Workflows onto Network Services Environments.
Proceedings of the Grid and Cooperative Computing, 2004

Design Open Sharing Framework for Spatial Information in Semantic Web.
Proceedings of the Grid and Cooperative Computing, 2004

QoS Analysis on Web Service Based Spatial Integration.
Proceedings of the Grid and Cooperative Computing, 2004

Spatial Application Integrating Infrastructure.
Proceedings of the Electronic Government: Third International Conference, 2004

Web Service and Geographical Information Integration.
Proceedings of the 28th International Computer Software and Applications Conference (COMPSAC 2004), 2004

2003
Spatial semantic network and agent-based framework for spatial information interoperation.
Proceedings of the 2003 IEEE International Conference on Information Reuse and Integration, 2003

Extension of spatial metadata for navigating distributed spatial data.
Proceedings of the 2003 IEEE International Geoscience and Remote Sensing Symposium, 2003

Extension of spatial metadata and agent-based spatial Data navigation mechanism.
Proceedings of the ACM-GIS 2003, 2003

Spatial Information Grid - An Agent Framework.
Proceedings of the Grid and Cooperative Computing, Second International Workshop, 2003

2002
Design and Implementation of Map Visualization Objects in Component-based WebGIS.
Proceedings of the 3rd International Conference on Web Information Systems Engineering Workshops, 2002


  Loading...