Minyi Guo

Orcid: 0000-0003-0034-2302

Affiliations:
  • Shanghai Jiao Tong University, Shanghai, China


According to our database1, Minyi Guo authored at least 596 papers between 1997 and 2025.

Collaborative distances:

Awards

IEEE Fellow

IEEE Fellow 2018, "For contributions to performance optimization and resource management of parallel and distributed systems".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
FLAPS: fluctuation-aware power auction strategy for reducing the power overload probability.
Frontiers Comput. Sci., May, 2025

Dynamic-EC: an efficient dynamic erasure coding method for permissioned blockchain systems.
Frontiers Comput. Sci., January, 2025

BAFT: bubble-aware fault-tolerant framework for distributed DNN training with hybrid parallelism.
Frontiers Comput. Sci., January, 2025

2024
Enabling Long Range Point Cloud Registration in Vehicular Networks via Muti-Hop Relays.
IEEE Trans. Mob. Comput., December, 2024

Automatic Mapping of Heterogeneous DNN Models on Adaptive Multiaccelerator Systems.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., December, 2024

Taming Distributed One-Hop Multicasting in Millimeter-Wave VANETs.
IEEE Trans. Mob. Comput., November, 2024

Adaptive QoS-Aware Microservice Deployment With Excessive Loads via Intra- and Inter-Datacenter Scheduling.
IEEE Trans. Parallel Distributed Syst., September, 2024

Hardware-Software Co-Design Enabling Static and Dynamic Sparse Attention Mechanisms.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., September, 2024

Novas: Tackling Online Dynamic Video Analytics With Service Adaptation at Mobile Edge Servers.
IEEE Trans. Computers, September, 2024

Ada-WL: An Adaptive Wear-Leveling Aware Data Migration Approach for Flexible SSD Array Scaling in Clusters.
IEEE Trans. Computers, August, 2024

Bayesian-Driven Automated Scaling in Stream Computing With Multiple QoS Targets.
IEEE Trans. Parallel Distributed Syst., July, 2024

Elevation Changes of A'nyemaqen Snow Mountain Revealed with Satellite Remote Sensing.
Remote. Sens., July, 2024

Accelerating Sparse DNNs Based on Tiled GEMM.
IEEE Trans. Computers, May, 2024

FlowWalker: A Memory-efficient and High-performance GPU-based Dynamic Graph Random Walk Framework.
Proc. VLDB Endow., April, 2024

SHA: QoS-Aware Software and Hardware Auto-Tuning for Database Systems.
J. Comput. Sci. Technol., March, 2024

DQS: A QoS-driven routing optimization approach in SDN using deep reinforcement learning.
J. Parallel Distributed Comput., 2024

HOBBIT: A Mixed Precision Expert Offloading System for Fast MoE Inference.
CoRR, 2024

SparseTem: Boosting the Efficiency of CNN-Based Video Encoders by Exploiting Temporal Continuity.
CoRR, 2024

Inf-MLLM: Efficient Streaming Inference of Multimodal Large Language Models on a Single GPU.
CoRR, 2024

Potamoi: Accelerating Neural Rendering via a Unified Streaming Architecture.
CoRR, 2024

AutoVCoder: A Systematic Framework for Automated Verilog Code Generation using LLMs.
CoRR, 2024

vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving.
CoRR, 2024

SimGen: Simulator-conditioned Driving Scene Generation.
CoRR, 2024

Towards Fast Setup and High Throughput of GPU Serverless Computing.
CoRR, 2024

A Codesign of Scheduling and Parallelization for Large Model Training in Heterogeneous Clusters.
CoRR, 2024

CPM: A Cross-layer Power Management Facility to Enable QoS-Aware AIoT Systems.
Proceedings of the 32nd IEEE/ACM International Symposium on Quality of Service, 2024

PAS: Towards Accurate and Efficient Federated Learning with Parameter-Adaptive Synchronization.
Proceedings of the 32nd IEEE/ACM International Symposium on Quality of Service, 2024

A Tale of Two Domains: Exploring Efficient Architecture Design for Truly Autonomous Things.
Proceedings of the 51st ACM/IEEE Annual International Symposium on Computer Architecture, 2024

Cicero: Addressing Algorithmic and Architectural Bottlenecks in Neural Rendering by Radiance Warping and Memory Optimizations.
Proceedings of the 51st ACM/IEEE Annual International Symposium on Computer Architecture, 2024

CoCG: Fine-grained Cloud Game Co-location on Heterogeneous Platform.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

CKSM: An Efficient Memory Deduplication Method for Container-based Cloud Computing Systems.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

The Blind and the Elephant: A Preference-aware Edge Video Analytics Scheduler for Maximizing System Benefit.
Proceedings of the 53rd International Conference on Parallel Processing, 2024

FedCA: Efficient Federated Learning with Client Autonomy.
Proceedings of the 53rd International Conference on Parallel Processing, 2024

HMT: A Hybrid Mitigating and Transferring Approach on I/O Throughput Degradation for Erasure Coded Storage Systems.
Proceedings of the 53rd International Conference on Parallel Processing, 2024

M<sup>2</sup>SN: Adaptive and Dynamic Multi-modal Shortcut Network Architecture for Latency-Aware Applications.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Anole: Adapting Diverse Compressed Models for Cross-Scene Prediction on Mobile Devices.
Proceedings of the 44th IEEE International Conference on Distributed Computing Systems, 2024

HGR: A Hybrid Global Graph-Based Recovery Approach for Cloud Storage Systems with Failure and Straggler Nodes.
Proceedings of the 44th IEEE International Conference on Distributed Computing Systems, 2024

An Optimizing Framework on MLIR for Efficient FPGA-based Accelerator Generation.
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2024

Embodied Understanding of Driving Scenarios.
Proceedings of the Computer Vision - ECCV 2024, 2024

Hierarchical Source-to-Post-Route QoR Prediction in High-Level Synthesis with GNNs.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2024

Extend Your Own Correspondences: Unsupervised Distant Point Cloud Registration by Progressive Distance Extension.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

RL-Cache: An Efficient Reinforcement Learning Based Cache Partitioning Approach for Multi-Tenant CDN Services.
Proceedings of the IEEE International Conference on Cluster Computing, 2024

Improving the Efficiency of Serverless Computing via Core-Level Power Management.
Proceedings of the 24th IEEE International Symposium on Cluster, 2024

FaaSMem: Improving Memory Efficiency of Serverless Computing with Memory Pool Architecture.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

FaaSGraph: Enabling Scalable, Efficient, and Cost-Effective Graph Processing with Serverless Computing.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

JUNO: Optimizing High-Dimensional Approximate Nearest Neighbour Search with Sparsity-Aware Algorithm and Ray-Tracing Core Mapping.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

Fractal: Joint Multi-Level Sparse Pattern Tuning of Accuracy and Performance for DNN Pruning.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

Amanda: Unified Instrumentation Framework for Deep Neural Networks.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

GMLake: Efficient and Transparent GPU Memory Defragmentation for Large-scale DNN Training with Virtual Memory Stitching.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023
Cost-Effective Traffic Scheduling and Resource Allocation for Edge Service Provisioning.
IEEE/ACM Trans. Netw., December, 2023

Improving Cluster Utilization Through Adaptive Resource Management for Deep Neural Network and CPU Jobs Colocation.
IEEE Trans. Computers, December, 2023

Enabling Efficient Spatio-Temporal GPU Sharing for Network Function Virtualization.
IEEE Trans. Computers, October, 2023

FPGA sharing in the cloud: a comprehensive analysis.
Frontiers Comput. Sci., October, 2023

Optimizing GPU-Based Graph Sampling and Random Walk for Efficiency and Scalability.
IEEE Trans. Computers, September, 2023

Fargraph+: Excavating the parallelism of graph processing workload on RDMA-based far memory system.
J. Parallel Distributed Comput., July, 2023

ISPA: Exploiting Intra-SM Parallelism in GPUs via Fine-Grained Resource Management.
IEEE Trans. Computers, May, 2023

Meta-Learning Based Classification for Moving Object Trajectories in Mobile IoT.
IEEE Trans. Big Data, April, 2023

Async-fork: Mitigating Query Latency Spikes Incurred by the Fork-based Snapshot Mechanism from the OS Level.
Proc. VLDB Endow., 2023

Kronos: towards bus contention-aware job scheduling in warehouse scale computers.
Frontiers Comput. Sci., 2023

Adaptive CPU Resource Allocation for Emulator in Kernel-based Virtual Machine.
CoRR, 2023

Accelerating Generic Graph Neural Networks via Architecture, Compiler, Partition Method Co-Design.
CoRR, 2023

DFlow: Efficient Dataflow-based Invocation Workflow Execution for Function-as-a-Service.
CoRR, 2023

Nodens: Enabling Resource Efficient and Fast QoS Recovery of Dynamic Microservice Applications in Datacenters.
Proceedings of the 2023 USENIX Annual Technical Conference, 2023

BLAD: Adaptive Load Balanced Scheduling and Operator Overlap Pipeline For Accelerating The Dynamic GNN Training.
Proceedings of the International Conference for High Performance Computing, 2023

SMG: A System-Level Modality Gating Facility for Fast and Energy-Efficient Multimodal Computing.
Proceedings of the IEEE Real-Time Systems Symposium, 2023

High-Throughput GPU Random Walk with Fine-Tuned Concurrent Query Processing.
Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2023

Optimizing Dynamic Neural Networks with Brainstorm.
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023

On Efficient Packet Batching and Resource Allocation for GPU based NFV Acceleration.
Proceedings of the 31st IEEE/ACM International Symposium on Quality of Service, 2023

Improving Productivity and Efficiency of SSD Manufacturing Self-Test Process by Learning-Based Proactive Defect Prediction.
Proceedings of the IEEE International Test Conference, 2023

Architecting Efficient Multi-modal AIoT Systems.
Proceedings of the 50th Annual International Symposium on Computer Architecture, 2023

OliVe: Accelerating Large Language Models via Hardware-friendly Outlier-Victim Pair Quantization.
Proceedings of the 50th Annual International Symposium on Computer Architecture, 2023

FIRST: Exploiting the Multi-Dimensional Attributes of Functions for Power-Aware Serverless Computing.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

APR: Online Distant Point Cloud Registration through Aggregated Point Cloud Reconstruction.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

MMBench: Benchmarking End-to-End Multi-modal DNNs and Understanding Their Hardware-Software Implications.
Proceedings of the IEEE International Symposium on Workload Characterization, 2023

PAC: Preference-Aware Co-location Scheduling on Heterogeneous NUMA Architectures To Improve Resource Utilization.
Proceedings of the 37th International Conference on Supercomputing, 2023

DW-LRC: A Dynamic Wide-stripe LRC Codes for Blockchain Data Under Malicious Node Scenarios.
Proceedings of the 29th IEEE International Conference on Parallel and Distributed Systems, 2023

PMR: Priority Memory Reclaim to Improve the Performance of Latency-Critical Services.
Proceedings of the 29th IEEE International Conference on Parallel and Distributed Systems, 2023

Microless: Cost-Efficient Hybrid Deployment of Microservices on IaaS VMs and Serverless.
Proceedings of the 29th IEEE International Conference on Parallel and Distributed Systems, 2023

Density-invariant Features for Distant Point Cloud Registration.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

STAG: Enabling Low Latency and Low Staleness of GNN-based Services with Dynamic Graphs.
Proceedings of the 41st IEEE International Conference on Computer Design, 2023

MMExit: Enabling Fast and Efficient Multi-modal DNN Inference with Adaptive Network Exits.
Proceedings of the Euro-Par 2023: Parallel Processing - 29th International Conference on Parallel and Distributed Computing, Limassol, Cyprus, August 28, 2023

MARS: Exploiting Multi-Level Parallelism for DNN Workloads on Adaptive Multi-Accelerator Systems.
Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023

MonoATT: Online Monocular 3D Object Detection with Adaptive Token Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Maximizing the Utilization of GPUs Used by Cloud Gaming through Adaptive Co-location with Combo.
Proceedings of the 2023 ACM Symposium on Cloud Computing, SoCC 2023, 2023

Not All Resources are Visible: Exploiting Fragmented Shadow Resources in Shared-State Scheduler Architecture.
Proceedings of the 2023 ACM Symposium on Cloud Computing, SoCC 2023, 2023

DistSim: A performance model of large-scale hybrid distributed DNN training.
Proceedings of the 20th ACM International Conference on Computing Frontiers, 2023

AdaptGear: Accelerating GNN Training via Adaptive Subgraph-Level Kernels on GPUs.
Proceedings of the 20th ACM International Conference on Computing Frontiers, 2023

Efficient Scheduler Live Update for Linux Kernel with Modularization.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

DataFlower: Exploiting the Data-flow Paradigm for Serverless Workflow Orchestration.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

uGrapher: High-Performance Graph Operator Computation via Unified Abstraction for Graph Neural Networks.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

2022
Identifying patients with Crohn's disease at high risk of primary nonresponse to infliximab using a radiomic-clinical model.
Int. J. Intell. Syst., December, 2022

The Serverless Computing Survey: A Technical Primer for Design Architecture.
ACM Comput. Surv., January, 2022

Online Thread Auto-Tuning for Performance Improvement and Resource Saving.
IEEE Trans. Parallel Distributed Syst., 2022

Efficient and Secure Deep Learning Inference in Trusted Processor Enabled Edge Clouds.
IEEE Trans. Parallel Distributed Syst., 2022

Adaptive Resource Efficient Microservice Deployment in Cloud-Edge Continuum.
IEEE Trans. Parallel Distributed Syst., 2022

PeerProbe: Estimating Vehicular Neighbor Distribution With Adaptive Compressive Sensing.
IEEE/ACM Trans. Netw., 2022

Exploiting big.LITTLE Batteries for Software Defined Management on Mobile Devices.
IEEE Trans. Mob. Comput., 2022

Integrated Power Anomaly Defense: Towards Oversubscription-Safe Data Centers.
IEEE Trans. Cloud Comput., 2022

Tapping into NFV Environment for Opportunistic Serverless Edge Function Deployment.
IEEE Trans. Computers, 2022

Toward QoS-Awareness and Improved Utilization of Spatial Multitasking GPUs.
IEEE Trans. Computers, 2022

Efficient Trustworthiness Management for Malicious User Detection in Big Data Collection.
IEEE Trans. Big Data, 2022

Reliability and Incentive of Performance Assessment for Decentralized Clouds.
J. Comput. Sci. Technol., 2022

Embedding-Based Similarity Computation for Massive Vehicle Trajectory Data.
IEEE Internet Things J., 2022

Preference-Aware Edge Server Placement in the Internet of Things.
IEEE Internet Things J., 2022

Modeling feature interactions for context-aware QoS prediction of IoT services.
Future Gener. Comput. Syst., 2022

Performance optimization for cloud computing systems in the microservice era: state-of-the-art and research opportunities.
Frontiers Comput. Sci., 2022

Efficient Activation Quantization via Adaptive Rounding Border for Post-Training Quantization.
CoRR, 2022

Oversubscribing GPU Unified Virtual Memory: Implications and Suggestions.
Proceedings of the ICPE '22: ACM/SPEC International Conference on Performance Engineering, Bejing, China, April 9, 2022

Help Rather Than Recycle: Alleviating Cold Startup in Serverless Computing Through Inter-Function Container Sharing.
Proceedings of the 2022 USENIX Annual Technical Conference, 2022

RunD: A Lightweight Secure Container Runtime for High-density Deployment and High-concurrency Startup in Serverless Computing.
Proceedings of the 2022 USENIX Annual Technical Conference, 2022

DVABatch: Diversity-aware Multi-Entry Multi-Exit Batching for Efficient Processing of DNN Services on GPUs.
Proceedings of the 2022 USENIX Annual Technical Conference, 2022

PilotFish: Harvesting Free Cycles of Cloud Gaming with Deep Learning Training.
Proceedings of the 2022 USENIX Annual Technical Conference, 2022

XHR-Code: An Efficient Wide Stripe Erasure Code to Reduce Cross-Rack Overhead in Cloud Storage Systems.
Proceedings of the 41st International Symposium on Reliable Distributed Systems, 2022

QoS-Aware Irregular Collaborative Inference for Improving Throughput of DNN Services.
Proceedings of the SC22: International Conference for High Performance Computing, 2022

Performance Improvement Validation of Decision Tree Algorithms with Non-normalized Information Distance in Experiments.
Proceedings of the PRICAI 2022: Trends in Artificial Intelligence, 2022

Cloud-Native Server Consolidation for Energy-Efficient FaaS Deployment.
Proceedings of the Network and Parallel Computing, 2022

MoGDE: Boosting Mobile Monocular 3D Object Detection with Ground Depth Estimation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Towards Reliable AI Applications via Algorithm-Based Fault Tolerance on NVDLA.
Proceedings of the 18th International Conference on Mobility, Sensing and Networking, 2022

ANT: Exploiting Adaptive Numerical Data Type for Low-bit Deep Neural Network Quantization.
Proceedings of the 55th IEEE/ACM International Symposium on Microarchitecture, 2022

PRM: An Efficient Partial Recovery Method to Accelerate Training Data Reconstruction for Distributed Deep Learning Applications in Cloud Storage Systems.
Proceedings of the 30th IEEE/ACM International Symposium on Quality of Service, 2022

Excavating the Potential of Graph Workload on RDMA-based Far Memory Architecture.
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022

Exploring Efficient Microservice Level Parallelism.
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022

QoS-awareness of Microservices with Excessive Loads via Inter-Datacenter Scheduling.
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022

CSC: Collaborative System Configuration for I/O-Intensive Applications in Multi-Tenant Clouds.
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022

PAME: precision-aware multi-exit DNN serving for reducing latencies of batched inferences.
Proceedings of the ICS '22: 2022 International Conference on Supercomputing, Virtual Event, June 28, 2022

ERP: An Efficient Rewrite Scheme to Improve the Inline Deduplication Restore Performance in Backup Systems.
Proceedings of the 28th IEEE International Conference on Parallel and Distributed Systems, 2022

SQuant: On-the-Fly Data-Free Quantization via Diagonal Hessian Approximation.
Proceedings of the Tenth International Conference on Learning Representations, 2022

mmV2V: Combating One-hop Multicasting in Millimeter-wave Vehicular Networks.
Proceedings of the 42nd IEEE International Conference on Distributed Computing Systems, 2022

LoADPart: Load-Aware Dynamic Partition of Deep Neural Networks for Edge Offloading.
Proceedings of the 42nd IEEE International Conference on Distributed Computing Systems, 2022

HyFarM: Task Orchestration on Hybrid Far Memory for High Performance Per Bit.
Proceedings of the IEEE 40th International Conference on Computer Design, 2022

GRPU: An Efficient Graph-based Cross-Rack Parallel Update Scheme for Cloud Storage Systems.
Proceedings of the IEEE 40th International Conference on Computer Design, 2022

RCS: A Redirection Computational Scheduler to Accelerate Straggler Recovery for Erasure Coded Cloud Storage System.
Proceedings of the IEEE 40th International Conference on Computer Design, 2022

Nesting Forward Automatic Differentiation for Memory-Efficient Deep Neural Network Training.
Proceedings of the IEEE 40th International Conference on Computer Design, 2022

Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS.
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2022

SALO: an efficient spatial accelerator enabling hybrid sparse attention mechanisms for long sequences.
Proceedings of the DAC '22: 59th ACM/IEEE Design Automation Conference, San Francisco, California, USA, July 10, 2022

Characterizing and orchestrating VM reservation in geo-distributed clouds to improve the resource efficiency.
Proceedings of the 13th Symposium on Cloud Computing, SoCC 2022, 2022

Astraea: towards QoS-aware and resource-efficient multi-stage GPU services.
Proceedings of the ASPLOS '22: 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Lausanne, Switzerland, 28 February 2022, 2022

VELTAIR: towards high-performance multi-tenant deep learning services via adaptive compilation and scheduling.
Proceedings of the ASPLOS '22: 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Lausanne, Switzerland, 28 February 2022, 2022

FaaSFlow: enable efficient workflow execution for function-as-a-service.
Proceedings of the ASPLOS '22: 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Lausanne, Switzerland, 28 February 2022, 2022

Transkimmer: Transformer Learns to Layer-wise Skim.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Block-Skim: Efficient Question Answering for Transformer.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Petrel: Heterogeneity-Aware Distributed Deep Learning Via Hybrid Synchronization.
IEEE Trans. Parallel Distributed Syst., 2021

Adaptive Preference-Aware Co-Location for Improving Resource Utilization of Power Constrained Datacenters.
IEEE Trans. Parallel Distributed Syst., 2021

E<sup>2</sup>bird: Enhanced Elastic Batch for Improving Responsiveness and Throughput of Deep Learning Services.
IEEE Trans. Parallel Distributed Syst., 2021

Learning Graph Representation With Generative Adversarial Nets.
IEEE Trans. Knowl. Data Eng., 2021

Flexible Aggregate Nearest Neighbor Queries and its Keyword-Aware Variant on Road Networks.
IEEE Trans. Knowl. Data Eng., 2021

Falcon: Addressing Stragglers in Heterogeneous Parameter Server Via Multiple Parallelism.
IEEE Trans. Computers, 2021

Grus: Toward Unified-memory-efficient High-performance Graph Processing on GPU.
ACM Trans. Archit. Code Optim., 2021

Pagurus: Eliminating Cold Startup in Serverless Computing with Inter-Action Container Sharing.
CoRR, 2021

ZIPPER: Exploiting Tile- and Operator-level Parallelism for General and Scalable Graph Neural Network Acceleration.
CoRR, 2021

A dynamic network traffic classifier using supervised ML for a Docker-based SDN network.
Connect. Sci., 2021

A Comprehensive Inspection of the Straggler Problem.
Computer, 2021

Editorial for the special issue on high performance distributed computing.
CCF Trans. High Perform. Comput., 2021

Enable simultaneous DNN services based on deterministic operator overlap and precise latency prediction.
Proceedings of the International Conference for High Performance Computing, 2021

EC-Scheduler: A Load-Balanced Scheduler to Accelerate the Straggler Recovery for Erasure Coded Storage Systems.
Proceedings of the 29th IEEE/ACM International Symposium on Quality of Service, 2021

BiPS: Hotness-aware Bi-tier Parameter Synchronization for Recommendation Models.
Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021

AuTraScale: An Automated and Transfer Learning Solution for Streaming System Auto-Scaling.
Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021

Rack-Scaling: An efficient rack-based redistribution method to accelerate the scaling of cloud disk arrays.
Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021

AlphaR: Learning-Powered Resource Management for Irregular, Dynamic Microservice Graph.
Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021

QoS-Aware and Resource Efficient Microservice Deployment in Cloud-Edge Continuum.
Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021

Distributed Neighbor Distribution Estimation with Adaptive Compressive Sensing in VANETs.
Proceedings of the 40th IEEE Conference on Computer Communications, 2021

Characterizing and Demystifying the Implicit Convolution Algorithm on Commercial Matrix-Multiplication Accelerators.
Proceedings of the IEEE International Symposium on Workload Characterization, 2021

Dubhe: Towards Data Unbiasedness with Homomorphic Encryption in Federated Learning Client Selection.
Proceedings of the ICPP 2021: 50th International Conference on Parallel Processing, Lemont, IL, USA, August 9, 2021

Spring Buddy: A Self-Adaptive Elastic Memory Management Scheme for Efficient Concurrent Allocation/Deallocation in Cloud Computing Systems.
Proceedings of the 27th IEEE International Conference on Parallel and Distributed Systems, 2021

TempNet: Online Semantic Segmentation on Large-scale Point Cloud Series.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

CHARM: Collaborative Host and Accelerator Resource Management for GPU Datacenters.
Proceedings of the 39th IEEE International Conference on Computer Design, 2021

Exploiting Intra-SM Parallelism in GPUs via Persistent and Elastic Blocks.
Proceedings of the 39th IEEE International Conference on Computer Design, 2021

Lazy-WL: A Wear-aware Load Balanced Data Redistribution Method for Efficient SSD Array Scaling.
Proceedings of the IEEE International Conference on Cluster Computing, 2021

Lasagna: Accelerating Secure Deep Learning Inference in SGX-enabled Edge Cloud.
Proceedings of the SoCC '21: ACM Symposium on Cloud Computing, 2021

Skywalker: Efficient Alias-Method-Based Graph Sampling and Random Walk on GPUs.
Proceedings of the 30th International Conference on Parallel Architectures and Compilation Techniques, 2021

2020
A Cyclic Game for Service-Oriented Resource Allocation in Edge Computing.
IEEE Trans. Serv. Comput., 2020

Modeling Latent Relation to Boost Things Categorization Service.
IEEE Trans. Serv. Comput., 2020

Learning User Preference from Heterogeneous Information for Store-Type Recommendation.
IEEE Trans. Serv. Comput., 2020

Renewable Energy-Aware Big Data Analytics in Geo-Distributed Data Centers with Reinforcement Learning.
IEEE Trans. Netw. Sci. Eng., 2020

Skia: Scalable and Efficient In-Memory Analytics for Big Spatial-Textual Data.
IEEE Trans. Knowl. Data Eng., 2020

Incremental Throughput Allocation of Heterogeneous Storage With No Disruptions in Dynamic Setting.
IEEE Trans. Computers, 2020

eXnet: An Efficient Approach for Emotion Recognition in the Wild.
Sensors, 2020

Joint Topic-Semantic-aware Social Matrix Factorization for online voting recommendation.
Knowl. Based Syst., 2020

Predicting and reining in application-level slowdown on spatial multitasking GPUs.
J. Parallel Distributed Comput., 2020

Probabilistic robust regression with adaptive weights - a case study on face recognition.
Frontiers Comput. Sci., 2020

Towards QoS-Aware and Resource-Efficient GPU Microservices Based on Spatial Multitasking GPUs In Datacenters.
CoRR, 2020

Survey and design of paleozoic: a high-performance compiler tool chain for deep learning inference accelerator.
CCF Trans. High Perform. Comput., 2020

Editorial for the special issue on operating systems and programming systems for HPC.
CCF Trans. High Perform. Comput., 2020

Architectural Implications of Graph Neural Networks.
IEEE Comput. Archit. Lett., 2020

Spool: Reliable Virtualized NVMe Storage Pool in Public Cloud Infrastructure.
Proceedings of the 2020 USENIX Annual Technical Conference, 2020

AZ-Recovery: An Efficient Crossing-AZ Recovery Scheme for Erasure Coded Cloud Storage Systems.
Proceedings of the International Symposium on Reliable Distributed Systems, 2020

ANT-man: towards agile power management in the microservice era.
Proceedings of the International Conference for High Performance Computing, 2020

Alita: comprehensive performance isolation through bias resource management for public clouds.
Proceedings of the International Conference for High Performance Computing, 2020

Accelerating sparse DNN models without hardware-support via tile-wise sparsity.
Proceedings of the International Conference for High Performance Computing, 2020

Ptolemy: Architecture Support for Robust Deep Learning.
Proceedings of the 53rd Annual IEEE/ACM International Symposium on Microarchitecture, 2020

DLFusion: An Auto-Tuning Compiler for Layer Fusion on Deep Neural Network Accelerator.
Proceedings of the IEEE International Conference on Parallel & Distributed Processing with Applications, 2020

Sturgeon: Preference-aware Co-location for Improving Utilization of Power Constrained Computers.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020

Amoeba: QoS-Awareness and Reduced Resource Usage of Microservices with Serverless Computing.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020

EC-Fusion: An Efficient Hybrid Erasure Coding Framework to Improve Both Application and Recovery Performance in Cloud Storage Systems.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020

OVERSEE: Outsourcing Verification to Enable Resource Sharing in Edge Environment.
Proceedings of the ICPP 2020: 49th International Conference on Parallel Processing, 2020

URSA: Precise Capacity Planning and Fair Scheduling based on Low-level Statistics for Public Clouds.
Proceedings of the ICPP 2020: 49th International Conference on Parallel Processing, 2020

Petrel: Community-aware Synchronous Parallel for Heterogeneous Parameter Server.
Proceedings of the 40th IEEE International Conference on Distributed Computing Systems, 2020

CODA: Improving Resource Utilization by Slimming and Co-locating DNN and CPU Jobs.
Proceedings of the 40th IEEE International Conference on Distributed Computing Systems, 2020

FAGR: An Efficient File-aware Graph Recovery Scheme for Erasure Coded Cloud Storage Systems.
Proceedings of the 38th IEEE International Conference on Computer Design, 2020

Asymmetric Resilience: Exploiting Task-Level Idempotency for Transient Error Recovery in Accelerator-Based Systems.
Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2020

Balancing Efficiency and Flexibility for DNN Acceleration via Temporal GPU-Systolic Array Integration.
Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020

How Far Does BERT Look At: Distance-based Clustering and Analysis of BERT's Attention.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

2019
Personalized Exposure Control Using Adaptive Metering and Reinforcement Learning.
IEEE Trans. Vis. Comput. Graph., 2019

Improving Power Efficiency for Online Video Streaming Service: A Self-Adaptive Approach.
IEEE Trans. Sustain. Comput., 2019

Making Big Data Open in Edges: A Resource-Efficient Blockchain-Based Approach.
IEEE Trans. Parallel Distributed Syst., 2019

CongraPlus: Towards Efficient Processing of Concurrent Graph Queries on NUMA Machines.
IEEE Trans. Parallel Distributed Syst., 2019

Exploring High-Order User Preference on the Knowledge Graph for Recommender Systems.
ACM Trans. Inf. Syst., 2019

Fast Coflow Scheduling via Traffic Compression and Stage Pipelining in Datacenter Networks.
IEEE Trans. Computers, 2019

DR Refresh: Releasing DRAM Potential by Enabling Read Accesses Under Refresh.
IEEE Trans. Computers, 2019

Bandwidth and Locality Aware Task-stealing for Manycore Architectures with Bandwidth-Asymmetric Memory.
ACM Trans. Archit. Code Optim., 2019

TACD: A throughput allocation method based on variant of Cobb-Douglas for hybrid storage system.
J. Parallel Distributed Comput., 2019

CATIRI: An Efficient Method for Content-and-Text Based Image Retrieval.
J. Comput. Sci. Technol., 2019

A Comprehensive Survey of Blockchain: From Theory to IoT Applications and Beyond.
IEEE Internet Things J., 2019

PAM: an efficient power-aware multilevel cache policy to reduce energy consumption of storage systems.
Frontiers Comput. Sci., 2019

URSA: Precise Capacity Planning and Contention-aware Scheduling for Public Clouds.
CoRR, 2019

Position-Aware Convolutional Networks for Traffic Prediction.
CoRR, 2019

Multi-Task Feature Learning for Knowledge Graph Enhanced Recommendation.
Proceedings of the World Wide Web Conference, 2019

Knowledge Graph Convolutional Networks for Recommender Systems.
Proceedings of the World Wide Web Conference, 2019

A Comprehensive Rearranging Priority Based Method To Accelerate the Reconstruction of RAID Arrays.
Proceedings of the 38th International Symposium on Reliable Distributed Systems Workshops, 2019

Characterizing Perception Module Performance and Robustness in Production-Scale Autonomous Driving System.
Proceedings of the Network and Parallel Computing, 2019

AZ-Code: An Efficient Availability Zone Level Erasure Code to Provide High Fault Tolerance in Cloud Storage Systems.
Proceedings of the 35th Symposium on Mass Storage Systems and Technologies, 2019

Characterizing and orchestrating NFV-ready servers for efficient edge data processing.
Proceedings of the International Symposium on Quality of Service, 2019

SprintCon: Controllable and Efficient Computational Sprinting for Data Center Servers.
Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium, 2019

Themis: Predicting and Reining in Application-Level Slowdown on Spatial Multitasking GPUs.
Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium, 2019

Excavating the Potential of GPU for Accelerating Graph Traversal.
Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium, 2019

Optimizing the Parity Check Matrix for Efficient Decoding of RS-Based Cloud Storage Systems.
Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium, 2019

Laius: Towards latency awareness and improved utilization of spatial multitasking accelerators in datacenters.
Proceedings of the ACM International Conference on Supercomputing, 2019

Avalon: towards QoS awareness and improved utilization through multi-resource management in datacenters.
Proceedings of the ACM International Conference on Supercomputing, 2019

Approximate Code: A Cost-Effective Erasure Coding Framework for Tiered Video Storage in Cloud Systems.
Proceedings of the 48th International Conference on Parallel Processing, 2019

When Power Oversubscription Meets Traffic Flood Attack: Re-Thinking Data Center Peak Load Management.
Proceedings of the 48th International Conference on Parallel Processing, 2019

Unleashing the Scalability Potential of Power-Constrained Data Center in the Microservice Era.
Proceedings of the 48th International Conference on Parallel Processing, 2019

Characterizing and Balancing the Workloads of Semi-Containerized Clouds.
Proceedings of the 25th IEEE International Conference on Parallel and Distributed Systems, 2019

Optimizing the Aggregated Throughput of GPUs in Public Clouds Based on Adaptive Kernel Reordering.
Proceedings of the 25th IEEE International Conference on Parallel and Distributed Systems, 2019

Falcon: Towards Computation-Parallel Deep Learning in Heterogeneous Parameter Server.
Proceedings of the 39th IEEE International Conference on Distributed Computing Systems, 2019

A Cyclic Game for Joint Cooperation and Competition of Edge Resource Allocation.
Proceedings of the 39th IEEE International Conference on Distributed Computing Systems, 2019

Service Demand Prediction with Incomplete Historical Data.
Proceedings of the 39th IEEE International Conference on Distributed Computing Systems, 2019

Adversarial Defense Through Network Profiling Based Path Extraction.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

PSL: Exploiting Parallelism, Sparsity and Locality to Accelerate Matrix Factorization on x86 Platforms.
Proceedings of the Benchmarking, Measuring, and Optimizing, 2019

POSTER: Precise Capacity Planning for Database Public Clouds.
Proceedings of the 28th International Conference on Parallel Architectures and Compilation Techniques, 2019

2018
Power consumption analysis of video streaming in 4G LTE networks.
Wirel. Networks, 2018

MeLoDy: A Long-Term Dynamic Quality-Aware Incentive Mechanism for Crowdsourcing.
IEEE Trans. Parallel Distributed Syst., 2018

Top-kCritical Vertices Query on Shortest Path.
IEEE Trans. Knowl. Data Eng., 2018

A Dynamical and Load-Balanced Flow Scheduling Approach for Big Data Centers in Clouds.
IEEE Trans. Cloud Comput., 2018

CNFET-Based High Throughput SIMD Architecture.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2018

Contention and Locality-Aware Work-Stealing for Iterative Applications in Multi-Socket Computers.
IEEE Trans. Computers, 2018

Optimizing power consumption of mobile devices for video streaming over 4G LTE networks.
Peer-to-Peer Netw. Appl., 2018

DCF: A Dataflow-Based Collaborative Filtering Training Algorithm.
Int. J. Parallel Program., 2018

HSCS: a hybrid shared cache scheduling scheme for multiprogrammed workloads.
Frontiers Comput. Sci., 2018

Ripple Network: Propagating User Preferences on the Knowledge Graph for Recommender Systems.
CoRR, 2018

Personalized Attention-Aware Exposure Control Using Reinforcement Learning.
CoRR, 2018

Learning Human Activities through Wi-Fi Channel State Information with Multiple Access Points.
IEEE Commun. Mag., 2018

An Efficient Graph Query Framework with Structural Recursion.
Comput. J., 2018

Toward multi-programmed workloads with different memory footprints: a self-adaptive last level cache scheduling scheme.
Sci. China Inf. Sci., 2018

KSM: Online Application-Level Performance Slowdown Prediction for Spatial Multitasking GPGPU.
IEEE Comput. Archit. Lett., 2018

QoE-driven big data management in pervasive edge computing environment.
Big Data Min. Anal., 2018

DKN: Deep Knowledge-Aware Network for News Recommendation.
Proceedings of the 2018 World Wide Web Conference on World Wide Web, 2018

SHINE: Signed Heterogeneous Information Network Embedding for Sentiment Link Prediction.
Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining, 2018

Swallow: Joint Online Scheduling and Coflow Compression in Datacenter Networks.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

Rebalance Modern Bike Sharing System: Spatio-Temporal Data Prediction and Path Planning for Multiple Carriers.
Proceedings of the 24th IEEE International Conference on Parallel and Distributed Systems, 2018

Deep learning based classification for paddy pests & diseases recognition.
Proceedings of 2018 International Conference on Mathematics and Artificial Intelligence, 2018

Flexible Aggregate Nearest Neighbor Queries in Road Networks.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

Power Grab in Aggressively Provisioned Data Centers: What is the Risk and What Can Be Done About It.
Proceedings of the 36th IEEE International Conference on Computer Design, 2018

DR DRAM: Accelerating Memory-Read-Intensive Applications.
Proceedings of the 36th IEEE International Conference on Computer Design, 2018

CLIBE: Precise Cluster-Level I/O Bandwidth Enforcement in Distributed File System.
Proceedings of the 20th IEEE International Conference on High Performance Computing and Communications; 16th IEEE International Conference on Smart City; 4th IEEE International Conference on Data Science and Systems, 2018

Fine-Gained Location Recommendation Based on User Textual Reviews in LBSNs.
Proceedings of the Green, Pervasive, and Cloud Computing - 13th International Conference, 2018

Distributed In-Memory Analytics for Big Temporal Data.
Proceedings of the Database Systems for Advanced Applications, 2018

RippleNet: Propagating User Preferences on the Knowledge Graph for Recommender Systems.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

Jointly Modeling Structural and Textual Representation for Knowledge Graph Completion in Zero-Shot Scenario.
Proceedings of the Web and Big Data - Second International Joint Conference, 2018

GraphGAN: Graph Representation Learning With Generative Adversarial Nets.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Deep Representation-Decoupling Neural Networks for Monaural Music Mixture Separation.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Delay-Minimized Routing in Mobile Cognitive Networks for Time-Critical Applications.
IEEE Trans. Ind. Informatics, 2017

GraphLoc: a graph-based method for indoor subarea localization with zero-configuration.
Pers. Ubiquitous Comput., 2017

Reverse Furthest Neighbors Query in Road Networks.
J. Comput. Sci. Technol., 2017

A Hint Frequency Based Approach to Enhancing the I/O Performance of Multilevel Cache Storage Systems.
J. Comput. Sci. Technol., 2017

Smart Infrastructure Design for Smart Cities.
IT Prof., 2017

The improved indoor localisation algorithm based on wireless sensor network.
Int. J. Comput. Sci. Eng., 2017

Mobile Crowdsensing in Software Defined Opportunistic Networks.
IEEE Commun. Mag., 2017

Re2l: An efficient output-sensitive algorithm for computing Boolean operations on circular-arc polygons and its applications.
Comput. Aided Des., 2017

TransT: Type-Based Multiple Embedding Representations for Knowledge Graph Completion.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2017

Understanding customer behaviour in urban shopping mall from WiFi logs.
Proceedings of the 2017 IEEE International Conference on Pervasive Computing and Communications Workshops, 2017

Reinforcement learning-based adaptive resource management of differentiated services in geo-distributed data centers.
Proceedings of the 25th IEEE/ACM International Symposium on Quality of Service, 2017

Electro: Toward QoS-Aware Power Management for Latency-Critical Applications.
Proceedings of the 2017 IEEE International Symposium on Parallel and Distributed Processing with Applications and 2017 IEEE International Conference on Ubiquitous Computing and Communications (ISPA/IUCC), 2017

Preemption-Aware Kernel Scheduling for GPUs.
Proceedings of the 2017 IEEE International Symposium on Parallel and Distributed Processing with Applications and 2017 IEEE International Conference on Ubiquitous Computing and Communications (ISPA/IUCC), 2017

Quality of Service Support for Fine-Grained Sharing on GPUs.
Proceedings of the 44th Annual International Symposium on Computer Architecture, 2017

Favorable Block First: A Comprehensive Cache Scheme to Accelerate Partial Stripe Recovery of Triple Disk Failure Tolerant Arrays.
Proceedings of the 46th International Conference on Parallel Processing, 2017

Joint Topic-Semantic-aware Social Recommendation for Online Voting.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

Prophet: Precise QoS Prediction on Non-Preemptive Accelerators to Improve Utilization in Warehouse-Scale Computers.
Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems, 2017

Task Scheduling for Multi-core and Parallel Architectures - Challenges, Solutions and Perspectives
Springer, ISBN: 978-981-10-6237-7, 2017

2016
Mobility Prediction Based Joint Stable Routing and Channel Assignment for Mobile Ad Hoc Cognitive Networks.
IEEE Trans. Parallel Distributed Syst., 2016

On Traffic-Aware Partition and Aggregation in MapReduce for Big Data Applications.
IEEE Trans. Parallel Distributed Syst., 2016

Joint Optimization of Lifetime and Transport Delay under Reliability Constraint Wireless Sensor Networks.
IEEE Trans. Parallel Distributed Syst., 2016

HyperspaceFlow: A System-Level Design Methodology for Smart Space.
IEEE Trans. Emerg. Top. Comput., 2016

Pricing and Repurchasing for Big Data Processing in Multi-Clouds.
IEEE Trans. Emerg. Top. Comput., 2016

A context-aware search system for Internet of Things based on hierarchical context model.
Telecommun. Syst., 2016

LSCD: A Low-Storage Clone Detection Protocol for Cyber-Physical Systems.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2016

A Feasible IP Traceback Framework through Dynamic Deterministic Packet Marking.
IEEE Trans. Computers, 2016

Rank-Aware Dynamic Migrations and Adaptive Demotions for DRAM Power Management.
IEEE Trans. Computers, 2016

Long-term location privacy protection for location-based services in mobile cloud computing.
Soft Comput., 2016

Real-Time Locating Systems Using Active RFID for Internet of Things.
IEEE Syst. J., 2016

Mobile Target Detection in Wireless Sensor Networks With Adjustable Sensing Frequency.
IEEE Syst. J., 2016

A Social-Network-Optimized Taxi-Sharing Service.
IT Prof., 2016

SMe: explicit & implicit constrained-space probabilistic threshold range queries for moving objects.
GeoInformatica, 2016

Adaptive demand-aware work-stealing in multi-programmed multi-core architectures.
Concurr. Comput. Pract. Exp., 2016

Simultaneous Multikernel: Fine-Grained Sharing of GPUs.
IEEE Comput. Archit. Lett., 2016

How video streaming consumes power in 4G LTE networks.
Proceedings of the 17th IEEE International Symposium on A World of Wireless, 2016

Primary user activity prediction based joint topology control and stable routing in mobile cognitive networks.
Proceedings of the IEEE Wireless Communications and Networking Conference, 2016

A Graph-Based Method for Indoor Subarea Localization with Zero-Configuration.
Proceedings of the 2016 Intl IEEE Conferences on Ubiquitous Intelligence & Computing, 2016

Online Credit Card Fraud Detection: A Hybrid Framework with Big Data Technologies.
Proceedings of the 2016 IEEE Trustcom/BigDataSE/ISPA, 2016

Towards Scalable and Reliable In-Memory Storage System: A Case Study with Redis.
Proceedings of the 2016 IEEE Trustcom/BigDataSE/ISPA, 2016

Simba: Efficient In-Memory Spatial Analytics.
Proceedings of the 2016 International Conference on Management of Data, 2016

Profiling energy consumption of DASH video streaming over 4G LTE networks.
Proceedings of the 8th International Workshop on Mobile Video, 2016

Power Attack Defense: Securing Battery-Backed Data Centers.
Proceedings of the 43rd ACM/IEEE Annual International Symposium on Computer Architecture, 2016

Zero-Chunk: An Efficient Cache Algorithm to Accelerate the I/O Processing of Data Deduplication.
Proceedings of the 22nd IEEE International Conference on Parallel and Distributed Systems, 2016

Practical private shortest path computation based on Oblivious Storage.
Proceedings of the 32nd IEEE International Conference on Data Engineering, 2016

BDR: A Balanced Data Redistribution scheme to accelerate the scaling process of XOR-based Triple Disk Failure Tolerant arrays.
Proceedings of the 34th IEEE International Conference on Computer Design, 2016

SAWS: Selective Asymmetry-Aware Work-Stealing for Asymmetric Multi-core Architectures.
Proceedings of the 18th IEEE International Conference on High Performance Computing and Communications; 14th IEEE International Conference on Smart City; 2nd IEEE International Conference on Data Science and Systems, 2016

Simultaneous Multikernel GPU: Multi-tasking throughput processors via fine-grained sharing.
Proceedings of the 2016 IEEE International Symposium on High Performance Computer Architecture, 2016

Simba: spatial in-memory big data analysis.
Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, GIS 2016, Burlingame, California, USA, October 31, 2016

2015
Scalable Multicore k-NN Search via Subspace Clustering for Filtering.
IEEE Trans. Parallel Distributed Syst., 2015

Secrecy Capacity Optimization via Cooperative Relaying and Jamming for WANETs.
IEEE Trans. Parallel Distributed Syst., 2015

Probabilistic Range Query over Uncertain Moving Objects in Constrained Two-Dimensional Space.
IEEE Trans. Knowl. Data Eng., 2015

Synergy of Dynamic Frequency Scaling and Demotion on DRAM Power Management: Models and Optimizations.
IEEE Trans. Computers, 2015

Locality-Aware Work Stealing Based on Online Profiling and Auto-Tuning for Multisocket Multicore Architectures.
ACM Trans. Archit. Code Optim., 2015

OFScheduler: A Dynamic Network Optimizer for MapReduce in Heterogeneous Cluster.
Int. J. Parallel Program., 2015

Joint channel assignment, stable routing and adaptive power control in mobile cognitive networks.
Proceedings of the 2015 IEEE Wireless Communications and Networking Conference, 2015

Joint rate, channel and route selection for cognitive radio ad hoc networks.
Proceedings of the 2015 IEEE Wireless Communications and Networking Conference, 2015

PCM: A Parity-Check Matrix Based Approach to Improve Decoding Performance of XOR-based Erasure Codes.
Proceedings of the 34th IEEE Symposium on Reliable Distributed Systems, 2015

Parallelism vs. speculation: exploiting speculative genetic algorithm on GPU.
Proceedings of the Sixth International Workshop on Programming Models and Applications for Multicores and Manycores, 2015

Fast Proof Generation for Verifying Cloud Search.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015

Efficient Selection Algorithm for Fast k-NN Search on GPUs.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015

Code 5-6: An Efficient MDS Array Coding Scheme to Accelerate Online RAID Level Migration.
Proceedings of the 44th International Conference on Parallel Processing, 2015

Unsupervised Extraction of Video Highlights via Robust Recurrent Auto-Encoders.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

EH-Code: An Extended MDS Code to Improve Single Write Performance of Disk Arrays for Correcting Triple Disk Failures.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2015

Joint Routing and Channel Assignment for Delay Minimization in Multi-Channel Multi-Flow Mobile Cognitive Ad Hoc Networks.
Proceedings of the 2015 IEEE Global Communications Conference, 2015

TIP-Code: A Three Independent Parity Code to Tolerate Triple Disk Failures with Optimal Update Complextiy.
Proceedings of the 45th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, 2015

BPS: A Balanced Partial Stripe Write Scheme to Improve the Write Performance of RAID-6.
Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015

Cowic: A Column-Wise Independent Compression for Log Stream Analysis.
Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015

2014
Loop Transforming for Reducing Data Alignment on Multi-Core SIMD Processors.
J. Signal Process. Syst., 2014

On the Multicast Lifetime of WANETs with Multibeam Antennas: Formulation, Algorithms, and Analysis.
IEEE Trans. Computers, 2014

Adaptive workload-aware task scheduling for single-ISA asymmetric multicore architectures.
ACM Trans. Archit. Code Optim., 2014

Modeling and Defending against Adaptive BitTorrent Worms in Peer-to-Peer Networks.
ACM Trans. Auton. Adapt. Syst., 2014

CPU + GPU scheduling with asymptotic profiling.
Parallel Comput., 2014

A calibration algorithm for maze micromouse continuous smooth turning.
Int. J. Embed. Syst., 2014

Architecture-based design and optimization of genetic algorithms on multi- and many-core systems.
Future Gener. Comput. Syst., 2014

LSShare: an efficient multiple query optimization system in the cloud.
Distributed Parallel Databases, 2014

Supervised hashing with latent factor models.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

DWS: Demand-aware Work-Stealing in Multi-programmed Multi-core Architectures.
Proceedings of the 2014 PPOPP International Workshop on Programming Models and Applications for Multicores and Manycores, 2014

Energy efficient data access and storage through HW/SW co-design.
Proceedings of the SIGPLAN/SIGBED Conference on Languages, 2014

EEWA: Energy-Efficient Workload-Aware Task Scheduling in Multi-core Architectures.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

LAWS: locality-aware work-stealing for multi-socket multi-core architectures.
Proceedings of the 2014 International Conference on Supercomputing, 2014

HFA: A Hint Frequency-based approach to enhance the I/O performance of multi-level cache storage systems.
Proceedings of the 20th IEEE International Conference on Parallel and Distributed Systems, 2014

Data filtering for scalable high-dimensional k-NN search on multicore systems.
Proceedings of the 23rd International Symposium on High-Performance Parallel and Distributed Computing, 2014

A Knowledge Based Approach for Tackling Mislabeled Multi-class Big Social Data.
Proceedings of the Semantic Web: Trends and Challenges - 11th International Conference, 2014

Welcome from DSAA 2014 chairs.
Proceedings of the International Conference on Data Science and Advanced Analytics, 2014

CSF protein dynamic driver network: At the crossroads of brain tumorigenesis.
Proceedings of the 2014 IEEE International Conference on Bioinformatics and Biomedicine, 2014

A scalable and topology configurable protocol for distributed parameter synchronization.
Proceedings of the Asia-Pacific Workshop on Systems, 2014

SRP: A routing protocol for data center networks.
Proceedings of the 16th Asia-Pacific Network Operations and Management Symposium, 2014

2013
Adaptive Cache Aware Bitier Work-Stealing in Multisocket Multicore Architectures.
IEEE Trans. Parallel Distributed Syst., 2013

Decentralized checking of context inconsistency in pervasive computing environments.
J. Supercomput., 2013

Semi-sparse algorithm based on multi-layer optimization for recommender system.
J. Supercomput., 2013

HAT: history-based auto-tuning MapReduce in heterogeneous environments.
J. Supercomput., 2013

Scheduling Co-Design for Reliability and Energy in Cyber-Physical Systems.
IEEE Trans. Emerg. Top. Comput., 2013

Hybrid CPU Management for Adapting to the Diversity of Virtual Machines.
IEEE Trans. Computers, 2013

Tag-based personalized image ranking in event browsing.
Peer-to-Peer Netw. Appl., 2013

Survey on context-awareness in ubiquitous media.
Multim. Tools Appl., 2013

A blind image copyright protection scheme for e-government.
J. Vis. Commun. Image Represent., 2013

A segmentation-free method for image classification based on pixel-wise matching.
J. Comput. Syst. Sci., 2013

New acoustic monitoring method using cross-correlation of primary frequency spectrum.
J. Ambient Intell. Humaniz. Comput., 2013

An efficient classification approach for large-scale mobile ubiquitous computing.
Inf. Sci., 2013

Fast dimension reduction for document classification based on imprecise spectrum analysis.
Inf. Sci., 2013

A Generic Tree-Like Index Framework in the Cloud.
Proceedings of the Web Information Systems Engineering - WISE 2013, 2013

Improving Rocchio Algorithm for Updating User Profile in Recommender Systems.
Proceedings of the Web Information Systems Engineering - WISE 2013, 2013

Automatic Locality Exploitation in the Codelet Model.
Proceedings of the 12th IEEE International Conference on Trust, 2013

CAP: co-scheduling based on asymptotic profiling in CPU+GPU hybrid systems.
Proceedings of the 2013 PPOPP International Workshop on Programming Models and Applications for Multicores and Manycores, 2013

An energy-efficient and scalable eDRAM-based register file architecture for GPGPU.
Proceedings of the 40th Annual International Symposium on Computer Architecture, 2013

Adaptive Non-Local Means for Image Denoising using Turbulent PSO with No-Reference Measures.
Proceedings of the International Symposium on Biometrics and Security Technologies, 2013

Performance Tuning on Multicore Systems for Feature Matching within Image Collections.
Proceedings of the 42nd International Conference on Parallel Processing, 2013

HMHS: Hybrid Multistage Heuristic Scheduling Algorithm for Heterogeneous MapReduce System.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2013

Performance Bottlenecks in Manycore Systems: A Case Study on Large Scale Feature Matching within Image Collections.
Proceedings of the 10th IEEE International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing, 2013

A dynamical Deterministic Packet Marking scheme for DDoS traceback.
Proceedings of the 2013 IEEE Global Communications Conference, 2013

LABERIO: Dynamic load-balanced Routing in OpenFlow-enabled Networks.
Proceedings of the 27th IEEE International Conference on Advanced Information Networking and Applications, 2013

ShmStreaming: A Shared Memory Approach for Improving Hadoop Streaming Performance.
Proceedings of the 27th IEEE International Conference on Advanced Information Networking and Applications, 2013

An Automatical Moderating System for FML Using Hashing Regression.
Proceedings of the Advanced Data Mining and Applications - 9th International Conference, 2013

2012
Adaptive Forwarding Delay Control for VANET Data Aggregation.
IEEE Trans. Parallel Distributed Syst., 2012

Optimally Maximizing Iteration-Level Loop Parallelism.
IEEE Trans. Parallel Distributed Syst., 2012

Communication-free data alignment for arrays with exponential references in parallelizing compilers for scalable parallel systems.
J. Supercomput., 2012

Molecular solutions of the RSA public-key cryptosystem on a DNA-based computer.
J. Supercomput., 2012

Context-aware HCI service selection.
Mob. Inf. Syst., 2012

A chain-cluster based routing algorithm for wireless sensor networks.
J. Intell. Manuf., 2012

An efficient and scalable ubiquitous storage scheme for delay-sensitive IT applications.
J. Intell. Manuf., 2012

An efficient deadlock prevention approach for service oriented transaction processing.
Comput. Math. Appl., 2012

Hole Avoiding in Advance Routing with Hole Recovery Mechanism in Wireless Sensor Networks.
Ad Hoc Sens. Wirel. Networks, 2012

Service-Oriented Wireless Sensor Networks and an Energy-Aware Mesh Routing Algorithm.
Ad Hoc Sens. Wirel. Networks, 2012

Long Duration Broadcast Authentication for Wireless Sensor Networks.
Proceedings of the 75th IEEE Vehicular Technology Conference, 2012

A Quick and Reliable Routing for Infrastructure Surveillance with Wireless Sensor Networks.
Proceedings of the IEEE 31st Symposium on Reliable Distributed Systems, 2012

Manhattan hashing for large-scale image retrieval.
Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

RAMZzz: rank-aware dram power management with dynamic migrations and demotions.
Proceedings of the SC Conference on High Performance Computing Networking, 2012

PMA: Pixel-based multi-anchor algorithm for image recognition on multi-core systems.
Proceedings of the 2012 PPOPP International Workshop on Programming Models and Applications for Multicores and Manycores, 2012

Semi-sparse algorithm based on multi-layer optimization for recommendation system.
Proceedings of the 2012 PPOPP International Workshop on Programming Models and Applications for Multicores and Manycores, 2012

WATS: Workload-Aware Task Scheduling in Asymmetric Multi-core Architectures.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

CATS: cache aware task-stealing based on online profiling in multi-socket multi-core architectures.
Proceedings of the International Conference on Supercomputing, 2012

Minimum Latency Broadcasting with Conflict Awareness in Wireless Sensor Networks.
Proceedings of the 41st International Conference on Parallel Processing, 2012

DirectedPush - A High Performance Peer-to-Peer Live Streaming System Using Network Coding.
Proceedings of the 18th IEEE International Conference on Parallel and Distributed Systems, 2012

AgileRegulator: A hybrid voltage regulator scheme redeeming dark silicon for power efficiency in a multicore architecture.
Proceedings of the 18th IEEE International Symposium on High Performance Computer Architecture, 2012

Inverted Grid-Based kNN Query Processing with MapReduce.
Proceedings of the Seventh ChinaGrid Annual Conference, ChinaGrid 2012, Beijing, 2012

Emoticon Smoothed Language Models for Twitter Sentiment Analysis.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

2011
TASA: Tag-Free Activity Sensing Using RFID Tag Arrays.
IEEE Trans. Parallel Distributed Syst., 2011

Compiler-assisted dynamic scratch-pad memory management with space overlapping for embedded systems.
Softw. Pract. Exp., 2011

Preface.
J. Comput. Sci. Technol., 2011

Spatial Localization of Concurrent Multiple Sound Sources Using Phase Candidate Histogram.
J. Adv. Comput. Intell. Intell. Informatics, 2011

Hierarchical attribute-based encryption and scalable user revocation for sharing data in cloud servers.
Comput. Secur., 2011

A Shadow-Like Task Migration Model Based on Context Semantics for Mobile and Pervasive Environments.
Comput. Informatics, 2011

More convenient more overhead: the performance evaluation of Hadoop streaming.
Proceedings of the Research in Applied Computation Symposium, 2011

A Method of Context-Driven HCI Service Selection in Multimodal Interaction Environments.
Proceedings of the 14th International Conference on Network-Based Information Systems, 2011

Improvements on Sequential Minimal Optimization Algorithm for Support Vector Machine Based on Semi-sparse Algorithm.
Proceedings of the Fifth International Conference on Innovative Mobile and Internet Services in Ubiquitous Computing, 2011

An Effective Deadlock Prevention Mechanism for Distributed Transaction Management.
Proceedings of the Fifth International Conference on Innovative Mobile and Internet Services in Ubiquitous Computing, 2011

Circuit Emulation Services over SCTP.
Proceedings of the Fifth International Conference on Innovative Mobile and Internet Services in Ubiquitous Computing, 2011

Trying Linear Network Coding on a Network Flow Processor.
Proceedings of the Fifth International Conference on Innovative Mobile and Internet Services in Ubiquitous Computing, 2011

HARVEST: A Task-objective Efficient Data Collection Scheme in Wireless Sensor and Actor Networks.
Proceedings of the Third International Conference on Communications and Mobile Computing, 2011

An Efficient Approach of Power Reducing for Scratch-Pad Memory Based Embedded Systems.
Proceedings of the 2011 International Conference on Parallel Processing Workshops, 2011

CAB: Cache Aware Bi-tier Task-Stealing in Multi-socket Multi-core Architecture.
Proceedings of the International Conference on Parallel Processing, 2011

Towards Context-Aware Ubiquitous Transaction Processing: A Model and Algorithm.
Proceedings of IEEE International Conference on Communications, 2011

A Scalable Multiprocessor Architecture for Pervasive Computing.
Proceedings of the Advances in Grid and Pervasive Computing - 6th International Conference, 2011

Proportional Response Based Bandwidth Allocation for Layered P2P Live Streaming.
Proceedings of the Global Communications Conference, 2011

Architecture-based Performance Evaluation of Genetic Algorithms on Multi/Many-core Systems.
Proceedings of the 14th IEEE International Conference on Computational Science and Engineering, 2011

PPMLT: A Pipeline Based Processing Model of Long Transactions.
Proceedings of the 25th IEEE International Conference on Advanced Information Networking and Applications, 2011

Mechanism Design for Stochastic Virtual Resource Allocation in Non-cooperative Cloud Systems.
Proceedings of the IEEE International Conference on Cloud Computing, 2011

2010
Designing energy efficient target tracking protocol with quality monitoring in wireless sensor networks.
J. Supercomput., 2010

Balanced bipartite graph based register allocation for network processors in mobile and wireless networks.
Mob. Inf. Syst., 2010

Tier-Based Scalable and Secure Routing for Wireless Sensor Networks with Mobile Sinks.
IEICE Trans. Inf. Syst., 2010

Trusted Routing Based on Dynamic Trust Mechanism in Mobile Ad-Hoc Networks.
IEICE Trans. Inf. Syst., 2010

A Secure and Scalable Rekeying Mechanism for Hierarchical Wireless Sensor Networks.
IEICE Trans. Inf. Syst., 2010

Context-Aware Workflow Management For Intelligent Navigation Applications In Pervasive Environments.
Intell. Autom. Soft Comput., 2010

Context reasoning using extended evidence theory in pervasive computing environments.
Future Gener. Comput. Syst., 2010

Dynamic scratch-pad memory management with data pipelining for embedded systems.
Concurr. Comput. Pract. Exp., 2010

Dynamic Itinerary Planning for Mobile Agents with a Content-Specific Approach in Wireless Sensor Networks.
Proceedings of the 72nd IEEE Vehicular Technology Conference, 2010

The Core Degree Based Tag Reduction on Chip Multiprocessor to Balance Energy Saving and Performance Overhead.
Proceedings of the Network and Parallel Computing, IFIP International Conference, 2010

GridTDK: A Grid Transaction Development Kit.
Proceedings of the 13th International Conference on Network-Based Information Systems, 2010

MTTF of Composite Web Services.
Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications, 2010

Exploring the Limits of Tag Reduction for Energy Saving on a Multi-core Processor.
Proceedings of the 39th International Conference on Parallel Processing, 2010

A Parallel Skeleton Library for Embedded Multicores.
Proceedings of the 39th International Conference on Parallel Processing, 2010

xMozart: A Novel Platform for Intelligent Task Migration.
Proceedings of the CISIS 2010, 2010

Quantum Algorithms and Mathematical Representation of Bio-molecular Solutions for the Clique Problem in a Finite-dimensional Hilbert Space.
Proceedings of the International Conference on Computational Aspects of Social Networks, 2010

Quantum Algorithms and Mathematical Representation of Bio-molecular Solutions for the Hitting-set Problem on a Quantum Computer.
Proceedings of the International Conference on Computational Aspects of Social Networks, 2010

A Context Conflict Resolution with Optimized Mediation.
Proceedings of the 24th IEEE International Conference on Advanced Information Networking and Applications Workshops, 2010

SAMR: A Self-adaptive MapReduce Scheduling Algorithm in Heterogeneous Environment.
Proceedings of the 10th IEEE International Conference on Computer and Information Technology, 2010

2009
Flexible Deterministic Packet Marking: An IP Traceback System to Find the Real Source of Attacks.
IEEE Trans. Parallel Distributed Syst., 2009

Adaptive location updates for mobile sinks in wireless sensor networks.
J. Supercomput., 2009

Loop scheduling and bank type assignment for heterogeneous multi-bank memory.
J. Parallel Distributed Comput., 2009

An innovative analyser for multi-classifier e-mail classification based on grey list analysis.
J. Netw. Comput. Appl., 2009

Black Bridge: A Scatternet Formation Algorithm for Solving a New Emerging Problem.
J. Inf. Process. Syst., 2009

Improved Resource Allocation Algorithms for Practical Image Encoding in a Ubiquitous Computing Environment.
J. Comput., 2009

A message complexity oriented design of distributed algorithm for long-lived multicasting in wireless sensor networks.
Int. J. Sens. Networks, 2009

Multipath Routing with Reliable Nodes in Large-Scale Mobile Ad-Hoc Networks.
IEICE Trans. Inf. Syst., 2009

An effective state-based predictive approach for leakage energy management on embedded systems.
Des. Autom. Embed. Syst., 2009

Special issue: Network and Parallel Computing.
Comput. Syst. Sci. Eng., 2009

Quantum Algorithms of Bio-molecular Solutions for the Clique Problem on a Quantum Computer
CoRR, 2009

A scalable key pre-distribution mechanism for large-scale wireless sensor networks.
Concurr. Comput. Pract. Exp., 2009

A class-feature-centroid classifier for text categorization.
Proceedings of the 18th International Conference on World Wide Web, 2009

Service-oriented multimedia delivery in pervasive space.
Proceedings of the 2009 IEEE Wireless Communications and Networking Conference, 2009

I-Cache Tag Reduction for Low Power Chip Multiprocessor.
Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications, 2009

An Efficient Algorithm for Multimedia Delivery in Pervasive Space.
Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications, 2009

Exploring the Multicast Lifetime Capacity of WANETs with Directional Multibeam Antennas.
Proceedings of the INFOCOM 2009. 28th IEEE International Conference on Computer Communications, 2009

An Accurate and Energy Efficient Fetch Direction Orientation Mechanism for Trace Cache.
Proceedings of the ICPPW 2009, 2009

Global Variable Partition with Virtually Shared Scratch Pad Memory to Minimize Schedule Length.
Proceedings of the ICPPW 2009, 2009

An Efficient Collaborative Filtering Approach Using Smoothing and Fusing.
Proceedings of the ICPP 2009, 2009

Context-Aware Multimedia Processing System in a Pervasive Environment.
Proceedings of the Global Communications Conference, 2009. GLOBECOM 2009, Honolulu, Hawaii, USA, 30 November, 2009

An Improved Approach to Tag Reduction on Low Power CMP with Trade-Off of Energy and Performance.
Proceedings of the Fourth International Conference on Frontier of Computer Science and Technology, 2009

Development of General-Purpose Processing Element and Network-Based Dataflow Processing System: Part One.
Proceedings of the Fourth International Conference on Frontier of Computer Science and Technology, 2009

A Realistic Interference Model in Ad Hoc Networks.
Proceedings of the Fourth International Conference on Frontier of Computer Science and Technology, 2009

Development of General-Purpose Processing Element and Network-Based Dataflow Processing System: Part Two.
Proceedings of the Fourth International Conference on Frontier of Computer Science and Technology, 2009

Network-Based Data Flow Processing System with Variable Data Granularity for Inter-module Communication.
Proceedings of the Fourth International Conference on Frontier of Computer Science and Technology, 2009

Analysis of the Availability of Composite Web Services.
Proceedings of the Fourth International Conference on Frontier of Computer Science and Technology, 2009

Extended Dempster-Shafer Theory in Context Reasoning for Ubiquitous Computing Environments.
Proceedings of the 12th IEEE International Conference on Computational Science and Engineering, 2009

Dynamic Scratch-Pad Memory Management with Data Pipelining for Embedded Systems.
Proceedings of the 12th IEEE International Conference on Computational Science and Engineering, 2009

Black Bridge: A Scatternet Formation Algorithm for Solving a New Emerging Problem.
Proceedings of the 12th IEEE International Conference on Computational Science and Engineering, 2009

A Trade-Off Approach to Optimal Resource Allocation Algorithm with Cache Technology in Ubiquitous Computing Environment.
Proceedings of the 12th IEEE International Conference on Computational Science and Engineering, 2009

A Register Framework for Network Processors with Banked Register File.
Proceedings of the 2009 International Conference on Complex, 2009

Efficient Task Allocation Method to Improve Network Processor Throughput.
Proceedings of the 2009 International Conference on Complex, 2009

Optimal loop parallelization for maximizing iteration-level parallelism.
Proceedings of the 2009 International Conference on Compilers, 2009

The Design and Evaluation of a Selective Way Based Trace Cache.
Proceedings of the Advanced Parallel Processing Technologies, 8th International Symposium, 2009

Transaction Management for Reliable Grid Applications.
Proceedings of the IEEE 23rd International Conference on Advanced Information Networking and Applications, 2009

2008
Topology Design of Network-Coding-Based Multicast Networks.
IEEE Trans. Parallel Distributed Syst., 2008

Improving the parallelism of iterative methods by aggressive loop fusion.
J. Supercomput., 2008

Advances in high performance computing.
J. Supercomput., 2008

Implementation of an Intelligent Urban Traffic Management System Based on a City Grid Infrastructure.
J. Inf. Sci. Eng., 2008

Special Issue: Network Attacks and Defense Systems.
Comput. Syst. Sci. Eng., 2008

An Adaptive Context-Aware Transaction Model for Mobile and Ubiquitous Computing.
Comput. Informatics, 2008

A tradeoff analysis on message complexity and lifetime optimality for a distributed multicast algorithm in WSNs.
Proceedings of the Twenty-Seventh Annual ACM Symposium on Principles of Distributed Computing, 2008

A linear message distributed multicast algorithm with guaranteed directional communication lifetime in WANETs.
Proceedings of the IEEE 5th International Conference on Mobile Adhoc and Sensor Systems, 2008

iShadow: Yet Another Pervasive Computing Environment.
Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications, 2008

An Improved Design for UMP (Ubiquitous Multi-processor) System.
Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications, 2008

Consistent Music Recommendation in Heterogeneous Pervasive Environment.
Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications, 2008

ISOS: Space Overlapping Based on Iteration Access Patterns for Dynamic Scratch-pad Memory Management in Embedded Systems.
Proceedings of the 9th International Conference for Young Computer Scientists, 2008

Performance Analysis of Resource Allocation Algorithms Using Cache Technology for Pervasive Computing System.
Proceedings of the 9th International Conference for Young Computer Scientists, 2008

Towards Context-Aware Workflow Management for Ubiquitous Computing.
Proceedings of the International Conference on Embedded Software and Systems, 2008

Lifetime Approximation Schemes Allow Multicast Algorithm with Linear Message Complexity in Wireless Sensor Networks.
Proceedings of the 10th IEEE International Conference on High Performance Computing and Communications, 2008

A Performance Guaranteed Distributed Multicast Algorithm for Long-Lived Directional Communications in WANETs.
Proceedings of the High Performance Computing, 2008

Scalable and Secure Routing for Large-Scale Sensor Networks.
Proceedings of the 2008 IEEE/IPIP International Conference on Embedded and Ubiquitous Computing (EUC 2008), 2008

A State-Based Predictive Approach for Leakage Reduction of Functional Units.
Proceedings of the 2008 IEEE/IPIP International Conference on Embedded and Ubiquitous Computing (EUC 2008), 2008

An Improved Design of the Ubiquitous Learning System Based on Sensor Networks.
Proceedings of the 2008 IEEE/IPIP International Conference on Embedded and Ubiquitous Computing (EUC 2008), 2008

Quantum algorithms for bio-molecular solutions to the satisfiability problem on a quantum computer.
Proceedings of the IEEE Congress on Evolutionary Computation, 2008

Performance Analysis of Heuristic Algorithms for Lifetime-Aware Directional Multicasting in Wireless Ad Hoc Networks.
Proceedings of the 22nd International Conference on Advanced Information Networking and Applications, 2008

2007
Restoration Probability Modelling for Active Restoration-Based Optical Networks with Correlation Among Backup Routes.
IEEE Trans. Parallel Distributed Syst., 2007

A degree-constrained QoS-aware routing algorithm for application layer multicast.
Inf. Sci., 2007

A transactional grid workflow service for ShanghaiGrid.
Int. J. Web Grid Serv., 2007

Constructing Bio-molecular Databases on a DNA-based Computer
CoRR, 2007

Efficient group key management for multi-privileged groups.
Comput. Commun., 2007

Energy-Efficient Dual Prediction-Based Data Gathering for Environmental Monitoring Applications.
Proceedings of the IEEE Wireless Communications and Networking Conference, 2007

Hole Avoiding in Advance Routing in Wireless Sensor Networks.
Proceedings of the IEEE Wireless Communications and Networking Conference, 2007

ShanghaiGrid and intelligent urban traffic applications.
Proceedings of the CHINA HPC 2007, 2007

Hardware Implementation of Common Protocol Interface for a Network-Based Multiprocessor.
Proceedings of the Parallel and Distributed Processing and Applications, 2007

Wireless Mesh Sensor Networks in Pervasive Environment: a Reliable Architecture and Routing Protocol.
Proceedings of the 2007 International Conference on Parallel Processing Workshops (ICPP Workshops 2007), 2007

RARE: An Energy-Efficient Target Tracking Protocol for Wireless Sensor Networks.
Proceedings of the 2007 International Conference on Parallel Processing Workshops (ICPP Workshops 2007), 2007

Local Update-Based Routing Protocol in Wireless Sensor Networks with Mobile Sinks.
Proceedings of IEEE International Conference on Communications, 2007

Design of a Stabilizing Second-Order Congestion Controller for Large-Delay Networks.
Proceedings of IEEE International Conference on Communications, 2007

ID-Based Hierarchical Key Graph Scheme in Multi-Privileged Group Communications.
Proceedings of the Global Communications Conference, 2007

Polynomial Regression for Data Gathering in Environmental Monitoring Applications.
Proceedings of the Global Communications Conference, 2007

Transaction Management for Grid Workflow Applications.
Proceedings of the Grid and Cooperative Computing, 2007

Designing Piecewise QoS Routing Protocol in Large-Scale MANETs.
Proceedings of the Japan-China Joint Workshop on Frontier of Computer Science and Technology, 2007

Performance Evaluation to Optimize the UMP System Focusing on Network Transmission Speed.
Proceedings of the Japan-China Joint Workshop on Frontier of Computer Science and Technology, 2007

Ubiquitous Laboratory: A Research Support Environment for Ubiquitous Learning Based on Sensor Networks.
Proceedings of the Emerging Directions in Embedded and Ubiquitous Computing, 2007

U-LES: Active E-mobile Device and E-server for Senility User Friendly in u-Healthcare.
Proceedings of The 2nd IEEE Asia-Pacific Services Computing Conference, 2007

Multiprocessor Simulator System Based on Multi-way Cluster Using Double-buffered Model.
Proceedings of the 21st International Conference on Advanced Information Networking and Applications (AINA 2007), 2007

UMP-PerComp: A Ubiquitous Multiprocessor Network-Based Pipeline Processing Framework for Pervasive Computing Environments.
Proceedings of the 21st International Conference on Advanced Information Networking and Applications (AINA 2007), 2007

An Effective Trust Establishment Scheme for Authentication in Mobile Ad Hoc Networks.
Proceedings of the Seventh International Conference on Computer and Information Technology (CIT 2007), 2007

2006
Overall Blocking Behavior Analysis of General Banyan-Based Optical Switching Networks.
IEEE Trans. Parallel Distributed Syst., 2006

Foreword.
J. Supercomput., 2006

Fast parallel bio-molecular solutions: the set-basis problem.
Int. J. Comput. Sci. Eng., 2006

A Multicast Based Anonymous Information Sharing Protocol for Peer-to-Peer Systems.
IEICE Trans. Inf. Syst., 2006

Message Scheduling for Irregular Data Redistribution in Parallelizing Compilers.
IEICE Trans. Inf. Syst., 2006

Special Section on Parallel/Distributed Computing and Networking.
IEICE Trans. Inf. Syst., 2006

A taxonomy of application scheduling tools for high performance cluster computing.
Clust. Comput., 2006

Ontology-Based Composition of Web Services for Ubiquitous Computing.
Proceedings of the Frontiers of High Performance Computing and Networking, 2006

CoopStream: A Cooperative Cache Based Streaming Schedule Scheme for On-demand Media Services on Overlay Networks.
Proceedings of the 2006 International Conference on Parallel Processing (ICPP 2006), 2006

BAIMD: A Responsive Rate Control for TCP over Optical Burst Switched (OBS) Networks.
Proceedings of IEEE International Conference on Communications, 2006

A GML-Based Mobile Device Trace Monitoring System.
Proceedings of the Emerging Directions in Embedded and Ubiquitous Computing, 2006

A High Performance Simulator System for a Multiprocessor System Based on a Multi-way Cluster.
Proceedings of the Advances in Computer Systems Architecture, 11th Asia-Pacific Conference, 2006

2005
Solving the Independent-set Problem in a Dna-based Supercomputer Model.
Parallel Process. Lett., 2005

Improving communication scheduling for array redistribution.
J. Parallel Distributed Comput., 2005

One-dimensional I test and direction vector I test with array references by induction variable.
Int. J. High Perform. Comput. Netw., 2005

A Heuristic Routing Algorithm for Degree-Constrained Minimum Overall Latency Application Layer Multicast.
Proceedings of the Parallel and Distributed Processing and Applications, 2005

Effective Resource Allocation in a JXTA-Based Grid Computing Platform JXTPIA.
Proceedings of the Parallel and Distributed Processing and Applications, 2005

Communication-Free Data Alignment for Arrays with Exponential References Using Elementary Linear Algebra.
Proceedings of the Parallel and Distributed Processing and Applications, 2005

Keynote Address: Energy-Aware Compiler Scheduling for VLIW Embedded Software.
Proceedings of the 34th International Conference on Parallel Processing Workshops (ICPP 2005 Workshops), 2005

Enabling Loop Fusion and Tiling for Cache Performance by Fixing Fusion-Preventing Data Dependences.
Proceedings of the 34th International Conference on Parallel Processing (ICPP 2005), 2005

Process Migration for MPI Applications based on Coordinated Checkpoint.
Proceedings of the 11th International Conference on Parallel and Distributed Systems, 2005

Keynote 4: Can Parallel Software Catch up with Parallel Hardware? Trends in Automatic Parallelization.
Proceedings of the Third International Conference on Information Technology and Applications (ICITA 2005), 2005

Fast Parallel DNA-Based Algorithms for Molecular Computation: Determining a Prime Number.
Proceedings of the Third International Conference on Information Technology and Applications (ICITA 2005), 2005

An IP Routing Inspired Information Search Scheme for Semantic Overlay Networks.
Proceedings of the High Performance Computing and Communications, 2005

JXTPIA: A JXTA-Based P2P Network Interface and Architecture for Grid Computing.
Proceedings of the High Performance Computing and Communications, 2005

Optical flooding cluster switching (OFCS).
Proceedings of the Global Telecommunications Conference, 2005. GLOBECOM '05, St. Louis, Missouri, USA, 28 November, 2005

A Scalable and Reliable Multiple Home Regions Based Location Service in Mobile Ad Hoc Networks.
Proceedings of the Embedded and Ubiquitous Computing, 2005

2004
A Divide-and-Conquer Algorithm for Irregular Redistribution in Parallelizing Compilers.
J. Supercomput., 2004

Editorial: Parallel and Distributed Processing with Applications.
J. Supercomput., 2004

Fast parallel molecular solution to the dominating-set problem on massively parallel bio-computing.
Parallel Comput., 2004

Using sticker to solve the 3-dimensional matching problem in molecular supercomputers.
Int. J. High Perform. Comput. Netw., 2004

A Parallel Implementation of Multi-Domain High-Order Navier-Stokes Equations Using MPI.
IEICE Trans. Inf. Syst., 2004

Fast Parallel Solution for Set-Packing and Clique Problems by DNA-Based Computing.
IEICE Trans. Inf. Syst., 2004

Foreword.
IEICE Trans. Inf. Syst., 2004

Programming Support for MPMD Parallel Computing in ClusterGOP.
IEICE Trans. Inf. Syst., 2004

Towards solution of the set-splitting problem on gel-based DNA computing.
Future Gener. Comput. Syst., 2004

The Non-continuous Direction Vector I Test.
Proceedings of the 7th International Symposium on Parallel Architectures, 2004

A Genetic Algorithm for Dynamic Routing and Wavelength Assignment in WDM Networks.
Proceedings of the Parallel and Distributed Processing and Applications, 2004

Automatic Parallelization and Optimization for Irregular Scientific Applications.
Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004

Distributed MD4 Password Hashing with Grid Computing Package BOINC.
Proceedings of the Grid and Cooperative Computing, 2004

Dynamic Routing and Wavelength Assignment in WDM Networks with Ant-Based Agents.
Proceedings of the Embedded and Ubiquitous Computing, 2004

Implementing Cooperative Caching in Distributed Streaming Media Server Clusters.
Proceedings of the Embedded and Ubiquitous Computing, 2004

Location-Aware Information Retrieval for Mobile Computing.
Proceedings of the Embedded and Ubiquitous Computing, 2004

Fast Parallel Molecular Algorithms for DNA-based Computation: Factoring Integers.
Proceedings of the 4th IEEE International Symposium on BioInformatics and BioEngineering (BIBE 2004), 2004

Switching Cost, Market Effects and the Pricing Model of e-Commerce.
Proceedings of the 2004 IEEE International Conference on Services Computing (SCC 2004), 2004

Effcient List Algorithms for Irregular Block Redistribution in Parallelizing Compilers.
Proceedings of the 2004 International Conference on Computer and Information Technology (CIT 2004), 2004

2003
Symbolic Communication Set Generation for Irregular Parallel Applications.
J. Supercomput., 2003

Parallel and distributed scientific and engineering computing.
Parallel Comput., 2003

A scalable HPF implementation of a finite-volume computational electromagnetics application on a CRAY T3E parallel system.
Concurr. Comput. Pract. Exp., 2003

A Virtual XML Database Engine for Relational Databases.
Proceedings of the Database and XML Technologies, 2003

A Scheme of Interactive Data Mining Support System in Parallel and Distributed Environment.
Proceedings of the Parallel and Distributed Processing and Applications, 2003

An Efficient Algorithm for Irregular Redistributions in Parallelizing Compilers.
Proceedings of the Parallel and Distributed Processing and Applications, 2003

Solving the Set-Splitting Problem in Sticker-Based Model and the Adleman-Lipton Model.
Proceedings of the Parallel and Distributed Processing and Applications, 2003

Is Cook's Theorem Correct for DNA-Based Computing?
Proceedings of the High Performance Computing, 5th International Symposium, 2003

Molecular Fast Solution for Set-basis Problem on Sticker-based Model.
Proceedings of the 32nd International Conference on Parallel Processing Workshops (ICPP 2003 Workshops), 2003

Parallel Biometrics Computing Using Mobile Agents.
Proceedings of the 32nd International Conference on Parallel Processing (ICPP 2003), 2003

Effective OpenMP Extensions for Irregular Applications on Cluster Environments.
Proceedings of the Grid and Cooperative Computing, Second International Workshop, 2003

Accessing Relational Databases via XML Schema.
Proceedings of the 15th Conference on Advanced Information Systems Engineering (CAiSE '03), 2003

On Transformation to Redundancy Free XML Schema from Relational Database Schema.
Proceedings of the Web Technologies and Applications, 5th Asian-Pacific Web Conference, 2003

2002
JAPS-II: A Source to Source Parallelizing Compiler for Java.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2002

A Proposal of High Performance Data Mining System.
Proceedings of the Applied Parallel Computing Advanced Scientific Computing, 2002

Reducing Communication Cost for Parallelizing Irregular Scientific Codes.
Proceedings of the Applied Parallel Computing Advanced Scientific Computing, 2002

Optimization Techniques for Parallel Codes of Irregular Scientific Computations.
Proceedings of the 31st International Conference on Parallel Processing Workshops (ICPP 2002 Workshops), 2002

2001
A Framework for Efficient Data Redistribution on Distributed Memory Multicomputers.
J. Supercomput., 2001

Scheduling and Automatic Parallelization.
Parallel Distributed Comput. Pract., 2001

Denotational Semantics of an HPF-Like Data-Parallel Langguage Model.
Parallel Process. Lett., 2001

2000
Contention-free communication scheduling for array redistribution.
Parallel Comput., 2000

Communication Scheduling with Considering Message Length for Array Redistribution.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2000

1998
Improving Performance of Multi-Dimensional Array Redistribution on Distributed Memory Machines.
Proceedings of the 3rd International Workshop on High-Level Programming Models and Supportive Environments (HIPS '98), 30 March, 1998

1997
Automatic transformation from data flow diagram to structure chart.
ACM SIGSOFT Softw. Eng. Notes, 1997


  Loading...