Rui Wang

Orcid: 0000-0003-2741-6033

Affiliations:
  • Beihang University, Sino-German Joint Software Institute, Beijing, China
  • Beihang University, School of Computer Science and Engineering, Beijing, China


According to our database1, Rui Wang authored at least 74 papers between 2008 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
HAOTuner: A Hardware Adaptive Operator Auto-Tuner for Dynamic Shape Tensor Compilers.
IEEE Trans. Computers, November, 2023

2022
Passive Motion Detection via mmWave Communication System.
Proceedings of the 95th IEEE Vehicular Technology Conference, 2022

2021
MIPSGPU: Minimizing Pipeline Stalls for GPUs With Non-Blocking Execution.
IEEE Trans. Computers, 2021

Guardauto: A Decentralized Runtime Protection System for Autonomous Driving.
IEEE Trans. Computers, 2021

Mutual calibration training: Training deep neural networks with noisy labels using dual-models.
Comput. Vis. Image Underst., 2021

Enabling Large-Reach TLBs for High-Throughput Processors by Exploiting Memory Subregion Contiguity.
CoRR, 2021

CompactNet: Platform-Aware Automatic Optimization for Convolutional Neural Networks.
Proceedings of the PMAM@PPoPP 2021: Proceedings of the Twelfth International Workshop on Programming Models and Applications for Multicores and Manycores, 2021

2020
Thread-Level Locking for SIMT Architectures.
IEEE Trans. Parallel Distributed Syst., 2020

Temperature-Aware DRAM Cache Management - Relaxing Thermal Constraints in 3-D Systems.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2020

Extremely Low-bit Convolution Optimization for Quantized Neural Network on Modern Computer Architectures.
Proceedings of the ICPP 2020: 49th International Conference on Parallel Processing, 2020

CCIED: Cache-aided Collaborative Intelligence Between Edge Devices.
Proceedings of the 22nd IEEE International Conference on High Performance Computing and Communications; 18th IEEE International Conference on Smart City; 6th IEEE International Conference on Data Science and Systems, 2020

2019
Accelerating in-memory transaction processing using general purpose graphics processing units.
Future Gener. Comput. Syst., 2019

A novel index system describing program runtime characteristics for workload consolidation.
Frontiers Comput. Sci., 2019

CompactNet: Platform-Aware Automatic Optimization for Convolutional Neural Networks.
CoRR, 2019

Multiple Algorithms Against Multiple Hardware Architectures: Data-Driven Exploration on Deep Convolution Neural Network.
Proceedings of the Network and Parallel Computing, 2019

GraphQ: Scalable PIM-Based Graph Processing.
Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture, 2019

Structure Characteristic-Aware Pruning Strategy for Convolutional Neural Networks.
Proceedings of the 21st IEEE International Conference on High Performance Computing and Communications; 17th IEEE International Conference on Smart City; 5th IEEE International Conference on Data Science and Systems, 2019

Towards a General and Efficient Linked-List Hash Table on GPUs.
Proceedings of the 21st IEEE International Conference on High Performance Computing and Communications; 17th IEEE International Conference on Smart City; 5th IEEE International Conference on Data Science and Systems, 2019

FLONet: Fewer Labeling Cost Active Learning for Deep Neural Network.
Proceedings of the 21st IEEE International Conference on High Performance Computing and Communications; 17th IEEE International Conference on Smart City; 5th IEEE International Conference on Data Science and Systems, 2019

LADet: A Light-weight and Adaptive Network for Multi-scale Object Detection.
Proceedings of The 11th Asian Conference on Machine Learning, 2019

2018
SRAM- and STT-RAM-based hybrid, shared last-level cache for on-chip CPU-GPU heterogeneous architectures.
J. Supercomput., 2018

T1000: Mitigating the memory footprint of convolution neural networks with decomposition and re-fusion.
Future Gener. Comput. Syst., 2018

Sparsing Deep Neural Network Using Semi-Discrete Matrix Decomposition.
IEEE Access, 2018

A network traffic flow prediction with deep learning approach for large-scale metropolitan area network.
Proceedings of the 2018 IEEE/IFIP Network Operations and Management Symposium, 2018

Nodes contact probability estimation approach based on Bayesian network for DTN.
Proceedings of the 2018 IEEE/IFIP Network Operations and Management Symposium, 2018

Network Alarm Flood Pattern Mining Algorithm Based on Multi-dimensional Association.
Proceedings of the 21st ACM International Conference on Modeling, 2018

Multi-level virtual desktop security enhancement technology based on docker and X server.
Proceedings of the International Conference on Geoinformatics and Data Analysis, 2018

A QOS-aware dynamic resources management for data center.
Proceedings of the International Conference on Geoinformatics and Data Analysis, 2018

Research on Asynchronous Inter-VM Communication Mechanism Based on Embedded Hypervisor.
Proceedings of the 2018 IEEE 42nd Annual Computer Software and Applications Conference, 2018

EffectFace: A Fast and Efficient Deep Neural Network Model for Face Recognition.
Proceedings of the Advanced Computer Architecture - 12th Conference, 2018

2017
Achieving Versatile and Simultaneous Cache Optimizations With Nonvolatile SRAM.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2017

Efficient Asynchronous Communication between Virtual Machines in Embedded Systems.
Proceedings of the 19th IEEE International Conference on High Performance Computing and Communications; 15th IEEE International Conference on Smart City; 3rd IEEE International Conference on Data Science and Systems, 2017

2016
Managing Server Clusters on Renewable Energy Mix.
ACM Trans. Auton. Adapt. Syst., 2016

Coordinating workload balancing and power switching in renewable energy powered data center.
Frontiers Comput. Sci., 2016

QIM: Quantifying Hyperparameter Importance for Deep Learning.
Proceedings of the Network and Parallel Computing, 2016

Scheduling Tasks with Mixed Timing Constraints in GPU-Powered Real-Time Systems.
Proceedings of the 2016 International Conference on Supercomputing, 2016

Restricted Boltzmann Machines and Deep Belief Networks on Sunway Cluster.
Proceedings of the 18th IEEE International Conference on High Performance Computing and Communications; 14th IEEE International Conference on Smart City; 2nd IEEE International Conference on Data Science and Systems, 2016

Lock-based synchronization for GPU architectures.
Proceedings of the ACM International Conference on Computing Frontiers, CF'16, 2016

2015
Reducing DRAM refreshing in an error correction manner.
Sci. China Inf. Sci., 2015

Improving multiprocessor performance with fine-grain coherence bypass.
Sci. China Inf. Sci., 2015

Leveraging Non-Volatile Storage to Achieve Versatile Cache Optimizations.
IEEE Comput. Archit. Lett., 2015

Merging of P2P Overlays Over Mobile Ad Hoc Network: Evaluation of Three Approaches.
Ad Hoc Sens. Wirel. Networks, 2015

An Efficient Transmission Method for Bulk Data Based on Network Coding in Delay Tolerant Network.
Proceedings of the 18th ACM International Conference on Modeling, 2015

Adaptive Assignment for Quality-Aware Mobile Sensing Network with Strategic Users.
Proceedings of the 17th IEEE International Conference on High Performance Computing and Communications, 2015

Optimizing Soft Real-Time Scheduling Performance for Virtual Machines with SRT-Xen.
Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015

2014
An Efficient and Scalable Routing for MANETs.
Wirel. Pers. Commun., 2014

Lightweight dynamic partitioning for last-level cache of multicore processor on real system.
J. Supercomput., 2014

Towards Automated Provisioning and Emergency Handling in Renewable Energy Powered Datacenters.
J. Comput. Sci. Technol., 2014

Software Transactional Memory for GPU Architectures.
IEEE Comput. Archit. Lett., 2014

Speedup Critical Stage of Machine Learning with Batch Scheduling in GPU.
Proceedings of the Network and Parallel Computing, 2014

Memory Centric Hardware Prefetching in Multi-core Processors.
Proceedings of the Trustworthy Computing and Services - International Conference, 2014

Managing Green Datacenters Powered by Hybrid Renewable Energy Systems.
Proceedings of the 11th International Conference on Autonomic Computing, 2014

Remapping NUCA: Improving NUCA Cache's Power Efficiency.
Proceedings of the 2014 IEEE International Conference on High Performance Computing and Communications, 2014

Lessons from Experimental Methodology of Cache Hierarchy Changes with the Memory Technology.
Proceedings of the 17th IEEE International Conference on Computational Science and Engineering, 2014

Dual Power: Integrating Renewable Energy into Green Datacenters without Grid Tie Inverter.
Proceedings of the 17th IEEE International Conference on Computational Science and Engineering, 2014

Software Transactional Memory for GPU Architectures.
Proceedings of the 12th Annual IEEE/ACM International Symposium on Code Generation and Optimization, 2014

2013
Chameleon: Adapting throughput server to time-varying green power budget using online learning.
Proceedings of the International Symposium on Low Power Electronics and Design (ISLPED), 2013

M&C: A Software Solution to Reduce Errors Caused by Incoherent Caches on GPUs in Unstructured Graphic Algorithm.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2013

Interference-Aware Program Scheduling for Multicore Processors.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2013

COMSP: Correlated Contact and Message Scheduling Policy in DTN.
Proceedings of the 10th IEEE International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing, 2013

PT: A Lightweight Job Scheduler with Remote Interface for Multiprocessors.
Proceedings of the 16th IEEE International Conference on Computational Science and Engineering, 2013

2012
MANET adaptive structured P2P overlay.
Peer-to-Peer Netw. Appl., 2012

2011
Enhancing cooperation with multiple stage auctions in opportunistic routing for wireless mesh networks.
Proceedings of the 12th IFIP/IEEE International Symposium on Integrated Network Management, 2011

2010
Throughput maximization with bargaining game in cognitive radio networks.
Proceedings of the 3rd IFIP Wireless Days Conference 2010, 2010

A Fair Thread-Aware Memory Scheduling Algorithm for Chip Multiprocessor.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2010

ORSP: An Efficient Resource Acquisition Policy for Peer-to-Peer Mesh Streaming Systems.
Proceedings of the GCC 2010, 2010

AIM: An Auction Incentive Mechanism in Wireless Networks with Opportunistic Routing.
Proceedings of the 13th IEEE International Conference on Computational Science and Engineering, 2010

2009
Re-exploring the Potential of Using Tree Structure in P2P Live Streaming Networks.
Proceedings of the NPC 2009, 2009

Optimizing Transmission in Multi-Flow Streaming Overlay Networks.
Proceedings of the NPC 2009, 2009

Tuning Performance of P2P Mesh Streaming System Using a Network Evolution Approach.
Proceedings of the Scalable Information Systems, 4th International ICST Conference, 2009

2008
An Adaptive Network Node Architecture for Evolutionary Networks.
Proceedings of the IEEE International Conference on Networking, Sensing and Control, 2008

GSON: A Group Based Hierarchically Structured Overlay Network.
Proceedings of the 12th IEEE International Workshop on Future Trends of Distributed Computing Systems, 2008

An Architecture for Distributed Controllable Networks and Manageable Node Based on Network Processor.
Proceedings of the Progress in WWW Research and Development, 2008

An evolutionary node architecture and performance optimization.
Proceedings of the 6th ACS/IEEE International Conference on Computer Systems and Applications, 2008


  Loading...