Zhongzhi Luan
Orcid: 0000-0002-7186-0556
According to our database1,
Zhongzhi Luan
authored at least 185 papers
between 2003 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
IEEE Trans. Computers, July, 2024
IEEE Trans. Parallel Distributed Syst., June, 2024
Towards optimized tensor code generation for deep learning on sunway many-core processor.
Frontiers Comput. Sci., April, 2024
Adaptive Auto-Tuning Framework for Global Exploration of Stencil Optimization on GPUs.
IEEE Trans. Parallel Distributed Syst., January, 2024
ElasticBatch: A Learning-Augmented Elastic Scheduling System for Batch Inference on MIG.
IEEE Trans. Parallel Distributed Syst., 2024
FDLoRA: Personalized Federated Learning of Large Language Model via Dual LoRA Tuning.
CoRR, 2024
INSPIRIT: Optimizing Heterogeneous Task Scheduling through Adaptive Priority in Task-based Runtime Systems.
CoRR, 2024
CoRR, 2024
Minions: Accelerating Large Language Model Inference with Adaptive and Collective Speculative Decoding.
CoRR, 2024
Building a domain-specific compiler for emerging processors with a reusable approach.
Sci. China Inf. Sci., 2024
Proceedings of the IEEE International Conference on Software Analysis, 2024
Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2024
Proceedings of the 53rd International Conference on Parallel Processing, 2024
PRoof: A Comprehensive Hierarchical Profiling Framework for Deep Neural Networks with Roofline Analysis.
Proceedings of the 53rd International Conference on Parallel Processing, 2024
Proceedings of the 33rd International Symposium on High-Performance Parallel and Distributed Computing, 2024
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
2023
HAOTuner: A Hardware Adaptive Operator Auto-Tuner for Dynamic Shape Tensor Compilers.
IEEE Trans. Computers, November, 2023
IEEE Trans. Computers, September, 2023
CCF Trans. High Perform. Comput., September, 2023
Computer, August, 2023
IEEE Trans. Netw. Serv. Manag., June, 2023
swSpAMM: optimizing large-scale sparse approximate matrix multiplication on Sunway Taihulight.
Frontiers Comput. Sci., 2023
TrivialSpy: Identifying Software Triviality via Fine-grained and Dataflow-based Value Profiling.
Proceedings of the International Conference for High Performance Computing, 2023
EasyScale: Elastic Training with Consistent Accuracy and Improved Utilization on GPUs.
Proceedings of the International Conference for High Performance Computing, 2023
Exploiting Input Tensor Dynamics in Activation Checkpointing for Efficient Training on GPU.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023
Proceedings of the 37th International Conference on Supercomputing, 2023
Proceedings of the 52nd International Conference on Parallel Processing, 2023
Proceedings of the 29th IEEE International Conference on Parallel and Distributed Systems, 2023
Proceedings of the 29th IEEE International Conference on Parallel and Distributed Systems, 2023
gGMED: Towards GPU Accelerated Geometric Modeling Evaluation and Derivative Processes.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2023
Proceedings of the IEEE International Conference on High Performance Computing & Communications, 2023
Proceedings of the IEEE International Conference on High Performance Computing & Communications, 2023
Large-Scale Parallelization and Optimization of Lattice QCD on Tianhe New Generation Supercomputer.
Proceedings of the IEEE International Conference on High Performance Computing & Communications, 2023
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023
2022
REVAL: Recommend Which Variables to Log With Pretrained Model and Graph Neural Network.
IEEE Trans. Netw. Serv. Manag., December, 2022
J. Supercomput., 2022
IEEE Trans. Computers, 2022
QoS-aware dynamic resource allocation with improved utilization and energy efficiency on GPU.
Parallel Comput., 2022
Frontiers Comput. Sci., 2022
CoRR, 2022
FamilySeer: Towards Optimized Tensor Codes by Exploiting Computation Subgraph Similarity.
CoRR, 2022
Proceedings of the SC22: International Conference for High Performance Computing, 2022
Proceedings of the 2022 IEEE/IFIP Network Operations and Management Symposium, 2022
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022
Proceedings of the ICS '22: 2022 International Conference on Supercomputing, Virtual Event, June 28, 2022
Proceedings of the 51st International Conference on Parallel Processing, 2022
Proceedings of the 24th IEEE Int Conf on High Performance Computing & Communications; 8th Int Conf on Data Science & Systems; 20th Int Conf on Smart City; 8th Int Conf on Dependability in Sensor, 2022
Proceedings of the 18th International Conference on Network and Service Management, 2022
2021
IEEE Trans. Parallel Distributed Syst., 2021
J. Supercomput., 2021
Inf. Sci., 2021
dgQuEST: Accelerating Large Scale Quantum Circuit Simulation through Hybrid CPU-GPU Memory Hierarchies.
Proceedings of the Network and Parallel Computing, 2021
Proceedings of the ICS '21: 2021 International Conference on Supercomputing, 2021
Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors.
Proceedings of the ICPP 2021: 50th International Conference on Parallel Processing, Lemont, IL, USA, August 9, 2021
Proceedings of the 2021 IEEE 23rd Int Conf on High Performance Computing & Communications; 7th Int Conf on Data Science & Systems; 19th Int Conf on Smart City; 7th Int Conf on Dependability in Sensor, 2021
Proceedings of the IEEE International Conference on Cluster Computing, 2021
PriPro: Towards Effective Privacy Protection on Edge-Cloud System running DNN Inference.
Proceedings of the 21st IEEE/ACM International Symposium on Cluster, 2021
2020
IEEE Trans. Parallel Distributed Syst., 2020
IEEE Trans. Parallel Distributed Syst., 2020
IEEE Trans. Parallel Distributed Syst., 2020
IEEE Trans. Netw. Serv. Manag., 2020
CoRR, 2020
Proceedings of the Supercomputing Frontiers - 6th Asian Conference, 2020
Proceedings of the International Conference for High Performance Computing, 2020
Proceedings of the International Conference for High Performance Computing, 2020
Proceedings of the NOMS 2020, 2020
Extremely Low-bit Convolution Optimization for Quantized Neural Network on Modern Computer Architectures.
Proceedings of the ICPP 2020: 49th International Conference on Parallel Processing, 2020
Proceedings of the 2020 International Conference on Information Networking, 2020
Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region, 2020
Proceedings of the 16th International Conference on Network and Service Management, 2020
swRodinia: A Benchmark Suite for Exploiting Architecture Properties of Sunway Processor.
Proceedings of the Benchmarking, Measuring, and Optimizing, 2020
2019
Accelerating in-memory transaction processing using general purpose graphics processing units.
Future Gener. Comput. Syst., 2019
A novel index system describing program runtime characteristics for workload consolidation.
Frontiers Comput. Sci., 2019
Comput. Sci. Eng., 2019
CoRR, 2019
CoRR, 2019
CoRR, 2019
CCF Trans. High Perform. Comput., 2019
Performance Evaluation and Analysis of Linear Algebra Kernels in the Prototype Tianhe-3 Cluster.
Proceedings of the Supercomputing Frontiers - 5th Asian Conference, 2019
Modeling Power Consumption of The Code Execution Using Performance Counters Statistics.
Proceedings of the 20th International Conference on Parallel and Distributed Computing, 2019
Multiple Algorithms Against Multiple Hardware Architectures: Data-Driven Exploration on Deep Convolution Neural Network.
Proceedings of the Network and Parallel Computing, 2019
Proceedings of the Algorithms and Architectures for Parallel Processing, 2019
Proceedings of the 21st IEEE International Conference on High Performance Computing and Communications; 17th IEEE International Conference on Smart City; 5th IEEE International Conference on Data Science and Systems, 2019
Proceedings of the 21st IEEE International Conference on High Performance Computing and Communications; 17th IEEE International Conference on Smart City; 5th IEEE International Conference on Data Science and Systems, 2019
Proceedings of the 21st IEEE International Conference on High Performance Computing and Communications; 17th IEEE International Conference on Smart City; 5th IEEE International Conference on Data Science and Systems, 2019
Anomaly Detection Models Based on Context-Aware Sequential Long Short-Term Memory Learning.
Proceedings of the 2019 IEEE Global Communications Conference, 2019
L-DAG: Enabling Loopy Workflow in Scientific Application with Automatic DAG Transformation.
Proceedings of the 2019 IEEE Intl Conf on Dependable, 2019
Proceedings of the 2019 IEEE International Conference on Cluster Computing, 2019
Proceedings of the 16th ACM International Conference on Computing Frontiers, 2019
Proceedings of The 11th Asian Conference on Machine Learning, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
IEEE Trans. Parallel Distributed Syst., 2018
SRAM- and STT-RAM-based hybrid, shared last-level cache for on-chip CPU-GPU heterogeneous architectures.
J. Supercomput., 2018
T1000: Mitigating the memory footprint of convolution neural networks with decomposition and re-fusion.
Future Gener. Comput. Syst., 2018
Proceedings of the 51st Annual IEEE/ACM International Symposium on Microarchitecture, 2018
Estimating Software Energy Consumption with Machine Learning Approach by Software Performance Feature.
Proceedings of the IEEE International Conference on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, 2018
Proceedings of the IEEE International Conference on Parallel & Distributed Processing with Applications, 2018
Proceedings of the 20th IEEE International Conference on High Performance Computing and Communications; 16th IEEE International Conference on Smart City; 4th IEEE International Conference on Data Science and Systems, 2018
Proceedings of the 14th International Conference on Network and Service Management, 2018
Performance Analysis and Optimization of Cyro-EM Structure Determination in RELION-2.
Proceedings of the Advanced Computer Architecture - 12th Conference, 2018
2017
PSOM: Periodic Self-Organizing Maps for unsupervised anomaly detection in periodic time series.
Proceedings of the 25th IEEE/ACM International Symposium on Quality of Service, 2017
iDPL: A scalable and flexible inter-continental testbed for data placement research and experiment.
Proceedings of the 2017 IEEE Symposium on Computers and Communications, 2017
PowerChief: Intelligent Power Allocation for Multi-Stage Applications to Improve Responsiveness on Power Constrained CMP.
Proceedings of the 44th Annual International Symposium on Computer Architecture, 2017
Proceedings of the 13th International Conference on Network and Service Management, 2017
Proceedings of the 2017 IEEE International Conference on Computer and Information Technology, 2017
2016
Coordinating workload balancing and power switching in renewable energy powered data center.
Frontiers Comput. Sci., 2016
Proceedings of the Network and Parallel Computing, 2016
Proceedings of the 24th IEEE/ACM International Symposium on Quality of Service, 2016
Proceedings of the 2016 International Conference on Supercomputing, 2016
Proceedings of the 18th IEEE International Conference on High Performance Computing and Communications; 14th IEEE International Conference on Smart City; 2nd IEEE International Conference on Data Science and Systems, 2016
Proceedings of the ACM International Conference on Computing Frontiers, CF'16, 2016
Efficient Power Allocation under Global Power Cap and Application-Level Power Budget.
Proceedings of the 7th International Conference on Cloud Computing and Big Data, 2016
VinaSC: Scalable Autodock Vina with fine-grained scheduling on heterogeneous platform.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2016
2015
Sci. China Inf. Sci., 2015
Proceedings of the 23rd IEEE International Symposium on Quality of Service, 2015
Proceedings of the 17th IEEE International Conference on High Performance Computing and Communications, 2015
Proceedings of the 17th IEEE International Conference on High Performance Computing and Communications, 2015
Proceedings of the 4th IEEE International Conference on Cloud Networking, 2015
Proceedings of the International Conference on Cloud Computing and Big Data, 2015
Proceedings of the International Conference on Cloud Computing and Big Data, 2015
2014
Future Gener. Comput. Syst., 2014
Proceedings of the Trustworthy Computing and Services - International Conference, 2014
Proceedings of the Trustworthy Computing and Services - International Conference, 2014
Information Integration of Heterogeneous Employment Service Information of College Graduates.
Proceedings of the Trustworthy Computing and Services - International Conference, 2014
Proceedings of the Trustworthy Computing and Services - International Conference, 2014
Proceedings of the 2014 IEEE International Conference on High Performance Computing and Communications, 2014
Lessons from Experimental Methodology of Cache Hierarchy Changes with the Memory Technology.
Proceedings of the 17th IEEE International Conference on Computational Science and Engineering, 2014
A Network Performance Based Data Placement Policy in Distributed Data-Intensive Applications.
Proceedings of the 14th IEEE International Conference on Computer and Information Technology, 2014
2013
Proceedings of the Trustworthy Computing and Services, 2013
Proceedings of the Trustworthy Computing and Services, 2013
Proceedings of the 2013 IFIP/IEEE International Symposium on Integrated Network Management (IM 2013), 2013
Proceedings of the Algorithms and Architectures for Parallel Processing, 2013
M&C: A Software Solution to Reduce Errors Caused by Incoherent Caches on GPUs in Unstructured Graphic Algorithm.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2013
Proceedings of the Algorithms and Architectures for Parallel Processing, 2013
Using Paralleled-PEs Method to Resolve the Bursting Data in Distributed Stream Processing System.
Proceedings of the 16th IEEE International Conference on Computational Science and Engineering, 2013
Proceedings of the 2013 International Conference on Cloud and Service Computing, 2013
Proceedings of the 2013 International Conference on Cloud and Service Computing, 2013
Proceedings of the 2013 International Conference on Cloud and Service Computing, 2013
Empowering Designers to Estimate Function-Level Power for Developing Green Applications.
Proceedings of the 2013 International Conference on Cloud and Service Computing, 2013
2012
IEICE Trans. Commun., 2012
Proceedings of the 13th International Conference on Parallel and Distributed Computing, 2012
Proceedings of the 15th International Conference on Network-Based Information Systems, 2012
Providing High Availability for Distributed Stream Processing Application with Replica Placement.
Proceedings of the 15th International Conference on Network-Based Information Systems, 2012
Proceedings of the Trustworthy Computing and Services - International Conference, ISCTCS 2012, Beijing, China, May 28, 2012
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012
Proceedings of the 41st International Conference on Parallel Processing Workshops, 2012
Proceedings of the 41st International Conference on Parallel Processing Workshops, 2012
Proceedings of the 2012 IEEE International Conference on Cluster Computing Workshops, 2012
Proceedings of the 2012 IEEE International Conference on Cluster Computing Workshops, 2012
2011
Enhancing cooperation with multiple stage auctions in opportunistic routing for wireless mesh networks.
Proceedings of the 12th IFIP/IEEE International Symposium on Integrated Network Management, 2011
Proceedings of the IEEE Ninth International Conference on Dependable, 2011
Proceedings of the 7th International Conference on Network and Service Management, 2011
Proceedings of the 7th International Conference on Network and Service Management, 2011
CDebugger: A scalable parallel debugger with dynamic communication topology configuration.
Proceedings of the 2011 International Conference on Cloud and Service Computing, 2011
Proceedings of the 2011 International Conference on Cloud and Service Computing, 2011
2010
Proceedings of the Algorithms and Architectures for Parallel Processing, 2010
Proceedings of the Algorithms and Architectures for Parallel Processing, 2010
Proceedings of the 12th IEEE International Conference on High Performance Computing and Communications, 2010
ORSP: An Efficient Resource Acquisition Policy for Peer-to-Peer Mesh Streaming Systems.
Proceedings of the GCC 2010, 2010
Proceedings of the GCC 2010, 2010
2009
Proceedings of the NPC 2009, 2009
Proceedings of the International Conference on Networking, Architecture, and Storage, 2009
A Two-Phase Log-Based Fault Recovery Mechanism in Master/Worker Based Computing Environment.
Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications, 2009
R-ECS: reliable elastic computing services for building virtual computing environment.
Proceedings of the 2nd International Conference on Interaction Sciences: Information Technology, 2009
Proceedings of the Scalable Information Systems, 4th International ICST Conference, 2009
Proceedings of the 9th IEEE/ACM International Symposium on Cluster Computing and the Grid, 2009
2008
Design of a Sequential Tentative Hold Protocol for Efficient Coordination of Web Services Transaction.
Proceedings of the IEEE International Conference on Networking, Sensing and Control, 2008
Proceedings of the 10th IEEE International Conference on High Performance Computing and Communications, 2008
Proceedings of the Seventh International Conference on Grid and Cooperative Computing, 2008
DDGrid: A Grid Computing Environment with Massive Concurrency and Fault-Tolerance Support.
Proceedings of the Seventh International Conference on Grid and Cooperative Computing, 2008
2007
Proceedings of The 2nd IEEE Asia-Pacific Services Computing Conference, 2007
2006
Proceedings of the Fifth International Conference on Networking and the International Conference on Systems (ICN / ICONS / MCL 2006), 2006
2004
Proceedings of the Grid and Cooperative Computing, 2004
2003
To Manage Grid Using Dynamically Constructed Network Management Concept: An Early Thought.
Proceedings of the Grid and Cooperative Computing, Second International Workshop, 2003
Proceedings of the Advanced Parallel Programming Technologies, 5th International Workshop, 2003