Guojing Cong

Orcid: 0000-0003-0850-7714

According to our database¹, Guojing Cong authored at least 88 papers between 2004 and 2024.

Collaborative distances:

Dijkstra number² of three.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Predicting Drug Effects from High-Dimensional, Asymmetric Drug Datasets by Using Graph Neural Networks: A Comprehensive Analysis of Multitarget Drug Effect Prediction.

[BibT_eX]

[DOI]

Avishek Bose

Guojing Cong

CoRR, 2024

Optimizing Distributed Training on Frontier for Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the ISC High Performance 2024 Research Paper Proceedings (39th International Conference), 2024

Comparative Study of Large Language Model Architectures on Frontier.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

Exploration of Novel Neuromorphic Methodologies for Materials Applications.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Neuromorphic Systems, 2024

Transductive Spiking Graph Neural Networks for Loihi.

[BibT_eX]

[DOI]

Proceedings of the Great Lakes Symposium on VLSI 2024, 2024

2023

Improving materials property predictions for graph neural networks with minimal feature engineering <sup>*</sup>.

[BibT_eX]

[DOI]

Guojing Cong

Victor Fung

Mach. Learn. Sci. Technol., September, 2023

AI-aided multiscale modeling of physiologically-significant blood clots.

[BibT_eX]

[DOI]

Comput. Phys. Commun., June, 2023

Optimizing Distributed Training on Frontier for Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies.

[BibT_eX]

[DOI]

Cindy Orozco Bohorquez

Massimiliano Lupo Pasini

CoRR, 2023

Hyperparameter Optimization and Feature Inclusion in Graph Neural Networks for Spiking Implementation.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning and Applications, 2023

Clustering High-dimensional Toxicogenomics Data with Rare Signals.

[BibT_eX]

[DOI]

Guojing Cong

Scott Auerbach

Proceedings of the IEEE International Conference on Data Mining, 2023

Clustering and GNN prediction with DrugMatrix.

[BibT_eX]

[DOI]

Jiaji Ma

Guojing Cong

Scott Auerbach

Proceedings of the IEEE International Conference on Big Data, 2023

2022

Scalable multiscale modeling of platelets with 100 million particles.

[BibT_eX]

[DOI]

J. Supercomput., 2022

Prediction of CO<sub>2</sub> Adsorption in Nano-Pores with Graph Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2022

Neuromorphic Computing for Scientific Applications.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM Redefining Scalability for Diversely Heterogeneous Architectures Workshop, 2022

Exaflops Biomedical Knowledge Graph Analytics.

[BibT_eX]

[DOI]

Proceedings of the SC22: International Conference for High Performance Computing, 2022

Semi-Supervised Graph Structure Learning on Neuromorphic Computers.

[BibT_eX]

[DOI]

Proceedings of the ICONS 2022: International Conference on Neuromorphic Systems, Knoxville, TN, USA, July 27, 2022

Augmenting Graph Convolution with Distance Preserving Embedding for Improved Learning.

[BibT_eX]

[DOI]

Guojing Cong

Seung-Hwan Lim

Steven Young

Proceedings of the IEEE International Conference on Data Mining Workshops, 2022

Extensive Attention Mechanisms in Graph Neural Networks for Materials Discovery.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Data Mining Workshops, 2022

2021

Artificial intelligence for accelerating time integrations in multiscale modeling.

[BibT_eX]

[DOI]

J. Comput. Phys., 2021

CASTELO: clustered atom subtypes aided lead optimization - a combined machine learning and molecular modeling method.

[BibT_eX]

[DOI]

BMC Bioinform., 2021

Enabling AI-Accelerated Multiscale Modeling of Thrombogenesis at Millisecond and Molecular Resolutions on Supercomputers.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing - 36th International Conference, 2021

Versatile feature learning with graph convolutions and graph structures.

[BibT_eX]

[DOI]

Guojing Cong

Seung-Hwan Lim

Proceedings of the 2021 International Conference on Data Mining, 2021

Elastic distributed training with fast convergence and efficient resource utilization.

[BibT_eX]

[DOI]

Guojing Cong

Proceedings of the 20th IEEE International Conference on Machine Learning and Applications, 2021

Visual Understanding of COVID-19 Knowledge Graph for Predictive Analysis.

[BibT_eX]

[DOI]

Seung-Hwan Lim

Junghoon Chae

Guojing Cong

Drahomira Herrmannova

Robert M. Patton

Ramakrishnan Kannan

Thomas E. Potok

Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

2020

Accelerate Distributed Stochastic Descent for Nonconvex Optimization with Momentum.

[BibT_eX]

[DOI]

Guojing Cong

Tianyi Liu

Proceedings of the 6th IEEE/ACM Workshop on Machine Learning in High Performance Computing Environments, 2020

Fast Training of Deep Neural Networks for Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Partial data permutation for training deep neural networks.

[BibT_eX]

[DOI]

Guojing Cong

Li Zhang

Chih-Chieh Yang

Proceedings of the 20th IEEE/ACM International Symposium on Cluster, 2020

Design of AI-Enhanced Drug Lead Optimization Workflow for HPC and Cloud.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020

2019

Fast neural network training on a cluster of GPUs for action recognition with high accuracy.

[BibT_eX]

[DOI]

J. Parallel Distributed Comput., 2019

A Distributed Hierarchical SGD Algorithm with Sparse Global Reduction.

[BibT_eX]

[DOI]

Fan Zhou

Guojing Cong

CoRR, 2019

Video Action Recognition With an Additional End-to-End Trained Temporal Stream.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Preparation and optimization of a diverse workload for a large-scale heterogeneous system.

[BibT_eX]

[DOI]

Ian Karlin

Yoonho Park

Bronis R. de Supinski

Sara Kokkila Schumacher

Guillaume Thomas-Collignon

Proceedings of the International Conference for High Performance Computing, 2019

Reducing global reductions in large-scale distributed training.

[BibT_eX]

[DOI]

Guojing Cong

Chih-Chieh Yang

Fan Zhou

Proceedings of the 48th International Conference on Parallel Processing, 2019

Accelerating Data Loading in Deep Neural Network Training.

[BibT_eX]

[DOI]

Chih-Chieh Yang

Guojing Cong

Proceedings of the 26th IEEE International Conference on High Performance Computing, 2019

2018

Accelerating Deep Neural Network Training for Action Recognition on a Cluster of GPUs.

[BibT_eX]

[DOI]

Proceedings of the 30th International Symposium on Computer Architecture and High Performance Computing, 2018

On the Convergence Properties of a K-step Averaging Stochastic Gradient Descent Algorithm for Nonconvex Optimization.

[BibT_eX]

[DOI]

Fan Zhou

Guojing Cong

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

2017

Foreword to the special issue of the 18th IEEE international conference on computational science and engineering (CSE2015).

[BibT_eX]

[DOI]

Christian Plessl

Guojing Cong

João M. P. Cardoso

Concurr. Comput. Pract. Exp., 2017

Accelerating deep neural network learning for speech recognition on a cluster of GPUs.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning on HPC Environments, 2017

An Efficient, Distributed Stochastic Gradient Descent Algorithm for Deep-Learning Applications.

[BibT_eX]

[DOI]

Guojing Cong

Onkar Bhardwaj

Minwei Feng

Proceedings of the 46th International Conference on Parallel Processing, 2017

A Hierarchical, Bulk-Synchronous Stochastic Gradient Descent Algorithm for Deep-Learning Applications on GPU Clusters.

[BibT_eX]

[DOI]

Guojing Cong

Onkar Bhardwaj

Proceedings of the 16th IEEE International Conference on Machine Learning and Applications, 2017

2016

Practical Efficiency of Asynchronous Stochastic Gradient Descent.

[BibT_eX]

[DOI]

Onkar Bhardwaj

Guojing Cong

Proceedings of the 2nd Workshop on Machine Learning in HPC Environments, 2016

Composable Locality Optimizations for Accelerating Parallel Forest Computations.

[BibT_eX]

[DOI]

Guojing Cong

Ilie Gabriel Tanase

Proceedings of the 18th IEEE International Conference on High Performance Computing and Communications; 14th IEEE International Conference on Smart City; 2nd IEEE International Conference on Data Science and Systems, 2016

2015

Parallelism-centric optimization and performance study of a finance aggregation engine on modern NUMA systems.

[BibT_eX]

[DOI]

Proceedings of the 8th Workshop on High Performance Computational Finance, 2015

Memory Centric Computation (Mc2) for Large-Scale Graph Processing.

[BibT_eX]

[DOI]

Kattamuri Ekanadham

Guojing Cong

Proceedings of the 27th International Symposium on Computer Architecture and High Performance Computing, 2015

Parallel Strategies for Solving Large Unit Commitment Problems in the California ISO Planning Model.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015

Accelerating Minimum Spanning Forest Computations on Multicore Platforms.

[BibT_eX]

[DOI]

Guojing Cong

Ilie Gabriel Tanase

Yinglong Xia

Proceedings of the Euro-Par 2015: Parallel Processing Workshops, 2015

2014

A Synchronous Parallel Max-Flow Algorithm for Real-World Networks.

[BibT_eX]

[DOI]

Guojing Cong

Proceedings of the 2014 IEEE International Conference on High Performance Computing and Communications, 2014

Fast Parallel Connected Components Algorithms on GPUs.

[BibT_eX]

[DOI]

Guojing Cong

Paul Muzio

Proceedings of the Euro-Par 2014: Parallel Processing Workshops, 2014

2013

Maximizing the performance of irregular applications on multithreaded, NUMA systems.

[BibT_eX]

[DOI]

Guojing Cong

Hui-Fang Wen

Proceedings of the 3rd Workshop on Irregular Applications - Architectures and Algorithms, 2013

Mapping applications for high performance on multithreaded, NUMA systems.

[BibT_eX]

[DOI]

Guojing Cong

Hui-Fang Wen

Proceedings of the Computing Frontiers Conference, 2013

2012

A Systematic Approach toward Automated Performance Analysis and Tuning.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2012

Application data prefetching on the IBM blue gene/Q supercomputer.

[BibT_eX]

[DOI]

Proceedings of the SC Conference on High Performance Computing Networking, 2012

A static analysis tool using a three-step approach for data races in HPC programs.

[BibT_eX]

[DOI]

Proceedings of the 10th Workshop on Parallel and Distributed Systems: Testing, 2012

An Efficient Framework for Multi-dimensional Tuning of High Performance Computing Applications.

[BibT_eX]

[DOI]

Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

Optimizing Large-scale Graph Analysis on Multithreaded, Multicore Platforms.

[BibT_eX]

[DOI]

Guojing Cong

Konstantin Makarychev

Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

Tool-assisted Optimization of Shared-memory Accesses in UPC Applications.

[BibT_eX]

[DOI]

Proceedings of the 14th IEEE International Conference on High Performance Computing and Communication & 9th IEEE International Conference on Embedded Software and Systems, 2012

2011

Hybrid Programming With SIMPLE.

[BibT_eX]

[DOI]

Guojing Cong

David A. Bader

Proceedings of the Encyclopedia of Parallel Computing, 2011

SWARM: A Parallel Programming Framework for Multicore Processors.

[BibT_eX]

[DOI]

David A. Bader

Guojing Cong

Proceedings of the Encyclopedia of Parallel Computing, 2011

Spanning Tree, Minimum Weight.

[BibT_eX]

[DOI]

David A. Bader

Guojing Cong

Proceedings of the Encyclopedia of Parallel Computing, 2011

Graph Algorithms.

[BibT_eX]

[DOI]

David A. Bader

Guojing Cong

Proceedings of the Encyclopedia of Parallel Computing, 2011

Optimizing Large-Scale Graph Analysis on a Multi-threaded, Multi-core Platform.

[BibT_eX]

[DOI]

Guojing Cong

Konstantin Makarychev

Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

2010

Workload performance characterization of DARPA HPCS benchmarks.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2010

Fast PGAS Implementation of Distributed Graph Algorithms.

[BibT_eX]

[DOI]

Guojing Cong

George Almási

Vijay A. Saraswat

Proceedings of the Conference on High Performance Computing Networking, 2010

Application tuning through bottleneck-driven refactoring.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Guided Performance Analysis Combining Profile and Trace Tools.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2010 Parallel Processing Workshops, 2010

2009

Towards a framework for automated performance tuning.

[BibT_eX]

[DOI]

Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

A Holistic Approach towards Automated Performance Analysis and Tuning.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2009 Parallel Processing, 2009

Improving Memory Access Locality for Large-Scale Graph Analysis Applications.

[BibT_eX]

Guojing Cong

Konstantin Makarychev

Proceedings of the 22nd International Conference on Parallel and Distributed Computing and Communication Systems, 2009

2008

A scalable, asynchronous spanning tree algorithm on a cluster of SMPs.

[BibT_eX]

[DOI]

Guojing Cong

Hanhong Xue

Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

A framework for automated performance bottleneck detection.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Solving Large, Irregular Graph Problems Using Adaptive Work-Stealing.

[BibT_eX]

[DOI]

Guojing Cong

Sreedhar B. Kodali

Sriram Krishnamoorthy

Doug Lea

Vijay A. Saraswat

Tong Wen

Proceedings of the 2008 International Conference on Parallel Processing, 2008

2007

Design of Multithreaded Algorithms for Combinatorial Problems.

[BibT_eX]

[DOI]

Proceedings of the Handbook of Parallel Computing - Models, Algorithms and Applications., 2007

Efficient Parallel Graph Algorithms for Multicore and Multiprocessors.

[BibT_eX]

[DOI]

Guojing Cong

David A. Bader

Proceedings of the Handbook of Parallel Computing - Models, Algorithms and Applications., 2007

A productivity centered application performance tuning framework.

[BibT_eX]

[DOI]

Proceedings of the 2nd International Conference on Performance Evaluation Methodolgies and Tools, 2007

A Productivity Centered Tools Framework for Application Performance Tuning.

[BibT_eX]

[DOI]

Proceedings of the Fourth International Conference on the Quantitative Evaluaiton of Systems (QEST 2007), 2007

Techniques for Designing Efficient Parallel Graph Algorithms for SMPs and Multicore Processors.

[BibT_eX]

[DOI]

Guojing Cong

David A. Bader

Proceedings of the Parallel and Distributed Processing and Applications, 2007

A Selective Pro ling Tool: Towards Automatic Performance Tuning.

[BibT_eX]

[DOI]

Abhinav Bhatele

Guojing Cong

Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

2006

Designing irregular parallel algorithms with mutual exclusion and lock-free protocols.

[BibT_eX]

[DOI]

Guojing Cong

David A. Bader

J. Parallel Distributed Comput., 2006

Fast shared-memory algorithms for computing the minimum spanning forest of sparse graphs.

[BibT_eX]

[DOI]

David A. Bader

Guojing Cong

J. Parallel Distributed Comput., 2006

A Study on the Locality Behavior of Minimum Spanning Tree Algorithms.

[BibT_eX]

[DOI]

Guojing Cong

Simone Sbaraglia

Proceedings of the High Performance Computing, 2006

2005

A fast, parallel spanning tree algorithm for symmetric multiprocessors (SMPs).

[BibT_eX]

[DOI]

David A. Bader

Guojing Cong

J. Parallel Distributed Comput., 2005

An Experimental Study of Parallel Biconnected Components Algorithms on Symmetric Multiprocessors (SMPs).

[BibT_eX]

[DOI]

Guojing Cong

David A. Bader

Proceedings of the 19th International Parallel and Distributed Processing Symposium (IPDPS 2005), 2005

On the Architectural Requirements for Efficient Execution of Graph Algorithms.

[BibT_eX]

[DOI]

David A. Bader

Guojing Cong

John Feo

Proceedings of the 34th International Conference on Parallel Processing (ICPP 2005), 2005

An Empirical Analysis of Parallel Random Permutation Algorithms ON SMPs.

[BibT_eX]

Guojing Cong

David A. Bader

Proceedings of the ISCA 18th International Conference on Parallel and Distributed Computing Systems, 2005

2004

A Fast, Parallel Spanning Tree Algorithm for Symmetric Multiprocessors.

[BibT_eX]

[DOI]

David A. Bader

Guojing Cong

Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004

The Euler Tour Technique and Parallel Rooted Spanning Tree.

[BibT_eX]

[DOI]

Guojing Cong

David A. Bader

Proceedings of the 33rd International Conference on Parallel Processing (ICPP 2004), 2004

Lock-Free Parallel Algorithms: An Experimental Study.

[BibT_eX]

[DOI]

Guojing Cong

David A. Bader

Proceedings of the High Performance Computing, 2004

Guojing Cong

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...