Junshi Chen

Orcid: 0000-0002-1430-9899

According to our database1, Junshi Chen authored at least 52 papers between 2003 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Wiometrics: Comparative Performance of Artificial Neural Networks for Wireless Navigation.
IEEE Trans. Veh. Technol., September, 2024

SWattention: designing fast and memory-efficient attention for a new Sunway Supercomputer.
J. Supercomput., July, 2024

Uncovering the performance bottleneck of modern HPC processor with static code analyzer: a case study on Kunpeng 920.
CCF Trans. High Perform. Comput., June, 2024

Extending the limit of LR-TDDFT on two different approaches: Numerical algorithms and new Sunway heterogeneous supercomputer.
Parallel Comput., 2024

PWDFT-SW: Extending the Limit of Plane-Wave DFT Calculations to 16K Atoms on the New Sunway Supercomputer.
CoRR, 2024

Pruner: An Efficient Cross-Platform Tensor Compiler with Dual Awareness.
CoRR, 2024

Enabling 13K-Atom Excited-State GW Calculations via Low-Rank Approximations and HPC on the New Sunway Supercomputer.
Proceedings of the International Conference for High Performance Computing, 2024

DB-SpGEMM: A Massively Distributed Block-Sparse Matrix-Matrix Multiplication for Linear-Scaling DFT Calculations.
Proceedings of the 53rd International Conference on Parallel Processing, 2024

Multi-level Load Balancing Strategies for Massively Parallel Smoothed Particle Hydrodynamics Simulation.
Proceedings of the 53rd International Conference on Parallel Processing, 2024

A<sup>3</sup>PIM: An Automated, Analytic and Accurate Processing-in-Memory Offloader.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2024

2023
Deep learning representations for quantum many-body systems on heterogeneous hardware.
Mach. Learn. Sci. Technol., March, 2023

High performance computing for first-principles Kohn-Sham density functional theory towards exascale supercomputers.
CCF Trans. High Perform. Comput., March, 2023

swMPAS-A: Scaling MPAS-A to 39 Million Heterogeneous Cores on the New Generation Sunway Supercomputer.
IEEE Trans. Parallel Distributed Syst., 2023

Flexible Density-based Multipath Component Clustering Utilizing Ground Truth Pose.
Proceedings of the 98th IEEE Vehicular Technology Conference, 2023

Extended FastSLAM Using Cellular Multipath Component Delays and Angular Information.
Proceedings of the 97th IEEE Vehicular Technology Conference, 2023

Establishing a Modeling System in 3-km Horizontal Resolution for Global Atmospheric Circulation triggered by Submarine Volcanic Eruptions with 400 Billion Smoothed Particle Hydrodynamics.
Proceedings of the International Conference for High Performance Computing, 2023

SWSPH: A Massively Parallel SPH Implementation for Hundred-Billion-Particle Simulation on New Sunway Supercomputer.
Proceedings of the Euro-Par 2023: Parallel Processing - 29th International Conference on Parallel and Distributed Computing, Limassol, Cyprus, August 28, 2023

High-Resolution Channel Sounding and Parameter Estimation in Multi-Site Cellular Networks.
Proceedings of the 2023 Joint European Conference on Networks and Communications & 6G Summit, 2023

2022
Bridging the Gap between Deep Learning and Frustrated Quantum Spin System for Extreme-Scale Simulations on New Generation of Sunway Supercomputer.
IEEE Trans. Parallel Distributed Syst., 2022

Whole-genome sequencing and gene sharing network analysis powered by machine learning identifies antibiotic resistance sharing between animals, humans and environment in livestock farming.
PLoS Comput. Biol., 2022

Urban Navigation with LTE using a Large Antenna Array and Machine Learning.
Proceedings of the 95th IEEE Vehicular Technology Conference, 2022

AI for Quantum Mechanics: High Performance Quantum Many-Body Simulations via Deep Learning.
Proceedings of the SC22: International Conference for High Performance Computing, 2022

2.5 Million-Atom Ab Initio Electronic-Structure Simulation of Complex Metallic Heterostructures with DGDFT.
Proceedings of the SC22: International Conference for High Performance Computing, 2022

Accelerating Parallel First-Principles Excited-State Calculation by Low-Rank Approximation with K-Means Clustering.
Proceedings of the 51st International Conference on Parallel Processing, 2022

High-Performance Matrix Multiplication on the New Generation Shenwei Processor.
Proceedings of the 24th IEEE Int Conf on High Performance Computing & Communications; 8th Int Conf on Data Science & Systems; 20th Int Conf on Smart City; 8th Int Conf on Dependability in Sensor, 2022

Machine Learning-enabled Performance Model for DNN Applications and AI Accelerator.
Proceedings of the 24th IEEE Int Conf on High Performance Computing & Communications; 8th Int Conf on Data Science & Systems; 20th Int Conf on Smart City; 8th Int Conf on Dependability in Sensor, 2022

Quantifying Throughput of Basic Blocks on ARM Microarchitectures by Static Code Analyzers: A Case Study on Kunpeng 920.
Proceedings of the 24th IEEE Int Conf on High Performance Computing & Communications; 8th Int Conf on Data Science & Systems; 20th Int Conf on Smart City; 8th Int Conf on Dependability in Sensor, 2022

2021
Towards Efficient Short-Range Pair Interaction on Sunway Many-Core Architecture.
J. Comput. Sci. Technol., 2021

swFLOW: A large-scale distributed framework for deep learning on Sunway TaihuLight supercomputer.
Inf. Sci., 2021

RDMA-Based Apache Storm for High-Performance Stream Data Processing.
Int. J. Parallel Program., 2021

Symplectic structure-preserving particle-in-cell whole-volume simulation of tokamak plasmas to 111.3 trillion particles and 25.7 billion grids.
Proceedings of the International Conference for High Performance Computing, 2021

2020
Distributed deep learning system for cancerous region detection on Sunway TaihuLight.
CCF Trans. High Perform. Comput., 2020

SLAM using LTE Multipath Component Delays.
Proceedings of the 91st IEEE Vehicular Technology Conference, 2020

RDMA-Based Apache Storm for High-Performance Stream Data Processing.
Proceedings of the Network and Parallel Computing, 2020

An Efficient Multi-GPU Implementation for Linear-Response Time-Dependent Density Functional Theory.
Proceedings of the 22nd IEEE International Conference on High Performance Computing and Communications; 18th IEEE International Conference on Smart City; 6th IEEE International Conference on Data Science and Systems, 2020

Optimizing Astrophysical Simulation Software on Sunway Heterogeneous Manycore Architecture.
Proceedings of the 22nd IEEE International Conference on High Performance Computing and Communications; 18th IEEE International Conference on Smart City; 6th IEEE International Conference on Data Science and Systems, 2020

2019
众核平台上广度优先搜索算法的优化 (Optimization of Breadth-first Search Algorithm Based on Many-core Platform).
计算机科学, 2019

Improving the Performance of MongoDB with RDMA.
Proceedings of the 21st IEEE International Conference on High Performance Computing and Communications; 17th IEEE International Conference on Smart City; 5th IEEE International Conference on Data Science and Systems, 2019

TripletRun: A Dataflow Runtime Simulator and Its Performance Model.
Proceedings of the 21st IEEE International Conference on High Performance Computing and Communications; 17th IEEE International Conference on Smart City; 5th IEEE International Conference on Data Science and Systems, 2019

Redesign NAMD Molecular Dynamics Non-Bonded Force-Field on Sunway Manycore Processor.
Proceedings of the 21st IEEE International Conference on High Performance Computing and Communications; 17th IEEE International Conference on Smart City; 5th IEEE International Conference on Data Science and Systems, 2019

An effective method for operations placement in Tensor Flow.
Proceedings of the 3rd International Conference on High Performance Compilation, 2019

2018
PEPS++: Towards Extreme-Scale Simulations of Strongly Correlated Quantum Many-Particle Models on Sunway TaihuLight.
IEEE Trans. Parallel Distributed Syst., 2018

2017
A Dataflow-Based Runtime Support on a 100P Actual System.
Proceedings of the 2017 IEEE International Symposium on Parallel and Distributed Processing with Applications and 2017 IEEE International Conference on Ubiquitous Computing and Communications (ISPA/IUCC), 2017

Refactoring the Molecular Docking Simulation for Heterogeneous, Manycore Processors Systems.
Proceedings of the 2017 IEEE International Symposium on Parallel and Distributed Processing with Applications and 2017 IEEE International Conference on Ubiquitous Computing and Communications (ISPA/IUCC), 2017

A hierarchical grid algorithm for accelerating high-performance conjugate gradient benchmark on sunway many-core processor.
Proceedings of the 3rd International Conference on Communication and Information Processing, 2017

Pipelining Computation and Optimization Strategies for Scaling GROMACS on the Sunway Many-Core Processor.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2017

2015
Local State Reusing for Efficient Model Checking of Multithreaded Programs.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2015

2012
VSCP: A Cache Controlling Method for Improving Single Thread Performance in Multicore System.
Proceedings of the 14th IEEE International Conference on High Performance Computing and Communication & 9th IEEE International Conference on Embedded Software and Systems, 2012

2010
Performance Analysis of Multicarrier DS-CDMA with Antenna Array in a Fading Channel.
Wirel. Pers. Commun., 2010

2007
Performance Analysis of MT-CDMA System with Antenna Array in a Multipath Fading Channel.
Wirel. Pers. Commun., 2007

Performance of space-frequency combining at the antenna array of a MC-CDMA system.
Wirel. Pers. Commun., 2007

2003
An ESPRIT based DOA estimation for CDMA frequency-selective fading channels.
Proceedings of the IEEE 14th International Symposium on Personal, 2003


  Loading...