Haojie Wang

Orcid: 0000-0003-4605-148X

According to our database1, Haojie Wang authored at least 67 papers between 2012 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
An efficient CNN accelerator for pattern-compressed sparse neural networks on FPGA.
Neurocomputing, 2025

2024
Graph-Centric Performance Analysis for Large-Scale Parallel Applications.
IEEE Trans. Parallel Distributed Syst., July, 2024

Firefighting Water Jet Trajectory Detection from Unmanned Aerial Vehicle Imagery Using Learnable Prompt Vectors.
Sensors, June, 2024

A Study on Predicting the Deviation of Jet Trajectory Falling Point under the Influence of Random Wind.
Sensors, June, 2024

A Multiharmonic Sources Localization Algorithm Based on ICA and Posterior Harmonic Admittance.
IEEE Trans. Instrum. Meas., 2024

A high-performance dataflow-centric optimization framework for deep learning inference on the edge.
J. Syst. Archit., 2024

PLANET: A Multi-objective Graph Neural Network Model for Protein-Ligand Binding Affinity Prediction.
J. Chem. Inf. Model., 2024

ARIC: An Activity Recognition Dataset in Classroom Surveillance Images.
CoRR, 2024

Optimal Kernel Orchestration for Tensor Programs with Korch.
CoRR, 2024

Wind-induced response of rapeseed seedling stage and lodging prediction based on UAV imagery and machine learning methods.
Comput. Electron. Agric., 2024

Research on Gangue Detection Algorithm Based on Cross-Scale Feature Fusion and Dynamic Pruning.
Algorithms, 2024

MAGPY: Compiling Eager Mode DNN Programs by Monitoring Execution States.
Proceedings of the 2024 USENIX Annual Technical Conference, 2024

WiseGraph: Optimizing GNN with Joint Workload Partition of Graph and Operations.
Proceedings of the Nineteenth European Conference on Computer Systems, 2024

AdaPipe: Optimizing Pipeline Parallelism with Adaptive Recomputation and Partitioning.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

Optimal Kernel Orchestration for Tensor Programs with Korch.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023
Optimizing DNNs With Partially Equivalent Transformations and Automated Corrections.
IEEE Trans. Computers, December, 2023

Multiple Nyström Kernel Adaptive Filter Under Minimum Generalized Cauchy Loss Criterion.
IEEE Trans. Circuits Syst. II Express Briefs, April, 2023

Unified Programming Models for Heterogeneous High-Performance Computers.
J. Comput. Sci. Technol., February, 2023

Fiber Optic Sensing Technology and Vision Sensing Technology for Structural Health Monitoring.
Sensors, 2023

Numerical computation of the stress concentration between closely located stiff inclusions of general shapes.
CoRR, 2023

PowerFusion: A Tensor Compiler with Explicit Data Movement Description and Instruction-level Graph IR.
CoRR, 2023

Implications on managing inventory systems for products with stock-dependent demand and nonlinear holding cost via the adaptive EOQ policy.
Comput. Oper. Res., 2023

Adaptive Massive MIMO Hybrid Precoding Based on Meta Learning.
Proceedings of the International Conference on Wireless Communications and Signal Processing, 2023

GraphSet: High Performance Graph Mining through Equivalent Set Transformations.
Proceedings of the International Conference for High Performance Computing, 2023

EINNET: Optimizing Tensor Programs with Derivation-Based Transformations.
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023

Where to Forget: A New Attention Stability Metric for Continual Learning Evaluation.
Proceedings of the Digital Multimedia Communications, 2023

Design of Fall Detection System based on YOLOv3.
Proceedings of the 2023 7th International Conference on Electronic Information Technology and Computer Engineering, 2023

Xenos : Dataflow-Centric Optimization to Accelerate Model Inference on Edge Devices.
Proceedings of the Database Systems for Advanced Applications, 2023

2022
Detecting Performance Variance for Parallel Applications Without Source Code.
IEEE Trans. Parallel Distributed Syst., 2022

$TC-Stream$TC-Stream: Large-Scale Graph Triangle Counting on a Single Machine Using GPUs.
IEEE Trans. Parallel Distributed Syst., 2022

L1-norm constraint kernel adaptive filtering framework for precise and robust indoor localization under the internet of things.
Inf. Sci., 2022

OLLIE: Derivation-based Tensor Program Optimizer.
CoRR, 2022

UniQ: A Unified Programming Model for Efficient Quantum Circuit Simulation.
Proceedings of the SC22: International Conference for High Performance Computing, 2022

Vapro: performance variance detection and diagnosis for production-run parallel applications.
Proceedings of the PPoPP '22: 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Seoul, Republic of Korea, April 2, 2022

BaGuaLu: targeting brain scale pretrained models with over 37 million cores.
Proceedings of the PPoPP '22: 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Seoul, Republic of Korea, April 2, 2022

PerFlow: a domain specific framework for automatic performance analysis of parallel applications.
Proceedings of the PPoPP '22: 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Seoul, Republic of Korea, April 2, 2022

FasterMoE: modeling and optimizing training of large-scale dynamic pre-trained models.
Proceedings of the PPoPP '22: 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Seoul, Republic of Korea, April 2, 2022

Scaling graph traversal to 281 trillion edges with 40 million cores.
Proceedings of the PPoPP '22: 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Seoul, Republic of Korea, April 2, 2022

FreeTensor: a free-form DSL with holistic optimizations for irregular tensor programs.
Proceedings of the PLDI '22: 43rd ACM SIGPLAN International Conference on Programming Language Design and Implementation, San Diego, CA, USA, June 13, 2022

Efficiently emulating high-bitwidth computation with low-bitwidth hardware.
Proceedings of the ICS '22: 2022 International Conference on Supercomputing, Virtual Event, June 28, 2022

An Efficient Sparse CNNs Accelerator on FPGA.
Proceedings of the IEEE International Conference on Cluster Computing, 2022

2021
A Robust Student's t-Based Kernel Adaptive Filter.
IEEE Trans. Circuits Syst. II Express Briefs, 2021

An Improved Timed Elastic Band (TEB) Algorithm of Autonomous Ground Vehicle (AGV) in Complex Environment.
Sensors, 2021

An Automated Snow Mapper Powered by Machine Learning.
Remote. Sens., 2021

Chukonu: A Fully-Featured Big Data Processing System by Efficiently Integrating a Native Compute Engine into Spark.
Proc. VLDB Endow., 2021

2020 CATARACTS Semantic Segmentation Challenge.
CoRR, 2021

LotusSQL: SQL engine for high-performance big data systems.
Big Data Min. Anal., 2021

PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections.
Proceedings of the 15th USENIX Symposium on Operating Systems Design and Implementation, 2021

Few-Shot Learning with Unlabeled Outlier Exposure.
Proceedings of the MultiMedia Modeling - 27th International Conference, 2021

Cross-Domain Landmarks Detection in Mitral Regurgitation.
Proceedings of the Deep Generative Models, and Data Augmentation, Labelling, and Imperfections, 2021

HyQuas: hybrid partitioner based quantum circuit simulation system on GPU.
Proceedings of the ICS '21: 2021 International Conference on Supercomputing, 2021

Sparker: Efficient Reduction for More Scalable Machine Learning with Spark.
Proceedings of the ICPP 2021: 50th International Conference on Parallel Processing, Lemont, IL, USA, August 9, 2021

2020
ScalAna: automating scaling loss detection with graph analysis.
Proceedings of the International Conference for High Performance Computing, 2020

Identifying scalability bottlenecks for large-scale parallel programs with graph analysis.
Proceedings of the PPoPP '20: 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2020

Hand Pose Estimation for Hand-Object Interaction Cases using Augmented Autoencoder.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Few-Shot Classification with Transductive Data Clustering Transformation.
Proceedings of the Neural Information Processing - 27th International Conference, 2020

Learning Class Prototypes Via Anisotropic Combination of Aligned Modalities for Few-Shot Learning.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

2019
Spread-n-share: improving application performance and cluster throughput with resource-aware job placement.
Proceedings of the International Conference for High Performance Computing, 2019

A Stacked Global-Shutter CMOS Imager with SC-Type Hybrid-GS Pixel and Self-Knee Point Calibration Single Frame HDR and On-Chip Binarization Algorithm for Smart Vision Applications.
Proceedings of the IEEE International Solid- State Circuits Conference, 2019

Solving Six-Player Games via Online Situation Estimation.
Proceedings of the 31st IEEE International Conference on Tools with Artificial Intelligence, 2019

The Multiple parameters double fuzzy decoupling PID algorithm for pig breeding system.
Proceedings of the 6th International Conference on Systems and Informatics, 2019

2018
Spindle: Informed Memory Access Monitoring.
Proceedings of the 2018 USENIX Annual Technical Conference, 2018

ExtTra: Short-Term Traffic Flow Prediction Based on Extremely Randomized Trees.
Proceedings of the Neural Information Processing - 25th International Conference, 2018

2017
The influence of internal current loop on transient response performance of I-V droop controlled paralleled DC-DC converters.
Proceedings of the IECON 2017 - 43rd Annual Conference of the IEEE Industrial Electronics Society, Beijing, China, October 29, 2017

2016
A Dual Threshold Secret Sharing Scheme among Weighted Participants of Special Right.
Proceedings of the IEEE First International Conference on Data Science in Cyberspace, 2016

2013
Combining frequent 2-itemsets and statistical features for texture classification in wavelet domain.
Proceedings of the International Conference on Advances in Computing, 2013

2012
A new algorithm based on complex wavelet transform for protein sequence classification.
Proceedings of the 2012 International Conference on Advances in Computing, 2012


  Loading...