Wenjing Ma

Orcid: 0000-0001-8757-651X

According to our database1, Wenjing Ma authored at least 77 papers between 2006 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Adaptive Biomimetic Neuronal Circuit System Based on Myelin Sheath Function.
IEEE Trans. Consumer Electron., February, 2024

LogicPrpBank: A Corpus for Logical Implication and Equivalence.
CoRR, 2024

Logic-based Benders decomposition for order acceptance and scheduling on heterogeneous factories with carbon caps.
Comput. Oper. Res., 2024

Cotton-YOLO: Improved YOLOV7 for rapid detection of foreign fibers in seed cotton.
Comput. Electron. Agric., 2024

Structure preserving FEM for the perturbed wave equation of quantum mechanics.
Appl. Math. Lett., 2024

Real-time scheduling for two-stage assembly flowshop with dynamic job arrivals by deep reinforcement learning.
Adv. Eng. Informatics, 2024

Uncertainty-Aware Pre-Trained Foundation Models for Patient Risk Prediction via Gaussian Process.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2024, 2024

2023
Logic-based Benders decomposition for order acceptance and scheduling in distributed manufacturing.
Adv. Eng. Informatics, October, 2023

MFFT: A GPU Accelerated Highly Efficient Mixed-Precision Large-Scale FFT Framework.
ACM Trans. Archit. Code Optim., September, 2023

An Optimized Framework for Matrix Factorization on the New Sunway Many-core Platform.
ACM Trans. Archit. Code Optim., June, 2023

Evolving the HPL benchmark towards multi-GPGPU clusters.
CCF Trans. High Perform. Comput., March, 2023

Editorial for the special issue on new algorithms and software for E-scale high performance computing.
CCF Trans. High Perform. Comput., March, 2023

Publisher Correction: xMath2.0: a high-performance extended math library for SW26010-Pro many-core processor.
CCF Trans. High Perform. Comput., March, 2023

xMath2.0: a high-performance extended math library for SW26010-Pro many-core processor.
CCF Trans. High Perform. Comput., March, 2023

A Survey on Knowledge Graphs for Healthcare: Resources, Applications, and Promises.
CoRR, 2023

HiPrompt: Few-Shot Biomedical Knowledge Fusion via Hierarchy-Oriented Prompting.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

GFFT: a Task Graph Based Fast Fourier Transform Optimization Framework.
Proceedings of the 52nd International Conference on Parallel Processing, 2023

2022
Data mining method for monitoring students' distance learning behaviour based on decision tree.
Int. J. Data Min. Bioinform., 2022

Quantum-Inspired Distributed Memetic Algorithm.
Complex Syst. Model. Simul., 2022

<i>LRcell</i>: detecting the source of differential expression at the sub-cell-type level from bulk RNA-seq data.
Briefings Bioinform., 2022

EasyView: Enabling and Scheduling Tensor Views in Deep Learning Compilers.
Proceedings of the 51st International Conference on Parallel Processing, 2022

2021
High-Capacity Reversible Data Hiding in Encrypted Images using Adaptive Encoding.
CoRR, 2021

2020
Enabling Highly Efficient Batched Matrix Multiplications on SW26010 Many-core Processor.
ACM Trans. Archit. Code Optim., 2020

Optimal network selection algorithms under the multi-network coexistence environment based on attribute decision.
Int. J. Internet Protoc. Technol., 2020

Reversible Data Hiding in Encrypted Images Based on Bit plane Compression of Prediction Error.
CoRR, 2020

Solving a trillion unknowns per second with HPGMG on Sunway TaihuLight.
Clust. Comput., 2020

2019
Enabling Highly Efficient k-Means Computations on the SW26010 Many-Core Processor of Sunway TaihuLight.
J. Comput. Sci. Technol., 2019

2018
Extreme-Scale High-Order WENO Simulations of 3-D Detonation Wave with 10 Million Cores.
ACM Trans. Archit. Code Optim., 2018

Extreme-Scale Realistic Stencil Computations on Sunway TaihuLight with Ten Million Cores.
Proceedings of the 18th IEEE/ACM International Symposium on Cluster, 2018

2017
A Smartphone Camera-Based Indoor Positioning Algorithm of Crowded Scenarios with the Assistance of Deep CNN.
Sensors, 2017

Numerical simulations of migration and coalescence behavior of microvoids driven by diffusion and electric field in solder interconnects.
Microelectron. Reliab., 2017

Communication-aware task scheduling algorithm for heterogeneous computing.
Int. J. High Perform. Comput. Netw., 2017

Localized Fault Recovery for Nested Fork-Join Programs.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

26 PFLOPS Stencil Computations for Atmospheric Modeling on Sunway TaihuLight.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

Towards Highly Efficient DGEMM on the Emerging SW26010 Many-Core Processor.
Proceedings of the 46th International Conference on Parallel Processing, 2017

2016
Highly Optimized Code Generation for Stencil Codes with Computation Reuse for GPUs.
J. Comput. Sci. Technol., 2016

Bridging Semantic Gap Between App Names: Collective Matrix Factorization for Similar Mobile App Recommendation.
Proceedings of the Web Information Systems Engineering - WISE 2016, 2016

Data-Oriented Runtime Scheduling Framework on Multi-GPUs.
Proceedings of the 2016 IEEE Trustcom/BigDataSE/ISPA, 2016

HPSVM: Heterogeneous Parallel SVM with Factorization Based IPM Algorithm on CPU-GPU Cluster.
Proceedings of the 24th Euromicro International Conference on Parallel, 2016

GPU-FV: Realtime Fisher Vector and Its Applications in Video Monitoring.
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

Online variational Bayesian Support Vector Regression.
Proceedings of the 2016 International Joint Conference on Neural Networks, 2016

GB-RC4: Effective brute force attacks on RC4 algorithm using GPU.
Proceedings of the Seventh International Green and Sustainable Computing Conference, 2016

Multi-Scale Fully Convolutional Network for Fast Face Detection.
Proceedings of the British Machine Vision Conference 2016, 2016

GLDA: Parallel Gibbs Sampling for Latent Dirichlet Allocation on GPU.
Proceedings of the Advanced Computer Architecture - 11th Conference, 2016

2015
Global transformations for legacy parallel applications via structural analysis and rewriting.
Parallel Comput., 2015

Detect Similar Mobile Applications with Transfer Learning.
Proceedings of the 2015 IEEE International Conference on Smart City/SocialCom/SustainCom/DataCom/SC2 2015, 2015

PE-TLD: Parallel Extended Tracking-Learning-Detection for Multi-target Tracking.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2015

2014
High performance two-dimensional phase unwrapping on GPUs.
Proceedings of the Computing Frontiers Conference, CF'14, 2014

2013
Optimizing tensor contraction expressions for hybrid CPU-GPU execution.
Clust. Comput., 2013

Study of web guide slippage phenomena in roll-to-roll embossing syste.
Proceedings of the 10th IEEE International Conference on Control and Automation, 2013

2012
Compiler and runtime support for enabling reduction computations on heterogeneous systems.
Concurr. Comput. Pract. Exp., 2012

Ad serving using a compact allocation plan.
Proceedings of the 13th ACM Conference on Electronic Commerce, 2012

SHALE: an efficient algorithm for allocation of guaranteed display advertising.
Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012

Water quality model parameters inversion based on improved stochastic optimization.
Proceedings of the 2012 IEEE International Geoscience and Remote Sensing Symposium, 2012

Data-driven fault tolerance for work stealing computations.
Proceedings of the International Conference on Supercomputing, 2012

GMProf: A low-overhead, fine-grained profiling approach for GPU programs.
Proceedings of the 19th International Conference on High Performance Computing, 2012

2011
Poster: FOX: a fault-oblivious extreme scale execution environment.
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, 2011

An execution strategy and optimized runtime support for parallelizing irregular reductions on modern GPUs.
Proceedings of the 25th International Conference on Supercomputing, 2011, Tucson, AZ, USA, May 31, 2011

Practical Loop Transformations for Tensor Contraction Expressions on Multi-level Memory Hierarchies.
Proceedings of the Compiler Construction - 20th International Conference, 2011

Parameterized Micro-benchmarking: An Auto-tuning Approach for Complex Applications.
Proceedings of the 2011 International Conference on Parallel Architectures and Compilation Techniques, 2011

2010
A Light-Size AKA Mechanism for Optimal Distributed AAA authorization Architecture.
Proceedings of the 71st IEEE Vehicular Technology Conference, 2010

AUTO-GC: Automatic translation of data mining applications to GPU clusters.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Compiler and runtime support for enabling generalized reduction computations on heterogeneous parallel configurations.
Proceedings of the 24th International Conference on Supercomputing, 2010

Parallelizing an Information Theoretic Co-clustering Algorithm Using a Cloud Middleware.
Proceedings of the ICDMW 2010, 2010

An integer programming framework for optimizing shared memory use on GPUs.
Proceedings of the 2010 International Conference on High Performance Computing, 2010

Approaches for parallelizing reductions on modern GPUs.
Proceedings of the 2010 International Conference on High Performance Computing, 2010

Acceleration of Streamed Tensor Contraction Expressions on GPGPU-Based Clusters.
Proceedings of the 2010 IEEE International Conference on Cluster Computing, 2010

Pricing guaranteed contracts in online display advertising.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

2009
Risk Adjusted Set Membership Identification of Wiener Systems.
IEEE Trans. Autom. Control., 2009

A compiler and runtime system for enabling data mining applications on gpus.
Proceedings of the 14th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2009

Online allocation of display advertisements subject to advanced sales contracts.
Proceedings of the 3rd ACM SIGKDD Workshop on Data Mining and Audience Intelligence for Advertising, 2009

A translation system for enabling data mining applications on GPUs.
Proceedings of the 23rd international conference on Supercomputing, 2009

2008
Exploiting Computing Power on Graphics Processing Unit.
Proceedings of the International Conference on Computer Science and Software Engineering, 2008

An Optimization Method to Develop AAA Architectures with MIPv6 Mobility Support.
Proceedings of the 3rd IEEE Asia-Pacific Services Computing Conference, 2008

2007
A risk adjusted approach to robust simultaneous fault detection and isolation.
Autom., 2007

Risk adjusted identification of a class of nonlinear systems.
Proceedings of the 46th IEEE Conference on Decision and Control, 2007

2006
Risk Adjusted Identification of Wiener Systems.
Proceedings of the 45th IEEE Conference on Decision and Control, 2006


  Loading...