Wenjing Ma

CCF Trans. High Perform. Comput., March, 2023

Publisher Correction: xMath2.0: a high-performance extended math library for SW26010-Pro many-core processor.

[BibT_eX]

[DOI]

CCF Trans. High Perform. Comput., March, 2023

xMath2.0: a high-performance extended math library for SW26010-Pro many-core processor.

[BibT_eX]

[DOI]

CCF Trans. High Perform. Comput., March, 2023

A Survey on Knowledge Graphs for Healthcare: Resources, Applications, and Promises.

[BibT_eX]

[DOI]

CoRR, 2023

HiPrompt: Few-Shot Biomedical Knowledge Fusion via Hierarchy-Oriented Prompting.

[BibT_eX]

[DOI]

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

GFFT: a Task Graph Based Fast Fourier Transform Optimization Framework.

[BibT_eX]

[DOI]

Proceedings of the 52nd International Conference on Parallel Processing, 2023

2022

Data mining method for monitoring students' distance learning behaviour based on decision tree.

[BibT_eX]

[DOI]

Ketong Liu

Andi Gao

Int. J. Data Min. Bioinform., 2022

Quantum-Inspired Distributed Memetic Algorithm.

[BibT_eX]

[DOI]

Complex Syst. Model. Simul., 2022

<i>LRcell</i>: detecting the source of differential expression at the sub-cell-type level from bulk RNA-seq data.

[BibT_eX]

[DOI]

Briefings Bioinform., 2022

EasyView: Enabling and Scheduling Tensor Views in Deep Learning Compilers.

[BibT_eX]

[DOI]

Proceedings of the 51st International Conference on Parallel Processing, 2022

2021

High-Capacity Reversible Data Hiding in Encrypted Images using Adaptive Encoding.

[BibT_eX]

[DOI]

Youqing Wu

Zhaoxia Yin

CoRR, 2021

2020

Enabling Highly Efficient Batched Matrix Multiplications on SW26010 Many-core Processor.

[BibT_eX]

[DOI]

Lijuan Jiang

Chao Yang

ACM Trans. Archit. Code Optim., 2020

Optimal network selection algorithms under the multi-network coexistence environment based on attribute decision.

[BibT_eX]

[DOI]

Chungeng Ma

Lixia Hou

Int. J. Internet Protoc. Technol., 2020

Reversible Data Hiding in Encrypted Images Based on Bit plane Compression of Prediction Error.

[BibT_eX]

[DOI]

CoRR, 2020

Solving a trillion unknowns per second with HPGMG on Sunway TaihuLight.

[BibT_eX]

[DOI]

Clust. Comput., 2020

2019

Enabling Highly Efficient k-Means Computations on the SW26010 Many-Core Processor of Sunway TaihuLight.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., 2019

2018

Extreme-Scale High-Order WENO Simulations of 3-D Detonation Wave with 10 Million Cores.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2018

Extreme-Scale Realistic Stencil Computations on Sunway TaihuLight with Ten Million Cores.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE/ACM International Symposium on Cluster, 2018

2017

A Smartphone Camera-Based Indoor Positioning Algorithm of Crowded Scenarios with the Assistance of Deep CNN.

[BibT_eX]

[DOI]

Sensors, 2017

Numerical simulations of migration and coalescence behavior of microvoids driven by diffusion and electric field in solder interconnects.

[BibT_eX]

[DOI]

Microelectron. Reliab., 2017

Communication-aware task scheduling algorithm for heterogeneous computing.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Netw., 2017

Localized Fault Recovery for Nested Fork-Join Programs.

[BibT_eX]

[DOI]

Gokcen Kestor

Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

26 PFLOPS Stencil Computations for Atmospheric Modeling on Sunway TaihuLight.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

Towards Highly Efficient DGEMM on the Emerging SW26010 Many-Core Processor.

[BibT_eX]

[DOI]

Proceedings of the 46th International Conference on Parallel Processing, 2017

2016

Highly Optimized Code Generation for Stencil Codes with Computation Reuse for GPUs.

[BibT_eX]

[DOI]

Daniel G. Chavarría-Miranda

Kan Gao

Guoping Long

J. Comput. Sci. Technol., 2016

Bridging Semantic Gap Between App Names: Collective Matrix Factorization for Similar Mobile App Recommendation.

[BibT_eX]

[DOI]

Proceedings of the Web Information Systems Engineering - WISE 2016, 2016

Data-Oriented Runtime Scheduling Framework on Multi-GPUs.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Trustcom/BigDataSE/ISPA, 2016

HPSVM: Heterogeneous Parallel SVM with Factorization Based IPM Algorithm on CPU-GPU Cluster.

[BibT_eX]

[DOI]

Proceedings of the 24th Euromicro International Conference on Parallel, 2016

GPU-FV: Realtime Fisher Vector and Its Applications in Video Monitoring.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

Online variational Bayesian Support Vector Regression.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Joint Conference on Neural Networks, 2016

GB-RC4: Effective brute force attacks on RC4 algorithm using GPU.

[BibT_eX]

[DOI]

Proceedings of the Seventh International Green and Sustainable Computing Conference, 2016

Multi-Scale Fully Convolutional Network for Fast Face Detection.

[BibT_eX]

[DOI]

Proceedings of the British Machine Vision Conference 2016, 2016

GLDA: Parallel Gibbs Sampling for Latent Dirichlet Allocation on GPU.

[BibT_eX]

[DOI]

Proceedings of the Advanced Computer Architecture - 11th Conference, 2016

2015

Global transformations for legacy parallel applications via structural analysis and rewriting.

[BibT_eX]

[DOI]

Ajay Panyala

Adrian Prantl

Parallel Comput., 2015

Detect Similar Mobile Applications with Transfer Learning.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Smart City/SocialCom/SustainCom/DataCom/SC2 2015, 2015

PE-TLD: Parallel Extended Tracking-Learning-Detection for Multi-target Tracking.

[BibT_eX]

[DOI]

Proceedings of the Algorithms and Architectures for Parallel Processing, 2015

2014

High performance two-dimensional phase unwrapping on GPUs.

[BibT_eX]

[DOI]

Proceedings of the Computing Frontiers Conference, CF'14, 2014

2013

Optimizing tensor contraction expressions for hybrid CPU-GPU execution.

[BibT_eX]

[DOI]

Oreste Villa

Karol Kowalski

Clust. Comput., 2013

Study of web guide slippage phenomena in roll-to-roll embossing syste.

[BibT_eX]

[DOI]

Proceedings of the 10th IEEE International Conference on Control and Automation, 2013

2012

Compiler and runtime support for enabling reduction computations on heterogeneous systems.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2012

Ad serving using a compact allocation plan.

[BibT_eX]

[DOI]

Peiji Chen

Srinath Mandalapu

Chandrashekhar Nagarajan

Jayavel Shanmugasundaram

Proceedings of the 13th ACM Conference on Electronic Commerce, 2012

SHALE: an efficient algorithm for allocation of guaranteed display advertising.

[BibT_eX]

[DOI]

Vijay Bharadwaj

Peiji Chen

Chandrashekhar Nagarajan

Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012

Water quality model parameters inversion based on improved stochastic optimization.

[BibT_eX]

[DOI]

Junping Zhang

Jiaguo Qi

Proceedings of the 2012 IEEE International Geoscience and Remote Sensing Symposium, 2012

Data-driven fault tolerance for work stealing computations.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Supercomputing, 2012

GMProf: A low-overhead, fine-grained profiling approach for GPU programs.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on High Performance Computing, 2012

2011

Poster: FOX: a fault-oblivious extreme scale execution environment.

[BibT_eX]

[DOI]

Ronald G. Minnich

Curtis L. Janssen

Andres Marquez

Maya B. Gokhale

Ponnuswamy Sadayappan

Eric Van Hensbergen

Jonathan Appavoo

Jim McKie

Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, 2011

An execution strategy and optimized runtime support for parallelizing irregular reductions on modern GPUs.

[BibT_eX]

[DOI]

Proceedings of the 25th International Conference on Supercomputing, 2011, Tucson, AZ, USA, May 31, 2011

Practical Loop Transformations for Tensor Contraction Expressions on Multi-level Memory Hierarchies.

[BibT_eX]

[DOI]

Proceedings of the Compiler Construction - 20th International Conference, 2011

Parameterized Micro-benchmarking: An Auto-tuning Approach for Complex Applications.

[BibT_eX]

[DOI]

Proceedings of the 2011 International Conference on Parallel Architectures and Compilation Techniques, 2011

2010

A Light-Size AKA Mechanism for Optimal Distributed AAA authorization Architecture.

[BibT_eX]

[DOI]

Mei Song

Proceedings of the 71st IEEE Vehicular Technology Conference, 2010

AUTO-GC: Automatic translation of data mining applications to GPU clusters.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Compiler and runtime support for enabling generalized reduction computations on heterogeneous parallel configurations.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Supercomputing, 2010

Parallelizing an Information Theoretic Co-clustering Algorithm Using a Cloud Middleware.

[BibT_eX]

[DOI]

Proceedings of the ICDMW 2010, 2010

An integer programming framework for optimizing shared memory use on GPUs.

[BibT_eX]

[DOI]

Proceedings of the 2010 International Conference on High Performance Computing, 2010

Approaches for parallelizing reductions on modern GPUs.

[BibT_eX]

[DOI]

Proceedings of the 2010 International Conference on High Performance Computing, 2010

Acceleration of Streamed Tensor Contraction Expressions on GPGPU-Based Clusters.

[BibT_eX]

[DOI]

Oreste Villa

Karol Kowalski

Proceedings of the 2010 IEEE International Conference on Cluster Computing, 2010

Pricing guaranteed contracts in online display advertising.

[BibT_eX]

[DOI]

Vijay Bharadwaj

Michael Schwarz

Jayavel Shanmugasundaram

Erik Vee

Jack Xie

Jian Yang

Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

2009

Risk Adjusted Set Membership Identification of Wiener Systems.

[BibT_eX]

[DOI]

IEEE Trans. Autom. Control., 2009

A compiler and runtime system for enabling data mining applications on gpus.

[BibT_eX]

[DOI]

Proceedings of the 14th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2009

Online allocation of display advertisements subject to advanced sales contracts.

[BibT_eX]

[DOI]

Proceedings of the 3rd ACM SIGKDD Workshop on Data Mining and Audience Intelligence for Advertising, 2009

A translation system for enabling data mining applications on GPUs.

[BibT_eX]

[DOI]

Proceedings of the 23rd international conference on Supercomputing, 2009

2008

Exploiting Computing Power on Graphics Processing Unit.

[BibT_eX]

[DOI]

Ziyi Liu

Proceedings of the International Conference on Computer Science and Software Engineering, 2008

An Optimization Method to Develop AAA Architectures with MIPv6 Mobility Support.

[BibT_eX]

[DOI]

Proceedings of the 3rd IEEE Asia-Pacific Services Computing Conference, 2008

2007

A risk adjusted approach to robust simultaneous fault detection and isolation.

[BibT_eX]

[DOI]

Mario Sznaier

Constantino M. Lagoa

Autom., 2007

Risk adjusted identification of a class of nonlinear systems.

[BibT_eX]

[DOI]

Mario Sznaier

Constantino Lagoa