Proceedings of the 2017 IEEE International Symposium on Parallel and Distributed Processing with Applications and 2017 IEEE International Conference on Ubiquitous Computing and Communications (ISPA/IUCC), 2017

2015

GF(2m)上椭圆曲线标量乘的硬件结构实现 (Hardware Implementation of Scalar Multiplication on Elliptic Curves over GF(2m)).

[BibT_eX]

[DOI]

计算机科学, 2015

面向定制结构的稀疏矩阵分块方法 (Sparse Matrix Blocking Method for Custom Architecture).

[BibT_eX]

[DOI]

计算机科学, 2015

A deeply-pipelined FPGA-based SpMV accelerator with a hardware-friendly storage scheme.

[BibT_eX]

[DOI]

IEICE Electron. Express, 2015

2013

High-Performance Architecture for the Conjugate Gradient Solver on FPGAs.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. II Express Briefs, 2013

2012

A High Performance and Memory Efficient LU Decomposer on FPGAs.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2012

Parallelizing sparse LU decomposition on FPGAs.

[BibT_eX]

[DOI]

Proceedings of the 2012 International Conference on Field-Programmable Technology, 2012

2010

A Unified Co-Processor Architecture for Matrix Decomposition.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., 2010

FPGA accelerating double/quad-double high precision floating-point applications for ExaScale computing.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Supercomputing, 2010

Automatic synthesis of processor arrays with local memories on FPGAs.

[BibT_eX]

[DOI]

Guiming Wu

Yong Dou

Miao Wang

Proceedings of the International Conference on Field-Programmable Technology, 2010

High performance and memory efficient implementation of matrix multiplication on FPGAs.

[BibT_eX]

[DOI]

Guiming Wu

Yong Dou

Miao Wang

Proceedings of the International Conference on Field-Programmable Technology, 2010

Blocking LU Decomposition for FPGAs.

[BibT_eX]

[DOI]

Guiming Wu

Yong Dou

Gregory D. Peterson

Proceedings of the 18th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2010

2009

A coarse-grained reconfigurable computing architecture with loop self-pipelining.

[BibT_eX]

[DOI]

Sci. China Ser. F Inf. Sci., 2009

Exploiting Fine-Grained Pipeline Parallelism for Wavefront Computations on Multicore Platforms.

[BibT_eX]

[DOI]

Proceedings of the ICPPW 2009, 2009

A Fine-grained Pipelined Implementation of the LINPACK Benchmark on FPGAs.

[BibT_eX]

[DOI]

Proceedings of the FCCM 2009, 2009

2008

Computation rotating for data reuse.

[BibT_eX]

[DOI]

Proceedings of the 13th Asia-Pacific Computer Systems Architecture Conference, 2008

2007

Instruction Selection for Subword Level Parallelism Optimizations for Application Specific Instruction Processors.

[BibT_eX]

[DOI]

Miao Wang

Guiming Wu

Zhiying Wang

Proceedings of the Parallel and Distributed Processing and Applications, 2007

The Implementation of a Coarse-Grained Reconfigurable Architecture with Loop Self-pipelining.

[BibT_eX]

[DOI]

Yong Dou

Jinhui Xu

Guiming Wu

Proceedings of the Reconfigurable Computing: Architectures, 2007

2006

Designing a Coarse-Grained Reconfigurable Architecture Using Loop Self-Pipelining.

[BibT_eX]

[DOI]

Proceedings of the Advances in Computer Systems Architecture, 11th Asia-Pacific Conference, 2006

Guiming Wu

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...