Yi Yang
Orcid: 0000-0003-1462-5100Affiliations:
- NEC Laboratories America, Department of Computing Systems Architecture, Princeton, NJ, USA
- North Carolina State University, Department of Electrical and Computer Engineering, Raleigh, NC, USA (former)
According to our database1,
Yi Yang
authored at least 34 papers
between 2010 and 2022.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2022
ACM Trans. Embed. Comput. Syst., November, 2022
2021
Proceedings of the IEEE International Conference on Smart Computing, 2021
Proceedings of the Middleware '21: 22nd International Middleware Conference, Québec City, Canada, December 6, 2021
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021
2017
Accelerating deep neural network training with inconsistent stochastic gradient descent.
Neural Networks, 2017
2016
Proceedings of the International Conference for High Performance Computing, 2016
BLASX: A High Performance Level-3 BLAS Library for Heterogeneous Multi-GPU Computing.
Proceedings of the 2016 International Conference on Supercomputing, 2016
Proceedings of the 45th International Conference on Parallel Processing, 2016
2015
J. Comput. Sci. Technol., 2015
BLASX: A High Performance Level-3 BLAS Library for Heterogeneous Multi-GPU Computing.
CoRR, 2015
Proceedings of the Languages and Compilers for Parallel Computing, 2015
Proceedings of the 13th Annual IEEE/ACM International Symposium on Code Generation and Optimization, 2015
Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015
2014
Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2014
Proceedings of the 47th Annual IEEE/ACM International Symposium on Microarchitecture, 2014
Understanding the tradeoffs between software-managed vs. hardware-managed caches in GPUs.
Proceedings of the 2014 IEEE International Symposium on Performance Analysis of Systems and Software, 2014
Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014
Proceedings of the 2014 International Conference on Supercomputing, 2014
Proceedings of the 20th IEEE International Symposium on High Performance Computer Architecture, 2014
Proceedings of the Numerical Computations with GPUs, 2014
2013
J. Parallel Distributed Comput., 2013
Int. J. Parallel Program., 2013
Proceedings of the International Conference for High Performance Computing, 2013
Exploiting uniform vector instructions for GPGPU performance, energy efficiency, and opportunistic reliability enhancement.
Proceedings of the International Conference on Supercomputing, 2013
2012
ACM Trans. Archit. Code Optim., 2012
Apricot: an optimizing compiler and productivity tool for x86-compatible many-core coprocessors.
Proceedings of the International Conference on Supercomputing, 2012
Proceedings of the 41st International Conference on Parallel Processing, 2012
Proceedings of the 18th IEEE International Symposium on High Performance Computer Architecture, 2012
Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2012
Many-thread aware instruction-level parallelism: architecting shader cores for GPU computing.
Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2012
2010
Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2010
Proceedings of the 2010 ACM SIGPLAN Conference on Programming Language Design and Implementation, 2010
Proceedings of 3rd Workshop on General Purpose Processing on Graphics Processing Units, 2010