Youhui Zhang
Orcid: 0000-0003-2333-3580
According to our database1,
Youhui Zhang
authored at least 82 papers
between 1999 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2024
DTrans: A Dataflow-transformation FPGA Accelerator with Nonlinear-operators fusion aiming for the Generative Model.
Proceedings of the 34th International Conference on Field-Programmable Logic and Applications, 2024
2023
IEEE Trans. Parallel Distributed Syst., September, 2023
MAICC : A Lightweight Many-core Architecture with In-Cache Computing for Multi-DNN Parallel Inference.
Proceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture, 2023
Proceedings of the IEEE/ACM International Symposium on Low Power Electronics and Design, 2023
Proceedings of the International Joint Conference on Neural Networks, 2023
2022
ACM J. Emerg. Technol. Comput. Syst., 2022
Frontiers Comput. Neurosci., 2022
EcoForecast: An interpretable data-driven approach for short-term macroeconomic forecasting using N-BEATS neural network.
Eng. Appl. Artif. Intell., 2022
CCF Trans. High Perform. Comput., 2022
Proceedings of the DAC '22: 59th ACM/IEEE Design Automation Conference, San Francisco, California, USA, July 10, 2022
Proceedings of the 22nd IEEE International Symposium on Cluster, 2022
2021
A Reduced Architecture for ReRAM-Based Neural Network Accelerator and Its Software Stack.
IEEE Trans. Computers, 2021
Regu2D: Accelerating Vectorization of SpMV on Intel Processors through 2D-partitioning and Regular Arrangement.
Proceedings of the ICPP 2021: 50th International Conference on Parallel Processing, Lemont, IL, USA, August 9, 2021
2020
IEEE Trans. Parallel Distributed Syst., 2020
IEEE Trans. Parallel Distributed Syst., 2020
CoRR, 2020
SuSy: A Programming Model for Productive Construction of High-Performance Systolic Arrays on FPGAs.
Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2020
2019
A Unified Framework for Training, Mapping and Simulation of ReRAM-Based Convolutional Neural Network Acceleration.
IEEE Comput. Archit. Lett., 2019
Design Guidelines of RRAM based Neural-Processing-Unit: A Joint Device-Circuit-Algorithm Analysis.
Proceedings of the 56th Annual Design Automation Conference 2019, 2019
FPSA: A Full System Stack Solution for Reconfigurable ReRAM-based NN Accelerator Architecture.
Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, 2019
2018
Bridging the Gap Between Neural Networks and Neuromorphic Hardware with A Neural Network Compiler.
CoRR, 2018
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
Bridge the Gap between Neural Networks and Neuromorphic Hardware with a Neural Network Compiler.
Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating Systems, 2018
2017
IEEE Trans. Parallel Distributed Syst., 2017
Int. J. High Perform. Comput. Netw., 2017
Proceedings of the 46th International Conference on Parallel Processing Workshops, 2017
Proceedings of the 26th International Conference on Parallel Architectures and Compilation Techniques, 2017
2016
A Cloud Gaming System Based on User-Level Virtualization and Its Resource Scheduling.
IEEE Trans. Parallel Distributed Syst., 2016
J. Comput. Sci. Technol., 2016
NEUTRAMS: Neural network transformation and co-design under neuromorphic hardware constraints.
Proceedings of the 49th Annual IEEE/ACM International Symposium on Microarchitecture, 2016
Proceedings of the Algorithms and Architectures for Parallel Processing, 2016
Proceedings of the 18th IEEE International Conference on High Performance Computing and Communications; 14th IEEE International Conference on Smart City; 2nd IEEE International Conference on Data Science and Systems, 2016
Proceedings of the 2016 International Conference on Compilers, 2016
2015
Solving the Global Atmospheric Equations through Heterogeneous Reconfigurable Platforms.
ACM Trans. Reconfigurable Technol. Syst., 2015
Software-Based Lightweight Multithreading to Overlap Memory-Access Latencies of Commodity Processors.
Proceedings of the 44th International Conference on Parallel Processing, 2015
Position-aware thread-level speculative parallelization for large-scale chip-multiprocessor.
Proceedings of the 12th ACM International Conference on Computing Frontiers, 2015
2014
Proceedings of the Algorithms and Architectures for Parallel Processing, 2014
Proceedings of the IEEE 25th International Conference on Application-Specific Systems, 2014
2013
Future Gener. Comput. Syst., 2013
Employing intelligence in object-based storage devices to provide attribute-based file access.
Sci. China Inf. Sci., 2013
Proceedings of the Network and Parallel Computing - 10th IFIP International Conference, 2013
Aegis: partitioning data block for efficient recovery of stuck-at-faults in phase change memory.
Proceedings of the 46th Annual IEEE/ACM International Symposium on Microarchitecture, 2013
Proceedings of the Algorithms and Architectures for Parallel Processing, 2013
Accelerating solvers for global atmospheric equations through mixed-precision data flow engine.
Proceedings of the 23rd International Conference on Field programmable Logic and Applications, 2013
Proceedings of the 2013 IEEE Symposium on Low-Power and High-Speed Chips, 2013
2012
2011
IEICE Electron. Express, 2011
Intell. Autom. Soft Comput., 2011
Sci. China Inf. Sci., 2011
Using User-Level Virtualization in Desktop Grid Clients for Application Delivery and Sandboxing.
Proceedings of the Fourth International Symposium on Parallel Architectures, 2011
2010
IEEE Trans. Serv. Comput., 2010
Efficient Monte Carlo-based options pricing on graphics processors and its optimizations.
Sci. China Inf. Sci., 2010
Proceedings of the Third International Symposium on Parallel Architectures, 2010
2009
IEICE Electron. Express, 2009
2008
Proceedings of the PACIIA 2008, 2008
Proceedings of the 22nd Large Installation System Administration Conference, 2008
Proceedings of the 13th Asia-Pacific Computer Systems Architecture Conference, 2008
Proceedings of the 13th Asia-Pacific Computer Systems Architecture Conference, 2008
IDRS: Combining File-level Intrusion Detection with Block-level Data Recovery based on iSCSI.
Proceedings of the The Third International Conference on Availability, 2008
2006
Proceedings of the 18th Symposium on Computer Architecture and High Performance Computing (SBAC-PAD 2006), 2006
Proceedings of the 12th International Conference on Parallel and Distributed Systems, 2006
Seamless Peripherals Integration for Network Computers based on the Reversed Server Message Block Protocol.
Proceedings of the 2006 International Conference on Networking and Services (ICNS 2006), 2006
2005
Thckpt: Transparent Checkpointing of Linux Processes Under IA-64.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2005
Proceedings of the 34th International Conference on Parallel Processing Workshops (ICPP 2005 Workshops), 2005
Proceedings of the Advances in Computer Systems Architecture, 10th Asia-Pacific Conference, 2005
2004
Proceedings of the Network and Parallel Computing, IFIP International Conference, 2004
Proceedings of the Parallel and Distributed Processing and Applications, 2004
Proceedings of the Embedded Software and Systems, First International Conference, 2004
Proceedings of the Grid and Cooperative Computing, 2004
Proceedings of the Advances in Computer Systems Architecture, 9th Asia-Pacific Conference, 2004
2003
ACM SIGOPS Oper. Syst. Rev., 2003
2002
ACM SIGOPS Oper. Syst. Rev., 2002
2001
Transparent Checkpointing and Rollback Recovery Mechanism for Windows NT Applications.
ACM SIGOPS Oper. Syst. Rev., 2001
1999
ACM SIGOPS Oper. Syst. Rev., 1999