Guangyu Sun

Liang Shi

Jingtong Hu

CCF Trans. High Perform. Comput., December, 2022

The Case for FPGA-Based Edge Computing.

[BibT_eX]

[DOI]

IEEE Trans. Mob. Comput., 2022

PIMulator-NN: An Event-Driven, Cross-Level Simulation Framework for Processing-In-Memory-Based Neural Network Accelerators.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022

Flatfish: A Reinforcement Learning Approach for Application-Aware Address Mapping.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022

A Survey of Trustworthy Graph Learning: Reliability, Explainability, and Privacy Protection.

[BibT_eX]

[DOI]

CoRR, 2022

PetS: A Unified Framework for Parameter-Efficient Transformers Serving.

[BibT_eX]

[DOI]

Proceedings of the 2022 USENIX Annual Technical Conference, 2022

GNNSampler: Bridging the Gap Between Sampling Algorithms of GNN and Hardware.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2022

Latency-aware Spatial-wise Dynamic Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

An 82nW 0.53pJ/SOP Clock-Free Spiking Neural Network with 40µs Latency for AloT Wake-Up Functions Using Ultimate-Event-Driven Bionic Architecture and Computing-in-Memory Technique.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Solid-State Circuits Conference, 2022

Enabling High-Quality Uncertainty Quantification in a PIM Designed for Bayesian Neural Network.

[BibT_eX]

[DOI]

Meng-Fan Marvin Chang

Tianchan Guan

Xin Si

Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2022

PTQ4ViT: Post-training Quantization for Vision Transformers with Twin Uniform Quantization.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Tailor: removing redundant operations in memristive analog neural network accelerators.

[BibT_eX]

[DOI]

Proceedings of the DAC '22: 59th ACM/IEEE Design Automation Conference, San Francisco, California, USA, July 10, 2022

A Mapping Model of SNNs to Neuromorphic Hardware.

[BibT_eX]

[DOI]

Proceedings of the 4th IEEE International Conference on Artificial Intelligence Circuits and Systems, 2022

GNNear: Accelerating Full-Batch Training of Graph Neural Networks with near-Memory Processing.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2022

2021

STAR: Synthesis of Stateful Logic in RRAM Targeting High Area Utilization.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2021

CIB-HIER: Centralized Input Buffer Design in Hierarchical High-radix Routers.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2021

Area Efficient Pattern Representation of Binary Neural Networks on RRAM.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., 2021

PTQ4ViT: Post-Training Quantization Framework for Vision Transformers.

[BibT_eX]

[DOI]

CoRR, 2021

GCNear: A Hybrid Architecture for Efficient GCN Training with Near-Memory Processing.

[BibT_eX]

[DOI]

CoRR, 2021

Energon: Towards Efficient Acceleration of Transformers Using Dynamic Sparse Attention.

[BibT_eX]

[DOI]

CoRR, 2021

PTQ-SL: Exploring the Sub-layerwise Post-training Quantization.

[BibT_eX]

[DOI]

CoRR, 2021

METRO: A Software-Hardware Co-Design of Interconnections for Spatial DNN Accelerators.

[BibT_eX]

[DOI]

CoRR, 2021

Agatha: Smart Contract for DNN Computation.

[BibT_eX]

[DOI]

CoRR, 2021

NAS4RRAM: neural network architecture search for inference on RRAM-based accelerators.

[BibT_eX]

[DOI]

Sci. China Inf. Sci., 2021

PipeZK: Accelerating Zero-Knowledge Proof with a Pipelined Architecture.

[BibT_eX]

[DOI]

Proceedings of the 48th ACM/IEEE Annual International Symposium on Computer Architecture, 2021

SSR: A Skeleton-based Synthesis Flow for Hybrid Processing-in-RRAM Modes.

[BibT_eX]

[DOI]

Feng Wang

Guojie Luo

Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2021

Rapid Configuration of Asynchronous Recurrent Neural Networks for ASIC Implementations.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE High Performance Extreme Computing Conference, 2021

BlockGNN: Towards Efficient GNN Acceleration Using Block-Circulant Weight Matrices.

[BibT_eX]

[DOI]

Proceedings of the 58th ACM/IEEE Design Automation Conference, 2021

Reconfigurable ASIC Implementation of Asynchronous Recurrent Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 27th IEEE International Symposium on Asynchronous Circuits and Systems, 2021

2020

Fork Path: Batching ORAM Requests to Remove Redundant Memory Accesses.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2020

Crane: Mitigating Accelerator Under-utilization Caused by Sparsity Irregularities in CNNs.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2020

Bigflow: A General Optimization Layer for Distributed Computing Frameworks.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., 2020

Preface.

[BibT_eX]

[DOI]

Wen-Guang Chen

Ying-Wei Luo

Guang-Yu Sun

J. Comput. Sci. Technol., 2020

Customizing Trusted AI Accelerators for Efficient Privacy-Preserving Machine Learning.

[BibT_eX]

[DOI]

Peichen Xie

Xuanle Ren

CoRR, 2020

ENAS4D: Efficient Multi-stage CNN Architecture Search for Dynamic Inference.

[BibT_eX]

[DOI]

CoRR, 2020

Edge-Stream: a Stream Processing Approach for Distributed Applications on a Hierarchical Edge-computing System.

[BibT_eX]

[DOI]

Proceedings of the 5th IEEE/ACM Symposium on Edge Computing, 2020

MobiLattice: A Depth-wise DCNN Accelerator with Hybrid Digital/Analog Nonvolatile Processing-In-Memory Block.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2020

SaFace: Towards Scenario-aware Face Recognition via Edge Computing System.

[BibT_eX]

[DOI]

Proceedings of the 3rd USENIX Workshop on Hot Topics in Edge Computing, 2020

S2DNAS: Transforming Static CNN Model for Dynamic Inference via Neural Architecture Search.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Hardware-assisted Service Live Migration in Resource-limited Edge Computing Systems.

[BibT_eX]

[DOI]

Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020

GNN-PIM: A Processing-in-Memory Architecture for Graph Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Advanced Computer Architecture - 13th Conference, 2020

Characterizing Membership Privacy in Stochastic Gradient Langevin Dynamics.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Caffeine: Toward Uniformed Representation and Acceleration for Deep Convolutional Neural Networks.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2019

GraphH: A Processing-in-Memory Architecture for Large-Scale Graph Processing.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2019

RC-NVM: Dual-Addressing Non-Volatile Memory Architecture Supporting Both Row and Column Memory Accesses.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2019

EdgeFlow: Open-Source Multi-layer Data Flow Processing in Edge Computing for 5G and Beyond.

[BibT_eX]

[DOI]

IEEE Netw., 2019

Joint Task Assignment, Transmission, and Computing Resource Allocation in Multilayer Mobile Edge Computing Systems.

[BibT_eX]

[DOI]

IEEE Internet Things J., 2019

Generalization in Generative Adversarial Networks: A Novel Perspective from Privacy Protection.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

BAYHENN: Combining Bayesian Deep Learning and Homomorphic Encryption for Secure DNN Inference.

[BibT_eX]

[DOI]

Peichen Xie

Bingzhe Wu

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Accelerate service live migration in resource-limited edge computing systems.

[BibT_eX]

[DOI]

Zhe Zhou

Xintong Li

Proceedings of the 4th ACM/IEEE Symposium on Edge Computing, 2019

ZUMA: Enabling Direct Insertion/Deletion Operations with Emerging Skyrmion Racetrack Memory.

[BibT_eX]

[DOI]

Proceedings of the 56th Annual Design Automation Conference 2019, 2019

P3SGD: Patient Privacy Preserving SGD for Regularizing Deep CNNs in Pathological Image Classification.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Parallel Stateful Logic in RRAM: Theoretical Analysis and Arithmetic Design.

[BibT_eX]

[DOI]

Proceedings of the 30th IEEE International Conference on Application-specific Systems, 2019

G2C: A Generator-to-Classifier Framework Integrating Multi-Stained Visual Cues for Pathological Glomerulus Classification.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Optimizing Cache Bypassing and Warp Scheduling for GPUs.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2018

CRAT: Enabling Coordinated Register Allocation and Thread-Level Parallelism Optimization for GPUs.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2018

V-PIM: An Analytical Overhead Model for Processing-in-Memory Architectures.

[BibT_eX]

[DOI]

Proceedings of the IEEE 7th Non-Volatile Memory Systems and Applications Symposium, 2018

Shadow Block: Accelerating ORAM Accesses with Data Duplication.

[BibT_eX]

[DOI]

Proceedings of the 51st Annual IEEE/ACM International Symposium on Microarchitecture, 2018

Path Prefetching: Accelerating Index Searches for In-Memory Databases.

[BibT_eX]

[DOI]

Proceedings of the 36th IEEE International Conference on Computer Design, 2018

PM3: Power Modeling and Power Management for Processing-in-Memory.

[BibT_eX]

[DOI]

Tong Meng

Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2018

RC-NVM: Enabling Symmetric Row and Column Memory Accesses for In-memory Databases.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2018

2017

Data Backup Optimization for Nonvolatile SRAM in Energy Harvesting Sensor Nodes.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2017

Pseudo-Differential Sensing Framework for STT-MRAM: A Cross-Layer Perspective.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2017

SRPGAN: Perceptual Generative Adversarial Network for Single Image Super Resolution.

[BibT_eX]

[DOI]

CoRR, 2017

Reducing Overfitting in Deep Convolutional Neural Networks Using Redundancy Regularizer.

[BibT_eX]

[DOI]

Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2017, 2017

FP-DNN: An Automated Framework for Mapping Deep Neural Networks onto FPGAs with RTL-HLS Hybrid Templates.

[BibT_eX]

[DOI]

Proceedings of the 25th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2017

Protect non-volatile memory from wear-out attack based on timing difference of row buffer hit/miss.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2017

SPMS: Strand based persistent memory system.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2017

Toss-up Wear Leveling: Protecting Phase-Change Memories from Inconsistent Write Patterns.

[BibT_eX]

[DOI]

Xian Zhang

Proceedings of the 54th Annual Design Automation Conference, 2017

FPGA-based accelerator for long short-term memory recurrent neural networks.

[BibT_eX]

[DOI]

Proceedings of the 22nd Asia and South Pacific Design Automation Conference, 2017

2016

Perspectives of Racetrack Memory for Large-Capacity On-Chip Memory: From Device to System.

[BibT_eX]

[DOI]

Jacques-Olivier Klein

Dafine Ravelosona

Weisheng Zhao

IEEE Trans. Circuits Syst. I Regul. Pap., 2016

Statistical Cache Bypassing for Non-Volatile Memory.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2016

Accelerate context switch by racetrack-SRAM hybrid cells.

[BibT_eX]

[DOI]

Weiqi Zhang

Proceedings of the IEEE/ACM International Symposium on Nanoscale Architectures, 2016

np-ECC: Nonadjacent position error correction code for racetrack memory.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Symposium on Nanoscale Architectures, 2016

Energy-Efficient CNN Implementation on a Deeply Pipelined FPGA Cluster.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Symposium on Low Power Electronics and Design, 2016

NXgraph: An efficient graph processing system on a single machine.

[BibT_eX]

[DOI]

Proceedings of the 32nd IEEE International Conference on Data Engineering, 2016

The Applications of NVM Technology in Hardware Security.

[BibT_eX]

[DOI]

Proceedings of the 26th edition on Great Lakes Symposium on VLSI, 2016

Exploring Main Memory Design Based on Racetrack Memory Technology.

[BibT_eX]

[DOI]

Proceedings of the 26th edition on Great Lakes Symposium on VLSI, 2016

PDS: pseudo-differential sensing scheme for STT-MRAM.

[BibT_eX]

[DOI]

Proceedings of the 53rd Annual Design Automation Conference, 2016

Pin Tumbler Lock: A shift based encryption mechanism for racetrack memory.

[BibT_eX]

[DOI]

Proceedings of the 21st Asia and South Pacific Design Automation Conference, 2016

A novel PUF based on cell error rate distribution of STT-RAM.

[BibT_eX]

[DOI]

Proceedings of the 21st Asia and South Pacific Design Automation Conference, 2016

Performance-centric register file design for GPUs using racetrack memory.

[BibT_eX]

[DOI]

Proceedings of the 21st Asia and South Pacific Design Automation Conference, 2016

2015

An Efficient Compiler Framework for Cache Bypassing on GPUs.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2015

Exploring data placement in racetrack memory based scratchpad memory.

[BibT_eX]

[DOI]

Proceedings of the IEEE Non-Volatile Memory System and Applications Symposium, 2015

An architecture-level cache simulation framework supporting advanced PMA STT-MRAM.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE/ACM International Symposium on Nanoscale Architectures, 2015

Atlas: Baidu's key-value storage system for cloud data.

[BibT_eX]

[DOI]

Proceedings of the IEEE 31st Symposium on Mass Storage Systems and Technologies, 2015

Fork path: improving efficiency of ORAM by removing redundant memory accesses.

[BibT_eX]

[DOI]

Proceedings of the 48th International Symposium on Microarchitecture, 2015

Enabling coordinated register allocation and thread-level parallelism optimization for GPUs.

[BibT_eX]

[DOI]

Proceedings of the 48th International Symposium on Microarchitecture, 2015

Leveraging emerging nonvolatile memory in high-level synthesis with loop transformations.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Symposium on Low Power Electronics and Design, 2015

Perspectives of racetrack memory based on current-induced domain wall motion: From device to system.

[BibT_eX]

[DOI]

Yue Zhang

Jacques-Olivier Klein

Dafine Ravelosona

Weisheng Zhao

Proceedings of the 2015 IEEE International Symposium on Circuits and Systems, 2015

Hi-fi playback: tolerating position errors in shift operations of racetrack memory.

[BibT_eX]

[DOI]

Proceedings of the 42nd Annual International Symposium on Computer Architecture, 2015

Coordinated static and dynamic cache bypassing for GPUs.

[BibT_eX]

[DOI]

Proceedings of the 21st IEEE International Symposium on High Performance Computer Architecture, 2015

Optimizing FPGA-based Accelerator Design for Deep Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 2015 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2015

From device to system: cross-layer design exploration of racetrack memory.

[BibT_eX]

[DOI]

Jacques-Olivier Klein

Proceedings of the 2015 Design, Automation & Test in Europe Conference & Exhibition, 2015

An energy efficient backup scheme with low inrush current for nonvolatile SRAM in energy harvesting sensor nodes.

[BibT_eX]

[DOI]

Proceedings of the 2015 Design, Automation & Test in Europe Conference & Exhibition, 2015

A STT-RAM-based low-power hybrid register file for GPGPUs.

[BibT_eX]

[DOI]

Proceedings of the 52nd Annual Design Automation Conference, 2015

Quantitative modeling of racetrack memory, a tradeoff among area, performance, and power.

[BibT_eX]

[DOI]

Proceedings of the 20th Asia and South Pacific Design Automation Conference, 2015

InterFS: An Interplanted Distributed File System to Improve Storage Utilization.

[BibT_eX]

[DOI]

Proceedings of the 6th Asia-Pacific Workshop on Systems, 2015

Improving Memory Access Performance of In-Memory Key-Value Store Using Data Prefetching Techniques.

[BibT_eX]

[DOI]

Proceedings of the Advanced Parallel Processing Technologies, 2015

2014

GRT: A Reconfigurable SDR Platform with High Performance and Usability.

[BibT_eX]

[DOI]

SIGARCH Comput. Archit. News, 2014

SBAC: a statistics based cache bypassing method for asymmetric-access caches.

[BibT_eX]

[DOI]

Proceedings of the International Symposium on Low Power Electronics and Design, 2014

Rapid design space exploration of two-level unified caches.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systemss, 2014

Half-DRAM: A high-bandwidth and low-power DRAM architecture from the rethinking of fine-grained activation.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE 41st International Symposium on Computer Architecture, 2014

CREAM: A Concurrent-Refresh-Aware DRAM Memory architecture.

[BibT_eX]

[DOI]

Proceedings of the 20th IEEE International Symposium on High Performance Computer Architecture, 2014

Adaptive placement and migration policy for an STT-RAM-based hybrid cache.

[BibT_eX]

[DOI]

Proceedings of the 20th IEEE International Symposium on High Performance Computer Architecture, 2014

3D-SWIFT: a high-performance 3D-stacked wide IO DRAM.

[BibT_eX]

[DOI]

Proceedings of the Great Lakes Symposium on VLSI 2014, GLSVLSI '14, Houston, TX, USA - May 21, 2014

An efficient design and implementation of LSM-tree based key-value store on open-channel SSD.

[BibT_eX]

[DOI]

Proceedings of the Ninth Eurosys Conference 2014, 2014

NoC-Sprinting: Interconnect for Fine-Grained Sprinting in the Dark Silicon Era.

[BibT_eX]

[DOI]

Jia Zhan

Proceedings of the 51st Annual Design Automation Conference 2014, 2014

Prefetching techniques for STT-RAM based last-level cache in CMP systems.

[BibT_eX]

[DOI]

Proceedings of the 19th Asia and South Pacific Design Automation Conference, 2014

2013

Optimizing GPU energy efficiency with 3D die-stacking graphics memory and reconfigurable memory interface.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2013

Exploring the vulnerability of CMPs to soft errors with 3D stacked nonvolatile memory.

[BibT_eX]

[DOI]

ACM J. Emerg. Technol. Comput. Syst., 2013

Active SSD design for energy-efficiency improvement of web-scale data analysis.

[BibT_eX]

[DOI]

Proceedings of the International Symposium on Low Power Electronics and Design (ISLPED), 2013

Designing scratchpad memory architecture with emerging STT-RAM memory technologies.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

Lazy Precharge: An overhead-free method to reduce precharge overhead for memory parallelism improvement of DRAM system.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE 31st International Conference on Computer Design, 2013

Asymmetric-access aware optimization for STT-RAM caches with process variations.

[BibT_eX]

[DOI]

Proceedings of the Great Lakes Symposium on VLSI 2013 (part of ECRC), 2013

An efficient run-time encryption scheme for non-volatile main memory.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Compilers, 2013

A Probabilistic Data Replacement Strategy for Flash-Based Hybrid Storage System.

[BibT_eX]

[DOI]

Proceedings of the Web Technologies and Applications - 15th Asia-Pacific Web Conference, 2013

2012

Performance/Thermal-Aware Design of 3D-Stacked L2 Caches for CMPs.

[BibT_eX]

[DOI]

Huazhong Yang

ACM Trans. Design Autom. Electr. Syst., 2012

Energy-efficient GPU design with reconfigurable in-package graphics memory.

[BibT_eX]

[DOI]

Proceedings of the International Symposium on Low Power Electronics and Design, 2012

Improving energy efficiency of write-asymmetric memories by log style write.

[BibT_eX]

[DOI]

Proceedings of the International Symposium on Low Power Electronics and Design, 2012

Multi-level cell STT-RAM: Is it realistic or just a dream?

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE/ACM International Conference on Computer-Aided Design, 2012

Modeling and design exploration of FBDRAM as on-chip memory.

[BibT_eX]

[DOI]

Cong Xu

Proceedings of the 2012 Design, Automation & Test in Europe Conference & Exhibition, 2012

3DHLS: Incorporating high-level synthesis in physical planning of three-dimensional (3D) ICs.

[BibT_eX]

[DOI]

Proceedings of the 2012 Design, Automation & Test in Europe Conference & Exhibition, 2012

2011

Influence of Stacked 3D Memory/Cache Architectures on GPUs.

[BibT_eX]

[DOI]

Narayanan Vijaykrishnan

Proceedings of the 3D Integration for NoC-based SoC Architectures, 2011

Three-dimensional Integrated Circuits: Design, EDA, and Architecture.

[BibT_eX]

[DOI]

Found. Trends Electron. Des. Autom., 2011

Exploiting Heterogeneity for Energy Efficiency in Chip Multiprocessors.

[BibT_eX]

[DOI]

Vijaykrishnan Narayanan

IEEE J. Emerg. Sel. Topics Circuits Syst., 2011

Moguls: a model to explore the memory hierarchy for bandwidth improvements.

[BibT_eX]

[DOI]

Christopher J. Hughes

Proceedings of the 38th International Symposium on Computer Architecture (ISCA 2011), 2011

Architecting on-chip interconnects for stacked 3D STT-RAM caches in CMPs.

[BibT_eX]

[DOI]

Narayanan Vijaykrishnan

Chita R. Das

Proceedings of the 38th International Symposium on Computer Architecture (ISCA 2011), 2011

Energy-efficient multi-level cell phase-change memory system with data encoding.

[BibT_eX]

[DOI]

Proceedings of the IEEE 29th International Conference on Computer Design, 2011

Exploring the vulnerability of CMPs to soft errors with 3D stacked non-volatile memory.

[BibT_eX]

[DOI]

Proceedings of the IEEE 29th International Conference on Computer Design, 2011

Enabling architectural innovations using non-volatile memory.

[BibT_eX]

[DOI]

Vijaykrishnan Narayanan

Vinay Saripalli

Karthik Swaminathan

Ravindhiran Mukundrajan

Suman Datta

Proceedings of the 21st ACM Great Lakes Symposium on VLSI 2010, 2011

Emerging non-volatile memories: opportunities and challenges.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Hardware/Software Codesign and System Synthesis, 2011

A frequent-value based PRAM memory architecture.

[BibT_eX]

[DOI]

Proceedings of the 16th Asia South Pacific Design Automation Conference, 2011

Using NEM relay to improve 3DIC cost efficiency.

[BibT_eX]

[DOI]

Tao Zhang

Proceedings of the 2011 IEEE International 3D Systems Integration Conference (3DIC), Osaka, Japan, January 31, 2011

Fabrication cost analysis for 2D, 2.5D, and 3D IC designs.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE International 3D Systems Integration Conference (3DIC), Osaka, Japan, January 31, 2011

2010

Variable-Latency Adder (VL-Adder) Designs for Low Power and NBTI Tolerance.

[BibT_eX]

[DOI]

IEEE Trans. Very Large Scale Integr. Syst., 2010

A Hybrid solid-state storage architecture for the performance, energy consumption, and lifetime improvement.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on High-Performance Computer Architecture (HPCA-16 2010), 2010

Energy- and endurance-aware design of phase change memory caches.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation and Test in Europe, 2010

Cost-driven 3D integration with interconnect layers.

[BibT_eX]

[DOI]

Proceedings of the 47th Design Automation Conference, 2010

2009

Exploration of 3D stacked L2 cache design for high performance and efficient thermal control.

[BibT_eX]

[DOI]

Xiaoxia Wu

Proceedings of the 2009 International Symposium on Low Power Electronics and Design, 2009

3D GPU architecture using cache stacking: Performance, cost, power and thermal analysis.

[BibT_eX]

[DOI]

Proceedings of the 27th International Conference on Computer Design, 2009

A novel architecture of the 3D stacked MRAM L2 cache for CMPs.

[BibT_eX]

[DOI]

Proceedings of the 15th International Conference on High-Performance Computer Architecture (HPCA-15 2009), 2009

A criticality-driven microarchitectural three dimensional (3D) floorplanner.

[BibT_eX]

[DOI]

Vijaykrishnan Narayanan

Proceedings of the 14th Asia South Pacific Design Automation Conference, 2009

Arithmetic unit design using 180nm TSV-based 3D stacking technology.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on 3D System Integration, 2009

2008

Thermal-aware Design Considerations for Application-Specific Instruction Set Processor.

[BibT_eX]

[DOI]

Anand Sivasubramaniam

Proceedings of the IEEE Symposium on Application Specific Processors, 2008

A Variation Aware High Level Synthesis Framework.

[BibT_eX]

[DOI]

Feng Wang