2024
Runtime Performance Anomaly Diagnosis in Production HPC Systems Using Active Learning.
IEEE Trans. Parallel Distributed Syst., April, 2024
Data center and load aggregator coordination towards electricity demand response.
Sustain. Comput. Informatics Syst., 2024
An Online Probabilistic Distributed Tracing System.
CoRR, 2024
A New Dataflow Implementation to Improve Energy Efficiency of Monolithic 3D Systolic Arrays.
CoRR, 2024
LLMs Cannot Reliably Identify and Reason About Security Vulnerabilities (Yet?): A Comprehensive Evaluation, Framework, and Benchmarks.
Proceedings of the IEEE Symposium on Security and Privacy, 2024
Analysis of Power Consumption and GPU Power Capping for MILC.
Proceedings of the SC24-W: Workshops of the International Conference for High Performance Computing, 2024
SOPHIE: A Scalable Recurrent Ising Machine Using Optically Addressed Phase Change Memory.
Proceedings of the 57th IEEE/ACM International Symposium on Microarchitecture, 2024
Enhanced Detection of Thermal Covert Channel Attacks in Multicore Processors.
Proceedings of the 25th International Symposium on Quality Electronic Design, 2024
PraxiPaaS: A Decomposable Machine Learning System for Efficient Container Package Discovery.
Proceedings of the IEEE International Conference on Cloud Engineering, 2024
Unleashing Performance Insights with Online Probabilistic Tracing.
Proceedings of the IEEE International Conference on Cloud Engineering, 2024
Energy-Efficient Dataflow Design for Monolithic 3D Systolic Arrays with Resistive RAM.
Proceedings of the 15th IEEE International Green and Sustainable Computing Conference, 2024
Conductor: A Collaboration Framework for Multi-Data-Center Demand Response.
Proceedings of the 15th IEEE International Green and Sustainable Computing Conference, 2024
Data Center Demand Response for Sustainable Computing: Myth or Opportunity?
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2024
2023
TREAD-M3D: Temperature-Aware DNN Accelerators for Monolithic 3-D Mobile Systems.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., December, 2023
Can Large Language Models Identify And Reason About Security Vulnerabilities? Not Yet.
CoRR, 2023
An End-to-End HPC Framework for Dynamic Power Objectives.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023
Prodigy: Towards Unsupervised Anomaly Detection in Production HPC Systems.
Proceedings of the International Conference for High Performance Computing, 2023
Processing-in-Memory Using Optically-Addressed Phase Change Memory.
Proceedings of the IEEE/ACM International Symposium on Low Power Electronics and Design, 2023
Enabling Privacy-preserving Multidimensional Network Telemetry with Autoencoders.
Proceedings of the IEEE International Conference on Cloud Engineering, 2023
Poster Paper: Efficient Navigation of Cloud Performance with 'nuffTrace.
Proceedings of the IEEE International Conference on Cloud Engineering, 2023
MicroFaaS on OpenFaaS: An Embedded Platform for Running Cloud Functions.
Proceedings of the IEEE International Conference on Cloud Engineering, 2023
Temperature-Aware Sizing of Multi-Chip Module Accelerators for Multi-DNN Workloads.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2023
2022
High Bandwidth Thermal Covert Channel in 3-D-Integrated Multicore Processors.
IEEE Trans. Very Large Scale Integr. Syst., 2022
HPC Data Center Participation in Demand Response: An Adaptive Policy With QoS Assurance.
IEEE Trans. Sustain. Comput., 2022
Praxi: Cloud Software Discovery That Learns From Practice.
IEEE Trans. Cloud Comput., 2022
PACT: An Extensible Parallel Thermal Simulator for Emerging Integration and Cooling Technologies.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022
Architecting Optically Controlled Phase Change Memory.
ACM Trans. Archit. Code Optim., 2022
VAIF: Variance-driven Automated Instrumentation Framework.
ACM SIGOPS Oper. Syst. Rev., 2022
Temperature-Aware Monolithic 3D DNN Accelerators for Biomedical Applications.
CoRR, 2022
Site-Wide HPC Data Center Demand Response.
Proceedings of the IEEE High Performance Extreme Computing Conference, 2022
Guiding Hardware-Driven Turbo with Application Performance Awareness.
Proceedings of the 13th IEEE International Green and Sustainable Computing Conference, 2022
MicroFaaS: Energy-efficient Serverless on Bare-metal Single-board Computers.
Proceedings of the 2022 Design, Automation & Test in Europe Conference & Exhibition, 2022
ALBADross: Active Learning Based Anomaly Diagnosis for Production HPC Systems.
Proceedings of the IEEE International Conference on Cluster Computing, 2022
2021
ECOGreen: Electricity Cost Optimization for Green Datacenters in Emerging Power Markets.
IEEE Trans. Sustain. Comput., 2021
Monolithic 3D Integrated Circuits: Recent Trends and Future Prospects.
IEEE Trans. Circuits Syst. II Express Briefs, 2021
PROWAVES: Proactive Runtime Wavelength Selection for Energy-Efficient Photonic NoCs.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2021
Proctor: A Semi-Supervised Performance Anomaly Diagnosis Framework for Production HPC Systems.
Proceedings of the High Performance Computing - 36th International Conference, 2021
Tritium: A Cross-layer Analytics System for Enhancing Microservice Rollouts in the Cloud.
Proceedings of the WOC '21: Proceedings of the Seventh International Workshop on Container Technologies and Container Clouds, 2021
Introducing Application Awareness Into a Unified Power Management Stack.
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021
Using Monitoring Data to Improve HPC Performance via Network-Data-Driven Allocation.
Proceedings of the 2021 IEEE High Performance Extreme Computing Conference, 2021
E2EWatch: An End-to-End Anomaly Diagnosis Framework for Production HPC Systems.
Proceedings of the Euro-Par 2021: Parallel Processing, 2021
A Data Center Demand Response Policy for Real-World Workload Scenarios in HPC.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2021
Automating instrumentation choices for performance problems in distributed applications with VAIF.
Proceedings of the SoCC '21: ACM Symposium on Cloud Computing, 2021
Iter8: Online Experimentation in the Cloud.
Proceedings of the SoCC '21: ACM Symposium on Cloud Computing, 2021
Temperature-Aware Optimization of Monolithic 3D Deep Neural Network Accelerators.
Proceedings of the ASPDAC '21: 26th Asia and South Pacific Design Automation Conference, 2021
2020
LoCool: Fighting Hot Spots Locally for Improving System Energy Efficiency.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2020
Cross-Layer Co-Optimization of Network Design and Chiplet Placement in 2.5-D Systems.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2020
Counterfactual Explanations for Machine Learning on Multivariate Time Series Data.
CoRR, 2020
ConfEx: A Framework for Automating Text-based Software Configuration Analysis in the Cloud.
CoRR, 2020
Coordinated Demand Response By Data Centers Using Inverse Optimization.
Proceedings of the 2020 IEEE International Conference on Communications, 2020
ACE: Just-in-time Serverless Software Component Discovery Through Approximate Concrete Execution.
Proceedings of the WoSC@Middleware 2020: Proceedings of the 2020 Sixth International Workshop on Serverless Computing, 2020
Version Detection for Software Discovery in the Cloud.
Proceedings of the Middleware '20 Demos and Posters: Proceedings of the 21st International Middleware Conference Demos and Posters, 2020
Bandwidth Allocation in Silicon-Photonic Networks Using Application Instrumentation.
Proceedings of the 2020 IEEE High Performance Extreme Computing Conference, 2020
JACKPOT: Online Experimentation of Cloud Microservices.
Proceedings of the 12th USENIX Workshop on Hot Topics in Cloud Computing, 2020
A Learning-Based Thermal Simulation Framework for Emerging Two-Phase Cooling Technologies.
Proceedings of the 2020 Design, Automation & Test in Europe Conference & Exhibition, 2020
POPSTAR: a Robust Modular Optical NoC Architecture for Chiplet-based 3D Integrated Systems.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2020 Design, Automation & Test in Europe Conference & Exhibition, 2020
System-level Evaluation of Chip-Scale Silicon Photonic Networks for Emerging Data-Intensive Applications.
Proceedings of the 2020 Design, Automation & Test in Europe Conference & Exhibition, 2020
Quantifying the impact of network congestion on application performance and network metrics.
Proceedings of the IEEE International Conference on Cluster Computing, 2020
2019
Online Diagnosis of Performance Variation in HPC Systems Using Machine Learning.
IEEE Trans. Parallel Distributed Syst., 2019
EnergyQARE: QoS-Aware Data Center Participation in Smart Grid Regulation Service Reserve Provision.
ACM Trans. Model. Perform. Evaluation Comput. Syst., 2019
Maestro: Autonomous QoS Management for Mobile Applications Under Thermal Constraints.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2019
CAPE: A cross-layer framework for accurate microprocessor power estimation.
Integr., 2019
Praxi: Cloud Software Discovery That Learns From Practice.
Proceedings of the 20th International Middleware Conference Demos and Posters, 2019
RANDR: Record and Replay for Android Applications via Targeted Runtime Instrumentation.
Proceedings of the 34th IEEE/ACM International Conference on Automated Software Engineering, 2019
Modeling and Optimization of Chip Cooling with Two-Phase Vapor Chambers.
Proceedings of the 2019 IEEE/ACM International Symposium on Low Power Electronics and Design, 2019
HPAS: An HPC Performance Anomaly Suite for Reproducing Performance Variations.
Proceedings of the 48th International Conference on Parallel Processing, 2019
An Overview of Thermal Challenges and Opportunities for Monolithic 3D ICs.
Proceedings of the 2019 on Great Lakes Symposium on VLSI, 2019
Data Center Participation in Demand Response Programs with Quality-of-Service Guarantees.
Proceedings of the Tenth ACM International Conference on Future Energy Systems, 2019
Data Center Demand Response Pricing Using Inverse Optimization.
Proceedings of the Tenth ACM International Conference on Future Energy Systems, 2019
WAVES: Wavelength Selection for Power-Efficient 2.5D-Integrated Photonic NoCs.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2019
Towards Practical Record and Replay for Mobile Applications.
Proceedings of the 56th Annual Design Automation Conference 2019, 2019
An automated, cross-layer instrumentation framework for diagnosing performance problems in distributed applications.
Proceedings of the ACM Symposium on Cloud Computing, SoCC 2019, 2019
2018
Report on DATE 2018 in Dresden, Germany.
IEEE Des. Test, 2018
Proteus: Detecting Android Emulators from Instruction-Level Profiles.
Proceedings of the Research in Attacks, Intrusions, and Defenses, 2018
Towards a Cross-Layer Framework for Accurate Power Modeling of Microprocessor Designs.
Proceedings of the 28th International Symposium on Power and Timing Modeling, 2018
Design Optimization of 3D Multi-Processor System-on-Chip with Integrated Flow Cell Arrays.
Proceedings of the International Symposium on Low Power Electronics and Design, 2018
Level-Spread: A New Job Allocation Policy for Dragonfly Networks.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018
MOCA: Memory Object Classification and Allocation in Heterogeneous Memory Systems.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018
A cross-layer methodology for design and optimization of networks in 2.5D systems.
Proceedings of the International Conference on Computer-Aided Design, 2018
Tangram: Colocating HPC Applications with Oversubscription.
Proceedings of the 2018 IEEE High Performance Extreme Computing Conference, 2018
Taxonomist: Application Detection Through Rich Monitoring Data.
Proceedings of the Euro-Par 2018: Parallel Processing, 2018
ConfEx: Towards Automating Software Configuration Analytics in the Cloud.
Proceedings of the 48th Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshops, 2018
2017
Scale & Cap: Scaling-Aware Resource Management for Consolidated Multi-threaded Applications.
ACM Trans. Design Autom. Electr. Syst., 2017
Adaptive Tuning of Photonic Devices in a Photonic NoC Through Dynamic Workload Allocation.
,
,
,
,
,
,
,
,
,
,
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2017
Diagnosing Performance Variations in HPC Applications Using Machine Learning.
Proceedings of the High Performance Computing - 32nd International Conference, 2017
Unveiling the Interplay Between Global Link Arrangements and Network Management Algorithms on Dragonfly Networks.
Proceedings of the 17th IEEE/ACM International Symposium on Cluster, 2017
User-profile-based analytics for detecting cloud security breaches.
Proceedings of the 2017 IEEE International Conference on Big Data (IEEE BigData 2017), 2017
2016
Communication and cooling aware job allocation in data centers for communication-intensive workloads.
J. Parallel Distributed Comput., 2016
Automated system change discovery and management in the cloud.
IBM J. Res. Dev., 2016
Adapt&Cap: Coordinating System- and Application-Level Adaptation for Power-Constrained Systems.
IEEE Des. Test, 2016
QScale: thermally-efficient QoS management on heterogeneous mobile platforms.
Proceedings of the 35th International Conference on Computer-Aided Design, 2016
Providing Sustainable Performance in Thermally Constrained Mobile Devices.
Proceedings of the 14th ACM/IEEE Symposium on Embedded Systems for Real-Time Multimedia, 2016
Cross-layer floorplan optimization for silicon photonic NoCs in many-core systems.
Proceedings of the 2016 Design, Automation & Test in Europe Conference & Exhibition, 2016
DeltaSherlock: Identifying changes in the cloud.
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016
2015
Leakage-Aware Cooling Management for Improving Server Energy Efficiency.
IEEE Trans. Parallel Distributed Syst., 2015
Simulation and optimization of HPC job allocation for jointly reducing communication and cooling costs.
Sustain. Comput. Informatics Syst., 2015
Dynamic Cache Pooling in 3D Multicore Processors.
ACM J. Emerg. Technol. Comput. Syst., 2015
On the Impacts of Greedy Thermal Management in Mobile Devices.
IEEE Embed. Syst. Lett., 2015
Adaptive sprinting: How to get the most out of Phase Change based passive cooling.
Proceedings of the IEEE/ACM International Symposium on Low Power Electronics and Design, 2015
PaCMap: Topology Mapping of Unstructured Communication Patterns onto Non-contiguous Allocations.
Proceedings of the 29th ACM on International Conference on Supercomputing, 2015
Just Enough is More: Achieving Sustainable Performance in Mobile Devices under Thermal Limitations.
Proceedings of the IEEE/ACM International Conference on Computer-Aided Design, 2015
Dynamic workload and cooling management in high-efficiency data centers.
Proceedings of the Sixth International Green and Sustainable Computing Conference, 2015
Optimizing energy storage participation in emerging power markets.
Proceedings of the Sixth International Green and Sustainable Computing Conference, 2015
Data center optimal regulation service reserve provision with explicit modeling of quality of service dynamics.
Proceedings of the 54th IEEE Conference on Decision and Control, 2015
2014
Message Passing-Aware Power Management on Many-Core Systems.
J. Low Power Electron., 2014
Sharing and placement of on-chip laser sources in silicon-photonic NoCs.
Proceedings of the Eighth IEEE/ACM International Symposium on Networks-on-Chip, 2014
CoolBudget: Data center power budgeting with workload and cooling asymmetry awareness.
Proceedings of the 32nd IEEE International Conference on Computer Design, 2014
Modeling and analysis of Phase Change Materials for efficient thermal management.
Proceedings of the 32nd IEEE International Conference on Computer Design, 2014
An investigation of Unified Memory Access performance in CUDA.
Proceedings of the IEEE High Performance Extreme Computing Conference, 2014
Reducing the data center electricity costs through participation in smart grid programs.
Proceedings of the International Green Computing Conference, 2014
Thermal management of manycore systems with silicon-photonic networks.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2014
Detecting and identifying system changes in the cloud via discovery by example.
Proceedings of the 2014 IEEE International Conference on Big Data (IEEE BigData 2014), 2014
The data center as a grid load stabilizer.
Proceedings of the 19th Asia and South Pacific Design Automation Conference, 2014
2013
Thermal Management in Many Core Systems.
Proceedings of the Evolutionary Based Solutions for Green Computing, 2013
GreenCool: An Energy-Efficient Liquid Cooling Design Technique for 3-D MPSoCs Via Channel Width Modulation.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2013
Energy Management in Wireless Mobile Systems Using Dynamic Task Assignment.
J. Low Power Electron., 2013
Dynamic cache pooling for improving energy efficiency in 3D stacked multicore processors.
Proceedings of the 21st IEEE/IFIP International Conference on VLSI and System-on-Chip, 2013
Accelerometer-based hand gesture recognition using feature weighted naïve bayesian classifiers and dynamic time warping.
Proceedings of the 18th International Conference on Intelligent User Interfaces, 2013
vCap: Adaptive power capping for virtualized servers.
Proceedings of the International Symposium on Low Power Electronics and Design (ISLPED), 2013
Adaptive Power and Resource Management Techniques for Multi-threaded Workloads.
Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013
Dynamic server power capping for enabling data center participation in power markets.
Proceedings of the IEEE/ACM International Conference on Computer-Aided Design, 2013
Optimizing communication and cooling costs in HPC data centers via intelligent job allocation.
Proceedings of the International Green Computing Conference, 2013
Energy-efficient server consolidation for multi-threaded applications in the cloud.
Proceedings of the International Green Computing Conference, 2013
3D-MMC: a modular 3D multi-core architecture with efficient resource pooling.
Proceedings of the Design, Automation and Test in Europe, 2013
Leakage and temperature aware server control for improving energy efficiency in data centers.
Proceedings of the Design, Automation and Test in Europe, 2013
Real-time power control of data centers for providing Regulation Service.
Proceedings of the 52nd IEEE Conference on Decision and Control, 2013
2012
Introduction to the special section on adaptive power management for energy and temperature-aware computing systems.
ACM Trans. Design Autom. Electr. Syst., 2012
Adaptive Power Capping for Servers with Multithreaded Workloads.
IEEE Micro, 2012
Topology-aware reliability optimization for multiprocessor systems.
Proceedings of the 20th IEEE/IFIP International Conference on VLSI and System-on-Chip, 2012
SST + gem5 = a scalable simulation infrastructure for high performance computing.
Proceedings of the International ICST Conference on Simulation Tools and Techniques, 2012
Temperature-aware computing: Achievements and remaining challenges.
Proceedings of the 2012 International Green Computing Conference, 2012
Analysis and runtime management of 3D systems with stacked DRAM for boosting energy efficiency.
Proceedings of the 2012 Design, Automation & Test in Europe Conference & Exhibition, 2012
Reducing the energy cost of computing through efficient co-scheduling of parallel workloads.
Proceedings of the 2012 Design, Automation & Test in Europe Conference & Exhibition, 2012
Quantifying the impact of frequency scaling on the energy efficiency of the single-chip cloud computer.
Proceedings of the 2012 Design, Automation & Test in Europe Conference & Exhibition, 2012
Optimizing energy efficiency of 3-D multicore systems with stacked DRAM under power and thermal constraints.
Proceedings of the 49th Annual Design Automation Conference 2012, 2012
2011
Energy-Efficient Multiobjective Thermal Control for Liquid-Cooled 3-D Stacked Architectures.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2011
Attaining Single-Chip, High-Performance Computing through 3D Systems with Active Cooling.
IEEE Micro, 2011
Pack & Cap: adaptive DVFS and thread packing under power caps.
Proceedings of the 44rd Annual IEEE/ACM International Symposium on Microarchitecture, 2011
Thermal analysis and active cooling management for 3D MPSoCs.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2011), 2011
Identifying the optimal energy-efficient operating points of parallel workloads.
Proceedings of the 2011 IEEE/ACM International Conference on Computer-Aided Design, 2011
Express Virtual Channels with Taps (EVC-T): A Flow Control Technique for Network-on-Chip (NoC) in Manycore Systems.
Proceedings of the IEEE 19th Annual Symposium on High Performance Interconnects, 2011
Exploring performance, power, and temperature characteristics of 3D systems with on-chip DRAM.
Proceedings of the 2011 International Green Computing Conference and Workshops, 2011
Software optimization for performance, energy, and thermal distribution: Initial case studies.
Proceedings of the 2011 International Green Computing Conference and Workshops, 2011
Run-time energy management of manycore systems through reconfigurable interconnects.
Proceedings of the 21st ACM Great Lakes Symposium on VLSI 2010, 2011
2010
Fuzzy control for enforcing energy efficiency in high-performance 3D systems.
Proceedings of the 2010 International Conference on Computer-Aided Design, 2010
Energy-efficient variable-flow liquid cooling in 3D stacked architectures.
Proceedings of the Design, Automation and Test in Europe, 2010
DynAHeal: Dynamic energy efficient task assignment for wireless healthcare systems.
Proceedings of the Design, Automation and Test in Europe, 2010
Hybrid dynamic energy and thermal management in heterogeneous embedded multiprocessor SoCs.
Proceedings of the 15th Asia South Pacific Design Automation Conference, 2010
2009
Efficient thermal management for multiprocessor systems.
PhD thesis, 2009
Utilizing Predictors for Efficient Thermal Management in Multiprocessor SoCs.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2009
Thermal Modeling and Management of Liquid-Cooled 3D Stacked Architectures.
Proceedings of the VLSI-SoC: Technologies for Systems Integration, 2009
Evaluating the impact of job scheduling and power management on processor lifetime for chip multiprocessors.
Proceedings of the Eleventh International Joint Conference on Measurement and Modeling of Computer Systems, 2009
Temperature- and Cost-Aware Design of 3D Multiprocessor Architectures.
Proceedings of the 12th Euromicro Conference on Digital System Design, 2009
Dynamic thermal management in 3D multicore architectures.
Proceedings of the Design, Automation and Test in Europe, 2009
2008
Static and Dynamic Temperature-Aware Scheduling for Multiprocessor SoCs.
IEEE Trans. Very Large Scale Integr. Syst., 2008
Proactive temperature management in MPSoCs.
Proceedings of the 2008 International Symposium on Low Power Electronics and Design, 2008
Proactive temperature balancing for low cost thermal management in MPSoCs.
Proceedings of the 2008 International Conference on Computer-Aided Design, 2008
Temperature management in multiprocessor SoCs using online learning.
Proceedings of the 45th Design Automation Conference, 2008
Temperature-aware MPSoC scheduling for reducing hot spots and gradients.
Proceedings of the 13th Asia South Pacific Design Automation Conference, 2008
2007
Transient fault prediction based on anomalies in processor events.
Proceedings of the 2007 Design, Automation and Test in Europe Conference and Exposition, 2007
Temperature aware task scheduling in MPSoCs.
Proceedings of the 2007 Design, Automation and Test in Europe Conference and Exposition, 2007
2006
Analysis and Optimization of MPSoC Reliability.
J. Low Power Electron., 2006
A simulation methodology for reliability analysis in multi-core SoCs.
Proceedings of the 16th ACM Great Lakes Symposium on VLSI 2006, Philadelphia, PA, USA, April 30, 2006