Siva Kumar Sastry Hari

Orcid: 0000-0001-8346-7981

According to our database1, Siva Kumar Sastry Hari authored at least 47 papers between 2007 and 2024.

Collaborative distances:



In proceedings 
PhD thesis 




Safety-Critical Scenario Generation Via Reinforcement Learning Based Editing.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

MPGemmFI: A Fault Injection Technique for Mixed Precision GEMM in ML Applications.
CoRR, 2023

CuRobo: Parallelized Collision-Free Minimum-Jerk Robot Motion Generation.
CoRR, 2023

ALBERTA: ALgorithm-Based Error Resilience in Transformer Architectures.
CoRR, 2023

VaPr: Variable-Precision Tensors to Accelerate Robot Motion Planning.
IROS, 2023

CuRobo: Parallelized Collision-Free Robot Motion Generation.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Making Convolutions Resilient Via Algorithm-Based Error Detection Techniques.
IEEE Trans. Dependable Secur. Comput., 2022

Characterizing and Mitigating Soft Errors in GPU DRAM.
IEEE Micro, 2022

Towards Precision-Aware Fault Tolerance Approaches for Mixed-Precision Applications.
Proceedings of the 12th IEEE/ACM Workshop on Fault Tolerance for HPC at eXtreme Scale, 2022

Exploiting Temporal Data Diversity for Detecting Safety-critical Faults in AV Compute Systems.
Proceedings of the 52nd Annual IEEE/IFIP International Conference on Dependable Systems and Networks, 2022

Zhuyi: perception processing rate estimation for safety in autonomous vehicles.
Proceedings of the DAC '22: 59th ACM/IEEE Design Automation Conference, San Francisco, California, USA, July 10, 2022

Generating and Characterizing Scenarios for Safety Testing of Autonomous Vehicles.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2021

Suraksha: A Framework to Analyze the Safety Implications of Perception Design Choices in AVs.
Proceedings of the 32nd IEEE International Symposium on Software Reliability Engineering, 2021

Optimizing Selective Protection for CNN Resilience.
Proceedings of the 32nd IEEE International Symposium on Software Reliability Engineering, 2021

Demystifying GPU Reliability: Comparing and Combining Beam Experiments, Fault Simulation, and Profiling.
Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021

Suraksha: A Quantitative AV Safety Evaluation Framework to Analyze Safety Implications of Perception Design Choices.
Proceedings of the 51st Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshops, 2021

NVBitFI: Dynamic Fault Injection for GPUs.
Proceedings of the 51st Annual IEEE/IFIP International Conference on Dependable Systems and Networks, 2021

Simulation Driven Design and Test for Safety of AI Based Autonomous Vehicles.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Estimating Silent Data Corruption Rates Using a Two-Level Model.
CoRR, 2020

HarDNN: Feature Map Vulnerability Evaluation in CNNs.
CoRR, 2020

GPU-trident: efficient modeling of error propagation in GPU programs.
Proceedings of the International Conference for High Performance Computing, 2020

AV-FUZZER: Finding Safety Violations in Autonomous Driving Systems.
Proceedings of the 31st IEEE International Symposium on Software Reliability Engineering, 2020

PyTorchFI: A Runtime Perturbation Tool for DNNs.
Proceedings of the 50th Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshops, 2020

Kayotee: A Fault Injection-based System to Assess the Safety and Reliability of Autonomous Vehicles to Faults and Errors.
CoRR, 2019

GPU snapshot: checkpoint offloading for GPU-dense systems.
Proceedings of the ACM International Conference on Supercomputing, 2019

On the Trend of Resilience for GPU-Dense Systems.
Proceedings of the 49th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, 2019

ML-Based Fault Injection for Autonomous Vehicles: A Case for Bayesian Fault Injection.
Proceedings of the 49th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, 2019

Optimizing software-directed instruction replication for GPU error detection.
Proceedings of the International Conference for High Performance Computing, 2018

SwapCodes: Error Codes for Hardware-Software Cooperative GPU Pipeline Error Detection.
Proceedings of the 51st Annual IEEE/ACM International Symposium on Microarchitecture, 2018

Modeling Soft-Error Propagation in Programs.
Proceedings of the 48th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, 2018

Understanding error propagation in deep learning neural network (DNN) accelerators and applications.
Proceedings of the International Conference for High Performance Computing, 2017

SASSIFI: An architecture-level fault injection tool for GPU application resilience evaluation.
Proceedings of the 2017 IEEE International Symposium on Performance Analysis of Systems and Software, 2017

Approxilyzer: Towards a systematic framework for instruction-level approximate computing and its application to hardware resiliency.
Proceedings of the 49th Annual IEEE/ACM International Symposium on Microarchitecture, 2016

Flexible software profiling of GPU architectures.
Proceedings of the 42nd Annual International Symposium on Computer Architecture, 2015

Locality-Driven Dynamic GPU Cache Bypassing.
Proceedings of the 29th ACM on International Conference on Supercomputing, 2015

Hardware Fault Recovery for I/O Intensive Applications.
ACM Trans. Archit. Code Optim., 2014

GangES: Gang error simulation for hardware resiliency evaluation.
Proceedings of the ACM/IEEE 41st International Symposium on Computer Architecture, 2014

Preserving application reliability on unreliable hardware
PhD thesis, 2013

Relyzer: Application Resiliency Analyzer for Transient Faults.
IEEE Micro, 2013

Low-cost program-level detectors for reducing silent data corruptions.
Proceedings of the IEEE/IFIP International Conference on Dependable Systems and Networks, 2012

CrashTest'ing SWAT: Accurate, gate-level evaluation of symptom-based resiliency solutions.
Proceedings of the 2012 Design, Automation & Test in Europe Conference & Exhibition, 2012

Relyzer: exploiting application-level fault equivalence to analyze application resiliency to transient faults.
Proceedings of the 17th International Conference on Architectural Support for Programming Languages and Operating Systems, 2012

Architectures for online error detection and recovery in multicore processors.
Proceedings of the Design, Automation and Test in Europe, 2011

mSWAT: low-cost hardware fault detection and diagnosis for multicore systems.
Proceedings of the 42st Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-42 2009), 2009

Accurate microarchitecture-level fault modeling for studying hardware faults.
Proceedings of the 15th International Conference on High-Performance Computer Architecture (HPCA-15 2009), 2009

Automatic Constraint Based Test Generation for Behavioral HDL Models.
IEEE Trans. Very Large Scale Integr. Syst., 2008

Power Virus Generation Using Behavioral Models of Circuits.
Proceedings of the 25th IEEE VLSI Test Symposium (VTS 2007), 2007
