Sudhakar Yalamanchili
Affiliations:- Georgia Institute of Technology, Atlanta, USA
According to our database1,
Sudhakar Yalamanchili
authored at least 174 papers
between 1982 and 2021.
Collaborative distances:
Collaborative distances:
Awards
IEEE Fellow
IEEE Fellow 2014, "For contributions to high-performance multiprocessor architecture and communication".
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
On csauthors.net:
Bibliography
2021
MAHASIM: Machine-Learning Hardware Acceleration Using a Software-Defined Intelligent Memory System.
J. Signal Process. Syst., 2021
Efficiently Solving Partial Differential Equations in a Partially Reconfigurable Specialized Hardware.
IEEE Trans. Computers, 2021
Proceedings of the 13th International Conference on Computer and Automation Engineering, 2021
2020
Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2020
Proceedings of the 2020 Design, Automation & Test in Europe Conference & Exhibition, 2020
2019
IEEE Micro, 2019
IBM J. Res. Dev., 2019
Proceedings of the 56th Annual Design Automation Conference 2019, 2019
Proceedings of the 28th International Conference on Parallel Architectures and Compilation Techniques, 2019
2018
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2018
Instruction-throughput regulation in computer processors with data-center applications.
Discret. Event Dyn. Syst., 2018
TRINITY: Coordinated Performance, Energy and Temperature Management in 3D Processor-Memory Stacks.
CoRR, 2018
CoRR, 2018
Proceedings of the International Conference on Computer-Aided Design, 2018
Slim NoC: A Low-Diameter On-Chip Network Topology for High Energy Efficiency and Scalability.
Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating Systems, 2018
2017
Proceedings of the International Symposium on Memory Systems, 2017
Proceedings of the International Symposium on Memory Systems, 2017
Demystifying the characteristics of 3D-stacked memories: A case study for Hybrid Memory Cube.
Proceedings of the 2017 IEEE International Symposium on Workload Characterization, 2017
Application-Specific Performance-Aware Energy Optimization on Android Mobile Devices.
Proceedings of the 2017 IEEE International Symposium on High Performance Computer Architecture, 2017
Proceedings of the 56th IEEE Annual Conference on Decision and Control, 2017
2016
FNM: An Enhanced Null-Message Algorithm for Parallel Simulation of Multicore Systems.
ACM Trans. Model. Comput. Simul., 2016
IEEE Comput. Archit. Lett., 2016
Proceedings of the 13th International Workshop on Discrete Event Systems, 2016
Proceedings of the 4th International Workshop on Energy Efficient Supercomputing, 2016
General-purpose join algorithms for large graph triangle listing on heterogeneous systems.
Proceedings of the 9th Annual Workshop on General Purpose Processing using Graphics Processing Unit, 2016
Understanding the Impact of Air and Microfluidics Cooling on Performance of 3D Stacked Memory Systems.
Proceedings of the Second International Symposium on Memory Systems, 2016
Proceedings of the 43rd ACM/IEEE Annual International Symposium on Computer Architecture, 2016
Neurocube: A Programmable Digital Neuromorphic Architecture with High-Density 3D Memory.
Proceedings of the 43rd ACM/IEEE Annual International Symposium on Computer Architecture, 2016
Proceedings of the 2016 IEEE International Symposium on High Performance Computer Architecture, 2016
2015
Architectural Reliability: Lifetime Reliability Characterization and Management ofMany-Core Processors.
IEEE Comput. Archit. Lett., 2015
Proceedings of the 2015 International Symposium on Memory Systems, 2015
Proceedings of the 2015 International Symposium on Memory Systems, 2015
Near Data Processing: Impact and Optimization of 3D Memory System Architecture on the Uncore.
Proceedings of the 2015 International Symposium on Memory Systems, 2015
Dynamic thread block launch: a lightweight execution mechanism to support irregular applications on GPUs.
Proceedings of the 42nd Annual International Symposium on Computer Architecture, 2015
Proceedings of the 42nd Annual International Symposium on Computer Architecture, 2015
Proceedings of the IEEE International Reliability Physics Symposium, 2015
Proceedings of the 17th IEEE International Conference on High Performance Computing and Communications, 2015
Proceedings of the 22nd IEEE International Conference on High Performance Computing, 2015
Temperature regulation in multicore processors using adjustable-gain integral controllers.
Proceedings of the 2015 IEEE Conference on Control Applications, 2015
2014
Control Principles and On-Chip Circuits for Active Cooling Using Integrated Superlattice-Based Thin-Film Thermoelectric Devices.
IEEE Trans. Very Large Scale Integr. Syst., 2014
ACM Trans. Design Autom. Electr. Syst., 2014
Microelectron. J., 2014
Proceedings of the International Workshop on Accelerating Data Management Systems Using Modern Processor and Storage Architectures, 2014
Proceedings of the 7th International ICST Conference on Simulation Tools and Techniques, 2014
Proceedings of the 2014 LLVM Compiler Infrastructure in HPC, 2014
Bubble sharing: Area and energy efficient adaptive routers using centralized buffers.
Proceedings of the Eighth IEEE/ACM International Symposium on Networks-on-Chip, 2014
Proceedings of the 2014 IEEE International Symposium on Performance Analysis of Systems and Software, 2014
Energy Introspector: A parallel, composable framework for integrated power-reliability-thermal modeling for multicore architectures.
Proceedings of the 2014 IEEE International Symposium on Performance Analysis of Systems and Software, 2014
Characterization and analysis of dynamic parallelism in unstructured GPU applications.
Proceedings of the 2014 IEEE International Symposium on Workload Characterization, 2014
Proceedings of the 22nd IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2014
Proceedings of the 12th Annual IEEE/ACM International Symposium on Code Generation and Optimization, 2014
Proceedings of the Seventh Workshop on General Purpose Processing Using GPUs, 2014
Efficient Instrumentation of GPGPU Applications Using Information Flow Analysis and Symbolic Execution.
Proceedings of the Seventh Workshop on General Purpose Processing Using GPUs, 2014
2013
Adaptive virtual channel partitioning for network-on-chip in heterogeneous architectures.
ACM Trans. Design Autom. Electr. Syst., 2013
Design space exploration of on-chip ring interconnection for a CPU-GPU heterogeneous architecture.
J. Parallel Distributed Comput., 2013
Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2013
Proceedings of the SIGSIM Principles of Advanced Discrete Simulation, 2013
Proceedings of the 2013 Seventh IEEE/ACM International Symposium on Networks-on-Chip (NoCS), 2013
Proceedings of the 2013 IEEE 21st International Symposium on Modelling, 2013
Proceedings of the 40th Annual International Symposium on Computer Architecture, 2013
Oncilla: A GAS runtime for efficient resource allocation and data movement in accelerated clusters.
Proceedings of the 2013 IEEE International Conference on Cluster Computing, 2013
Proceedings of the 6th Workshop on General Purpose Processor Using Graphics Processing Units, 2013
2012
Characterization and transformation of unstructured control flow in bulk synchronous GPU applications.
Int. J. High Perform. Comput. Appl., 2012
Instruction-based energy estimation methodology for asymmetric manycore processor simulations.
Proceedings of the International ICST Conference on Simulation Tools and Techniques, 2012
Proceedings of the 2012 SC Companion: High Performance Computing, 2012
Designing Configurable, Modifiable and Reusable Components for Simulation of Multicore Systems.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012
Proceedings of the 2012 Workshop on Rapid Simulation and Performance Evaluation: Methods and Tools, 2012
Kernel Weaver: Automatically Fusing Database Primitives for Efficient GPU Computation.
Proceedings of the 45th Annual IEEE/ACM International Symposium on Microarchitecture, 2012
Lynx: A dynamic instrumentation system for data-parallel applications on GPGPU architectures.
Proceedings of the 2012 IEEE International Symposium on Performance Analysis of Systems & Software, 2012
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012
Proceedings of the 31st IEEE International Performance Computing and Communications Conference, 2012
Proceedings of the 14th IEEE International Conference on High Performance Computing and Communication & 9th IEEE International Conference on Embedded Software and Systems, 2012
Proceedings of the 19th International Conference on High Performance Computing, 2012
Proceedings of the 10th Annual IEEE/ACM International Symposium on Code Generation and Optimization, 2012
Proceedings of the 51th IEEE Conference on Decision and Control, 2012
Proceedings of the American Control Conference, 2012
2011
Proceedings of the Encyclopedia of Parallel Computing, 2011
A Scalable Design Methodology for Energy Minimization of STTRAM: A Circuit and Architecture Perspective.
IEEE Trans. Very Large Scale Integr. Syst., 2011
Keeneland: Bringing Heterogeneous GPU Computing to the Computational Science Community.
Comput. Sci. Eng., 2011
Proceedings of the 44rd Annual IEEE/ACM International Symposium on Microarchitecture, 2011
A framework for dynamically instrumenting GPU compute applications within GPU Ocelot.
Proceedings of 4th Workshop on General Purpose Processing on Graphics Processing Units, 2011
Regulating Locality vs. Parallelism Tradeoffs in Multiple Memory Controller Environments.
Proceedings of the 2011 International Conference on Parallel Architectures and Compilation Techniques, 2011
2010
Proceedings of the 2010 International Symposium on Low Power Electronics and Design, 2010
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010
Proceedings of the 8th IEEE International Conference on Control and Automation, 2010
Proceedings of the International Green Computing Conference 2010, 2010
Proceedings of 3rd Workshop on General Purpose Processing on Graphics Processing Units, 2010
HyVM - Hybrid Virtual Machines - Efficient Use of Future Heterogeneous Chip Multiprocessors.
Proceedings of the Architecture of Computing Systems, 2010
Ocelot: a dynamic optimization framework for bulk-synchronous applications in heterogeneous systems.
Proceedings of the 19th International Conference on Parallel Architectures and Compilation Techniques, 2010
2009
Proceedings of the IEEE Computer Society Annual Symposium on VLSI, 2009
Proceedings of the 2009 IEEE International Symposium on Workload Characterization, 2009
A methodology for robust, energy efficient design of Spin-Torque-Transfer RAM arrays at scaled technologies.
Proceedings of the 2009 International Conference on Computer-Aided Design, 2009
2008
Proceedings of the 17th International Symposium on High-Performance Distributed Computing (HPDC-17 2008), 2008
Proceedings of the High Performance Computing, 2008
Proceedings of the 16th IEEE International Symposium on Field-Programmable Custom Computing Machines, 2008
2007
Proceedings of the 25th International Conference on Computer Design, 2007
Proceedings of the Architecture of Computing Systems, 2007
2006
SIGARCH Comput. Archit. News, 2006
J. Parallel Distributed Comput., 2006
Proceedings of the 24th International Conference on Computer Design (ICCD 2006), 2006
2005
Traffic Scheduling Solutions with QoS Support for an Input-Buffered MultiMedia Router.
IEEE Trans. Parallel Distributed Syst., 2005
2004
ShareStreams: A Scalable Architecture and Hardware Support for High-Speed QoS Packet Schedulers.
Proceedings of the 12th IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM 2004), 2004
A Framework for Compiler Driven Design Space Exploration for Embedded System Customization.
Proceedings of the Advances in Computer Science, 2004
2003
A Hardware Approach to QoS Support in Cluster Environments: The Multimedia Router MMR.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2003
Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003
A Solution for Handling Hybrid Traffic in Clustered Environments: The MultiMedia Router MMR.
Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003
2002
A Tunable Communications Library for Data Injection.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2002
Proceedings of the 16th International Parallel and Distributed Processing Symposium (IPDPS 2002), 2002
A multimedia router architecture to provide high performance and QoS guarantees to mixed traffic.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002
Proceedings of the 10th Annual IEEE Symposium on High Performance Interconnects (HOTIC 2002), August 21, 2002
Proceedings of the High Performance Computing, 2002
Proceedings of the High Performance Computing, 2002
Proceedings of the IEEE 5th Workshop on Multimedia Signal Processing, 2002
2001
Proceedings of the 15th International Parallel & Distributed Processing Symposium (IPDPS-01), 2001
Proceedings of the Networking, 2001
2000
IEEE Trans. Parallel Distributed Syst., 2000
IEEE Trans. Parallel Distributed Syst., 2000
An Extensible Message Layer for High-Performance Clusters.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2000
Proceedings of the 14th International Parallel & Distributed Processing Symposium (IPDPS'00), 2000
1999
IEEE Trans. Parallel Distributed Syst., 1999
Proceedings of the IEEE International Conference on Microelectronic Systems Education, 1999
Proceedings of the Fifth International Symposium on High-Performance Computer Architecture, 1999
Proceedings of the 8th Heterogeneous Computing Workshop, 1999
Proceedings of the Network-Based Parallel Computing: Communication, 1999
1998
IEEE Trans. Parallel Distributed Syst., 1998
Proceedings of the Fourth IEEE Real-Time Technology and Applications Symposium, 1998
1997
Proceedings of the 18th IEEE Real-Time Systems Symposium (RTSS '97), 1997
Proceedings of the Parallel Computer Routing and Communication, 1997
Proceedings of the 11th International Parallel Processing Symposium (IPPS '97), 1997
Proceedings of the Proceedings 1997 International Conference on Computer Design: VLSI in Computers & Processors, 1997
Architectural Support for Reducing Communication Overhead in Multiprocessor Interconnection Networks.
Proceedings of the 3rd IEEE Symposium on High-Performance Computer Architecture (HPCA '97), 1997
Interconnection networks - an engineering approach.
IEEE, ISBN: 978-0-8186-7800-4, 1997
1996
IEEE Trans. Computers, 1996
Distributed Deadlock-Free Routing in Faulty, Pipelined, Direct Interconnection Networks.
IEEE Trans. Computers, 1996
Paradigms for Modeling and Simulation of Multiprocessor Architectures.
Int. J. Comput. Simul., 1996
Proceedings of the 1996 workshop on Computer architecture education, 1996
Proceedings of the Eighth IEEE Symposium on Parallel and Distributed Processing, 1996
Proceedings of IPPS '96, 1996
Proceedings of the 1996 International Conference on Parallel Processing, 1996
Proceedings of the 3rd International Conference on High Performance Computing, 1996
Proceedings of the 1996 European Design and Test Conference, 1996
1995
IEEE Trans. Parallel Distributed Syst., 1995
ACM Trans. Model. Comput. Simul., 1995
Partitioning and mapping in embedded multiprocessor architectures in the presence of constraints.
Concurr. Pract. Exp., 1995
Proceedings of the 22nd Annual International Symposium on Computer Architecture, 1995
Proceedings of IPPS '95, 1995
Software Based Fault-Tolerant Oblivious Routing in Pipelined Networks.
Proceedings of the 1995 International Conference on Parallel Processing, 1995
1994
IEEE Trans. Knowl. Data Eng., 1994
Int. J. Comput. Simul., 1994
Proceedings of the MASCOTS '94, Proceedings of the Second International Workshop on Modeling, Analysis, and Simulation On Computer and Telecommunication Systems, January 31, 1994
Proceedings of the 21st Annual International Symposium on Computer Architecture. Chicago, 1994
Proceedings of the Proceedings 1994 International Conference on Parallel and Distributed Systems, 1994
1993
Proceedings of the Fifth IEEE Symposium on Parallel and Distributed Processing, 1993
Proceedings of the Fifth IEEE Symposium on Parallel and Distributed Processing, 1993
Proceedings of the Seventh International Parallel Processing Symposium, 1993
1992
Proceedings of the Fourth IEEE Symposium on Parallel and Distributed Processing, 1992
Proceedings of the 6th International Parallel Processing Symposium, 1992
Parallel Optimization and Execution of Large Join Queries.
Proceedings of the International Conference on Fifth Generation Computer Systems. FGCS 1992, 1992
1991
Proceedings of the Third IEEE Symposium on Parallel and Distributed Processing, 1991
1987
IEEE Trans. Computers, 1987
Pattern Recognit., 1987
1985
1984
Proceedings of the First International Conference on Data Engineering, 1984
1982
Comput. Graph. Image Process., 1982