Antonino Tumeo

Joseph B. Manzano

Proceedings of the DAC '22: 59th ACM/IEEE Design Automation Conference, San Francisco, California, USA, July 10, 2022

Hardware acceleration of complex machine learning models through modern high-level synthesis.

[BibT_eX]

[DOI]

Serena Curzel

Proceedings of the CF '22: 19th ACM International Conference on Computing Frontiers, Turin, Italy, May 17, 2022

SODA-OPT an MLIR based flow for co-design and high-level synthesis.

[BibT_eX]

[DOI]

Serena Curzel

David R. Kaeli

Proceedings of the CF '22: 19th ACM International Conference on Computing Frontiers, Turin, Italy, May 17, 2022

VWC-BERT: Scaling Vulnerability-Weakness-Exploit Mapping on Modern AI Accelerators.

[BibT_eX]

[DOI]

Siddhartha Shankar Das

Proceedings of the IEEE International Conference on Big Data, 2022

MLIR Loop Optimizations for High-Level Synthesis: A Case Study.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2022

2021

ARENA: Asynchronous Reconfigurable Accelerator Ring to Enable Data-Centric Parallel Computing.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2021

HAM: Hotspot-Aware Manager for Improving Communications With 3D-Stacked Memory.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2021

Energy characterization of graph workloads.

[BibT_eX]

[DOI]

Ankur Limaye

Tosiron Adegbija

Sustain. Comput. Informatics Syst., 2021

EXAGRAPH: Graph and combinatorial methods for enabling exascale applications.

[BibT_eX]

[DOI]

Sivasankaran Rajamanickam

Oguz Selvitopi

Nathan R. Tallent

Int. J. High Perform. Comput. Appl., 2021

The future is big graphs: a community view on graph processing systems.

[BibT_eX]

[DOI]

Commun. ACM, 2021

High-Level Synthesis of Parallel Specifications Coupling Static and Dynamic Controllers.

[BibT_eX]

[DOI]

Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021

DynPaC: Coarse-Grained, Dynamic, and Partially Reconfigurable Array for Streaming Applications.

[BibT_eX]

[DOI]

Cheng Tan

Tong Geng

Chenhao Xie

Proceedings of the 39th IEEE International Conference on Computer Design, 2021

Automated Generation of Integrated Digital and Spiking Neuromorphic Machine Learning Accelerators.

[BibT_eX]

[DOI]

Serena Curzel

Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2021

AURORA: Automated Refinement of Coarse-Grained Reconfigurable Accelerators.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2021

Invited: Bambu: an Open-Source Research Framework for the High-Level Synthesis of Complex Applications.

[BibT_eX]

[DOI]

Proceedings of the 58th ACM/IEEE Design Automation Conference, 2021

Towards Automatic and Agile AI/ML Accelerator Design with End-to-End Synthesis.

[BibT_eX]

[DOI]

Jeff Jun Zhang

Gu-Yeon Wei

David Brooks

Proceedings of the 32nd IEEE International Conference on Application-specific Systems, 2021

OpenCGRA: Democratizing Coarse-Grained Reconfigurable Arrays.

[BibT_eX]

[DOI]

Cheng Tan

Jeff Zhang

Proceedings of the 32nd IEEE International Conference on Application-specific Systems, 2021

2020

Introduction to the TOPC Special Issue on Innovations in Systems for Irregular Applications, Part 2.

[BibT_eX]

[DOI]

Fabrizio Petrini

ACM Trans. Parallel Comput., 2020

Introduction to the TOPC Special Issue on Innovations in Systems for Irregular Applications, Part 1.

[BibT_eX]

[DOI]

Fabrizio Petrini

ACM Trans. Parallel Comput., 2020

ARENA: Asynchronous Reconfigurable Accelerator Ring to Enable Data-Centric Parallel Computing.

[BibT_eX]

[DOI]

CoRR, 2020

Preempt: scalable epidemic interventions using submodular optimization on multi-GPU systems.

[BibT_eX]

[DOI]

Prathyush Sambaturu

Anil Vullikanti

Proceedings of the International Conference for High Performance Computing, 2020

AWB-GCN: A Graph Convolutional Network Accelerator with Runtime Workload Rebalancing.

[BibT_eX]

[DOI]

Proceedings of the 53rd Annual IEEE/ACM International Symposium on Microarchitecture, 2020

Message from the workshop chairs.

[BibT_eX]

[DOI]

Scott McMillan

Manoj Kumar

Danai Koutra

Tim Mattson

Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020

cuRipples: influence maximization on multi-GPU systems.

[BibT_eX]

[DOI]

Maurizio Drocco

Proceedings of the ICS '20: 2020 International Conference on Supercomputing, 2020

OpenCGRA: An Open-Source Unified Framework for Modeling, Testing, and Evaluating CGRAs.

[BibT_eX]

[DOI]

Proceedings of the 38th IEEE International Conference on Computer Design, 2020

SODA: a New Synthesis Infrastructure for Agile Hardware Design of Machine Learning Accelerators.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2020

Invited: Software Defined Accelerators From Learning Tools Environment.

[BibT_eX]

[DOI]

Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020

2019

Special Issue on: Systems for Learning, Inferencing, and Discovering (SLID).

[BibT_eX]

[DOI]

J. Parallel Distributed Comput., 2019

UWB-GCN: Hardware Acceleration of Graph-Convolution-Network through Runtime Workload Rebalancing.

[BibT_eX]

[DOI]

CoRR, 2019

Advert: An Asynchronous Runtime for Fine-Grained Network Systems.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM Third Annual Workshop on Emerging Parallel and Distributed Runtime Systems and Middleware, 2019

PIMS: a lightweight processing-in-memory accelerator for stencil computations.

[BibT_eX]

[DOI]

Proceedings of the International Symposium on Memory Systems, 2019

Introduction to GrAPL 2019.

[BibT_eX]

[DOI]

Tim Mattson

Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2019

MAC: Memory Access Coalescer for 3D-Stacked Memory.

[BibT_eX]

[DOI]

Proceedings of the 48th International Conference on Parallel Processing, 2019

Scaling and Quality of Modularity Optimization Methods for Graph Clustering.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE High Performance Extreme Computing Conference, 2019

A Parallel Graph Environment for Real-World Data Analytics Workflows.

[BibT_eX]

[DOI]

Maurizio Drocco

Jesun Sahariar Firoz

Thejaka Amila Kanewala

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2019

Data and model convergence: a case for software defined architectures.

[BibT_eX]

[DOI]

Proceedings of the 16th ACM International Conference on Computing Frontiers, 2019

Software defined architectures for data analytics.

[BibT_eX]

[DOI]

Proceedings of the 24th Asia and South Pacific Design Automation Conference, 2019

POSTER: Memory Hotspot Optimization for Data-Intensive Applications.

[BibT_eX]

[DOI]

Proceedings of the 28th International Conference on Parallel Architectures and Compilation Techniques, 2019

2018

Guest Editorial: Special Issue on Computing Frontiers.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 2018

Adaptive anonymization of data using b-edge cover.

[BibT_eX]

[DOI]

Arif Khan

Krzysztof Choromanski

Alex Pothen

S. M. Ferdous

Proceedings of the International Conference for High Performance Computing, 2018

MiniVite: A Graph Analytics Benchmarking Tool for Massively Parallel Systems.

[BibT_eX]

[DOI]

Assefaw H. Gebremedhin

Proceedings of the 2018 IEEE/ACM Performance Modeling, 2018

Introduction to GraML 2018.

[BibT_eX]

[DOI]

Assefaw Hadish Gebremedhin

Abhinav Vishnu

Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018

Distributed Louvain Algorithm for Graph Community Detection.

[BibT_eX]

[DOI]

Hao Lu

Assefaw Hadish Gebremedhin

Arif Khan

Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

Scalable Distributed Memory Community Detection Using Vite.

[BibT_eX]

[DOI]

Assefaw H. Gebremedhin

Proceedings of the 2018 IEEE High Performance Extreme Computing Conference, 2018

2017

Exploring Efficient Hardware Support for Applications with Irregular Memory Patterns on Multinode Manycore Architectures.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2017

Exploring performance and energy tradeoffs for irregular applications: A case study on the Tilera many-core architecture.

[BibT_eX]

[DOI]

Ajay Panyala

Joseph B. Manzano

J. Parallel Distributed Comput., 2017

Introduction to GraML Workshop.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

Community Detection on the GPU.

[BibT_eX]

[DOI]

Md. Naim

Fredrik Manne

Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

Exploring DataVortex Systems for Irregular Applications.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

Scalable static and dynamic community detection using Grappolo.

[BibT_eX]

[DOI]

Hao Lu

Proceedings of the 2017 IEEE High Performance Extreme Computing Conference, 2017

Architecture independent integrated early performance and energy estimation.

[BibT_eX]

[DOI]

Proceedings of the Eighth International Green and Sustainable Computing Conference, 2017

Pushing the Limits of Irregular Access Patterns on Emerging Network Architecture: A Case Study.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017

2016

Special Issue on Theory and Practice of Irregular Applications (TaPIA).

[BibT_eX]

[DOI]

Parallel Comput., 2016

Assessing Advanced Technology in CENATE.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Networking, 2016

Modeling the Impact of Silicon Photonics on Graph Analytics.

[BibT_eX]

[DOI]

Nathan R. Tallent

Kevin J. Barker

Andrès Márquez

Darren J. Kerbyson

Adolfy Hoisie

Proceedings of the IEEE International Conference on Networking, 2016

Efficient synthesis of graph methods: a dynamically scheduled architecture.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Computer-Aided Design, 2016

Exploring Data Vortex Network Architectures.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE Annual Symposium on High-Performance Interconnects, 2016

A dynamically scheduled architecture for the synthesis of graph methods.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Hot Chips 28 Symposium (HCS), 2016

A Dynamically Scheduled Architecture for the Synthesis of Graph Database Queries.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2016

Enabling the high level synthesis of data analytics accelerators.

[BibT_eX]

[DOI]

Proceedings of the Eleventh IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis, 2016

2015

Special Issue on Architectures and Algorithms for Irregular Applications (AAIA) - Guest editors' introduction.

[BibT_eX]

[DOI]

J. Parallel Distributed Comput., 2015

Irregular Applications: From Architectures to Algorithms [Guest editors' introduction].

[BibT_eX]

[DOI]

Computer, 2015

In-Memory Graph Databases for Web-Scale Data.

[BibT_eX]

[DOI]

Computer, 2015

High Level Synthesis of RDF Queries for Graph Analytics.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Conference on Computer-Aided Design, 2015

Optimizing Approximate Weighted Matching on Nvidia Kepler K40.

[BibT_eX]

[DOI]

Md. Naim

Fredrik Manne

Johannes Langguth

Proceedings of the 22nd IEEE International Conference on High Performance Computing, 2015

Inter-procedural resource sharing in High Level Synthesis through function proxies.

[BibT_eX]

[DOI]

Proceedings of the 25th International Conference on Field Programmable Logic and Applications, 2015

Function Proxies for Improved Resource Sharing in High Level Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 23rd IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2015

High-Performance, Distributed Dictionary Encoding of RDF Datasets.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015

Optimizing irregular applications for energy and performance on the Tilera many-core architecture.

[BibT_eX]

[DOI]

Ajay Panyala

Joseph B. Manzano

Proceedings of the 12th ACM International Conference on Computing Frontiers, 2015

Power and performance trade-offs for Space Time Adaptive Processing.

[BibT_eX]

[DOI]

Proceedings of the 26th IEEE International Conference on Application-specific Systems, 2015

GEMS: Graph Database Engine for Multithreaded Systems.

[BibT_eX]

[DOI]

Jesse Weaver

Gregory Todd Williams

David J. Haglin

Proceedings of the Big Data - Algorithms, Analytics, and Applications., 2015

2014

Toward a data scalable solution for facilitating discovery of science resources.

[BibT_eX]

[DOI]

Jesse Weaver

Parallel Comput., 2014

Scaling Semantic Graph Databases in Size and Performance.

[BibT_eX]

[DOI]

IEEE Micro, 2014

Scaling Irregular Applications through Data Aggregation and Software Multithreading.

[BibT_eX]

[DOI]

Mateo Valero

Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014

High-level synthesis of memory bound and irregular parallel applications with Bambu.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Hot Chips 26 Symposium (HCS), 2014

An adaptive Memory Interface Controller for improving bandwidth utilization of hybrid and reconfigurable systems.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2014

A Flexible CUDA LU-Based Solver for Small, Batched Linear Systems.

[BibT_eX]

[DOI]

Nitin Gawande

Proceedings of the Numerical Computations with GPUs, 2014

2013

Composing Data Parallel Code for a SPARQL Graph Engine.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Social Computing, SocialCom 2013, 2013

Toward a data scalable solution for facilitating discovery of scientific data resources.

[BibT_eX]

[DOI]

Proceedings of the 2013 International Workshop on Data-Intensive Scalable Computing Systems, 2013

YAPPA: A compiler-based parallelization framework for irregular applications on MPSoCs.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE International Symposium on Rapid System Prototyping, 2013

Prototyping hardware support for irregular applications.

[BibT_eX]

[DOI]

Proceedings of the 2013 Workshop on Rapid Simulation and Performance Evaluation: Methods and Tools, 2013

Exploring manycore multinode systems for irregular applications with FPGA prototyping.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Hot Chips 25 Symposium (HCS), 2013

Power/Performance Trade-Offs of Small Batched LU Based Solvers on GPUs.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2013 Parallel Processing, 2013

Accelerating subsurface transport simulation on heterogeneous clusters.

[BibT_eX]

[DOI]

Nitin Gawande

Proceedings of the 2013 IEEE International Conference on Cluster Computing, 2013

Accelerating semantic graph databases on commodity clusters.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Conference on Big Data (IEEE BigData 2013), 2013

Exploring hardware support for scaling irregular applications on multi-node multi-core architectures.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Application-Specific Systems, 2013

Ant Colony Optimization for mapping, scheduling and placing in reconfigurable systems.

[BibT_eX]

[DOI]

Proceedings of the 2013 NASA/ESA Conference on Adaptive Hardware and Systems, 2013

2012

Fast and Accurate Simulation of the Cray XMT Multithreaded Supercomputer.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2012

Aho-Corasick String Matching on Shared and Distributed-Memory Parallel Architectures.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2012

Approximate weighted matching on emerging manycore and multithreaded architectures.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Appl., 2012

Designing Next-Generation Massively Multithreaded Architectures for Irregular Applications.

[BibT_eX]

[DOI]

Computer, 2012

A High Performance Computing Network and System Simulator for the Power Grid: NGNS^2.

[BibT_eX]

[DOI]

Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Efficient Sorting on the Tilera Manycore Architecture.

[BibT_eX]

[DOI]

Proceedings of the IEEE 24th International Symposium on Computer Architecture and High Performance Computing, 2012

A Bandwidth-Optimized Multi-core Architecture for Irregular Applications.

[BibT_eX]

[DOI]

Proceedings of the 12th IEEE/ACM International Symposium on Cluster, 2012

2011

Towards efficient execution of irregular applications: panel outline.

[BibT_eX]

[DOI]

Proceedings of the first workshop on Irregular applications: architectures and algorithm, 2011

Irregular applications: architectures & algorithms.

[BibT_eX]

[DOI]

Proceedings of the first workshop on Irregular applications: architectures and algorithm, 2011

Contention Modeling for Multithreaded Distributed Shared Memory Machines: The Cray XMT.

[BibT_eX]

[DOI]

Proceedings of the 11th IEEE/ACM International Symposium on Cluster, 2011

Experiences with String Matching on the Fermi Architecture.

[BibT_eX]

[DOI]

Proceedings of the Architecture of Computing Systems - ARCS 2011, 2011

Emulating Transactional Memory on FPGA Multiprocessors.

[BibT_eX]

[DOI]

Proceedings of the Architecture of Computing Systems - ARCS 2011, 2011

2010

Ant Colony Heuristic for Mapping and Scheduling Tasks and Communications on Heterogeneous Embedded Systems.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2010

Accelerating DNA analysis applications on GPU clusters.

[BibT_eX]

[DOI]

Proceedings of the IEEE 8th Symposium on Application Specific Processors, 2010

Multiprocessor systems-on-chip synthesis using multi-objective evolutionary computation.

[BibT_eX]

[DOI]

Proceedings of the Genetic and Evolutionary Computation Conference, 2010

A Compact Transactional Memory Multiprocessor System on FPGA.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Field Programmable Logic and Applications, 2010

A reconfigurable multiprocessor architecture for a reliable face recognition implementation.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation and Test in Europe, 2010

Efficient pattern matching on GPUs for intrusion detection systems.

[BibT_eX]

[DOI]