Aydin Buluç

Orcid: 0000-0001-7253-9038

Affiliations:
  • Lawrence Berkeley National Laboratory, USA
  • University of California, Berkeley, USA


According to our database1, Aydin Buluç authored at least 137 papers between 2008 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
DyG-DPCD: A Distributed Parallel Community Detection Algorithm for Large-Scale Dynamic Graphs.
Int. J. Parallel Program., February, 2025

2024
Exabiome: Advancing Microbial Science through Exascale Computing.
Comput. Sci. Eng., 2024

Anonymized Network Sensing Graph Challenge.
CoRR, 2024

What is Normal? A Big Data Observational Science Model of Anonymized Internet Traffic.
CoRR, 2024

Distributed-Memory Randomized Algorithms for Sparse Tensor CP Decomposition.
Proceedings of the 36th ACM Symposium on Parallelism in Algorithms and Architectures, 2024

A Sparsity-Aware Distributed-Memory Algorithm for Sparse-Sparse Matrix Multiplication.
Proceedings of the International Conference for High Performance Computing, 2024

CPMA: An Efficient Batch-Parallel Compressed Set Without Pointers.
Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2024

Distributed Matrix-Based Sampling for Graph Neural Network Training.
Proceedings of the Seventh Annual Conference on Machine Learning and Systems, 2024

Fast multiplication of random dense matrices with sparse matrices.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

RDMA-Based Algorithms for Sparse Matrix Multiplication on GPUs.
Proceedings of the 38th ACM International Conference on Supercomputing, 2024

Sparsity-Aware Communication for Distributed Graph Neural Network Training.
Proceedings of the 53rd International Conference on Parallel Processing, 2024

2023
Exploring temporal community evolution: algorithmic approaches and parallel optimization for dynamic community detection.
Appl. Netw. Sci., December, 2023

Fast multiplication of random dense matrices with fixed sparse matrices.
CoRR, 2023

Space Efficient Sequence Alignment for SRAM-Based Computing: X-Drop on the Graphcore IPU.
Proceedings of the International Conference for High Performance Computing, 2023

Fast Exact Leverage Score Sampling from Khatri-Rao Products with Applications to Tensor Decomposition.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

C++ and Interoperability Between Libraries: The GraphBLAS C++ Specification.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

Optimizing Search Layouts in Packed Memory Arrays.
Proceedings of the Symposium on Algorithm Engineering and Experiments, 2023

2022
Combinatorial BLAS 2.0: Scaling Combinatorial Algorithms on Distributed-Memory Systems.
IEEE Trans. Parallel Distributed Syst., 2022

GraphBLAST: A High-Performance Linear Algebra-based Graph Framework on the GPU.
ACM Trans. Math. Softw., 2022

Fast intratumor heterogeneity inference from single-cell sequencing data.
Nat. Comput. Sci., 2022

GraphBLAS on the Edge: High Performance Streaming of Network Traffic.
CoRR, 2022

Extreme-Scale Many-against-Many Protein Similarity Search.
Proceedings of the SC22: International Conference for High Performance Computing, 2022

Scalable Irregular Parallelism with GPUs: Getting CPUs Out of the Way.
Proceedings of the SC22: International Conference for High Performance Computing, 2022

Sparse Matrices Powering Three Pillars of Science: Simulation, Data, and Learning.
Proceedings of the ISSAC '22: International Symposium on Symbolic and Algebraic Computation, Villeneuve-d'Ascq, France, July 4, 2022


Parallel Algorithms for Adding a Collection of Sparse Matrices.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2022

GraphBLAS: C++ Iterators for Sparse Matrices.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2022

Distributed-Memory Sparse Kernels for Machine Learning.
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022

Parallel Algorithms for Masked Sparse Matrix-Matrix Products.
Proceedings of the 51st International Conference on Parallel Processing, 2022

Distributed-Memory Parallel Contig Generation for De Novo Long-Read Genome Assembly.
Proceedings of the 51st International Conference on Parallel Processing, 2022

Atos: A Task-Parallel GPU Scheduler for Graph Analytics.
Proceedings of the 51st International Conference on Parallel Processing, 2022

Large Scale Enrichment and Statistical Cyber Characterization of Network Traffic.
Proceedings of the IEEE High Performance Extreme Computing Conference, 2022



2021
Introduction to the Special Issue for SPAA 2018 - Part 2.
ACM Trans. Parallel Comput., 2021

Introduction to the Special Issue for SPAA 2018: Part 1.
ACM Trans. Parallel Comput., 2021

EXAGRAPH: Graph and combinatorial methods for enabling exascale applications.
Int. J. High Perform. Comput. Appl., 2021

Atos: A Task-Parallel GPU Dynamic Scheduling Framework for Dynamic Irregular Computations.
CoRR, 2021

Randomized Algorithms for Scientific Computing (RASC).
CoRR, 2021

10 Years Later: Cloud Computing is Closing the Performance Gap.
Proceedings of the ICPE '21: ACM/SPEC International Conference on Performance Engineering, 2021

Terrace: A Hierarchical Graph Container for Skewed Dynamic Graphs.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

Accelerating large scale <i>de novo</i> metagenome assembly using GPUs.
Proceedings of the International Conference for High Performance Computing, 2021

Asynchrony versus bulk-synchrony for a generalized N-body problem from genomics.
Proceedings of the PPoPP '21: 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2021

Distributed-Memory k-mer Counting on GPUs.
Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021

Communication-Avoiding and Memory-Constrained Sparse Matrix-Matrix Multiplication at Extreme Scale.
Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021

Parallel String Graph Construction and Transitive Reduction for De Novo Genome Assembly.
Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021

Introduction to GraphBLAS 2.0.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2021

Distributed-memory parallel algorithms for sparse times tall-skinny-dense matrix multiplication.
Proceedings of the ICS '21: 2021 International Conference on Supercomputing, 2021

Scaling Generalized N-Body Problems, A Case Study from Genomics.
Proceedings of the ICPP 2021: 50th International Conference on Parallel Processing, Lemont, IL, USA, August 9, 2021


BELLA: Berkeley Efficient Long-Read to Long-Read Aligner and Overlapper.
Proceedings of the 2021 SIAM Conference on Applied and Computational Discrete Algorithms, 2021

2020
A Distributed-Memory Algorithm for Computing a Heavy-Weight Perfect Matching on Bipartite Graphs.
SIAM J. Sci. Comput., 2020

Parallel algorithms for finding connected components using linear algebra.
J. Parallel Distributed Comput., 2020

PersGNN: Applying Topological Data Analysis and Geometric Deep Learning to Structure-Based Protein Function Prediction.
CoRR, 2020

The Parallelism Motifs of Genomic Data Analysis.
CoRR, 2020

ADEPT: a domain independent sequence alignment strategy for gpu architectures.
BMC Bioinform., 2020

Reducing communication in graph neural network training.
Proceedings of the International Conference for High Performance Computing, 2020

Distributed many-to-many protein sequence alignment using sparse matrices.
Proceedings of the International Conference for High Performance Computing, 2020

LOGAN: High-Performance GPU-Based X-Drop Long-Read Alignment.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020

A High-Throughput Solver for Marginalized Graph Kernels on GPU.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020

Optimizing High Performance Markov Clustering for Pre-Exascale Architectures.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020

GPU accelerated partial order multiple sequence alignment for long reads self-correction.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020

Considerations for a Distributed GraphBLAS API.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020

A Roadmap for the GraphBLAS C++ API.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020

2019
Performance optimization, modeling and analysis of sparse matrix-matrix products on multi-core and many-core processors.
Parallel Comput., 2019

RDMA vs. RPC for Implementing Distributed Data Structures.
Proceedings of the 9th IEEE/ACM Workshop on Irregular Applications: Architectures and Algorithms, 2019

Graph Coloring on the GPU.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2019

LAGraph: A Community Effort to Collect Graph Algorithms Built on Top of the GraphBLAS.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2019

LACC: A Linear-Algebraic Algorithm for Finding Connected Components in Distributed Memory.
Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium, 2019

diBELLA: Distributed Long Read to Long Read Alignment.
Proceedings of the 48th International Conference on Parallel Processing, 2019

BCL: A Cross-Platform Distributed Data Structures Library.
Proceedings of the 48th International Conference on Parallel Processing, 2019

H-INDEX: Hash-Indexing for Parallel Triangle Counting on GPUs.
Proceedings of the 2019 IEEE High Performance Extreme Computing Conference, 2019

2018
BCL: A Cross-Platform Distributed Container Library.
CoRR, 2018

A distributed-memory approximation algorithm for maximum weight perfect bipartite matching.
CoRR, 2018

Integrated Model, Batch, and Domain Parallelism in Training Neural Networks.
Proceedings of the 30th on Symposium on Parallelism in Algorithms and Architectures, 2018

Extreme scale de novo metagenome assembly.
Proceedings of the International Conference for High Performance Computing, 2018

High-Performance Sparse Matrix-Matrix Products on Intel KNL and Multicore Architectures.
Proceedings of the 47th International Conference on Parallel Processing, 2018

Implementing Push-Pull Efficiently in GraphBLAS.
Proceedings of the 47th International Conference on Parallel Processing, 2018

Design Principles for Sparse Matrix Multiplication on the GPU.
Proceedings of the Euro-Par 2018: Parallel Processing, 2018

Communication-Avoiding Optimization Methods for Distributed Massive-Scale Sparse Inverse Covariance Estimation.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

2017
Computing Maximum Cardinality Matchings in Parallel on Bipartite Graphs via Tree-Grafting.
IEEE Trans. Parallel Distributed Syst., 2017

A High Performance Block Eigensolver for Nuclear Configuration Interaction Calculations.
IEEE Trans. Parallel Distributed Syst., 2017

Integrated Model and Data Parallelism in Training Neural Networks.
CoRR, 2017

Communication-Avoiding Optimization Methods for Massive-Scale Graphical Model Structure Learning.
CoRR, 2017

Extreme-Scale De Novo Genome Assembly.
CoRR, 2017

Distributed-Memory Breadth-First Search on Massive Graphs.
CoRR, 2017

Scaling deep learning on GPU and knights landing clusters.
Proceedings of the International Conference for High Performance Computing, 2017

MerBench: PGAS Benchmarks for High Performance Genome Assembly.
Proceedings of PAW@SC 2017: Second Annual PGAS Applications Workshop, 2017

Design of the GraphBLAS API for C.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

Introduction to GABB Workshop.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

The Reverse Cuthill-McKee Algorithm in Distributed-Memory.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

Towards a GraphBLAS Library in Chapel.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

A Work-Efficient Parallel Sparse Matrix-Sparse Vector Multiplication Algorithm.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

GraphBLAS C API: Ideas for future versions of the specification.
Proceedings of the 2017 IEEE High Performance Extreme Computing Conference, 2017

Performance Characterization of De Novo Genome Assembly on Leading Parallel Systems.
Proceedings of the Euro-Par 2017: Parallel Processing - 23rd International Conference on Parallel and Distributed Computing, Santiago de Compostela, Spain, August 28, 2017

2016
Recent Advances in Graph Partitioning.
Proceedings of the Algorithm Engineering - Selected Results and Surveys, 2016

Exploiting Multiple Levels of Parallelism in Sparse Matrix-Matrix Multiplication.
SIAM J. Sci. Comput., 2016

A matrix-algebraic formulation of distributed-memory maximal cardinality matching algorithms in bipartite graphs.
Parallel Comput., 2016

LiRa: A New Likelihood-Based Similarity Score for Collaborative Filtering.
CoRR, 2016

Communication-Avoiding Parallel Sparse-Dense Matrix-Matrix Multiplication.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

Distributed-Memory Algorithms for Maximum Cardinality Matching in Bipartite Graphs.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016


2015
Special issue "Graph analysis for scientific discovery".
Parallel Comput., 2015

Parallel processing of filtered queries in attributed semantic graphs.
J. Parallel Distributed Comput., 2015

HipMer: an extreme-scale de novo genome assembler.
Proceedings of the International Conference for High Performance Computing, 2015

merAligner: A Fully Parallel Sequence Aligner.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015

A Parallel Tree Grafting Algorithm for Maximum Cardinality Matching in Bipartite Graphs.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015

Parallel Triangle Counting and Enumeration Using Matrix Algebra.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015

Graphs, Matrices, and the GraphBLAS: Seven Good Reasons.
Proceedings of the International Conference on Computational Science, 2015

Distributed-Memory Algorithms for Maximal Cardinality Matching Using Matrix Algebra.
Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015

Efficient data reduction for large-scale genetic mapping.
Proceedings of the 6th ACM Conference on Bioinformatics, 2015

2014
Parallel De Bruijn Graph Construction and Traversal for De Novo Genome Assembly.
Proceedings of the International Conference for High Performance Computing, 2014

GABB Introduction.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

Optimizing Sparse Matrix-Multiple Vectors Multiplication for Nuclear Configuration Interaction Calculations.
Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014

Efficient and accurate clustering for large-scale genetic mapping.
Proceedings of the 2014 IEEE International Conference on Bioinformatics and Biomedicine, 2014

2013
Recent Advances in Graph Partitioning.
CoRR, 2013

Communication optimal parallel multiplication of sparse random matrices.
Proceedings of the 25th ACM Symposium on Parallelism in Algorithms and Architectures, 2013

Minimizing Communication in All-Pairs Shortest Paths.
Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Processing, 2013

High-Productivity and High-Performance Analysis of Filtered Semantic Graphs.
Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Processing, 2013

Distributed Memory Breadth-First Search Revisited: Enabling Bottom-Up Search.
Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013


2012
Parallel Sparse Matrix-Matrix Multiplication and Indexing: Implementation and Experiments.
SIAM J. Sci. Comput., 2012

A Flexible Open-Source Toolbox for Scalable Complex Graph Analysis.
Proceedings of the Twelfth SIAM International Conference on Data Mining, 2012

Scalable complex graph analysis with the knowledge discovery toolbox.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Graph partitioning for scalable distributed graph computations.
Proceedings of the Graph Partitioning and Graph Clustering, 2012

Can Network-Offload Based Non-blocking Neighborhood MPI Collectives Improve Communication Overheads of Irregular Graph Algorithms?
Proceedings of the 2012 IEEE International Conference on Cluster Computing Workshops, 2012

High-performance analysis of filtered semantic graphs.
Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2012

2011
The Combinatorial BLAS: design, implementation, and applications.
Int. J. High Perform. Comput. Appl., 2011

Parallel breadth-first search on distributed memory systems.
Proceedings of the Conference on High Performance Computing Networking, 2011

Reduced-Bandwidth Multithreaded Algorithms for Sparse Matrix-Vector Multiplication.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

Implementing Sparse Matrices for Graph Algorithms.
Proceedings of the Graph Algorithms in the Language of Linear Algebra, 2011

New Ideas in Sparse Matrix Matrix Multiplication.
Proceedings of the Graph Algorithms in the Language of Linear Algebra, 2011

2010
Solving path problems on the GPU.
Parallel Comput., 2010

Highly Parallel Sparse Matrix-Matrix Multiplication
CoRR, 2010

2009
Parallel sparse matrix-vector and matrix-transpose-vector multiplication using compressed sparse blocks.
Proceedings of the SPAA 2009: Proceedings of the 21st Annual ACM Symposium on Parallelism in Algorithms and Architectures, 2009

2008
On the representation and multiplication of hypersparse matrices.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Challenges and Advances in Parallel Sparse Matrix-Matrix Multiplication.
Proceedings of the 2008 International Conference on Parallel Processing, 2008


  Loading...