Cevdet Aykanat

Orcid: 0000-0002-4559-1321

According to our database1, Cevdet Aykanat authored at least 142 papers between 1987 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.




In proceedings 
PhD thesis 


On csauthors.net:


Stochastic Gradient Descent for matrix completion: Hybrid parallelization on shared- and distributed-memory systems.
Knowl. Based Syst., January, 2024

Scaling Stratified Stochastic Gradient Descent for Distributed Matrix Completion.
IEEE Trans. Knowl. Data Eng., October, 2023

Minimizing Staleness and Communication Overhead in Distributed SGD for Collaborative Filtering.
IEEE Trans. Computers, October, 2023

Enhancing Block Cimmino for Sparse Linear Systems with Dense Columns via Schur Complement.
SIAM J. Sci. Comput., April, 2023

Load balanced locality-aware parallel SGD on multicore architectures for latent factor based collaborative filtering.
Future Gener. Comput. Syst., 2023

Scalable Unsupervised ML: Latency Hiding in Distributed Sparse Tensor Decomposition.
IEEE Trans. Parallel Distributed Syst., 2022

Partitioning and Reordering for Spike-Based Distributed-Memory Parallel Gauss-Seidel.
SIAM J. Sci. Comput., 2022

Simultaneous Computational and Data Load Balancing in Distributed-Memory Setting.
SIAM J. Sci. Comput., 2022

Partitioning Models for General Medium-Grain Parallel Sparse Tensor Decomposition.
IEEE Trans. Parallel Distributed Syst., 2021

True Load Balancing for Matricized Tensor Times Khatri-Rao Product.
IEEE Trans. Parallel Distributed Syst., 2021

Fast shared-memory streaming multilevel graph partitioning.
J. Parallel Distributed Comput., 2021

Reduce Operations: Send Volume Balancing While Minimizing Latency.
IEEE Trans. Parallel Distributed Syst., 2020

Cartesian Partitioning Models for 2D and 3D Parallel SpGEMM Algorithms.
IEEE Trans. Parallel Distributed Syst., 2020

Reordering sparse matrices into block-diagonal column-overlapped form.
J. Parallel Distributed Comput., 2020

Scaling sparse matrix-matrix multiplication in the accumulo database.
Distributed Parallel Databases, 2020

Cascade-aware partitioning of large graph databases.
VLDB J., 2019

Spatiotemporal Graph and Hypergraph Partitioning Models for Sparse Matrix-Vector Multiplication on Many-Core Architectures.
IEEE Trans. Parallel Distributed Syst., 2019

A Hypergraph Partitioning Model for Profile Minimization.
SIAM J. Sci. Comput., 2019

Locality-aware and load-balanced static task scheduling for MapReduce.
Future Gener. Comput. Syst., 2019

Regularizing irregularly sparse point-to-point communications.
Proceedings of the International Conference for High Performance Computing, 2019

Improving Medium-Grain Partitioning for Scalable Sparse Tensor Decomposition.
IEEE Trans. Parallel Distributed Syst., 2018

Partitioning Models for Scaling Parallel Sparse Matrix-Matrix Multiplication.
ACM Trans. Parallel Comput., 2018

A Novel Partitioning Method for Accelerating the Block Cimmino Algorithm.
SIAM J. Sci. Comput., 2018

1.5D Parallel Sparse Matrix-Vector Multiply.
SIAM J. Sci. Comput., 2018

Optimizing nonzero-based sparse matrix partitioning models via reducing latency.
J. Parallel Distributed Comput., 2018

A Recursive Hypergraph Bipartitioning Framework for Reducing Bandwidth and Latency Costs Simultaneously.
IEEE Trans. Parallel Distributed Syst., 2017

Exploiting Locality in Sparse Matrix-Matrix Multiplication on Many-Core Architectures.
IEEE Trans. Parallel Distributed Syst., 2017

Parallel Minimum Norm Solution of Sparse Block Diagonal Column Overlapped Underdetermined Systems.
ACM Trans. Math. Softw., 2017

A machine learning approach for result caching in web search engines.
Inf. Process. Manag., 2017

Addressing Volume and Latency Overheads in 1D-parallel Sparse Matrix-Vector Multiplication.
Proceedings of the Euro-Par 2017: Parallel Processing - 23rd International Conference on Parallel and Distributed Computing, Santiago de Compostela, Spain, August 28, 2017

Parallelization of Sparse Matrix Kernels for Big Data Applications.
Proceedings of the Resource Management for Big Data Platforms, 2016

Locality-Aware Parallel Sparse Matrix-Vector and Matrix-Transpose-Vector Multiplication on Many-Core Processors.
IEEE Trans. Parallel Distributed Syst., 2016

Reducing latency cost in 2D sparse matrix partitioning models.
Parallel Comput., 2016

Improving performance of sparse matrix dense matrix multiplication on large-scale parallel systems.
Parallel Comput., 2016

A Novel Method for Scaling Iterative Solvers: Avoiding Latency Overhead of Parallel Sparse-Matrix Vector Multiplies.
IEEE Trans. Parallel Distributed Syst., 2015

Semi-two-dimensional Partitioning for Parallel Sparse Matrix-Vector Multiplication.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015

Improving the Performance of IndependentTask Assignment Heuristics MinMin, MaxMin and Sufferage.
IEEE Trans. Parallel Distributed Syst., 2014

Temporal Workload-Aware Replicated Partitioning for Social Networks.
IEEE Trans. Knowl. Data Eng., 2014

Simultaneous Input and Output Matrix Partitioning for Outer-Product-Parallel Sparse Matrix-Matrix Multiplication.
SIAM J. Sci. Comput., 2014

Constrained Min-Cut Replication for K-Way Hypergraph Partitioning.
INFORMS J. Comput., 2014

1-D and 2-D Parallel Algorithms for All-Pairs Similarity Problem.
CoRR, 2014

A term-based inverted index partitioning model for efficient distributed query processing.
ACM Trans. Web, 2013

Query-Log Aware Replicated Declustering.
IEEE Trans. Parallel Distributed Syst., 2013

Hypergraph Partitioning Based Models and Methods for Exploiting Cache Locality in Sparse Matrix-Vector Multiplication.
SIAM J. Sci. Comput., 2013

A Recursive Bipartitioning Algorithm for Permuting Sparse Square Matrices into Block Diagonal Form with Overlap.
SIAM J. Sci. Comput., 2013

Document replication strategies for geographically distributed web search engines.
Inf. Process. Manag., 2013

Active node determination for correlated data gathering in wireless sensor networks.
Comput. Networks, 2013

Incorporating the surfing behavior of web users into pagerank.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

Multilevel Segmentation of Histopathological Images Using Cooccurrence of Tissue Objects.
IEEE Trans. Biomed. Eng., 2012

Partitioning Hypergraphs in Scientific Computing Applications through Vertex Separators on Graphs.
SIAM J. Sci. Comput., 2012

Replicated partitioning for undirected hypergraphs.
J. Parallel Distributed Comput., 2012

Analyzing and enhancing OSKI for sparse matrix-vector multiplication
CoRR, 2012

Technical Report on Hypergraph-Partitioning-Based Models and Methods for Exploiting Cache Locality in Sparse-Matrix Vector Multiplication.
CoRR, 2012

A Parallel Framework for In-Memory Construction of Term-Partitioned Inverted Indexes.
Comput. J., 2012

Investigation of Load Balancing Scalability in Space Plasma Simulations.
Proceedings of the Applied Parallel and Scientific Computing, 2012

Hypergraph Partitioning.
Proceedings of the Encyclopedia of Parallel Computing, 2011

PaToH (Partitioning Tool for Hypergraphs).
Proceedings of the Encyclopedia of Parallel Computing, 2011

Parallel Frequent Item Set Mining with Selective Item Replication.
IEEE Trans. Parallel Distributed Syst., 2011

Site-Based Partitioning and Repartitioning Techniques for Parallel PageRank Computation.
IEEE Trans. Parallel Distributed Syst., 2011

Hypergraph Partitioning-Based Fill-Reducing Ordering for Symmetric Matrices.
SIAM J. Sci. Comput., 2011

Improving the Load Balancing Performance of Vlasiator
CoRR, 2011

Hypergraph Partitioning through Vertex Separators on Graphs
CoRR, 2011

Energy-price-driven query processing in multi-center web search engines.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Hypergraph-Theoretic Partitioning Models for Parallel Web Crawling.
Proceedings of the Computer and Information Sciences II, 2011

Memory Resident Parallel Inverted Index Construction.
Proceedings of the Computer and Information Sciences II, 2011

CoDet: sentence-based containment detection in news corpora.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

On Two-Dimensional Sparse Matrix Partitioning: Models, Methods, and a Recipe.
SIAM J. Sci. Comput., 2010

A Matrix Partitioning Interface to PaToH in MATLAB.
Parallel Comput., 2010

Efficient successor retrieval operations for aggregate query processing on clustered road networks.
Inf. Sci., 2010

A link-based storage scheme for efficient aggregate query processing on clustered road networks.
Inf. Syst., 2010

Query forwarding in geographically distributed search engines.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

Data and Model Driven Hybrid Approach to Activity Scoring of Cyclic Pathways.
Proceedings of the Computer and Information Sciences, 2010

Selective Replicated Declustering for Arbitrary Queries.
Proceedings of the Euro-Par 2009 Parallel Processing, 2009

A Model for Task Repartioning under Data Replication.
Proceedings of the Combinatorial Scientific Computing, 01.02. - 06.02.2009, 2009

One-dimensional partitioning for heterogeneous systems: Theory and practice.
J. Parallel Distributed Comput., 2008

Multi-level direct K-way hypergraph partitioning with multiple constraints and fixed vertices.
J. Parallel Distributed Comput., 2008

Clustering spatial networks for aggregate query processing: A hypergraph approach.
Inf. Syst., 2008

Chat mining: Predicting user and message attributes in computer-mediated communication.
Inf. Process. Manag., 2008

Hypergraph-Partitioning-Based Remapping Models for Image-Space-Parallel Direct Volume Rendering of Unstructured Grids.
IEEE Trans. Parallel Distributed Syst., 2007

Partitioning Sparse Matrices for Parallel Preconditioned Iterative Methods.
SIAM J. Sci. Comput., 2007

Revisiting Hypergraph Models for Sparse Matrix Partitioning.
SIAM Rev., 2007

Parallel image restoration using surrogate constraint methods.
J. Parallel Distributed Comput., 2007

Heuristics for scheduling file-sharing tasks on heterogeneous systems with distributed repositories.
J. Parallel Distributed Comput., 2007

Adaptive decomposition and remapping algorithms for object-space-parallel direct volume rendering of unstructured grids.
J. Parallel Distributed Comput., 2007

Architecture of a grid-enabled Web search engine.
Inf. Process. Manag., 2007

Iterative-Improvement-Based Heuristics for Adaptive Scheduling of Tasks Sharing Files on Heterogeneous Master-Slave Environments.
IEEE Trans. Parallel Distributed Syst., 2006

Task assignment in heterogeneous computing systems.
J. Parallel Distributed Comput., 2006

Performance of query processing implementations in ranking-based text retrieval systems using inverted indices.
Inf. Process. Manag., 2006

A Web-Site-Based Partitioning Technique for Reducing Preprocessing Overhead of Parallel PageRank Computation.
Proceedings of the Applied Parallel Computing. State of the Art in Scientific Computing, 2006

Effect of Inverted Index Partitioning Schemes on Performance of Query Processing in Parallel Text Retrieval Systems.
Proceedings of the Computer and Information Sciences, 2006

Chat Mining for Gender Prediction.
Proceedings of the Advances in Information Systems, 4th International Conference, 2006

Iterative-improvement-based declustering heuristics for multi-disk databases.
Inf. Syst., 2005

Encapsulating Multiple Communication-Cost Metrics in Partitioning Sparse Rectangular Matrices for Parallel Matrix-Vector Multiplies.
SIAM J. Sci. Comput., 2004

Permuting Sparse Rectangular Matrices into Block-Diagonal Form.
SIAM J. Sci. Comput., 2004

Fast optimal load balancing algorithms for 1D partitioning.
J. Parallel Distributed Comput., 2004

Hypergraph Models and Algorithms for Data-Pattern-Based Clustering.
Data Min. Knowl. Discov., 2004

A Message Ordering Problem in Parallel Programs.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2004

Data-Parallel Web Crawling Models.
Proceedings of the Computer and Information Sciences, 2004

A Space Optimization for FP-Growth.
Proceedings of the FIMI '04, 2004

Direct volume rendering of unstructured grids.
Comput. Graph., 2003

Minimizing Communication Cost in Fine-Grain Partitioning of Sparse Matrices.
Proceedings of the Computer and Information Sciences, 2003

Image-Space-Parallel Direct Volume Rendering on a Cluster of PCs.
Proceedings of the Computer and Information Sciences, 2003

Adaptive Routing on the New Switch Chip for IBM SP Systems.
J. Parallel Distributed Comput., 2001

A hypergraph-partitioning approach for coarse-grain decomposition.
Proceedings of the 2001 ACM/IEEE conference on Supercomputing, 2001

A Fine-Grain Hypergraph Model for 2D Decomposition of Sparse Matrices.
Proceedings of the 15th International Parallel & Distributed Processing Symposium (IPDPS-01), 2001

Image-Space Decomposition Algorithms for Sort-First Parallel Volume Rendering of Unstructured Grids.
J. Supercomput., 2000

Hypergraph-Partitioning-Based Decomposition for Parallel Sparse-Matrix Vector Multiplication.
IEEE Trans. Parallel Distributed Syst., 1999

A fast neural-network algorithm for VLSI cell placement.
Neural Networks, 1998

Object-space parallel polygon rendering on hypercubes.
Comput. Graph., 1998

A parallel scaled conjugate-gradient algorithm for the solution phase of gathering radiosity on hypercubes.
Vis. Comput., 1997

Two novel multiway circuit partitioning algorithms using relaxed locking.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 1997

A Comparison of Spatial Subdivision Algorithms for Sort-First Rendering.
Proceedings of the High-Performance Computing and Networking, 1997

Sparse matrix decomposition with optimal load balancing.
Proceedings of the Fourth International on High-Performance Computing, 1997

A parallel progressive radiosity algorithm based on patch data circulation.
Comput. Graph., 1996

An Effective Model to Decompose Linear Programs for Parallel Solution.
Proceedings of the Applied Parallel Computing, 1996

Decomposing Irregularly Sparse Matrices for Parallel Matrix-Vector Multiplication.
Proceedings of the Parallel Algorithms for Irregularly Structured Problems, 1996

Adaptive Source Routing in Multistage Interconnection Networks.
Proceedings of IPPS '96, 1996

Active Pixel Merging on Hypercube Multicomputers.
Proceedings of the High-Performance Computing and Networking, 1996

Efficient Fast Hartley Transform Algorithms for Hypercube-Connected Multicomputers.
IEEE Trans. Parallel Distributed Syst., 1995

Circuit partitioning using mean field annealing.
Neurocomputing, 1995

Efficient Multiple-Way Graph Partitioning Algorithms.
Proceedings of the Seventh SIAM Conference on Parallel Processing for Scientific Computing, 1995

Comparison of Two Image-Space Subdivision Algorithms for Direct Volume Rendering on Distributed-Memory Multicomputers.
Proceedings of the Applied Parallel Computing, 1995

Decomposing Linear Programs for Parallel Solution.
Proceedings of the Applied Parallel Computing, 1995

An Efficient Mean Field Annealing Formulation for Mapping Unstructured Domains to Hypercubes.
Proceedings of the Parallel Algorithms for Irregularly Structured Problems, 1995

Efficient Overlapped Fft Algorithms for Hypercube-connected Multicomputers.
Parallel Algorithms Appl., 1994

Efficient parallel spatial subdivision algorithm for object-based parallel ray tracing.
Comput. Aided Des., 1994

Routing Algorithms for IBM SP1.
Proceedings of the Parallel Computer Routing and Communication, 1994

A Global Routing Heuristic for FPGAs Based on Mean Field Annealing.
Proceedings of the Field-Programmable Logic, 1994

An Efficient Mapping Heuristic for Mesh-Connected Parallel Architectures Based on Mean Field Annealing.
Proceedings of the Parallel Processing: CONPAR 94, 1994

Progressive refinement radiosity on ring-connected multicomputers.
Proceedings of the 1993 Symposium on Parallel Rendering, 1993

A New Mapping Heuristic Based on Mean Field Annealing.
J. Parallel Distributed Comput., 1992

A Fault-Tolerant Hexagonal Systolic Array.
Inf. Process. Lett., 1992

Circuit partitioning using parallel mean field annealing algorithms.
Proceedings of the Third IEEE Symposium on Parallel and Distributed Processing, 1991

Efficient Parallel Maze Routing Algorithms on a Hypercube Multicomputer.
Proceedings of the International Conference on Parallel Processing, 1991

An Overlapped FFT Algorithm for Hypercube Multicomputers.
Proceedings of the International Conference on Parallel Processing, 1991

Parallelization of Lee's Routing Algorithm on a Hypercube Multicomputer.
Proceedings of the Distributed Memory Computing, 2nd European Conference, 1991

Vectorization and parallelization of the conjugate gradient algorithm on hypercube-connected vector processors.
Microprocessing and Microprogramming, 1990

Iterative Algorithms for Solution of Large Sparse Systems of Linear Equations on Hypercubes.
IEEE Trans. Computers, 1988

A Reconfiguration Algorithm for Fault Tolerance in a Hypercube Multiprocessor.
Inf. Process. Lett., 1988

Logic fault simulation on a vector hypercube multiprocessor.
Proceedings of the Third Conference on Hypercube Concurrent Computers and Applications, 1988

Implementation of the conjugate gradient algorithm on a vector hypercube multiprocessor.
Proceedings of the Third Conference on Hypercube Concurrent Computers and Applications, 1988

Large Grain Parallel Conjugate Gradient Algorithms on a Hypercube Multiprocessor.
Proceedings of the International Conference on Parallel Processing, 1987
