Narayanan Sundaram

According to our database1, Narayanan Sundaram authored at least 39 papers between 2008 and 2021.

Collaborative distances:



In proceedings 
PhD thesis 




First-Generation Inference Accelerator Deployment at Facebook.
CoRR, 2021

The LDBC Graphalytics Benchmark.
CoRR, 2020

Matrix-normal models for fMRI analysis.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

Bridging the Gap between HPC and Big Data frameworks.
Proc. VLDB Endow., 2017

High-Performance Incremental SVM Learning on Intel<sup>®</sup> Xeon Phi™ Processors.
Proceedings of the High Performance Computing - 32nd International Conference, 2017

Deep learning at 15PF: supervised and semi-supervised classification for scientific data.
Proceedings of the International Conference for High Performance Computing, 2017

Galactos: computing the anisotropic 3-point correlation function for 2 billion galaxies.
Proceedings of the International Conference for High Performance Computing, 2017

Exploring optimizations on shared-memory platforms for parallel triangle counting algorithms.
Proceedings of the 2017 IEEE High Performance Extreme Computing Conference, 2017

Efficient Approximation Algorithms for Weighted b-Matching.
SIAM J. Sci. Comput., 2016

LDBC Graphalytics: A Benchmark for Large-Scale Graph Analysis on Parallel and Distributed Platforms.
Proc. VLDB Endow., 2016

Designing scalable <i>b</i>-Matching algorithms on distributed memory multiprocessors by approximation.
Proceedings of the International Conference for High Performance Computing, 2016

Graphicionado: A high-performance and energy-efficient accelerator for graph analytics.
Proceedings of the 49th Annual IEEE/ACM International Symposium on Microarchitecture, 2016

PANDA: Extreme Scale Parallel K-Nearest Neighbor on Distributed Architectures.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

GraphPad: Optimized Graph Primitives for Parallel and Distributed Platforms.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

Data tiering in heterogeneous memory systems.
Proceedings of the Eleventh European Conference on Computer Systems, 2016

GraphIn: An Online High Performance Incremental Graph Processing Framework.
Proceedings of the Euro-Par 2016: Parallel Processing, 2016

Real-time full correlation matrix analysis of fMRI data.
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016

GraphMat: High performance graph analytics made productive.
Proc. VLDB Endow., 2015

GraphMat: High performance graph analytics made productive.
CoRR, 2015

Parallel Efficient Sparse Matrix-Matrix Multiplication on Multicore Platforms.
Proceedings of the High Performance Computing - 30th International Conference, 2015

Exploiting NVM in large-scale graph analytics.
Proceedings of the 3rd Workshop on Interactions of NVM/FLASH with Operating Systems and Workloads, 2015

Full correlation matrix analysis of fMRI data on Intel® Xeon Phi™ coprocessors.
Proceedings of the International Conference for High Performance Computing, 2015

BD-CATS: big data clustering at trillion particle scale.
Proceedings of the International Conference for High Performance Computing, 2015

Improving graph partitioning for modern graphs and architectures.
Proceedings of the 5th Workshop on Irregular Applications - Architectures and Algorithms, 2015

Scalable Bayesian Optimization Using Deep Neural Networks.
Proceedings of the 32nd International Conference on Machine Learning, 2015

Sparsifying Synchronization for High-Performance Shared-Memory Sparse Triangular Solver.
Proceedings of the Supercomputing - 29th International Conference, 2014

GenBase: a complex analytics genomics benchmark.
Proceedings of the International Conference on Management of Data, 2014

Navigating the maze of graph analytics frameworks using massive graph datasets.
Proceedings of the International Conference on Management of Data, 2014

Pardicle: Parallel Approximate Density-Based Clustering.
Proceedings of the International Conference for High Performance Computing, 2014

Streaming Similarity Search over one Billion Tweets using Parallel Locality-Sensitive Hashing.
Proc. VLDB Endow., 2013

Making Computer Vision Computationally Efficient.
PhD thesis, 2012

Long term video segmentation through pixel level spectral clustering on GPUs.
Proceedings of the IEEE International Conference on Computer Vision Workshops, 2011

Considerations When Evaluating Microprocessor Platforms.
Proceedings of the 3rd USENIX Workshop on Hot Topics in Parallelism, 2011

PALLAS: Mapping Applications onto Manycore.
Proceedings of the Multiprocessor System-on-Chip - Hardware Design and Tool Integration., 2011

Dense Point Trajectories by GPU-Accelerated Large Displacement Optical Flow.
Proceedings of the Computer Vision, 2010

A framework for efficient and scalable execution of domain-specific templates on GPUs.
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

Efficient, high-quality image contour detection.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Optimizing the use of GPU memory in applications with large data sets.
Proceedings of the 16th International Conference on High Performance Computing, 2009

Fast support vector machine training and classification on graphics processors.
Proceedings of the Machine Learning, 2008
