Proceedings of the Euro-Par 2023: Parallel Processing - 29th International Conference on Parallel and Distributed Computing, Limassol, Cyprus, August 28, 2023

Asynchronous Decentralized Bayesian Optimization for Large Scale Hyperparameter Optimization.

[BibT_eX]

[DOI]

Romain Égelé

Isabelle Guyon

Venkatram Vishwanath

Prasanna Balaprakash

Proceedings of the 19th IEEE International Conference on e-Science, 2023

2022

PythonFOAM: In-situ data analyses with OpenFOAM and Python.

[BibT_eX]

[DOI]

J. Comput. Sci., 2022

Intelligent resolution: Integrating Cryo-EM with AI-driven multi-resolution simulations to observe the severe acute respiratory syndrome coronavirus-2 replication-transcription machinery in action.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Appl., 2022

Operation-Level Performance Benchmarking of Graph Neural Networks for Scientific Applications.

[BibT_eX]

[DOI]

Ryien Hosseini

Filippo Simini

Venkatram Vishwanath

CoRR, 2022

Asynchronous Distributed Bayesian Optimization at HPC Scale.

[BibT_eX]

[DOI]

CoRR, 2022

Neural Architecture Search for Transformers: A Survey.

[BibT_eX]

[DOI]

Krishna Teja Chitty-Venkata

Murali Emani

Venkatram Vishwanath

Arun K. Somani

IEEE Access, 2022

AI Benchmarking for Science: Efforts from the MLCommons Science Working Group.

[BibT_eX]

[DOI]

Christine R. Kirkpatrick

Proceedings of the High Performance Computing. ISC High Performance 2022 International Workshops - Hamburg, Germany, May 29, 2022

A Comprehensive Evaluation of Novel AI Accelerators for Deep Learning Workloads.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Workshop on Performance Modeling, 2022

Efficient Design Space Exploration for Sparse Mixed Precision Neural Architectures.

[BibT_eX]

[DOI]

Krishna Teja Chitty-Venkata

Murali Emani

Venkatram Vishwanath

Arun K. Somani

Proceedings of the HPDC '22: The 31st International Symposium on High-Performance Parallel and Distributed Computing, Minneapolis, MN, USA, 27 June 2022, 2022

HDF5 Cache VOL: Efficient and Scalable Parallel I/O through Caching Data on Node-local Storage.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Symposium on Cluster, 2022

Toward an In-Depth Analysis of Multifidelity High Performance Computing Systems.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Symposium on Cluster, 2022

Stimulus: Accelerate Data Management for Scientific AI applications in HPC.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Symposium on Cluster, 2022

2021

Accelerating Scientific Applications With SambaNova Reconfigurable Dataflow Architecture.

[BibT_eX]

[DOI]

Volodymyr V. Kindratenko

Anne C. Elster

Comput. Sci. Eng., 2021

MLPerf HPC: A Holistic Benchmark Suite for Scientific Machine Learning on HPC Systems.

[BibT_eX]

[DOI]

CoRR, 2021

AgEBO-tabular: joint neural architecture and hyperparameter search with autotuned data-parallel training for tabular data.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2021

Stream-AI-MD: streaming AI-driven adaptive molecular simulations for heterogeneous computing platforms.

[BibT_eX]

[DOI]

Proceedings of the PASC '21: Platform for Advanced Scientific Computing Conference, 2021

MLPerf™ HPC: A Holistic Benchmark Suite for Scientific Machine Learning on HPC Systems.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM Workshop on Machine Learning in High Performance Computing Environments, 2021

DLIO: A Data-Centric Benchmark for Scientific Deep Learning Applications.

[BibT_eX]

[DOI]

Proceedings of the 21st IEEE/ACM International Symposium on Cluster, 2021

2020

A machine learning workflow for molecular analysis: application to melting points.

[BibT_eX]

[DOI]

Ganesh Sivaraman

Nicholas E. Jackson

Benjamín Sánchez-Lengeling

Álvaro Vázquez-Mayagoitia

Alán Aspuru-Guzik

Venkatram Vishwanath

Juan J. de Pablo

Mach. Learn. Sci. Technol., 2020

ExaHDF5: Delivering Efficient Parallel I/O on Exascale Computing Systems.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., 2020

A terminology for in situ visualization and analysis systems.

[BibT_eX]

[DOI]

Christopher R. Johnson

Int. J. High Perform. Comput. Appl., 2020

AgEBO-Tabular: Joint Neural Architecture and Hyperparameter Search with Autotuned Data-Parallel Training for Tabular Data.

[BibT_eX]

[DOI]

CoRR, 2020

SeeSAw: Optimizing Performance of In-Situ Analytics Applications under Power Constraints.

[BibT_eX]

[DOI]

Ivana Marincic

Venkatram Vishwanath

Henry Hoffmann

Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020

2019

Balsam: Automated Scheduling and Execution of Dynamic, Data-Intensive HPC Workflows.

[BibT_eX]

[DOI]

CoRR, 2019

Scaling Distributed Training of Flood-Filling Networks on HPC Infrastructure for Brain Mapping.

[BibT_eX]

[DOI]

CoRR, 2019

A Benchmarking Study to Evaluate Apache Spark on Large-Scale Supercomputers.

[BibT_eX]

[DOI]

George K. Thiruvathukal

CoRR, 2019

MELA: A Visual Analytics Tool for Studying Multifidelity HPC System Logs.

[BibT_eX]

[DOI]

Proceedings of the 3rd IEEE/ACM Industry/University Joint International Workshop on Data-center Automation, 2019

Balsam: Near Real-Time Experimental Data Analysis on Supercomputers.

[BibT_eX]

[DOI]

Proceedings of the 1st IEEE/ACM Annual Workshop on Large-scale Experiment-in-the-Loop Computing, 2019

Scaling Distributed Training of Flood-Filling Networks on HPC Infrastructure for Brain Mapping.

[BibT_eX]

[DOI]

Proceedings of the Third IEEE/ACM Workshop on Deep Learning on Supercomputers, 2019

Scalable reinforcement-learning-based neural architecture search for cancer deep learning research.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2019

2018

libIS: a lightweight library for flexible in transit visualization.

[BibT_eX]

[DOI]

Proceedings of the Workshop on In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization, 2018

Topology-aware space-shared co-analysis of large-scale molecular dynamics simulations.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2018

Benchmarking Machine Learning Methods for Performance Modeling of Scientific Applications.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE/ACM Performance Modeling, 2018

Optimizing Data Aggregation by Leveraging the Deep Memory Hierarchy on Large-scale Systems.

[BibT_eX]

[DOI]

François Tessier

Paul Gressier

Venkatram Vishwanath

Proceedings of the 32nd International Conference on Supercomputing, 2018

Toward Scalable and Asynchronous Object-Centric Data Management for HPC.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE/ACM International Symposium on Cluster, 2018

2017

Data movement optimizations for independent MPI I/O on the Blue Gene/Q.

[BibT_eX]

[DOI]

Preeti Malakar

Venkatram Vishwanath

Parallel Comput., 2017

Hierarchical Read-Write Optimizations for Scientific Applications with Multi-variable Structured Datasets.

[BibT_eX]

[DOI]

Preeti Malakar

Venkatram Vishwanath

Int. J. Parallel Program., 2017

HACC: extreme scaling and performance across diverse architectures.

[BibT_eX]

[DOI]

Commun. ACM, 2017

A distributed graph approach for pre-processing linked RDF data using supercomputers.

[BibT_eX]

[DOI]

Michael J. Lewis

George K. Thiruvathukal

Venkatram Vishwanath

Michael E. Papka

Andrew E. Johnson

Proceedings of The International Workshop on Semantic Big Data, 2017

PoLiMEr: An Energy Monitoring and Power Limiting Interface for HPC Applications.

[BibT_eX]

[DOI]

Ivana Marincic

Venkatram Vishwanath

Henry Hoffmann

Proceedings of the 5th International Workshop on Energy Efficient Supercomputing, 2017

Scalable In situ Analysis of Molecular Dynamics Simulations.

[BibT_eX]

[DOI]

Proceedings of the In Situ Infrastructures on Enabling Extreme-Scale Analysis and Visualization, 2017

A Visual Analytics System for Optimizing Communications in Massively Parallel Applications.

[BibT_eX]

[DOI]

Proceedings of the 12th IEEE Conference on Visual Analytics Science and Technology, 2017

TAPIOCA: An I/O Library for Optimized Topology-Aware Data Aggregation on Large-Scale Supercomputers.

[BibT_eX]

[DOI]

Francois Tessier

Venkatram Vishwanath

Emmanuel Jeannot

Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017

Analytical Performance Modeling and Validation of Intel's Xeon Phi Architecture.

[BibT_eX]

[DOI]

Proceedings of the Computing Frontiers Conference, 2017

2016

Application power profiling on IBM Blue Gene/Q.

[BibT_eX]

[DOI]

Parallel Comput., 2016

Improving sparse data movement performance using multiple paths on the Blue Gene/Q supercomputer.

[BibT_eX]

[DOI]

Parallel Comput., 2016

Workflow performance improvement using model-based scheduling over multiple clusters and clouds.

[BibT_eX]

[DOI]

Future Gener. Comput. Syst., 2016

<i>In Situ</i> Methods, Infrastructures, and Applications on High Performance Computing Platforms.

[BibT_eX]

[DOI]

Comput. Graph. Forum, 2016

Early Investigations into Using a Remote RAM Pool with the vl3 Visualization Framework.

[BibT_eX]

[DOI]

Proceedings of the Second Workshop on In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization, 2016

A data driven scheduling approach for power management on HPC systems.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2016

Topology-Aware Data Aggregation for Intensive I/O on Large-Scale Supercomputers.

[BibT_eX]

[DOI]

Proceedings of the First International Workshop on Communication Optimizations in HPC, 2016

Optimal execution of co-analysis for large-scale molecular dynamics simulations.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2016

Performance analysis, design considerations, and applications of extreme-scale <i>in situ</i> infrastructures.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2016

Parallel distributed, GPU-accelerated, advanced lighting calculations for large-scale volume visualization.

[BibT_eX]

[DOI]

Proceedings of the 6th IEEE Symposium on Large Data Analysis and Visualization, 2016

Coupling LAMMPS and the vl3 Framework for Co-Visualization of Atomistic Simulations.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

2015

Cluster-to-cluster data transfer with data compression over wide-area networks.

[BibT_eX]

[DOI]

Eun-Sung Jung

Rajkumar Kettimuthu

Venkatram Vishwanath

J. Parallel Distributed Comput., 2015

Optimal scheduling of in-situ analysis for large-scale scientific simulations.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2015

Route-aware independent MPI I/O on the blue gene/Q.

[BibT_eX]

[DOI]

Preeti Malakar

Venkatram Vishwanath

Proceedings of the 2015 International Workshop on Data-Intensive Scalable Computing Systems, 2015

Large-scale co-visualization for LAMMPS using vl3.

[BibT_eX]

[DOI]

Proceedings of the 5th IEEE Symposium on Large Data Analysis and Visualization, 2015

Streaming ultra high resolution images to large tiled display at nearly interactive frame rate with vl3.

[BibT_eX]

[DOI]

Proceedings of the 5th IEEE Symposium on Large Data Analysis and Visualization, 2015

Modeling Cooperative Threads to Project GPU Performance for Adaptive Parallelism.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015

Profiling transport performance for big data transfer over dedicated channels.

[BibT_eX]

[DOI]

Daqing Yun

Chase Qishi Wu

Nageswara S. V. Rao

Bradley W. Settlemyer

Josh Lothian

Rajkumar Kettimuthu

Venkatram Vishwanath

Proceedings of the International Conference on Computing, Networking and Communications, 2015

Improving Communication Throughput by Multipath Load Balancing on Blue Gene/Q.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Conference on High Performance Computing, 2015

Large-Scale Parallel Visualization of Particle-Based Simulations using Point Sprites and Level-Of-Detail.

[BibT_eX]

[DOI]

Proceedings of the 15th Eurographics Symposium on Parallel Graphics and Visualization, 2015

Comparison of Vendor Supplied Environmental Data Collection Mechanisms.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015

Multipath Load Balancing for M × N Communication Patterns on the Blue Gene/Q Supercomputer Interconnection Network.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015

TECA: Petascale Pattern Recognition for Climate Science.

[BibT_eX]

[DOI]

Proceedings of the Computer Analysis of Images and Patterns, 2015

2014

Large-Scale Simulations of Sky Surveys.

[BibT_eX]

[DOI]

Comput. Sci. Eng., 2014

DIRAQ: scalable in situ data- and resource-aware indexing for optimized query performance.

[BibT_eX]

[DOI]

Sriram Lakshminarasimhan

Clust. Comput., 2014

Fast Multiresolution Reads of Massive Simulation Datasets.

[BibT_eX]

[DOI]

Proceedings of the Supercomputing - 29th International Conference, 2014

Efficient I/O and Storage of Adaptive-Resolution Data.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2014

Distributed multipath routing algorithm for data center networks.

[BibT_eX]

[DOI]

Eun-Sung Jung

Venkatram Vishwanath

Rajkumar Kettimuthu

Proceedings of the 2014 International Workshop on Data Intensive Scalable Computing Systems, 2014

Scalable Parallel I/O on a Blue Gene/Q Supercomputer Using Compression, Topology-Aware Data Aggregation, and Subfiling.

[BibT_eX]

[DOI]

Proceedings of the 22nd Euromicro International Conference on Parallel, 2014

Improving Data Movement Performance for Sparse Data Patterns on the Blue Gene/Q Supercomputer.

[BibT_eX]

[DOI]

Proceedings of the 43rd International Conference on Parallel Processing Workshops, 2014

Improving Multisite Workflow Performance Using Model-Based Scheduling.

[BibT_eX]

[DOI]

Proceedings of the 43rd International Conference on Parallel Processing, 2014

Performance Modeling of vl3 Volume Rendering on GPU-Based Clusters.

[BibT_eX]

[DOI]

Proceedings of the 14th Eurographics Symposium on Parallel Graphics and Visualization, 2014

SKOPE: a framework for modeling and exploring workload behavior.

[BibT_eX]

[DOI]

Proceedings of the Computing Frontiers Conference, CF'14, 2014

2013

Multi-domain job coscheduling for leadership computing systems.

[BibT_eX]

[DOI]

J. Supercomput., 2013

On-demand unstructured mesh translation for reducing memory pressure during in situ analysis.

[BibT_eX]

[DOI]

Proceedings of the 8th International Workshop on Ultrascale Visualization, 2013

Characterization and modeling of PIDX parallel I/O for performance optimization.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2013

Characterization and Understanding Machine-Specific Interconnects.

[BibT_eX]

[DOI]

Proceedings of the Parallel Computing Technologies - 12th International Conference, 2013

Efficient parallel volume rendering of large-scale adaptive mesh refinement data.

[BibT_eX]

[DOI]

Proceedings of the IEEE Symposium on Large-Scale Data Analysis and Visualization, 2013

Measuring Power Consumption on IBM Blue Gene/Q.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

Early Experience on the Blue Gene/Q Supercomputing System.

[BibT_eX]

[DOI]

Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Processing, 2013

Proactive Support for Large-Scale Data Exploration.

[BibT_eX]

[DOI]

Mark Hereld

Tanu Malik

Venkatram Vishwanath

Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

Scalable in situ scientific data encoding for analytical query processing.

[BibT_eX]

[DOI]

Sriram Lakshminarasimhan

Proceedings of the 22nd International Symposium on High-Performance Parallel and Distributed Computing, 2013

A Generic High-Performance Method for Deinterleaving Scientific Data.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2013 Parallel Processing, 2013

Application power profiling on IBM Blue Gene/Q.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Conference on Cluster Computing, 2013

Model-driven multisite workflow scheduling.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Conference on Cluster Computing, 2013

Toward optimizing disk-to-disk transfer on 100G networks.

[BibT_eX]

[DOI]

Eun-Sung Jung

Rajkumar Kettimuthu

Venkatram Vishwanath

Proceedings of the IEEE International Conference on Advanced Networks and Telecommunications Systems, 2013

2012

Accelerating Data Movement Leveraging End-System and Network Parallelism.

[BibT_eX]

[DOI]

Jun Yi

Rajkumar Kettimuthu

Venkatram Vishwanath

Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Dataflow-driven GPU performance projection for multi-kernel transformations.

[BibT_eX]

[DOI]

Proceedings of the SC Conference on High Performance Computing Networking, 2012

Efficient data restructuring and aggregation for I/O acceleration in PIDX.

[BibT_eX]

[DOI]

Proceedings of the SC Conference on High Performance Computing Networking, 2012

Poster: Evaluating Communication Performance in BlueGene/Q and Cray XE6 Supercomputers.

[BibT_eX]

[DOI]

Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Abstract: Evaluating Communication Performance in BlueGene/Q and Cray XE6 Supercomputers.

[BibT_eX]

[DOI]

Proceedings of the 2012 SC Companion: High Performance Computing, 2012

ALCF MPI Benchmarks: Understanding Machine-Specific Communication Behavior.

[BibT_eX]

[DOI]

Proceedings of the 41st International Conference on Parallel Processing Workshops, 2012

Evaluating Power-Monitoring Capabilities on IBM Blue Gene/P and Blue Gene/Q.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Cluster Computing, 2012

2011

Topology-aware data movement and staging for I/O acceleration on Blue Gene/P supercomputing systems.

[BibT_eX]

[DOI]

Proceedings of the Conference on High Performance Computing Networking, 2011

Electronic poster: co-visualization of full data and in situ data extracts from unstructured grid cfd at 160k cores.

[BibT_eX]

[DOI]

Christopher D. Carothers

Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, 2011

GROPHECY: GPU performance projection from CPU code skeletons.

[BibT_eX]

[DOI]

Proceedings of the Conference on High Performance Computing Networking, 2011

Modeling early galaxies using radiation hydrodynamics.

[BibT_eX]

[DOI]

Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, 2011

Toward simulation-time data analysis and I/O acceleration on leadership-class systems.

[BibT_eX]

[DOI]

Venkatram Vishwanath

Mark Hereld

Michael E. Papka

Proceedings of the IEEE Symposium on Large Data Analysis and Visualization, 2011

Exploring large data over wide area networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE Symposium on Large Data Analysis and Visualization, 2011

Job Coscheduling on Coupled High-End Computing Systems.

[BibT_eX]

[DOI]

Proceedings of the 2011 International Conference on Parallel Processing Workshops, 2011

PIDX: Efficient Parallel I/O for Multi-resolution Multi-dimensional Scientific Datasets.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE International Conference on Cluster Computing (CLUSTER), 2011

2010

Accelerating I/O Forwarding in IBM Blue Gene/P Systems.

[BibT_eX]

[DOI]

Proceedings of the Conference on High Performance Computing Networking, 2010

Multi-application inter-tile synchronization on ultra-high-resolution display walls.

[BibT_eX]

[DOI]

Proceedings of the First Annual ACM SIGMM Conference on Multimedia Systems, 2010

2009

Accelerating tropical cyclone analysis using LambdaRAM, a distributed data cache over wide-area ultra-fast networks.

[BibT_eX]

[DOI]

Future Gener. Comput. Syst., 2009

The OptIPortal, a scalable visualization, storage, and computing interface device for the OptiPuter.

[BibT_eX]

[DOI]

Future Gener. Comput. Syst., 2009

2008

Specification and Verification of LambdaRAM: A Wide-area Distributed Cache for High Performance Computing.

[BibT_eX]

[DOI]

Venkatram Vishwanath

Lenore D. Zuck

Jason Leigh

Proceedings of the 6th ACM & IEEE International Conference on Formal Methods and Models for Co-Design (MEMOCODE 2008), 2008

The Rails Toolkit - Enabling End-System Topology-Aware High End Computing.

[BibT_eX]

[DOI]

Proceedings of the Fourth International Conference on e-Science, 2008

2006

The global lambda visualization facility: An international ultra-high-definition wide-area visualization collaboratory.

[BibT_eX]

[DOI]

Future Gener. Comput. Syst., 2006

The first functional demonstration of optical virtual concatenation as a technique for achieving Terabit networking.

[BibT_eX]

[DOI]

Future Gener. Comput. Syst., 2006

AR-PIN/PDC: Flexible Advance Reservation of Intradomain and Interdomain Lightpaths.

[BibT_eX]

[DOI]

Proceedings of the Global Telecommunications Conference, 2006. GLOBECOM '06, San Francisco, CA, USA, 27 November, 2006

LambdaBridge: A Scalable Architecture for Future Generation Terabit Applications.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Conference on Broadband Communications, 2006

2004

Vol-a-Tile - A Tool for Interactive Exploration of Large Volumetric Data on Scalable Tiled Displays.

[BibT_eX]

[DOI]

Nicholas Schwarz

Shalini Venkataraman

Luc Renambot

Naveen K. Krishnaprasad

Proceedings of the 15th IEEE Visualization Conference, 2004

JuxtaView - a tool for interactive visualization of large imagery on scalable tiled displays.

[BibT_eX]

[DOI]

Naveen K. Krishnaprasad

Proceedings of the 2004 IEEE International Conference on Cluster Computing (CLUSTER 2004), 2004

Venkatram Vishwanath

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...