Doru-Thom Popovici

Orcid: 0000-0002-7271-8092

According to our database1, Doru-Thom Popovici authored at least 28 papers between 2013 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Toward Practical Superconducting Accelerators for Machine Learning Using U-SFQ.
ACM J. Emerg. Technol. Comput. Syst., April, 2024

Flexible Multi-Dimensional FFTs for Plane Wave Density Functional Theory Codes.
CoRR, 2024

SlimFit: Memory-Efficient Fine-Tuning of Transformer-based Models Using Training Dynamics.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

To Tile or not to Tile, That is the Question.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

Unlocking the Potential: Performance Portability of Graph Algorithms on Kokkos Framework.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

2023
An Area Efficient Superconducting Unary CNN Accelerator.
Proceedings of the 24th International Symposium on Quality Electronic Design, 2023

Designing Efficient SIMD Kernels for High Performance Sequence Alignment.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

Fast Community Detection in Graphs with Infomap Method using Accelerated Sparse Accumulation.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

Towards a Flexible Hardware Implementation for Mixed-Radix Fourier Transforms.
Proceedings of the IEEE High Performance Extreme Computing Conference, 2023

2021
A systematic approach to improving data locality across Fourier transforms and linear algebra operations.
Proceedings of the ICS '21: 2021 International Conference on Supercomputing, 2021

HyPC-Map: A Hybrid Parallel Community Detection Algorithm Using Information-Theoretic Approach.
Proceedings of the 2021 IEEE High Performance Extreme Computing Conference, 2021

2020
A Flexible Framework for Multidimensional DFTs.
SIAM J. Sci. Comput., 2020

A High-Throughput Solver for Marginalized Graph Kernels on GPU.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020

2019
A Flexible Framework for Parallel Multi-Dimensional DFTs.
CoRR, 2019

Exploiting Symmetries of Small Prime-Sized DFTs.
Proceedings of the Parallel Processing and Applied Mathematics, 2019

A Portable GPU Framework for SNP Comparisons.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2019

2018
SPIRAL: Extreme Performance Portability.
Proc. IEEE, 2018

Large Bandwidth-Efficient FFTs on Multicore and Multi-socket Systems.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

Linear Algebraic Formulation of Edge-centric K-truss Algorithms with Adjacency Matrices.
Proceedings of the 2018 IEEE High Performance Extreme Computing Conference, 2018

FFTX and SpectralPack: A First Look.
Proceedings of the 25th IEEE International Conference on High Performance Computing Workshops, 2018

2017
Mixed data layout kernels for vectorized complex arithmetic.
Proceedings of the 2017 IEEE High Performance Extreme Computing Conference, 2017

First look: Linear algebra-based triangle counting without matrix multiplication.
Proceedings of the 2017 IEEE High Performance Extreme Computing Conference, 2017

2016
Compilers, hands-off my hands-on optimizations.
Proceedings of the 3rd Workshop on Programming Models for SIMD/Vector Processing, 2016

Efficient Computation of Linkage Disequilibria as Dense Linear Algebra Operations.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

2015
Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015

2014
Algorithm/hardware co-optimized SAR image reconstruction with 3D-stacked logic in memory.
Proceedings of the IEEE High Performance Extreme Computing Conference, 2014

2013
Automatic Extraction of Behavioral Models from Distributed Systems and Services.
Proceedings of the Evaluation of Novel Approaches to Software Engineering, 2013

Extracting Behavioral Models from Service Implementations.
Proceedings of the ENASE 2013, 2013


  Loading...