In-depth analysis on parallel processing patterns for high-performance Dataframes.
Future Gener. Comput. Syst., December, 2023
Supercharging Distributed Computing Environments For High Performance Data Engineering.
CoRR, 2023
High-performance iterative dataflow abstractions in Twister2: TSet.
Concurr. Comput. Pract. Exp., 2022
Twister2 Cross-platform resource scheduler for big data.
Concurr. Comput. Pract. Exp., 2022
Stochastic gradient descent-based support vector machines training optimization on Big Data and HPC frameworks.
,
,
,
,
,
,
,
,
,
,
,
Concurr. Comput. Pract. Exp., 2022
High Performance Dataframes from Parallel Processing Patterns.
Proceedings of the Parallel Processing and Applied Mathematics, 2022
HPTMT Parallel Operators for High Performance Data Science and Data Engineering.
Frontiers Big Data, 2021
HPTMT Parallel Operators for High Performance Data Science & Data Engineering.
CoRR, 2021
HPTMT: Operator-Based Architecture for Scalable High-Performance Data-Intensive Frameworks.
Proceedings of the 14th IEEE International Conference on Cloud Computing, 2021
A Fast, Scalable, Universal Approach For Distributed Data Reductions.
CoRR, 2020
Twister2: Design of a big data toolkit.
Concurr. Comput. Pract. Exp., 2020
High Performance Data Engineering Everywhere.
Proceedings of the IEEE International Conference on Smart Data Services, 2020
Data Engineering for HPC with Python.
Proceedings of the 9th IEEE/ACM Workshop on Python for High-Performance and Scientific Computing, 2020
A Fast, Scalable, Universal Approach For Distributed Data Aggregations.
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020
Scientific Image Restoration Anywhere.
Proceedings of the 1st IEEE/ACM Annual Workshop on Large-scale Experiment-in-the-Loop Computing, 2019
Performance Optimization on Model Synchronization in Parallel Stochastic Gradient Descent Based SVM.
Proceedings of the 19th IEEE/ACM International Symposium on Cluster, 2019
Streaming Machine Learning Algorithms with Big Data Systems.
Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019
Twister: Net - Communication Library for Big Data Processing in HPC and Cloud Environments.
Proceedings of the 11th IEEE International Conference on Cloud Computing, 2018