Jongsoo Park
Orcid: 0000-0002-4750-9440
According to our database1,
Jongsoo Park
authored at least 64 papers
between 2007 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Disaggregated Multi-Tower: Topology-aware Modeling Technique for Efficient Large Scale Recommendation.
Proceedings of the Seventh Annual Conference on Machine Learning and Systems, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Discovering regional digital innovation tasks to revitalize digital platform government.
Proceedings of the 25th Annual International Conference on Digital Government Research, 2024
2023
75% radiation dose reduction using deep learning reconstruction on low-dose chest CT.
BMC Medical Imaging, December, 2023
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023
RecD: Deduplication for End-to-End Deep Learning Recommendation Model Training Infrastructure.
Proceedings of the Sixth Conference on Machine Learning and Systems, 2023
Proceedings of the 50th Annual International Symposium on Computer Architecture, 2023
2022
DHEN: A Deep and Hierarchical Ensemble Network for Large-Scale Click-Through Rate Prediction.
CoRR, 2022
Unity: Accelerating DNN Training Through Joint Optimization of Algebraic Transformations and Parallelization.
Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation, 2022
Software-hardware co-design for fast and scalable training of deep learning recommendation models.
Proceedings of the ISCA '22: The 49th Annual International Symposium on Computer Architecture, New York, New York, USA, June 18, 2022
Efficient Soft-Error Detection for Low-precision Deep Learning Recommendation Models.
Proceedings of the IEEE International Conference on Big Data, 2022
2021
IEEE Micro, 2021
High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models.
CoRR, 2021
Proceedings of the 20th IEEE International Conference on Machine Learning and Applications, 2021
2020
Adaptive Dense-to-Sparse Paradigm for Pruning Online Recommendation System with Non-Stationary Data.
CoRR, 2020
2019
CoRR, 2019
2018
Parallel Comput., 2018
Deep Learning Inference in Facebook Data Centers: Characterization, Performance Optimizations and Hardware Implications.
CoRR, 2018
CoRR, 2018
Proceedings of the International Symposium on Memory Systems, 2018
2017
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017
Proceedings of the 5th International Conference on Learning Representations, 2017
2016
Optimizations in a high-performance conjugate gradient benchmark for IA-based multi- and many-core processors.
Int. J. High Perform. Comput. Appl., 2016
Proceedings of the International Conference for High Performance Computing, 2016
Proceedings of the International Conference for High Performance Computing, 2016
Proceedings of the 2016 International Conference on Parallel Architectures and Compilation, 2016
2015
Proceedings of the High Performance Computing - 30th International Conference, 2015
Improving concurrency and asynchrony in multithreaded MPI applications using software offloading.
Proceedings of the International Conference for High Performance Computing, 2015
High-performance algebraic multigrid solver optimized for multi-core based distributed parallel systems.
Proceedings of the International Conference for High Performance Computing, 2015
Exploring Shared-Memory Optimizations for an Unstructured Mesh CFD Application on Modern Parallel Systems.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015
2014
Sparsifying Synchronization for High-Performance Shared-Memory Sparse Triangular Solver.
Proceedings of the Supercomputing - 29th International Conference, 2014
Proceedings of the International Conference on Management of Data, 2014
Efficient Shared-Memory Implementation of High-Performance Conjugate Gradient Benchmark and its Application to Unstructured Matrices.
Proceedings of the International Conference for High Performance Computing, 2014
Improving Communication Performance and Scalability of Native Applications on Intel Xeon Phi Coprocessor Clusters.
Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014
Proceedings of the International Conference on Parallel Architectures and Compilation, 2014
2013
Efficient backprojection-based synthetic aperture radar computation with many-core processors.
Sci. Program., 2013
Proc. VLDB Endow., 2013
Proceedings of the International Conference for High Performance Computing, 2013
Tera-scale 1D FFT with low-communication algorithm and Intel® Xeon Phi™ coprocessors.
Proceedings of the International Conference for High Performance Computing, 2013
2012
CloudRAMSort: fast and efficient large-scale distributed RAM sort on shared-nothing cluster.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2012
Billion-particle SIMD-friendly two-point correlation on large-scale HPC cluster systems.
Proceedings of the SC Conference on High Performance Computing Networking, 2012
2010
Buffer-space efficient and deadlock-free scheduling of stream applications on multi-core architectures.
Proceedings of the SPAA 2010: Proceedings of the 22nd Annual ACM Symposium on Parallelism in Algorithms and Architectures, 2010
Proceedings of the 2010 International Conference on Compilers, 2010
2008
J. Comput. Sci. Eng., 2008
IEEE Comput. Archit. Lett., 2008
2007
Proceedings of the 2007 Design, Automation and Test in Europe Conference and Exposition, 2007