Rengan Xu
Orcid: 0000-0002-5230-5530
According to our database1,
Rengan Xu
authored at least 18 papers
between 2013 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Enhancing Performance and Scalability of Large-Scale Recommendation Systems with Jagged Flash Attention.
Proceedings of the 18th ACM Conference on Recommender Systems, 2024
2019
Densifying Assumed-Sparse Tensors - Improving Memory Efficiency and MPI Collective Performance During Tensor Accumulation for Parallelized Training of Neural Machine Translation Models.
Proceedings of the High Performance Computing - 34th International Conference, 2019
2018
The OpenACC data model: Preliminary study on its major challenges and implementations.
Parallel Comput., 2018
Proceedings of the 2018 IEEE/ACM Performance Modeling, 2018
2017
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017
2016
Concurr. Comput. Pract. Exp., 2016
Proceedings of the High Performance Computing - 31st International Conference, 2016
Proceedings of the 45th International Conference on Parallel Processing, 2016
2015
Sci. Program., 2015
2014
Proceedings of the First Workshop on Accelerator Programming using Directives, 2014
SPEC ACCEL: A Standard Application Suite for Measuring Hardware Accelerator Performance.
Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking, and Simulation, 2014
Proceedings of the 2014 PPOPP International Workshop on Programming Models and Applications for Multicores and Manycores, 2014
Proceedings of the Languages and Compilers for Parallel Computing, 2014
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014
2013
Proceedings of the Languages and Compilers for Parallel Computing, 2013
Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013
Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013