Yida Wang
Orcid: 0000-0001-8165-840XAffiliations:
- Amazon Web Services, Inc., East Palo Alto, CA, USA
- Intel Corporation, Parallel Computing Lab, Santa Clara, CA, USA
- Princeton University, Department of Computer Science, NJ, USA
According to our database1,
Yida Wang
authored at least 38 papers
between 2015 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
Fast Convolution Meets Low Precision: Exploring Efficient Quantized Winograd Convolution on Modern CPUs.
ACM Trans. Archit. Code Optim., March, 2024
Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024
Proceedings of the Seventh Annual Conference on Machine Learning and Systems, 2024
Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapping.
Proceedings of the Seventh Annual Conference on Machine Learning and Systems, 2024
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024
Proceedings of the Nineteenth European Conference on Computer Systems, 2024
Proceedings of the 2024 ACM Symposium on Cloud Computing, 2024
Slapo: A Schedule Language for Progressive Optimization of Large Deep Learning Model Training.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024
2023
Perception and memory retrieval states are reflected in distributed patterns of background functional connectivity.
NeuroImage, August, 2023
Proceedings of the 29th Symposium on Operating Systems Principles, 2023
Grape: Practical and Efficient Graphed Execution for Dynamic Deep Neural Networks on GPUs.
Proceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture, 2023
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023
2022
Proc. VLDB Endow., 2022
Alpa: Automating Inter- and Intra-Operator Parallelism for Distributed Deep Learning.
CoRR, 2022
Alpa: Automating Inter- and Intra-Operator Parallelism for Distributed Deep Learning.
Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation, 2022
Proceedings of the Fifth Conference on Machine Learning and Systems, 2022
2021
Proceedings of the Fourth Conference on Machine Learning and Systems, 2021
Proceedings of the ICPP 2021: 50th International Conference on Parallel Processing, Lemont, IL, USA, August 9, 2021
Proceedings of the SoCC '21: ACM Symposium on Cloud Computing, 2021
Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2021
2020
CoRR, 2020
Proceedings of the 2020 Workshop on Network Meets AI & ML, 2020
Proceedings of the International Conference for High Performance Computing, 2020
Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation, 2020
2019
Proceedings of the 2019 USENIX Annual Technical Conference, 2019
Proceedings of the 48th International Conference on Parallel Processing, 2019
2018
2017
BIDS apps: Improving ease of use, accessibility, and reproducibility of neuroimaging data analysis methods.
PLoS Comput. Biol., 2017
Proceedings of the High Performance Computing - 32nd International Conference, 2017
2016
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016
2015
Proceedings of the International Conference for High Performance Computing, 2015