Mao Yang
Orcid: 0009-0009-6455-3898Affiliations:
- Microsoft Research, Beijing, China
According to our database1,
Mao Yang
authored at least 86 papers
between 2004 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
Matryoshka: Optimization of Dynamic Diverse Quantum Chemistry Systems via Elastic Parallelism Transformation.
CoRR, 2024
CoRR, 2024
Proceedings of the Companion Proceedings of the ACM on Web Conference 2024, 2024
MS MARCO Web Search: A Large-scale Information-rich Web Dataset with Millions of Real Click Labels.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2024, 2024
Uncovering Nested Data Parallelism and Data Reuse in DNN Computation with FractalTensor.
Proceedings of the ACM SIGOPS 30th Symposium on Operating Systems Principles, 2024
Long Exposure: Accelerating Parameter-Efficient Fine-Tuning for LLMs under Shadowy Sparsity.
Proceedings of the International Conference for High Performance Computing, 2024
Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2024
Ladder: Enabling Efficient Low-Precision Deep Learning Computing through Hardware-aware Tensor Transformation.
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024
nnScaler: Constraint-Guided Parallelization Plan Generation for Deep Learning Training.
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024
LitePred: Transferable and Scalable Latency Prediction for Hardware-Aware Neural Architecture Search.
Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024
Empowering In-Browser Deep Learning Inference on Edge Through Just-In-Time Kernel Optimization.
Proceedings of the 22nd Annual International Conference on Mobile Systems, 2024
Pre-gated MoE: An Algorithm-System Co-Design for Fast and Scalable Mixture-of-Expert Inference.
Proceedings of the 51st ACM/IEEE Annual International Symposium on Computer Architecture, 2024
Proceedings of the 46th IEEE/ACM International Conference on Software Engineering, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
PIM-DL: Expanding the Applicability of Commodity DRAM-PIMs for Deep Learning via Algorithm-System Co-Optimization.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024
2023
Boosting LLM Reasoning: Push the Limits of Few-shot Learning with Reinforced In-Context Pruning.
CoRR, 2023
Compresso: Structured Pruning with Collaborative Prompting Learns Compact Large Language Models.
CoRR, 2023
Accelerating In-Browser Deep Learning Inference on Diverse Edge Clients through Just-in-Time Kernel Optimizations.
CoRR, 2023
Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models.
CoRR, 2023
SparDA: Accelerating Dynamic Sparse Deep Neural Networks via Sparse-Dense Transformation.
CoRR, 2023
CoRR, 2023
PIT: Optimization of Dynamic Sparse Deep Learning Models via Permutation Invariant Transformation.
Proceedings of the 29th Symposium on Operating Systems Principles, 2023
Proceedings of the 29th Symposium on Operating Systems Principles, 2023
VBASE: Unifying Online Vector Similarity Search and Relational Queries via Relaxed Monotonicity.
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023
Proceedings of the 20th USENIX Symposium on Networked Systems Design and Implementation, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
NN-Stretch: Automatic Neural Network Branching for Parallel Inference on Heterogeneous Multi-Processors.
Proceedings of the 21st Annual International Conference on Mobile Systems, 2023
LUT-NN: Empower Efficient Neural Network Inference with Centroid Learning and Table Lookup.
Proceedings of the 29th Annual International Conference on Mobile Computing and Networking, 2023
Proceedings of the Sixth Conference on Machine Learning and Systems, 2023
Constraint-aware and Ranking-distilled Token Pruning for Efficient Transformer Inference.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2023
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
ElasticViT: Conflict-aware Supernet Training for Deploying Fast Vision Transformer on Diverse Mobile Devices.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the Eighteenth European Conference on Computer Systems, 2023
2022
Towards efficient vision transformer inference: a first study of transformers on mobile devices.
Proceedings of the HotMobile '22: The 23rd International Workshop on Mobile Computing Systems and Applications, Tempe, Arizona, USA, March 9, 2022
Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation, 2022
Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the 44th IEEE/ACM 44th International Conference on Software Engineering, 2022
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022
2021
CoRR, 2021
Argus: A Fully Transparent Incentive System for Anti-Piracy Campaigns (Extended Version).
CoRR, 2021
Match Plan Generation in Web Search with Parameterized Action Reinforcement Learning.
Proceedings of the WWW '21: The Web Conference 2021, 2021
Proceedings of the 40th International Symposium on Reliable Distributed Systems, 2021
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the 43rd IEEE/ACM International Conference on Software Engineering, 2021
2020
Proceedings of the 2020 USENIX Annual Technical Conference, 2020
Proceedings of the ESEC/FSE '20: 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2020
Proceedings of the ESEC/FSE '20: 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2020
Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation, 2020
Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation, 2020
Proceedings of the ICSE '20: 42nd International Conference on Software Engineering, Seoul, South Korea, 27 June, 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019
2014
Proceedings of the Ninth Eurosys Conference 2014, 2014
Network Performance Aware Graph Partitioning for Large Graph Processing Systems in the Cloud.
Proceedings of the Large Scale and Big Data - Processing and Management., 2014
2013
Proceedings of the 29th IEEE International Conference on Data Engineering, 2013
Proceedings of the 14th Workshop on Hot Topics in Operating Systems, 2013
2012
Proceedings of the ACM Symposium on Cloud Computing, SOCC '12, 2012
2011
Proceedings of the 2011 USENIX Annual Technical Conference, 2011
2010
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010
Proceedings of the 1st ACM Symposium on Cloud Computing, 2010
2009
Proceedings of the Third IEEE International Conference on Secure Software Integration and Reliability Improvement, 2009
Proceedings of the 6th USENIX Symposium on Networked Systems Design and Implementation, 2009
Proceedings of HotOS'09: 12th Workshop on Hot Topics in Operating Systems, 2009
2008
2007
A Multi-dimensional Reputation System Combined with Trust and Incentive Mechanisms in P2P File Sharing Systems.
Proceedings of the 27th International Conference on Distributed Computing Systems Workshops (ICDCS 2007 Workshops), 2007
Proceedings of the 27th IEEE International Conference on Distributed Computing Systems (ICDCS 2007), 2007
2006
Proceedings of the 2006 International Symposium on Collaborative Technologies and Systems, 2006
2005
Proceedings of the Peer-to-Peer Systems IV, 4th International Workshop, 2005
2004
Proceedings of the First USENIX Workshop on Real, Large Distributed Systems, 2004