Fan Yang
Orcid: 0000-0002-0378-060XAffiliations:
- Microsoft Research Asia, Beijing, China
- Nanjing Universiiy, Department of Computer Science, State Key Lab for Novel Software Technology, China (former)
According to our database1,
Fan Yang
authored at least 89 papers
between 2003 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
IEEE Trans. Parallel Distributed Syst., December, 2024
Facilitating Multi-turn Function Calling for LLMs via Compositional Instruction Tuning.
CoRR, 2024
FB-Bench: A Fine-Grained Multi-Task Benchmark for Evaluating LLMs' Responsiveness to Human Feedback.
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
Proceedings of the Companion Proceedings of the ACM on Web Conference 2024, 2024
MS MARCO Web Search: A Large-scale Information-rich Web Dataset with Millions of Real Click Labels.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2024, 2024
Uncovering Nested Data Parallelism and Data Reuse in DNN Computation with FractalTensor.
Proceedings of the ACM SIGOPS 30th Symposium on Operating Systems Principles, 2024
Ladder: Enabling Efficient Low-Precision Deep Learning Computing through Hardware-aware Tensor Transformation.
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024
nnScaler: Constraint-Guided Parallelization Plan Generation for Deep Learning Training.
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024
Understanding the Weakness of Large Language Model Agents within a Complex Android Environment.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024
Tessel: Boosting Distributed Execution of Large DNN Models via Flexible Schedule Search.
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2024
Proceedings of the Nineteenth European Conference on Computer Systems, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024
2023
Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models.
CoRR, 2023
SparDA: Accelerating Dynamic Sparse Deep Neural Networks via Sparse-Dense Transformation.
CoRR, 2023
CoRR, 2023
PIT: Optimization of Dynamic Sparse Deep Learning Models via Permutation Invariant Transformation.
Proceedings of the 29th Symposium on Operating Systems Principles, 2023
Proceedings of the 29th Symposium on Operating Systems Principles, 2023
VBASE: Unifying Online Vector Similarity Search and Relational Queries via Relaxed Monotonicity.
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023
Proceedings of the 20th USENIX Symposium on Networked Systems Design and Implementation, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Sixth Conference on Machine Learning and Systems, 2023
Proceedings of the Sixth Conference on Machine Learning and Systems, 2023
OliVe: Accelerating Large Language Models via Hardware-friendly Outlier-Victim Pair Quantization.
Proceedings of the 50th Annual International Symposium on Computer Architecture, 2023
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023
Proceedings of the Eighteenth European Conference on Computer Systems, 2023
Adam Accumulation to Reduce Memory Footprints of Both Activations and Gradients for Large-Scale DNN Training.
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
2022
Distill-VQ: Learning Retrieval Oriented Vector Quantization By Distilling Knowledge from Dense Embeddings.
CoRR, 2022
Proceedings of the 2022 USENIX Annual Technical Conference, 2022
Distill-VQ: Learning Retrieval Oriented Vector Quantization By Distilling Knowledge from Dense Embeddings.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022
Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation, 2022
Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation, 2022
ANT: Exploiting Adaptive Numerical Data Type for Low-bit Deep Neural Network Quantization.
Proceedings of the 55th IEEE/ACM International Symposium on Microarchitecture, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
Nesting Forward Automatic Differentiation for Memory-Efficient Deep Neural Network Training.
Proceedings of the IEEE 40th International Conference on Computer Design, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
2021
2020
XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training, Understanding and Generation.
CoRR, 2020
Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation, 2020
Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation, 2020
Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation, 2020
InvisibleFL: Federated Learning over Non-Informative Intermediate Updates against Multimedia Privacy Leakages.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
XGLUE: A New Benchmark Datasetfor Cross-lingual Pre-training, Understanding and Generation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
Proceedings of the ASPLOS '20: Architectural Support for Programming Languages and Operating Systems, 2020
2019
Proceedings of the 2019 USENIX Annual Technical Conference, 2019
2018
Proceedings of the 13th USENIX Symposium on Operating Systems Design and Implementation, 2018
Proceedings of the ACM Symposium on Cloud Computing, 2018
2015
ACM Trans. Storage, 2015
Proceedings of the Sixth ACM Symposium on Cloud Computing, 2015
2014
Proceedings of the Ninth Eurosys Conference 2014, 2014
2012
Proceedings of the European Conference on Computer Systems, 2012
2007
IEEE Trans. Wirel. Commun., 2007
Distributed Cooperative Rate Adaptation for Energy Efficiency in IEEE 802.11-Based Multihop Networks.
IEEE Trans. Veh. Technol., 2007
IEEE Netw., 2007
2006
Distributed Channel Assignment and Routing in Multiradio Multichannel Multihop Wireless Networks.
IEEE J. Sel. Areas Commun., 2006
Distributed cooperative rate adaptation for energy efficiency in IEEE 802.11-based multi-hop networks.
Proceedings of the 3rd International ICST Conference on Quality of Service in Heterogeneous Wired/Wireless Networks, 2006
Proceedings of the IEEE 3rd International Conference on Mobile Adhoc and Sensor Systems, 2006
Proceedings of IEEE International Conference on Communications, 2006
On Improving the Throughput of Media Delivery Applications in Heterogeneous Overlay Network.
Proceedings of the Global Telecommunications Conference, 2006. GLOBECOM '06, San Francisco, CA, USA, 27 November, 2006
2005
EURASIP J. Adv. Signal Process., 2005
Proceedings of IEEE International Conference on Communications, 2005
2004
End-to-end TCP-friendly streaming protocol and bit allocation for scalable video over wireless Internet.
IEEE J. Sel. Areas Commun., 2004
Proceedings of the Proceedings IEEE INFOCOM 2004, 2004
2003
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003