Xiaoxia Wu
According to our database1,
Xiaoxia Wu
authored at least 67 papers
between 2006 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding.
CoRR, 2024
FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design.
CoRR, 2024
Quant-LLM: Accelerating the Serving of Large Language Models via FP6-Centric Algorithm-System Co-Design on Modern GPUs.
Proceedings of the 2024 USENIX Annual Technical Conference, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Exploring Post-training Quantization in LLMs from Comprehensive Study to Low Rank Compensation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
DeepSpeed Data Efficiency: Improving Deep Learning Model Quality and Training Efficiency via Efficient Data Sampling and Routing.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
A novel bias-alleviated hybrid ensemble model based on over-sampling and post-processing for fair classification.
Connect. Sci., December, 2023
ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative Tasks.
CoRR, 2023
ZeroQuant-HERO: Hardware-Enhanced Robust Optimized Post-Training Quantization Framework for W8A8 Transformers.
CoRR, 2023
DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies.
CoRR, 2023
DeepSpeed-VisualChat: Multi-Round Multi-Image Interleave Chat via Multi-Modal Causal Attention.
CoRR, 2023
CoRR, 2023
DeepSpeed-Chat: Easy, Fast and Affordable RLHF Training of ChatGPT-like Models at All Scales.
CoRR, 2023
ZeroQuant-FP: A Leap Forward in LLMs Post-Training W4A8 Quantization Using Floating-Point Formats.
CoRR, 2023
CoRR, 2023
Understanding INT4 Quantization for Transformer Models: Latency Speedup, Composability, and Failure Cases.
CoRR, 2023
Understanding Int4 Quantization for Language Models: Latency Speedup, Composability, and Failure Cases.
Proceedings of the International Conference on Machine Learning, 2023
2022
DeepSpeed Data Efficiency: Improving Deep Learning Model Quality and Training Efficiency via Efficient Data Sampling and Routing.
CoRR, 2022
Random-LTD: Random and Layerwise Token Dropping Brings Efficient Training for Large-scale Transformers.
CoRR, 2022
CoRR, 2022
ZeroQuant: Efficient and Affordable Post-Training Quantization for Large-Scale Transformers.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
AdaLoss: A Computationally-Efficient and Provably Convergent Adaptive Gradient Method.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
Proceedings of the 9th International Conference on Learning Representations, 2021
2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020
2019
Global Convergence of Adaptive Gradient Methods for An Over-parameterized Neural Network.
CoRR, 2019
Appl. Math. Comput., 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
Proceedings of the Cyber Security Intelligence and Analytics, 2019
2018
Methionine-Capped Gold Nanoclusters as a Fluorescence-Enhanced Probe for Cadmium(II) Sensing.
Sensors, 2018
Toward Transport Ecosystem Interoperability Enabled by Vendor-Diverse Coherent Optical Sources Over an Open Line System.
JOCN, 2018
AdaGrad stepsizes: Sharp convergence over nonconvex landscapes, from any initialization.
CoRR, 2018
2017
Interoperation of layer-2/3 modular switches with 8QAM/16QAM integrated coherent optics over 2000 km open line system.
Proceedings of the Optical Fiber Communications Conference and Exhibition, 2017
2014
IEEE J. Solid State Circuits, 2014
Height Probabilities in the Abelian Sandpile Model on the Generalized Trees.
Ars Comb., 2014
2013
Proceedings of the 2013 IEEE International Solid-State Circuits Conference, 2013
2012
Electrical Characterization for Intertier Connections and Timing Analysis for 3-D ICs.
IEEE Trans. Very Large Scale Integr. Syst., 2012
Estimating the Proportion of True Null Hypotheses in Nonparametric Exponential Mixture Model with Appication to the Leukemia Gene Expression Data.
Commun. Stat. Simul. Comput., 2012
Small Randic Index Ordering of Trees with k Pendant Vertices.
Ars Comb., 2012
2011
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2011
Stacking magnetic random access memory atop microprocessors: an architecture-level evaluation.
IET Comput. Digit. Tech., 2011
2010
ACM Trans. Archit. Code Optim., 2010
Microelectron. J., 2010
Proceedings of the 47th Design Automation Conference, 2010
2009
ACM J. Emerg. Technol. Comput. Syst., 2009
Exploration of 3D stacked L2 cache design for high performance and efficient thermal control.
Proceedings of the 2009 International Symposium on Low Power Electronics and Design, 2009
Proceedings of the 36th International Symposium on Computer Architecture (ISCA 2009), 2009
Proceedings of the Design, Automation and Test in Europe, 2009
2008
Proceedings of the 2008 IEEE International Test Conference, 2008
Proceedings of the 26th International Conference on Computer Design, 2008
Circuit and microarchitecture evaluation of 3D stacking magnetic RAM (MRAM) as a universal memory replacement.
Proceedings of the 45th Design Automation Conference, 2008
Variability-driven module selection with joint design time optimization and post-silicon tuning.
Proceedings of the 13th Asia South Pacific Design Automation Conference, 2008
2007
Proceedings of the 25th International Conference on Computer Design, 2007
Proceedings of the 2007 International Conference on Computer-Aided Design, 2007
2006
Proceedings of the 2006 IEEE International SOC Conference, Austin, Texas, USA, 2006
Proceedings of the 2006 International Conference on Computer-Aided Design, 2006