Fuzhao Xue

According to our database¹, Fuzhao Xue authored at least 30 papers between 2019 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures.

[BibT_eX]

[DOI]

CoRR, 2024

LongVILA: Scaling Long-Context Visual Language Models for Long Videos.

[BibT_eX]

[DOI]

CoRR, 2024

Wolf: Captioning Everything with a World Summarization Framework.

[BibT_eX]

[DOI]

CoRR, 2024

MixEval: Deriving Wisdom of the Crowd from LLM Benchmark Mixtures.

[BibT_eX]

[DOI]

CoRR, 2024

OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023

Recent advances in deep learning based dialogue systems: a systematic survey.

[BibT_eX]

[DOI]

Artif. Intell. Rev., April, 2023

Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

To Repeat or Not To Repeat: Insights from Scaling LLM under Token-Crisis.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Adaptive Computation with Elastic Input Sequence.

[BibT_eX]

[DOI]

Fuzhao Xue

Valerii Likhosherstov

Proceedings of the International Conference on Machine Learning, 2023

A Study on Transformer Configuration and Training Objective.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

One Student Knows All Experts Know: From Sparse to Dense.

[BibT_eX]

[DOI]

Proceedings of the First Tiny Papers Track at ICLR 2023, 2023

Hierarchical Dialogue Understanding with Special Tokens and Turn-level Attention.

[BibT_eX]

[DOI]

Proceedings of the First Tiny Papers Track at ICLR 2023, 2023

Sequence Parallelism: Long Sequence Training from System Perspective.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

CowClip: Reducing CTR Prediction Model Training Time from 12 Hours to 10 Minutes on 1 GPU.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

RACP: A network with attention corrected prototype for few-shot speaker recognition using indefinite distance metric.

[BibT_eX]

[DOI]

Neurocomputing, 2022

Deeper vs Wider: A Revisit of Transformer Configuration.

[BibT_eX]

[DOI]

CoRR, 2022

CowClip: Reducing CTR Prediction Model Training Time from 12 hours to 10 minutes on 1 GPU.

[BibT_eX]

[DOI]

CoRR, 2022

An Embarrassingly Simple Model for Dialogue Relation Extraction.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Automated Audio Captioning Using Transfer Learning and Reconstruction Latent Space Similarity Regularization.

[BibT_eX]

[DOI]

Andrew Koh

Fuzhao Xue

Chng Eng Siong

Proceedings of the IEEE International Conference on Acoustics, 2022

Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Go Wider Instead of Deeper.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Large-Scale Deep Learning Optimizations: A Comprehensive Survey.

[BibT_eX]

[DOI]

CoRR, 2021

Sparse-MLP: A Fully-MLP Architecture with Conditional Computation.

[BibT_eX]

[DOI]

CoRR, 2021

Sequence Parallelism: Making 4D Parallelism Possible.

[BibT_eX]

[DOI]

CoRR, 2021

Recent Advances in Deep Learning Based Dialogue Systems: A Systematic Survey.

[BibT_eX]

[DOI]

CoRR, 2021

GDPNet: Refining Latent Multi-View Graph for Relation Extraction.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

A network model of speaker identification with new feature extraction methods and asymmetric BLSTM.

[BibT_eX]

[DOI]

Neurocomputing, 2020

An Embarrassingly Simple Model for Dialogue Relation Extraction.

[BibT_eX]

[DOI]

CoRR, 2020

Deep Graph Random Process for Relational-Thinking-Based Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

2019

Underwater Acoustic Target Recognition: A Combination of Multi-Dimensional Fusion Features and Modified Deep Neural Network.

[BibT_eX]

[DOI]

Remote. Sens., 2019

Fuzhao Xue

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...