Fuzhao Xue

According to our database1, Fuzhao Xue authored at least 30 papers between 2019 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures.
CoRR, 2024

LongVILA: Scaling Long-Context Visual Language Models for Long Videos.
CoRR, 2024

Wolf: Captioning Everything with a World Summarization Framework.
CoRR, 2024

MixEval: Deriving Wisdom of the Crowd from LLM Benchmark Mixtures.
CoRR, 2024

OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Recent advances in deep learning based dialogue systems: a systematic survey.
Artif. Intell. Rev., April, 2023

Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

To Repeat or Not To Repeat: Insights from Scaling LLM under Token-Crisis.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Adaptive Computation with Elastic Input Sequence.
Proceedings of the International Conference on Machine Learning, 2023

A Study on Transformer Configuration and Training Objective.
Proceedings of the International Conference on Machine Learning, 2023

One Student Knows All Experts Know: From Sparse to Dense.
Proceedings of the First Tiny Papers Track at ICLR 2023, 2023

Hierarchical Dialogue Understanding with Special Tokens and Turn-level Attention.
Proceedings of the First Tiny Papers Track at ICLR 2023, 2023

Sequence Parallelism: Long Sequence Training from System Perspective.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

CowClip: Reducing CTR Prediction Model Training Time from 12 Hours to 10 Minutes on 1 GPU.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
RACP: A network with attention corrected prototype for few-shot speaker recognition using indefinite distance metric.
Neurocomputing, 2022

Deeper vs Wider: A Revisit of Transformer Configuration.
CoRR, 2022

CowClip: Reducing CTR Prediction Model Training Time from 12 hours to 10 minutes on 1 GPU.
CoRR, 2022

An Embarrassingly Simple Model for Dialogue Relation Extraction.
Proceedings of the IEEE International Conference on Acoustics, 2022

Automated Audio Captioning Using Transfer Learning and Reconstruction Latent Space Similarity Regularization.
Proceedings of the IEEE International Conference on Acoustics, 2022

Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Go Wider Instead of Deeper.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Large-Scale Deep Learning Optimizations: A Comprehensive Survey.
CoRR, 2021

Sparse-MLP: A Fully-MLP Architecture with Conditional Computation.
CoRR, 2021

Sequence Parallelism: Making 4D Parallelism Possible.
CoRR, 2021

Recent Advances in Deep Learning Based Dialogue Systems: A Systematic Survey.
CoRR, 2021

GDPNet: Refining Latent Multi-View Graph for Relation Extraction.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
A network model of speaker identification with new feature extraction methods and asymmetric BLSTM.
Neurocomputing, 2020

An Embarrassingly Simple Model for Dialogue Relation Extraction.
CoRR, 2020

Deep Graph Random Process for Relational-Thinking-Based Speech Recognition.
Proceedings of the 37th International Conference on Machine Learning, 2020

2019
Underwater Acoustic Target Recognition: A Combination of Multi-Dimensional Fusion Features and Modified Deep Neural Network.
Remote. Sens., 2019


  Loading...