Kaiyan Zhang

Orcid: 0000-0002-1014-8442

According to our database1, Kaiyan Zhang authored at least 19 papers between 2021 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices.
CoRR, 2024

Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation.
CoRR, 2024

Towards Building Specialized Generalist AI with System 1 and System 2 Fusion.
CoRR, 2024

Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding.
CoRR, 2024

Online DPO: Online Direct Preference Optimization with Fast-Slow Chasing.
CoRR, 2024

UltraMedical: Building Specialized Generalists in Biomedicine.
CoRR, 2024

Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process.
CoRR, 2024

PaD: Program-aided Distillation Can Teach Small Models Reasoning Better than Chain-of-thought Fine-tuning.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

CoGenesis: A Framework Collaborating Large and Small Language Models for Secure Context-Aware Instruction Following.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

SMR: State Memory Replay for Long Sequence Modeling.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Generative Multi-Modal Knowledge Retrieval with Large Language Models.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
A Static and Dynamic Attention Framework for Multi Turn Dialogue Generation.
ACM Trans. Inf. Syst., January, 2023

A Stack-Propagation Framework for Low-Resource Personalized Dialogue Generation.
ACM Trans. Inf. Syst., 2023

Large Language Models are Zero Shot Hypothesis Proposers.
CoRR, 2023

PaD: Program-aided Distillation Specializes Large Models in Reasoning.
CoRR, 2023

Demo: Domino: A High-Precision Performance Monitoring and Analysis Platform for Client Applications.
Proceedings of the 21st Annual International Conference on Mobile Systems, 2023

CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language Model.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2021
BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited Personalized Data.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021


  Loading...