Yekun Chai

According to our database1, Yekun Chai authored at least 28 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions.
CoRR, 2024

Tokenization Falling Short: The Curse of Tokenization.
CoRR, 2024

Dual Modalities of Text: Visual and Textual Generative Pre-training.
CoRR, 2024

On Training Data Influence of GPT Models.
CoRR, 2024

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order.
CoRR, 2024

StarCoder 2 and The Stack v2: The Next Generation.
CoRR, 2024

GiLOT: Interpreting Generative Language Models via Optimal Transport.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Tool-Augmented Reward Modeling.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Autoregressive Pre-Training on Pixels and Texts.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

On Training Data Influence of GPT Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Tokenization Falling Short: On Subword Robustness in Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

HumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual Natural Language Generalization.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Tool-Augmented Reward Modeling.
CoRR, 2023

ERNIE-Music: Text-to-Waveform Music Generation with Diffusion Models.
CoRR, 2023

M<sup>4</sup>: A Unified XAI Benchmark for Faithfulness Evaluation of Feature Attribution Methods across Metrics, Modalities and Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Neural Text Classification by Jointly Learning to Cluster and Align.
Proceedings of the International Joint Conference on Neural Networks, 2023

Improved Training Of Mixture-Of-Experts Language GANs.
Proceedings of the IEEE International Conference on Acoustics, 2023

ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
X-PuDu at SemEval-2022 Task 6: Multilingual Learning for English and Arabic Sarcasm Detection.
CoRR, 2022

X-PuDu at SemEval-2022 Task 6: Multilingual Learning for English and Arabic Sarcasm Detection.
Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

Clip-Tuning: Towards Derivative-free Prompt Learning with a Mixture of Rewards.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Predicate-Argument Based Bi-Encoder for Paraphrase Identification.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
RefineCap: Concept-Aware Refinement for Image Captioning.
CoRR, 2021

Counter-Contrastive Learning for Language GANs.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

2020
Neural Text Classification by Jointly Learning to Cluster and Align.
CoRR, 2020

Highway Transformer: Self-Gating Enhanced Self-Attentive Networks.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
How to Evaluate Word Representations of Informal Domain?
CoRR, 2019

Exponential Moving Averaged Q-Network for DDPG.
Proceedings of the Pattern Recognition and Computer Vision - Second Chinese Conference, 2019


  Loading...