Yu Sun
Orcid: 0000-0002-5430-5534Affiliations:
- Baidu Inc., Beijing, China
According to our database1,
Yu Sun
authored at least 31 papers
between 2013 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
CoRR, 2024
NACL: A General and Effective KV Cache Eviction Framework for LLMs at Inference Time.
CoRR, 2024
DHA: Learning Decoupled-Head Attention from Transformer Checkpoints via Adaptive Heads Fusion.
CoRR, 2024
Proceedings of the ACM on Web Conference 2024, 2024
LEMON: Reviving Stronger and Smaller LMs from Larger LMs with Linear Parameter Fusion.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
ACM Trans. Web, February, 2023
Proceedings of the ACM Web Conference 2023, 2023
Proceedings of the 8th Workshop on Representation Learning for NLP, 2023
ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
2022
X-PuDu at SemEval-2022 Task 6: Multilingual Learning for English and Arabic Sarcasm Detection.
CoRR, 2022
ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding.
CoRR, 2022
CoRR, 2022
Nebula-I: A General Framework for Collaboratively Training Deep Learning Models on Low-Bandwidth Cloud Clusters.
CoRR, 2022
X-PuDu at SemEval-2022 Task 7: A Replaced Token Detection Task Pre-trained Model with Pattern-aware Ensembling for Identifying Plausible Clarifications.
Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022
X-PuDu at SemEval-2022 Task 6: Multilingual Learning for English and Arabic Sarcasm Detection.
Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Simple and Effective Relation-based Embedding Propagation for Knowledge Representation Learning.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022
ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
2021
ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation.
CoRR, 2021
CoRR, 2021
Masked Label Prediction: Unified Message Passing Model for Semi-Supervised Classification.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021
ERNIE-M: Enhanced Multilingual Representation by Aligning Cross-lingual Semantics with Monolingual Corpora.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding.
CoRR, 2020
Masked Label Prediction: Unified Massage Passing Model for Semi-Supervised Classification.
CoRR, 2020
Kk2018 at SemEval-2020 Task 9: Adversarial Training for Code-Mixing Sentiment Classification.
Proceedings of the Fourteenth Workshop on Semantic Evaluation, 2020
2013
Proceedings of The 13th International Conference on Parsing Technologies, 2013