Yu Sun

Orcid: 0000-0002-5430-5534

Affiliations:
  • Baidu Inc., Beijing, China


According to our database1, Yu Sun authored at least 28 papers between 2013 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
DHA: Learning Decoupled-Head Attention from Transformer Checkpoints via Adaptive Heads Fusion.
CoRR, 2024

HFT: Half Fine-Tuning for Large Language Models.
CoRR, 2024

Dual Modalities of Text: Visual and Textual Generative Pre-training.
CoRR, 2024

Spectral Heterogeneous Graph Convolutions via Positive Noncommutative Polynomials.
Proceedings of the ACM on Web Conference 2024, 2024

2023
Pre-trained Language Model-based Retrieval and Ranking for Web Search.
ACM Trans. Web, February, 2023

Tool-Augmented Reward Modeling.
CoRR, 2023

ERNIE-Music: Text-to-Waveform Music Generation with Diffusion Models.
CoRR, 2023

Label Information Enhanced Fraud Detection against Low Homophily in Graphs.
Proceedings of the ACM Web Conference 2023, 2023

Retrieval-Augmented Domain Adaptation of Language Models.
Proceedings of the 8th Workshop on Representation Learning for NLP, 2023

ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
X-PuDu at SemEval-2022 Task 6: Multilingual Learning for English and Arabic Sarcasm Detection.
CoRR, 2022

ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding.
CoRR, 2022

ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding.
CoRR, 2022

Nebula-I: A General Framework for Collaboratively Training Deep Learning Models on Low-Bandwidth Cloud Clusters.
CoRR, 2022

X-PuDu at SemEval-2022 Task 7: A Replaced Token Detection Task Pre-trained Model with Pattern-aware Ensembling for Identifying Plausible Clarifications.
Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

X-PuDu at SemEval-2022 Task 6: Multilingual Learning for English and Arabic Sarcasm Detection.
Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

mmLayout: Multi-grained MultiModal Transformer for Document Understanding.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Simple and Effective Relation-based Embedding Propagation for Knowledge Representation Learning.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021
ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation.
CoRR, 2021

Graph4Rec: A Universal Toolkit with Graph Neural Networks for Recommender Systems.
CoRR, 2021

Masked Label Prediction: Unified Message Passing Model for Semi-Supervised Classification.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

ERNIE-M: Enhanced Multilingual Representation by Aligning Cross-lingual Semantics with Monolingual Corpora.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

ERNIE-ViL: Knowledge Enhanced Vision-Language Representations through Scene Graphs.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding.
CoRR, 2020

Masked Label Prediction: Unified Massage Passing Model for Semi-Supervised Classification.
CoRR, 2020

Kk2018 at SemEval-2020 Task 9: Adversarial Training for Code-Mixing Sentiment Classification.
Proceedings of the Fourteenth Workshop on Semantic Evaluation, 2020

2013
Generalization of Words for Chinese Dependency Parsing.
Proceedings of The 13th International Conference on Parsing Technologies, 2013


  Loading...