Baosong Yang

Orcid: 0000-0001-5002-2409

According to our database1, Baosong Yang authored at least 76 papers between 2015 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Rethinking the Exploitation of Monolingual Data for Low-Resource Neural Machine Translation.
Comput. Linguistics, March, 2024

Qwen2.5 Technical Report.
CoRR, 2024

P-MMEval: A Parallel Multilingual Multitask Benchmark for Consistent Evaluation of LLMs.
CoRR, 2024

ZhoBLiMP: a Systematic Assessment of Language Models with Linguistic Minimal Pairs in Chinese.
CoRR, 2024

Not All Languages are Equal: Insights into Multilingual Retrieval-Augmented Generation.
CoRR, 2024

Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation.
CoRR, 2024

Qwen2 Technical Report.
CoRR, 2024

MoE-CT: A Novel Approach For Large Language Models Training With Resistance To Catastrophic Forgetting.
CoRR, 2024

Trajectory Volatility for Out-of-Distribution Detection in Mathematical Reasoning.
CoRR, 2024

Final Submission of SJTULoveFiction to Literary Task.
Proceedings of the Ninth Conference on Machine Translation, 2024

SJTU System Description for the WMT24 Low-Resource Languages of Spain Task.
Proceedings of the Ninth Conference on Machine Translation, 2024

mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024

AnyTrans: Translate AnyText in the Image with Large Scale Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Large Language Model for Multi-Domain Translation: Benchmarking and Domain CoT Fine-tuning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

MoNMT: Modularly Leveraging Monolingual and Bilingual Knowledge for Neural Machine Translation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Meta-Reasoning: Semantics-Symbol Deconstruction for Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Efficient k-Nearest-Neighbor Machine Translation with Dynamic Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Towards Energy-Preserving Natural Language Understanding With Spiking Neural Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

From statistical methods to deep learning, automatic keyphrase prediction: A survey.
Inf. Process. Manag., 2023

PolyLM: An Open Source Polyglot Large Language Model.
CoRR, 2023

Towards Fine-Grained Information: Identifying the Type and Location of Translation Errors.
CoRR, 2023

Imitation Attacks Can Steal More Than You Think from Machine Translation Systems.
Proceedings of the Natural Language Processing and Chinese Computing, 2023

EMMA-X: An EM-like Multilingual Pre-training Algorithm for Cross-lingual Representation Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Dynamic Voting for Efficient Reasoning in Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Unifying Discrete and Continuous Representations for Unsupervised Paraphrase Generation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

MMNMT: Modularizing Multilingual Neural Machine Translation with Flexibly Assembled MoE and Dense Blocks.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Fantastic Expressions and Where to Find Them: Chinese Simile Generation with Multiple Constraints.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Bridging the Domain Gaps in Context Representations for k-Nearest Neighbor Neural Machine Translation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Tailor: A Soft-Prompt-Based Approach to Attribute-Based Controlled Text Generation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Multi-view self-attention networks.
Knowl. Based Syst., 2022

Alibaba-Translate China's Submission for WMT 2022 Metrics Shared Task.
CoRR, 2022

Draft, Command, and Edit: Controllable Text Editing in E-Commerce.
CoRR, 2022

Tailor: A Prompt-Based Approach to Attribute-Based Controlled Text Generation.
CoRR, 2022

RMBR: A Regularized Minimum Bayes Risk Reranking Framework for Machine Translation.
CoRR, 2022

Challenges of Neural Machine Translation for Short Texts.
Comput. Linguistics, 2022

Effective Approaches to Neural Query Language Identification.
Comput. Linguistics, 2022

Alibaba-Translate China's Submission for WMT2022 Metrics Shared Task.
Proceedings of the Seventh Conference on Machine Translation, 2022

Alibaba-Translate China's Submission for WMT 2022 Quality Estimation Shared Task.
Proceedings of the Seventh Conference on Machine Translation, 2022

Cross-Lingual Product Retrieval in E-Commerce Search.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2022

Should We Rely on Entity Mentions for Relation Extraction? Debiasing Relation Extraction with Counterfactual Analysis.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Dangling-Aware Entity Alignment with Mixed High-Order Proximities.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Bridging the Gap between Training and Inference: Multi-Candidate Optimization for Diverse Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

WR-One2Set: Towards Well-Calibrated Keyphrase Generation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Competency-Aware Neural Machine Translation: Can Machine Translation Know its Own Translation Quality?
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Unsupervised Preference-Aware Language Identification.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Attention Mechanism with Energy-Friendly Operations.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

UniTE: Unified Translation Evaluation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

GCPG: A General Framework for Controllable Paraphrase Generation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Frequency-Aware Contrastive Learning for Neural Machine Translation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

KGR4: Retrieval, Retrospect, Refine and Rethink for Commonsense Generation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Context-aware Self-Attention Networks for Natural Language Processing.
Neurocomputing, 2021

KGR^4: Retrieval, Retrospect, Refine and Rethink for Commonsense Generation.
CoRR, 2021

Leveraging Advantages of Interactive and Non-Interactive Models for Vector-Based Cross-Lingual Information Retrieval.
CoRR, 2021

RoBLEURT Submission for WMT2021 Metrics Task.
Proceedings of the Sixth Conference on Machine Translation, 2021

Multi-Hop Transformer for Document-Level Machine Translation.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Bridging Subword Gaps in Pretrain-Finetune Paradigm for Natural Language Generation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Towards User-Driven Neural Machine Translation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Improving tree-based neural machine translation with dynamic lexicalized dependency encoding.
Knowl. Based Syst., 2020

Exploiting Neural Query Translation into Cross Lingual Information Retrieval.
CoRR, 2020

Constraint Translation Candidates: A Bridge between Neural Query Translation and Cross-lingual Information Retrieval.
CoRR, 2020

Self-Paced Learning for Neural Machine Translation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Domain Transfer based Data Augmentation for Neural Query Translation.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Uncertainty-Aware Curriculum Learning for Neural Machine Translation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Unsupervised Neural Dialect Translation with Commonality and Diversity Modeling.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Neuron Interaction Based Representation Composition for Neural Machine Translation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Convolutional Self-Attention Networks.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Information Aggregation for Multi-Head Attention with Routing-by-Agreement.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Modeling Recurrence for Transformer.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Assessing the Ability of Self-Attention Networks to Learn Word Order.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Leveraging Local and Global Patterns for Self-Attention Networks.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Context-Aware Self-Attention Networks.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Modeling Localness for Self-Attention Networks.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Multi-Head Attention with Disagreement Regularization.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

2017
Towards Bidirectional Hierarchical Representations for Attention-based Neural Machine Translation.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

2015
Leveraging the Advantages of Associative Alignment Methods for PB-SMT Systems.
Proceedings of the Human Language Technology. Challenges for Computer Science and Linguistics, 2015

Sampling-based Alignment and Hierarchical Sub-sentential Alignment in Chinese-Japanese Translation of Patents.
Proceedings of the 2nd Workshop on Asian Translation, 2015


  Loading...