We stand with Ukraine

We stand with Ukraine

Runxin Xu

Orcid: 0000-0002-3876-2284

According to our database¹, Runxin Xu authored at least 34 papers between 2020 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

DeepSeek-V3 Technical Report.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Mingchuan Zhang

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

Shanghaoran Quan

,

,

,

,

,

,

CoRR, 2024

Towards a Unified View of Preference Learning for Large Language Models: A Survey.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Mingchuan Zhang

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

Mingchuan Zhang

,

,

,

CoRR, 2024

Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

Making Pre-trained Language Models End-to-end Few-shot Learners with Contrastive Prompt Tuning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

Knowledgeable Salient Span Mask for Enhancing Language Models as Knowledge Base.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Natural Language Processing and Chinese Computing, 2023

TABLEIE: Capturing the Interactions Among Sub-Tasks in Information Extraction via Double Tables.

[BibT_eX]

[DOI]

,

,

Proceedings of the IEEE International Conference on Acoustics, 2023

Bi-Drop: Enhancing Fine-tuning Generalization via Synchronous sub-net Estimation and Optimization.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

An Iteratively Parallel Generation Method with the Pre-Filling Strategy for Document-level Event Extraction.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022

On Effectively Learning of Knowledge in Continual Pre-training.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2022

A Double-Graph Based Framework for Frame Semantic Parsing.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

A Two-Stream AMR-enhanced Model for Document-level Event Argument Extraction.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

An Enhanced Span-based Decomposition Method for Few-Shot Sequence Labeling.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

ATP: AMRize Then Parse! Enhancing AMR Parsing with PseudoAMRs.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

S$^4$-Tuning: A Simple Cross-lingual Sub-network Tuning Method.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022

Probing Structured Pruning on Multilingual Pre-trained Models: Settings, Algorithms, and Efficiency.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Focus on the Target's Vocabulary: Masked Label Smoothing for Machine Translation.

[BibT_eX]

[DOI]

,

,

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022

From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Explicit Interaction Network for Aspect Sentiment Triplet Extraction.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2021

Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Behind the Scenes: An Exploration of Trigger Biases Problem in Few-Shot Event Classification.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Document-level Event Extraction via Heterogeneous Graph-based Interaction Model with a Tracker.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

ACMo: Angle-Calibrated Moment Methods for Stochastic Optimization.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Adaptive Gradient Methods Can Be Provably Faster than SGD after Finite Epochs.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2020

Volctrans Parallel Corpus Filtering System for WMT 2020.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Fifth Conference on Machine Translation, 2020

Double Graph Based Reasoning for Document-level Relation Extraction.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Xiaomingbot: A Multilingual Robot News Reporter.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Songcheng Jiang

,

,

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2020

Loading...