Runxin Xu

Orcid: 0000-0002-3876-2284

According to our database1, Runxin Xu authored at least 33 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models.
CoRR, 2024

Towards a Unified View of Preference Learning for Large Language Models: A Survey.
CoRR, 2024

Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models.
CoRR, 2024

LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback.
CoRR, 2024

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence.
CoRR, 2024

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model.
CoRR, 2024

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models.
CoRR, 2024

Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Making Pre-trained Language Models End-to-end Few-shot Learners with Contrastive Prompt Tuning.
Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

Knowledgeable Salient Span Mask for Enhancing Language Models as Knowledge Base.
Proceedings of the Natural Language Processing and Chinese Computing, 2023

TABLEIE: Capturing the Interactions Among Sub-Tasks in Information Extraction via Double Tables.
Proceedings of the IEEE International Conference on Acoustics, 2023

Bi-Drop: Enhancing Fine-tuning Generalization via Synchronous sub-net Estimation and Optimization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

An Iteratively Parallel Generation Method with the Pre-Filling Strategy for Document-level Event Extraction.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
On Effectively Learning of Knowledge in Continual Pre-training.
CoRR, 2022

A Double-Graph Based Framework for Frame Semantic Parsing.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

A Two-Stream AMR-enhanced Model for Document-level Event Argument Extraction.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

An Enhanced Span-based Decomposition Method for Few-Shot Sequence Labeling.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

ATP: AMRize Then Parse! Enhancing AMR Parsing with PseudoAMRs.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

S$^4$-Tuning: A Simple Cross-lingual Sub-network Tuning Method.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022

Probing Structured Pruning on Multilingual Pre-trained Models: Settings, Algorithms, and Efficiency.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Focus on the Target's Vocabulary: Masked Label Smoothing for Machine Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022

From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Explicit Interaction Network for Aspect Sentiment Triplet Extraction.
CoRR, 2021

Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Behind the Scenes: An Exploration of Trigger Biases Problem in Few-Shot Event Classification.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Document-level Event Extraction via Heterogeneous Graph-based Interaction Model with a Tracker.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

ACMo: Angle-Calibrated Moment Methods for Stochastic Optimization.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Adaptive Gradient Methods Can Be Provably Faster than SGD after Finite Epochs.
CoRR, 2020

Volctrans Parallel Corpus Filtering System for WMT 2020.
Proceedings of the Fifth Conference on Machine Translation, 2020

Double Graph Based Reasoning for Document-level Relation Extraction.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Xiaomingbot: A Multilingual Robot News Reporter.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2020


  Loading...