Dian Yu

Orcid: 0000-0002-8583-8931

Affiliations:
  • Tencent AI Lab, Bellevue, WA, USA


According to our database1, Dian Yu authored at least 60 papers between 2013 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning.
CoRR, 2024

DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search.
CoRR, 2024

SIaM: Self-Improving Code-Assisted Mathematical Reasoning of Large Language Models.
CoRR, 2024

Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning.
CoRR, 2024

LiteSearch: Efficacious Tree Search for LLM.
CoRR, 2024

Scaling Synthetic Data Creation with 1,000,000,000 Personas.
CoRR, 2024

Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning.
CoRR, 2024

MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions.
CoRR, 2024

Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing.
CoRR, 2024

Conceptual and Unbiased Reasoning in Language Models.
CoRR, 2024

Salute the Classic: Revisiting Challenges of Machine Translation in the Age of Large Language Models.
CoRR, 2024

Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Skills-in-Context: Unlocking Compositionality in Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Alternate Diverse Teaching for Semi-supervised Medical Image Segmentation.
Proceedings of the Computer Vision - ECCV 2024, 2024

MinT: Boosting Generalization in Mathematical Reasoning via Multi-view Fine-tuning.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
OpenFact: Factuality Enhanced Open Knowledge Extraction.
Trans. Assoc. Comput. Linguistics, 2023

Skills-in-Context Prompting: Unlocking Compositionality in Large Language Models.
CoRR, 2023

Disco-Bench: A Discourse-Aware Evaluation Benchmark for Language Modelling.
CoRR, 2023

Findings of the WMT 2023 Shared Task on Discourse-Level Literary Translation: A Fresh Orb in the Cosmos of LLMs.
Proceedings of the Eighth Conference on Machine Translation, 2023

Knowledge-in-Context: Towards Knowledgeable Semi-Parametric Language Models.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

More Than Spoken Words: Nonverbal Message Extraction and Generation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Document-Level Machine Translation with Large Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

How do Words Contribute to Sentence Semantics? Revisiting Sentence Embeddings with a Perturbation Method.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Cross-Lingual Speaker Identification Using Distant Supervision.
CoRR, 2022

End-to-End Chinese Speaker Identification.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

ZeroKBC: A Comprehensive Benchmark for Zero-Shot Knowledge Base Completion.
Proceedings of the IEEE International Conference on Data Mining Workshops, 2022

NarraSum: A Large-Scale Dataset for Abstractive Narrative Summarization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Learning-by-Narrating: Narrative Pre-Training for Zero-Shot Dialogue Comprehension.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022

C-MORE: Pretraining to Answer Open-Domain Questions by Consulting Millions of References.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022

Improving Machine Reading Comprehension with Contextualized Commonsense Knowledge.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Self-Teaching Machines to Read and Comprehend with Large-Scale Multi-Subject Question-Answering Data.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Connect-the-Dots: Bridging Semantics between Words and Definitions via Aligning Word Sense Inventories.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020
Investigating Prior Knowledge for Challenging Chinese Machine Reading Comprehension.
Trans. Assoc. Comput. Linguistics, 2020

CLUE: A Chinese Language Understanding Evaluation Benchmark.
CoRR, 2020


Dialogue-Based Relation Extraction.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Recurrent Chunking Mechanisms for Long-Text Machine Reading Comprehension.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

MultiSumm: Towards a Unified Model for Multi-Lingual Abstractive Summarization.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
DREAM: A Challenge Dataset and Models for Dialogue-Based Reading Comprehension.
Trans. Assoc. Comput. Linguistics, 2019

Improving Pre-Trained Multilingual Models with Vocabulary Expansion.
CoRR, 2019

Teaching Pretrained Models with Commonsense Reasoning: A Preliminary KB-Based Approach.
CoRR, 2019

Probing Prior Knowledge Needed in Challenging Chinese Machine Reading Comprehension.
CoRR, 2019

Improving Question Answering with External Knowledge.
CoRR, 2019

Improving Machine Reading Comprehension with General Reading Strategies.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Evidence Sentence Extraction for Machine Reading Comprehension.
Proceedings of the 23rd Conference on Computational Natural Language Learning, 2019

Improving Pre-Trained Multilingual Model with Vocabulary Expansion.
Proceedings of the 23rd Conference on Computational Natural Language Learning, 2019

Improving Question Answering with External Knowledge.
Proceedings of the 2nd Workshop on Machine Reading for Question Answering, 2019

2017
Unsupervised graph-based relation extraction and validation for knowledge base population.
PhD thesis, 2017

Open Relation Extraction and Grounding.
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017

2016
RPI BLENDER TAC-KBP2016 System Description.
Proceedings of the 2016 Text Analysis Conference, 2016

Unsupervised Person Slot Filling based on Graph Mining.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015
RPI BLENDER TAC-KBP2015 System Description.
Proceedings of the 2015 Text Analysis Conference, 2015

Why Read if You Can Scan? Trigger Scoping Strategy for Biographical Fact Extraction.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Modeling Truth Existence in Truth Discovery.
Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

Detecting Deceptive Groups Using Conversations and Network Analysis.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014
The Wisdom of Minority: Unsupervised Slot Filling Validation based on Multi-dimensional Truth-Finding.
Proceedings of the COLING 2014, 2014

2013
RPI-BLENDER TAC-KBP2013 Knowledge Base Population System.
Proceedings of the Sixth Text Analysis Conference, 2013

Resolving Entity Morphs in Censored Data.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013


  Loading...