Yasheng Wang

Orcid: 0000-0002-3221-0470

According to our database1, Yasheng Wang authored at least 84 papers between 2010 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
ToolFlow: Boosting LLM Tool-Calling Through Natural and Coherent Dialogue Synthesis.
CoRR, 2024

SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation.
CoRR, 2024

Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance.
CoRR, 2024

Learning Evolving Tools for Large Language Models.
CoRR, 2024

RevisEval: Improving LLM-as-a-Judge via Response-Adapted References.
CoRR, 2024

Flat-LoRA: Low-Rank Adaption over a Flat Loss Landscape.
CoRR, 2024

RethinkMCTS: Refining Erroneous Thoughts in Monte Carlo Tree Search for Code Generation.
CoRR, 2024

ToolACE: Winning the Points of LLM Function Calling.
CoRR, 2024

Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization.
CoRR, 2024

Entropy Law: The Story Behind Data Compression and LLM Performance.
CoRR, 2024

CoIR: A Comprehensive Benchmark for Code Information Retrieval Models.
CoRR, 2024

Chain-of-Probe: Examing the Necessity and Accuracy of CoT Step-by-Step.
CoRR, 2024

Evaluating the External and Parametric Knowledge Fusion of Large Language Models.
CoRR, 2024

CELA: Cost-Efficient Language Model Alignment for CTR Prediction.
CoRR, 2024

CodeGRAG: Extracting Composed Syntax Graphs for Retrieval Augmented Cross-Lingual Code Generation.
CoRR, 2024

WESE: Weak Exploration to Strong Exploitation for LLM Agents.
CoRR, 2024

Evaluating Robustness of Generative Search Engine on Adversarial Factual Questions.
CoRR, 2024

Understanding the planning of LLM agents: A survey.
CoRR, 2024

YODA: Teacher-Student Progressive Learning for Language Models.
CoRR, 2024

PROXYQA: An Alternative Framework for Evaluating Long-Form Text Generation with Large Language Models.
CoRR, 2024

UniRetriever: Multi-task Candidates Selection for Various Context-Adaptive Conversational Retrieval.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Improving Language Model Reasoning with Self-motivated Learning.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

SINKT: A Structure-Aware Inductive Knowledge Tracing Model with Large Language Model.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

ProxyQA: An Alternative Framework for Evaluating Long-Form Text Generation with Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Dynamic Stochastic Decoding Strategy for Open-Domain Dialogue Generation.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Evaluating Robustness of Generative Search Engine on Adversarial Factoid Questions.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Red Alarm for Pre-trained Models: Universal Vulnerability to Neuron-level Backdoor Attacks.
Mach. Intell. Res., April, 2023

Sub-Character Tokenization for Chinese Pretrained Language Models.
Trans. Assoc. Comput. Linguistics, 2023

Large Language Models as Source Planner for Personalized Knowledge-grounded Dialogue.
CoRR, 2023

Learning Summary-Worthy Visual Representation for Abstractive Summarization in Video.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

History, Present and Future: Enhancing Dialogue Generation with Few-Shot History-Future Prompt.
Proceedings of the IEEE International Conference on Acoustics, 2023

Lexicon-injected Semantic Parsing for Task-Oriented Dialog.
Proceedings of the IEEE International Conference on Acoustics, 2023

Improving Factual Consistency for Knowledge-Grounded Dialogue Systems via Knowledge Enhancement and Alignment.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Large Language Models as Source Planner for Personalized Knowledge-grounded Dialogues.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

HyperPELT: Unified Parameter-Efficient Language Model Tuning for Both Language and Vision-and-Language Tasks.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Retrieval-free Knowledge Injection through Multi-Document Traversal for Dialogue Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

One Cannot Stand for Everyone! Leveraging Multiple User Simulators to train Task-oriented Dialogue Systems.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

DecompEval: Evaluating Generated Texts as Unsupervised Decomposed Question Answering.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

A Synthetic Data Generation Framework for Grounded Dialogues.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

MoralDial: A Framework to Train and Evaluate Moral Dialogue Systems via Moral Discussions.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

KPT: Keyword-Guided Pre-training for Grounded Dialog Generation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
MoralDial: A Framework to Train and Evaluate Moral Dialogue Systems via Constructing Moral Discussions.
CoRR, 2022

MultiCoder: Multi-Programming-Lingual Pre-Training for Low-Resource Code Completion.
CoRR, 2022

PanGu-Coder: Program Synthesis with Function-Level Language Modeling.
CoRR, 2022

Sparse Structure Search for Parameter-Efficient Tuning.
CoRR, 2022

Revisiting Pre-trained Language Models and their Evaluation for Arabic Natural Language Understanding.
CoRR, 2022

PANGUBOT: Efficient Generative Dialogue Pre-training from Pre-trained Language Model.
CoRR, 2022

Towards Identifying Social Bias in Dialog Systems: Frame, Datasets, and Benchmarks.
CoRR, 2022

Source Code Summarization with Structural Relative Position Guided Transformer.
Proceedings of the IEEE International Conference on Software Analysis, 2022

Sparse Structure Search for Delta Tuning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

LMTurk: Few-Shot Learners as Crowdsourcing Workers in a Language-Model-as-a-Service Framework.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

CODE-MVP: Learning to Represent Source Code from Multiple Views with Contrastive Pre-Training.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

CoCA-MDD: A Coupled Cross-Attention based Framework for Streaming Mispronunciation Detection and Diagnosis.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Towards Identifying Social Bias in Dialog Systems: Framework, Dataset, and Benchmark.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Constructing Highly Inductive Contexts for Dialogue Safety through Controllable Reverse Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Leveraging Only the Category Name for Aspect Detection through Prompt-based Constrained Clustering.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Momentum Contrastive Pre-training for Question Answering.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Revisiting Pre-trained Language Models and their Evaluation for Arabic Natural Language Processing.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

AEG: Argumentative Essay Generation via A Dual-Decoder Model with Content Planning.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Pan More Gold from the Sand: Refining Open-domain Dialogue Training with Noisy Self-Retrieval Generation.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Compilable Neural Code Generation with Compiler Feedback.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

UniDS: A Unified Dialogue System for Chit-Chat and Task-oriented Dialogues.
Proceedings of the Second DialDoc Workshop on Document-grounded Dialogue and Conversational Question Answering, 2022

UniMS: A Unified Framework for Multimodal Summarization with Knowledge Distillation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

CINS: Comprehensive Instruction for Few-Shot Learning in Task-Oriented Dialog Systems.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
LMTurk: Few-Shot Learners as Crowdsourcing Workers.
CoRR, 2021

JABER: Junior Arabic BERt.
CoRR, 2021

CCA-MDD: A Coupled Cross-Attention based Framework for Streaming Mispronunciation detection and diagnosis.
CoRR, 2021

CINS: Comprehensive Instruction for Few-shot Learning in Task-oriented Dialog Systems.
CoRR, 2021

CLSEBERT: Contrastive Learning for Syntax Enhanced Code Pre-Trained Model.
CoRR, 2021

Red Alarm for Pre-trained Models: Universal Vulnerabilities by Neuron-Level Backdoor Attacks.
CoRR, 2021

ServiceBERT: A Pre-trained Model for Web Service Tagging and Recommendation.
Proceedings of the Service-Oriented Computing - 19th International Conference, 2021

Better Robustness by More Coverage: Adversarial and Mixup Data Augmentation for Robust Finetuning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Improving Sequence Modeling Ability of Recurrent Neural Networks via Sememes.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Better Robustness by More Coverage: Adversarial Training with Mixup Augmentation for Robust Fine-tuning.
CoRR, 2020

Unified Mandarin TTS Front-end Based on Distilled BERT Model.
CoRR, 2020

Multi-Channel Reverse Dictionary Model.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Multi-channel Reverse Dictionary Model.
CoRR, 2019

Enhancing Recurrent Neural Networks with Sememes.
CoRR, 2019

NEZHA: Neural Contextualized Representation for Chinese Language Understanding.
CoRR, 2019

GPT-based Generation for Classical Chinese Poetry.
CoRR, 2019

2017
Sentiment Lexicon Expansion Based on Neural PU Learning, Double Dictionary Lookup, and Polarity Association.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

2010
Entity answer extraction of web table.
Proceedings of the Seventh International Conference on Fuzzy Systems and Knowledge Discovery, 2010


  Loading...