Yao Wan

Orcid: 0000-0001-6937-4180

Affiliations:
  • Huazhong University of Science and Technology, China
  • Zhejiang University, China (former)


According to our database1, Yao Wan authored at least 79 papers between 2015 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Deep Learning for Code Intelligence: Survey, Benchmark and Toolkit.
ACM Comput. Surv., December, 2024

PyScribe-Learning to describe python code.
Softw. Pract. Exp., March, 2024

Collaborative Knowledge Graph Fusion by Exploiting the Open Corpus.
IEEE Trans. Knowl. Data Eng., February, 2024

FedGKD: Toward Heterogeneous Federated Learning via Global Knowledge Distillation.
IEEE Trans. Computers, January, 2024

IRCoCo: Immediate Rewards-Guided Deep Reinforcement Learning for Code Completion.
Proc. ACM Softw. Eng., 2024

Automated Data Visualization from Natural Language via Large Language Models: An Exploratory Study.
Proc. ACM Manag. Data, 2024

The Impact of Large Language Models in Academia: from Writing to Speaking.
CoRR, 2024

Sifting through the Chaff: On Utilizing Execution Feedback for Ranking the Generated Code Candidates.
CoRR, 2024

Self-Cognition in Large Language Models: An Exploratory Study.
CoRR, 2024

UniGen: A Unified Framework for Textual Dataset Generation Using Large Language Models.
CoRR, 2024

ObscurePrompt: Jailbreaking Large Language Models via Obscure Input.
CoRR, 2024

GUI-WORLD: A Dataset for GUI-oriented Multimodal LLM-based Agents.
CoRR, 2024

The Best of Both Worlds: Toward an Honest and Helpful Large Language Model.
CoRR, 2024

Does Your Neural Code Completion Model Use My Code? A Membership Inference Approach.
CoRR, 2024

VISION2UI: A Real-World Dataset with Layout for Code Generation from UI Designs.
CoRR, 2024

NL2Formula: Generating Spreadsheet Formulas from Natural Language Queries.
CoRR, 2024

I Think, Therefore I am: Awareness in Large Language Models.
CoRR, 2024

LLM-as-a-Coauthor: The Challenges of Detecting LLM-Human Mixcase.
CoRR, 2024

Hierarchical medical image report adversarial generation with hybrid discriminator.
Artif. Intell. Medicine, 2024

DIVKNOWQA: Assessing the Reasoning Ability of LLMs via Open-Domain Question Answering over Knowledge Base and Text.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

LLM-as-a-Coauthor: Can Mixed Human-Written and Machine-Generated Text Be Detected?
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

kNN-ICL: Compositional Task-Oriented Parsing Generalization with Nearest Neighbor In-Context Learning.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Graph Neural Networks for Vulnerability Detection: A Counterfactual Explanation.
Proceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis, 2024

MLLM-as-a-Judge: Assessing Multimodal LLM-as-a-Judge with Vision-Language Benchmark.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

CodeIP: A Grammar-Guided Multi-Bit Watermark for Large Language Models of Code.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

NL2Formula: Generating Spreadsheet Formulas from Natural Language Queries.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

Enhancing Code Generation Performance of Smaller Models by Distilling the Reasoning Ability of LLMs.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

KEEP CHATTING! An Attractive Dataset for Continuous Conversation Agents.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Iterative Refinement of Project-Level Code Context for Precise Code Generation with Compiler Feedback.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Summarizing source code with Heterogeneous Syntax Graph and dual position.
Inf. Process. Manag., September, 2023

Reinforced MOOCs Concept Recommendation in Heterogeneous Information Networks.
ACM Trans. Web, August, 2023

Diverse title generation for Stack Overflow posts with multiple-sampling-enhanced transformer.
J. Syst. Softw., June, 2023

SOR-TC: Self-attentive octave ResNet with temporal consistency for compressed video action recognition.
Neurocomputing, 2023

Localize, Retrieve and Fuse: A Generalized Framework for Free-Form Question Answering over Tables.
Proceedings of the Findings of the Association for Computational Linguistics: IJCNLP-AACL 2023, 2023

Named Entity Recognition via Machine Reading Comprehension: A Multi-Task Learning Approach.
Proceedings of the Findings of the Association for Computational Linguistics: IJCNLP-AACL 2023, 2023

SiMFy: A Simple Yet Effective Approach for Temporal Knowledge Graph Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

2022
Reinforcement-Learning-Guided Source Code Summarization Using Hierarchical Attention.
IEEE Trans. Software Eng., 2022

XCode: Towards Cross-Language Code Representation with Large-Scale Pre-Training.
ACM Trans. Softw. Eng. Methodol., 2022

Modeling Sequential Listening Behaviors With Attentive Temporal Point Process for Next and Next New Music Recommendation.
IEEE Trans. Multim., 2022

FedBERT: When Federated Learning Meets Pre-training.
ACM Trans. Intell. Syst. Technol., 2022

Multi-triage: A multi-task learning framework for bug triage.
J. Syst. Softw., 2022

Reinforced MOOCs Concept Recommendation in Heterogeneous Information Networks.
CoRR, 2022

Cross-Language Binary-Source Code Matching with Intermediate Representations.
Proceedings of the IEEE International Conference on Software Analysis, 2022

You see what I want you to see: poisoning vulnerabilities in neural code search.
Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2022

CODE-MVP: Learning to Represent Source Code from Multiple Views with Contrastive Pre-Training.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

What Do They Capture? - A Structural Analysis of Pre-Trained Language Models for Source Code.
Proceedings of the 44th IEEE/ACM 44th International Conference on Software Engineering, 2022

NaturalCC: An Open-Source Toolkit for Code Intelligence.
Proceedings of the 44th IEEE/ACM International Conference on Software Engineering: Companion Proceedings, 2022

Rethinking the Video Sampling and Reasoning Strategies for Temporal Sentence Grounding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Compilable Neural Code Generation with Compiler Feedback.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Modeling Hierarchical Syntax Structure with Triplet Position for Source Code Summarization.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Are Pre-trained Transformers Robust in Intent Classification? A Missing Ingredient in Evaluation of Out-of-Scope Intent Detection.
Proceedings of the 4th Workshop on NLP for Conversational AI, 2022

DANets: Deep Abstract Networks for Tabular Data Classification and Regression.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
FCCA: Hybrid Code Representation for Functional Clone Detection Using Attention Networks.
IEEE Trans. Reliab., 2021

FedHM: Efficient Federated Learning for Heterogeneous Models via Low-rank Factorization.
CoRR, 2021

Are Pretrained Transformers Robust in Intent Classification? A Missing Ingredient in Evaluation of Out-of-Scope Intent Detection.
CoRR, 2021

Enriching Non-Autoregressive Transformer with Syntactic and SemanticStructures for Neural Machine Translation.
CoRR, 2021

Fix-Filter-Fix: Intuitively Connect Any Models for Effective Bug Fixing.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

HETFORMER: Heterogeneous Transformer with Sparse Attention for Long-Text Extractive Summarization.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Attend, Memorize and Generate: Towards Faithful Table-to-Text Generation in Few Shots.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Enriching Non-Autoregressive Transformer with Syntactic and Semantic Structures for Neural Machine Translation.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Disentangled Code Representation Learning for Multiple Programming Languages.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

KG-BART: Knowledge Graph-Augmented BART for Generative Commonsense Reasoning.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Multi-view factorization machines for mobile app recommendation based on hierarchical attention.
Knowl. Based Syst., 2020

NaturalCC: A Toolkit to Naturalize the Source Code Corpus.
CoRR, 2020

Find or Classify? Dual Strategy for Slot-Value Predictions on Multi-Domain Dialog State Tracking.
Proceedings of the Ninth Joint Conference on Lexical and Computational Semantics, 2020

Cross-Supervised Joint-Event-Extraction with Heterogeneous Information Networks.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Discriminative Nearest Neighbor Few-Shot Intent Detection by Transferring Natural Language Inference.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

2019
Multi-Modal Generative Adversarial Network for Short Product Title Generation in Mobile E-Commerce.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Multi-modal Attention Network Learning for Semantic Source Code Retrieval.
Proceedings of the 34th IEEE/ACM International Conference on Automated Software Engineering, 2019

Competitive Multi-agent Deep Reinforcement Learning with Counterfactual Thinking.
Proceedings of the 2019 IEEE International Conference on Data Mining, 2019

2018
SCSMiner: mining social coding sites for software developer recommendation with relevance propagation.
World Wide Web, 2018

Exploiting cross-source knowledge for warming up community question answering services.
Neurocomputing, 2018

Product Title Refinement via Multi-Modal Generative Adversarial Learning.
CoRR, 2018

Improving automatic source code summarization via deep reinforcement learning.
Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering, 2018

Improved Dynamic Memory Network for Dialogue Act Classification with Adversarial Training.
Proceedings of the IEEE International Conference on Big Data (IEEE BigData 2018), 2018

2017
Exploiting Geographical Location for Team Formation in Social Coding Sites.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2017

2016
Incorporating Heterogeneous Information for Mashup Discovery with Consistent Regularization.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2016

2015
Time-Aware API Popularity Prediction via Heterogeneous Features.
Proceedings of the 2015 IEEE International Conference on Web Services, 2015


  Loading...