Yanlin Wang

Orcid: 0000-0001-7761-7269

Affiliations:
  • Sun Yat-sen University, School of Software Engineering, Zhuhai, China
  • Microsoft Research Asia, Beijing, China (former)
  • University of Hong Kong, China (PhD 2019)


According to our database1, Yanlin Wang authored at least 65 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
KADEL: Knowledge-Aware Denoising Learning for Commit Message Generation.
ACM Trans. Softw. Eng. Methodol., June, 2024

Snippet Comment Generation Based on Code Context Expansion.
ACM Trans. Softw. Eng. Methodol., January, 2024

Efficiently Detecting Reentrancy Vulnerabilities in Complex Smart Contracts.
Proc. ACM Softw. Eng., 2024

RMCBench: Benchmarking Large Language Models' Resistance to Malicious Code.
CoRR, 2024

Hyperion: Unveiling DApp Inconsistencies using LLM and Dataflow-Guided Symbolic Execution.
CoRR, 2024

Demystifying and Detecting Cryptographic Defects in Ethereum Smart Contracts.
CoRR, 2024

MCGMark: An Encodable and Robust Online Watermark for LLM-Generated Malicious Code.
CoRR, 2024

RLCoder: Reinforcement Learning for Repository-Level Code Completion.
CoRR, 2024

Beyond Functional Correctness: Investigating Coding Style Inconsistencies in Large Language Models.
CoRR, 2024

CoSQA+: Enhancing Code Search Dataset with Matching Code.
CoRR, 2024

Towards more realistic evaluation of LLM-based code generation: an experimental study and beyond.
CoRR, 2024

An Empirical Study on Low Code Programming using Traditional vs Large Language Model Support.
CoRR, 2024

SparseCoder: Identifier-Aware Sparse Transformer for File- Level Code Summarization.
Proceedings of the IEEE International Conference on Software Analysis, 2024

MoonBit: Explore the Design of an AI-Friendly Programming Language.
LLM4CODE@ICSE, 2024

When to Stop? Towards Efficient Code Generation in LLMs with Excess Token Prevention.
Proceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis, 2024

Identifying Smart Contract Security Issues in Code Snippets from Stack Overflow.
Proceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis, 2024

RepoMinCoder: Improving Repository-Level Code Generation Based on Information Loss Screening.
Proceedings of the 15th Asia-Pacific Symposium on Internetware, 2024

The Dark Side of NFTs: A Large-Scale Empirical Study of Wash Trading.
Proceedings of the 15th Asia-Pacific Symposium on Internetware, 2024

Tackling Long Code Search with Splitting, Encoding, and Aggregating.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
CoCoAST: Representing Source Code via Hierarchical Splitting and Reconstruction of Abstract Syntax Trees.
Empir. Softw. Eng., November, 2023

The Dark Side of NFTs: A Large-Scale Empirical Study of Wash Trading.
CoRR, 2023

Code Search Debiasing: Improve Search Results beyond Overall Ranking Performance.
CoRR, 2023

A Survey of Large Language Models for Code: Evolution, Benchmarking, and Future Trends.
CoRR, 2023

When ChatGPT Meets Smart Contract Vulnerability Detection: How Far Are We?
CoRR, 2023

SoTaNa: The Open-Source Software Development Assistant.
CoRR, 2023

Towards an Understanding of Large Language Models in Software Engineering Tasks.
CoRR, 2023

An overview of Web3.0 Technology: Infrastructure, Applications, and Popularity.
CoRR, 2023

Revisiting Code Search in a Two-Stage Paradigm.
Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

MM-GNN: Mix-Moment Graph Neural Network towards Modeling Neighborhood Feature Distribution.
Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

Modeling Orders of User Behaviors via Differentiable Sorting: A Multi-task Framework to Predicting User Post-click Conversion.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

PrivateRec: Differentially Private Model Training and Online Serving for Federated News Recommendation.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

EALink: An Efficient and Accurate Pre-Trained Framework for Issue-Commit Link Recovery.
Proceedings of the 38th IEEE/ACM International Conference on Automated Software Engineering, 2023

Towards Efficient Fine-Tuning of Pre-trained Code Models: An Experimental Study and Beyond.
Proceedings of the 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis, 2023

Toward Automated Detecting Unanticipated Price Feed in Smart Contract.
Proceedings of the 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis, 2023

RefBERT: A Two-Stage Pre-trained Framework for Automatic Rename Refactoring.
Proceedings of the 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis, 2023

DeFiTainter: Detecting Price Manipulation Vulnerabilities in DeFi Protocols.
Proceedings of the 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis, 2023

Can Neural Networks Help Smart Contract Testing? An Empirical Study.
Proceedings of the 14th Asia-Pacific Symposium on Internetware, 2023

You Augment Me: Exploring ChatGPT-based Data Augmentation for Semantic Code Search.
Proceedings of the IEEE International Conference on Software Maintenance and Evolution, 2023

CoCoSoDa: Effective Contrastive Learning for Code Search.
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering, 2023

Multi-task Learning for Recommendation over Heterogeneous Information Network (Extended abstract).
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

Unveiling the Black Box of PLMs with Semantic Anchors: Towards Interpretable Neural Semantic Parsing.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Multi-Task Learning for Recommendation Over Heterogeneous Information Network.
IEEE Trans. Knowl. Data Eng., 2022

A large-scale empirical study of commit message generation: models, datasets and evaluation.
Empir. Softw. Eng., 2022

Guiding the PLMs with Semantic Anchors as Intermediate Supervision: Towards Interpretable Semantic Parsing.
CoRR, 2022

Make Heterophily Graphs Better Fit GNN: A Graph Rewiring Approach.
CoRR, 2022

Long Code for Code Search.
CoRR, 2022

PrivateRec: Differentially Private Training and Serving for Federated News Recommendation.
CoRR, 2022

Enhancing Semantic Code Search with Multimodal Contrastive Learning and Soft Data Augmentation.
CoRR, 2022

ECMG: Exemplar-based Commit Message Generation.
CoRR, 2022

Game of Privacy: Towards Better Federated Platform Collaboration under Privacy Restriction.
CoRR, 2022

LibDB: An Effective and Efficient Framework for Detecting Third-Party Libraries in Binaries.
Proceedings of the 19th IEEE/ACM International Conference on Mining Software Repositories, 2022

No One Left Behind: Inclusive Federated Learning over Heterogeneous Devices.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

On the Evaluation of Neural Code Summarization.
Proceedings of the 44th IEEE/ACM 44th International Conference on Software Engineering, 2022

RACE: Retrieval-augmented Commit Message Generation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Accelerating Code Search with Deep Hashing and Code Classification.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Neural Code Summarization: How Far Are We?
CoRR, 2021

Is a Single Model Enough? MuCoS: A Multi-Model Ensemble Learning for Semantic Code Search.
CoRR, 2021

CoCoSum: Contextual Code Summarization with Multi-Relational Graph Neural Network.
CoRR, 2021

On the Evaluation of Commit Message Generation Models: An Experimental Study.
Proceedings of the IEEE International Conference on Software Maintenance and Evolution, 2021

CAST: Enhancing Code Summarization with Hierarchical Splitting and Reconstruction of Abstract Syntax Trees.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Is a Single Model Enough? MuCoS: A Multi-Model Ensemble Learning Approach for Semantic Code Search.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Code Completion by Modeling Flattened Abstract Syntax Trees as Graphs.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2018
FHJ: A Formal Model for Hierarchical Dispatching and Overriding.
Proceedings of the 32nd European Conference on Object-Oriented Programming, 2018

2016
Classless Java.
Proceedings of the 2016 ACM SIGPLAN International Conference on Generative Programming: Concepts and Experiences, 2016

The expression problem, trivially!
Proceedings of the 15th International Conference on Modularity, 2016


  Loading...