Yanlin Wang

Orcid: 0000-0001-7761-7269

Affiliations:
  • Sun Yat-sen University, School of Software Engineering, Zhuhai, China
  • Microsoft Research Asia, Beijing, China (former)
  • University of Hong Kong (PhD 2019)


According to our database1, Yanlin Wang authored at least 109 papers between 2016 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Agents in software engineering: survey, landscape, and vision.
Autom. Softw. Eng., November, 2025

EvolMathEval: Towards Evolvable Benchmarks for Mathematical Reasoning via Evolutionary Testing.
CoRR, August, 2025

Web3 x AI Agents: Landscape, Integrations, and Foundational Challenges.
CoRR, August, 2025

SPENCER: Self-Adaptive Model Distillation for Efficient Code Retrieval.
CoRR, August, 2025

A Hierarchical and Evolvable Benchmark for Fine-Grained Code Instruction Following with Multi-Turn Feedback.
CoRR, July, 2025

FORGE: An LLM-driven Framework for Large-Scale Smart Contract Vulnerability Dataset Construction.
CoRR, June, 2025

SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks.
CoRR, June, 2025

Towards Better Code Generation: Adaptive Decoding with Uncertainty Guidance.
CoRR, June, 2025

MidPO: Dual Preference Optimization for Safety and Helpfulness in Large Language Models via a Mixture of Experts Framework.
CoRR, June, 2025

Context-aware code summarization with multi-relational graph neural network.
Autom. Softw. Eng., June, 2025

When Crypto Fails: Demystifying Cryptographic Defects in Ethereum Smart Contracts.
IEEE Trans. Software Eng., May, 2025

NumScout: Unveiling Numerical Defects in Smart Contracts Using LLM-Pruning Symbolic Execution.
IEEE Trans. Software Eng., May, 2025

When ChatGPT Meets Smart Contract Vulnerability Detection: How Far Are We?
ACM Trans. Softw. Eng. Methodol., May, 2025

Adversarial Reasoning for Repair Based on Inferred Program Intent.
CoRR, May, 2025

Towards an Understanding of Context Utilization in Code Intelligence.
CoRR, April, 2025

FeedbackEval: A Benchmark for Evaluating Large Language Models in Feedback-Driven Code Repair Tasks.
CoRR, April, 2025

Towards an understanding of large language models in software engineering tasks.
Empir. Softw. Eng., March, 2025

What to Retrieve for Effective Retrieval-Augmented Code Generation? An Empirical Study and Beyond.
CoRR, March, 2025

Enhancing the Robustness of LLM-Generated Code: Empirical Study and Framework.
CoRR, March, 2025

RustEvo<sup>2</sup>: An Evolving Benchmark for API Evolution in LLM-based Rust Code Generation.
CoRR, March, 2025

LLM Hallucinations in Practical Code Generation: Phenomena, Mechanism, and Mitigation.
Proc. ACM Softw. Eng., 2025

AdverIntent-Agent: Adversarial Reasoning for Repair Based on Inferred Program Intent.
Proc. ACM Softw. Eng., 2025

Beyond Functional Correctness: Investigating Coding Style Inconsistencies in Large Language Models.
Proc. ACM Softw. Eng., 2025

OmniGIRL: A Multilingual and Multimodal Benchmark for GitHub Issue Resolution.
Proc. ACM Softw. Eng., 2025

Large Language Models Are Qualified Benchmark Builders: Rebuilding Pre-Training Datasets for Advancing Code Intelligence Tasks.
Proceedings of the 33rd IEEE/ACM International Conference on Program Comprehension, 2025

HumanEvo: An Evolution-Aware Benchmark for More Realistic Evaluation of Repository-Level Code Generation.
Proceedings of the 47th IEEE/ACM International Conference on Software Engineering, 2025

Demystifying and Detecting Cryptographic Defects in Ethereum Smart Contracts.
Proceedings of the 47th IEEE/ACM International Conference on Software Engineering, 2025

Automated Test Generation For Smart Contracts via On-Chain Test Case Augmentation and Migration.
Proceedings of the 47th IEEE/ACM International Conference on Software Engineering, 2025

Hyperion: Unveiling DApp Inconsistencies Using LLM and Dataflow-Guided Symbolic Execution.
Proceedings of the 47th IEEE/ACM International Conference on Software Engineering, 2025

SECRET: Towards Scalable and Efficient Code Retrieval via Segmented Deep Hashing.
Proceedings of the 47th IEEE/ACM International Conference on Software Engineering, 2025

RLCoder: Reinforcement Learning for Repository-Level Code Completion.
Proceedings of the 47th IEEE/ACM International Conference on Software Engineering, 2025

SoTaNa: An Open-Source Software Engineering Instruction-Tuned Model.
Proceedings of the IEEE/ACM Second International Conference on AI Foundation Models and Software Engineering, 2025

2024
Make Heterophilic Graphs Better Fit GNN: A Graph Rewiring Approach.
IEEE Trans. Knowl. Data Eng., December, 2024

KADEL: Knowledge-Aware Denoising Learning for Commit Message Generation.
ACM Trans. Softw. Eng. Methodol., June, 2024

Snippet Comment Generation Based on Code Context Expansion.
ACM Trans. Softw. Eng. Methodol., January, 2024

Efficiently Detecting Reentrancy Vulnerabilities in Complex Smart Contracts.
Proc. ACM Softw. Eng., 2024

How Well Do LLMs Generate Code for Different Application Domains? Benchmark and Evaluation.
CoRR, 2024

RepoTransBench: A Real-World Benchmark for Repository-Level Code Translation.
CoRR, 2024

Are Decoder-Only Large Language Models the Silver Bullet for Code Search?
CoRR, 2024

Knowledge Probing for Graph Representation Learning.
CoRR, 2024

MCGMark: An Encodable and Robust Online Watermark for LLM-Generated Malicious Code.
CoRR, 2024

RLCoder: Reinforcement Learning for Repository-Level Code Completion.
CoRR, 2024

Beyond Functional Correctness: Investigating Coding Style Inconsistencies in Large Language Models.
CoRR, 2024

CoSQA+: Enhancing Code Search Dataset with Matching Code.
CoRR, 2024

Towards more realistic evaluation of LLM-based code generation: an experimental study and beyond.
CoRR, 2024

An Empirical Study on Low Code Programming using Traditional vs Large Language Model Support.
CoRR, 2024

YODA: Teacher-Student Progressive Learning for Language Models.
CoRR, 2024

SparseCoder: Identifier-Aware Sparse Transformer for File- Level Code Summarization.
Proceedings of the IEEE International Conference on Software Analysis, 2024

MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

MoonBit: Explore the Design of an AI-Friendly Programming Language.
LLM4CODE@ICSE, 2024

RMCBench: Benchmarking Large Language Models' Resistance to Malicious Code.
Proceedings of the 39th IEEE/ACM International Conference on Automated Software Engineering, 2024

When to Stop? Towards Efficient Code Generation in LLMs with Excess Token Prevention.
Proceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis, 2024

Identifying Smart Contract Security Issues in Code Snippets from Stack Overflow.
Proceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis, 2024

RepoMinCoder: Improving Repository-Level Code Generation Based on Information Loss Screening.
Proceedings of the 15th Asia-Pacific Symposium on Internetware, 2024

The Dark Side of NFTs: A Large-Scale Empirical Study of Wash Trading.
Proceedings of the 15th Asia-Pacific Symposium on Internetware, 2024

Tackling Long Code Search with Splitting, Encoding, and Aggregating.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

CoSTV: Accelerating Code Search with Two-Stage Paradigm and Vector Retrieval.
Proceedings of the 31st Asia-Pacific Software Engineering Conference, 2024

MemoryBank: Enhancing Large Language Models with Long-Term Memory.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
CoCoAST: Representing Source Code via Hierarchical Splitting and Reconstruction of Abstract Syntax Trees.
Empir. Softw. Eng., November, 2023

Enhancing Code Intelligence Tasks with ChatGPT.
CoRR, 2023

The Dark Side of NFTs: A Large-Scale Empirical Study of Wash Trading.
CoRR, 2023

Code Search Debiasing: Improve Search Results beyond Overall Ranking Performance.
CoRR, 2023

A Survey of Large Language Models for Code: Evolution, Benchmarking, and Future Trends.
CoRR, 2023

Adaptive-Solver Framework for Dynamic Strategy Selection in Large Language Model Reasoning.
CoRR, 2023

When ChatGPT Meets Smart Contract Vulnerability Detection: How Far Are We?
CoRR, 2023

SoTaNa: The Open-Source Software Development Assistant.
CoRR, 2023

Towards an Understanding of Large Language Models in Software Engineering Tasks.
CoRR, 2023

An overview of Web3.0 Technology: Infrastructure, Applications, and Popularity.
CoRR, 2023

Revisiting Code Search in a Two-Stage Paradigm.
Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

MM-GNN: Mix-Moment Graph Neural Network towards Modeling Neighborhood Feature Distribution.
Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

Modeling Orders of User Behaviors via Differentiable Sorting: A Multi-task Framework to Predicting User Post-click Conversion.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

PrivateRec: Differentially Private Model Training and Online Serving for Federated News Recommendation.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

EALink: An Efficient and Accurate Pre-Trained Framework for Issue-Commit Link Recovery.
Proceedings of the 38th IEEE/ACM International Conference on Automated Software Engineering, 2023

Towards Efficient Fine-Tuning of Pre-trained Code Models: An Experimental Study and Beyond.
Proceedings of the 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis, 2023

Toward Automated Detecting Unanticipated Price Feed in Smart Contract.
Proceedings of the 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis, 2023

RefBERT: A Two-Stage Pre-trained Framework for Automatic Rename Refactoring.
Proceedings of the 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis, 2023

DeFiTainter: Detecting Price Manipulation Vulnerabilities in DeFi Protocols.
Proceedings of the 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis, 2023

Can Neural Networks Help Smart Contract Testing? An Empirical Study.
Proceedings of the 14th Asia-Pacific Symposium on Internetware, 2023

You Augment Me: Exploring ChatGPT-based Data Augmentation for Semantic Code Search.
Proceedings of the IEEE International Conference on Software Maintenance and Evolution, 2023

CoCoSoDa: Effective Contrastive Learning for Code Search.
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering, 2023

Multi-task Learning for Recommendation over Heterogeneous Information Network (Extended abstract).
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

Unveiling the Black Box of PLMs with Semantic Anchors: Towards Interpretable Neural Semantic Parsing.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Multi-Task Learning for Recommendation Over Heterogeneous Information Network.
IEEE Trans. Knowl. Data Eng., 2022

A large-scale empirical study of commit message generation: models, datasets and evaluation.
Empir. Softw. Eng., 2022

Guiding the PLMs with Semantic Anchors as Intermediate Supervision: Towards Interpretable Semantic Parsing.
CoRR, 2022

Make Heterophily Graphs Better Fit GNN: A Graph Rewiring Approach.
CoRR, 2022

Long Code for Code Search.
CoRR, 2022

PrivateRec: Differentially Private Training and Serving for Federated News Recommendation.
CoRR, 2022

Enhancing Semantic Code Search with Multimodal Contrastive Learning and Soft Data Augmentation.
CoRR, 2022

ECMG: Exemplar-based Commit Message Generation.
CoRR, 2022

Game of Privacy: Towards Better Federated Platform Collaboration under Privacy Restriction.
CoRR, 2022

LibDB: An Effective and Efficient Framework for Detecting Third-Party Libraries in Binaries.
Proceedings of the 19th IEEE/ACM International Conference on Mining Software Repositories, 2022

No One Left Behind: Inclusive Federated Learning over Heterogeneous Devices.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

On the Evaluation of Neural Code Summarization.
Proceedings of the 44th IEEE/ACM 44th International Conference on Software Engineering, 2022

RACE: Retrieval-augmented Commit Message Generation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Exploring Representation-level Augmentation for Code Search.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

UniXcoder: Unified Cross-Modal Pre-training for Code Representation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Accelerating Code Search with Deep Hashing and Code Classification.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Neural Code Summarization: How Far Are We?
CoRR, 2021

Is a Single Model Enough? MuCoS: A Multi-Model Ensemble Learning for Semantic Code Search.
CoRR, 2021

CoCoSum: Contextual Code Summarization with Multi-Relational Graph Neural Network.
CoRR, 2021

On the Evaluation of Commit Message Generation Models: An Experimental Study.
Proceedings of the IEEE International Conference on Software Maintenance and Evolution, 2021

CAST: Enhancing Code Summarization with Hierarchical Splitting and Reconstruction of Abstract Syntax Trees.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Is a Single Model Enough? MuCoS: A Multi-Model Ensemble Learning Approach for Semantic Code Search.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Code Completion by Modeling Flattened Abstract Syntax Trees as Graphs.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2018
FHJ: A Formal Model for Hierarchical Dispatching and Overriding.
Proceedings of the 32nd European Conference on Object-Oriented Programming, 2018

2016
Classless Java.
Proceedings of the 2016 ACM SIGPLAN International Conference on Generative Programming: Concepts and Experiences, 2016

The expression problem, trivially!
Proceedings of the 15th International Conference on Modularity, 2016


  Loading...