Ensheng Shi

Orcid: 0000-0002-5543-2025

According to our database1, Ensheng Shi authored at least 37 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Towards an Understanding of Context Utilization in Code Intelligence.
ACM Comput. Surv., August, 2026

Yet Even Less Is Even Better For Agentic, Reasoning, and Coding LLMs.
CoRR, April, 2026

DRAINCODE: Stealthy Energy Consumption Attacks on Retrieval-Augmented Code Generation via Context Poisoning.
CoRR, January, 2026

ShortCoder: Knowledge-Augmented Syntax Optimization for Token-Efficient Code Generation.
CoRR, January, 2026

2025
UCoder: Unsupervised Code Generation by Internal Probing of Large Language Models.
CoRR, December, 2025

SimpleDevQA: Benchmarking Large Language Models on Development Knowledge QA.
CoRR, December, 2025

From Code Foundation Models to Agents and Applications: A Comprehensive Survey and Practical Guide to Code Intelligence.
CoRR, November, 2025

Agents in software engineering: survey, landscape, and vision.
Autom. Softw. Eng., November, 2025

EffiReasonTrans: RL-Optimized Reasoning for Code Translation.
CoRR, October, 2025

Context-aware code summarization with multi-relational graph neural network.
Autom. Softw. Eng., June, 2025

LLM Hallucinations in Practical Code Generation: Phenomena, Mechanism, and Mitigation.
Proc. ACM Softw. Eng., 2025

DrainCode: Stealthy Energy Consumption Attacks on Retrieval-Augmented Code Generation via Context Poisoning.
Proceedings of the 40th IEEE/ACM International Conference on Automated Software Engineering, 2025

AlignCoder: Aligning Retrieval with Target Intent for Repository-Level Code Completion.
Proceedings of the 40th IEEE/ACM International Conference on Automated Software Engineering, 2025

HumanEvo: An Evolution-Aware Benchmark for More Realistic Evaluation of Repository-Level Code Generation.
Proceedings of the 47th IEEE/ACM International Conference on Software Engineering, 2025

SECRET: Towards Scalable and Efficient Code Retrieval via Segmented Deep Hashing.
Proceedings of the 47th IEEE/ACM International Conference on Software Engineering, 2025

SoTaNa: An Open-Source Software Engineering Instruction-Tuned Model.
Proceedings of the IEEE/ACM Second International Conference on AI Foundation Models and Software Engineering, 2025

Speed Up Your Code: Progressive Code Acceleration Through Bidirectional Tree Editing.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
How Well Do LLMs Generate Code for Different Application Domains? Benchmark and Evaluation.
CoRR, 2024

Towards more realistic evaluation of LLM-based code generation: an experimental study and beyond.
CoRR, 2024

When to Stop? Towards Efficient Code Generation in LLMs with Excess Token Prevention.
Proceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis, 2024

RepoMinCoder: Improving Repository-Level Code Generation Based on Information Loss Screening.
Proceedings of the 15th Asia-Pacific Symposium on Internetware, 2024

2023
CoCoAST: Representing Source Code via Hierarchical Splitting and Reconstruction of Abstract Syntax Trees.
Empir. Softw. Eng., November, 2023

SoTaNa: The Open-Source Software Development Assistant.
CoRR, 2023

Towards Efficient Fine-Tuning of Pre-trained Code Models: An Experimental Study and Beyond.
Proceedings of the 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis, 2023

You Augment Me: Exploring ChatGPT-based Data Augmentation for Semantic Code Search.
Proceedings of the IEEE International Conference on Software Maintenance and Evolution, 2023

CoCoSoDa: Effective Contrastive Learning for Code Search.
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering, 2023

2022
A large-scale empirical study of commit message generation: models, datasets and evaluation.
Empir. Softw. Eng., 2022

Enhancing Semantic Code Search with Multimodal Contrastive Learning and Soft Data Augmentation.
CoRR, 2022

ECMG: Exemplar-based Commit Message Generation.
CoRR, 2022

On the Evaluation of Neural Code Summarization.
Proceedings of the 44th IEEE/ACM 44th International Conference on Software Engineering, 2022

RACE: Retrieval-augmented Commit Message Generation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021
Neural Code Summarization: How Far Are We?
CoRR, 2021

Is a Single Model Enough? MuCoS: A Multi-Model Ensemble Learning for Semantic Code Search.
CoRR, 2021

CoCoSum: Contextual Code Summarization with Multi-Relational Graph Neural Network.
CoRR, 2021

On the Evaluation of Commit Message Generation Models: An Experimental Study.
Proceedings of the IEEE International Conference on Software Maintenance and Evolution, 2021

CAST: Enhancing Code Summarization with Hierarchical Splitting and Reconstruction of Abstract Syntax Trees.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Is a Single Model Enough? MuCoS: A Multi-Model Ensemble Learning Approach for Semantic Code Search.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021


  Loading...