Ge Li
Orcid: 0000-0002-5828-0186Affiliations:
- Peking University, Key Laboratory of High Confidence Software Technologies, Bejing, China
According to our database1,
Ge Li
authored at least 193 papers
between 2004 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
ACM Trans. Softw. Eng. Methodol., September, 2025
RL-PLUS: Countering Capability Boundary Collapse of LLMs in Reinforcement Learning with Hybrid-policy Optimization.
CoRR, August, 2025
CoRR, July, 2025
Empir. Softw. Eng., June, 2025
CoRR, May, 2025
CoRR, April, 2025
ACM Trans. Softw. Eng. Methodol., March, 2025
Empir. Softw. Eng., March, 2025
LLMigrate: Transforming "Lazy" Large Language Models into Efficient Source Code Migrators.
CoRR, March, 2025
aiXcoder-7B-v2: Training LLMs to Fully Utilize the Long Context in Repository-level Code Completion.
CoRR, March, 2025
CoRR, March, 2025
ACM Trans. Softw. Eng. Methodol., February, 2025
CoRR, February, 2025
Focused-DPO: Enhancing Code Generation Through Focused Preference Optimization on Error-Prone Points.
CoRR, February, 2025
J. Comput. Sci. Technol., January, 2025
PATCH: Empowering Large Language Model with Programmer-Intent Guidance and Collaborative-Behavior Simulation for Automatic Bug Fixing.
CoRR, January, 2025
ROCODE: Integrating Backtracking Mechanism and Program Analysis in Large Language Models for Code Generation.
Proceedings of the 47th IEEE/ACM International Conference on Software Engineering, 2025
Proceedings of the 47th IEEE/ACM International Conference on Software Engineering, 2025
Proceedings of the 47th IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2025
UnCert-CoT: Uncertainty-Aware Chain-of-Thought for Code Generation with Large Language Model.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2025
Focused-DPO: Enhancing Code Generation Through Focused Preference Optimization on Error-Prone Points.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
2024
ACM Trans. Softw. Eng. Methodol., November, 2024
ACM Trans. Softw. Eng. Methodol., September, 2024
ACM Trans. Softw. Eng. Methodol., September, 2024
J. Softw. Evol. Process., September, 2024
IEEE Trans. Software Eng., June, 2024
ACM Trans. Softw. Eng. Methodol., June, 2024
ACM Trans. Softw. Eng. Methodol., March, 2024
CodeBERT-Attack: Adversarial attack against source code deep learning models via pre-trained model.
J. Softw. Evol. Process., March, 2024
Inf. Softw. Technol., February, 2024
Exploring and Unleashing the Power of Large Language Models in Automated Code Translation.
Proc. ACM Softw. Eng., 2024
Proc. ACM Softw. Eng., 2024
ROCODE: Integrating Backtracking Mechanism and Program Analysis in Large Language Models for Code Generation.
CoRR, 2024
EvoCodeBench: An Evolving Code Generation Benchmark with Domain-Specific Evaluations.
CoRR, 2024
CoRR, 2024
Exploring and Lifting the Robustness of LLM-powered Automated Program Repair with Metamorphic Testing.
CoRR, 2024
CoRR, 2024
Code Structure-Aware through Line-level Semantic Learning for Code Vulnerability Detection.
CoRR, 2024
DevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories.
CoRR, 2024
Peer-aided Repairer: Empowering Large Language Models to Repair Advanced Student Assignments.
CoRR, 2024
EvoCodeBench: An Evolving Code Generation Benchmark Aligned with Real-World Code Repositories.
CoRR, 2024
CoRR, 2024
SEED: Customize Large Language Models with Sample-Efficient Adaptation for Code Generation.
CoRR, 2024
Generalization or Memorization: Data Contamination and Trustworthy Evaluation for Large Language Models.
CoRR, 2024
EvoCodeBench: An Evolving Code Generation Benchmark with Domain-Specific Evaluations.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the 39th IEEE/ACM International Conference on Automated Software Engineering, 2024
Sifting through the Chaff: On Utilizing Execution Feedback for Ranking the Generated Code Candidates.
Proceedings of the 39th IEEE/ACM International Conference on Automated Software Engineering, 2024
FastFixer: An Efficient and Effective Approach for Repairing Programming Assignments.
Proceedings of the 39th IEEE/ACM International Conference on Automated Software Engineering, 2024
Proceedings of the 32nd IEEE/ACM International Conference on Program Comprehension, 2024
Proceedings of the 2024 IEEE/ACM 46th International Conference on Software Engineering: Companion Proceedings, 2024
Large Language Models are Few-Shot Summarizers: Multi-Intent Comment Generation via In-Context Learning.
Proceedings of the 46th IEEE/ACM International Conference on Software Engineering, 2024
Enhancing Code Generation Performance of Smaller Models by Distilling the Reasoning Ability of LLMs.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
CodeAgent: Enhancing Code Generation with Tool-Integrated Agent Systems for Real-World Repo-level Coding Challenges.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
DevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Generalization or Memorization: Data Contamination and Trustworthy Evaluation for Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Hot or Cold? Adaptive Temperature Sampling for Code Generation with Large Language Models.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
ACM Trans. Softw. Eng. Methodol., November, 2023
CoRR, 2023
CoRR, 2023
CoRR, 2023
Neural Program Repair with Program Dependence Analysis and Effective Filter Mechanism.
CoRR, 2023
CoRR, 2023
An Empirical Study on Using Large Language Models for Multi-Intent Comment Generation.
CoRR, 2023
Proceedings of the IEEE International Conference on Software Analysis, 2023
Proceedings of the 38th IEEE/ACM International Conference on Automated Software Engineering, 2023
Implant Global and Local Hierarchy Information to Sequence based Code Representation Models.
Proceedings of the 31st IEEE/ACM International Conference on Program Comprehension, 2023
Proceedings of the 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis, 2023
Proceedings of the 14th Asia-Pacific Symposium on Internetware, 2023
Proceedings of the 14th Asia-Pacific Symposium on Internetware, 2023
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering, 2023
Antecedent Predictions Are More Important Than You Think: An Effective Method for Tree-Based Code Generation.
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
2022
IEEE Trans. Software Eng., 2022
Towards Robustness of Deep Program Processing Models - Detection, Estimation, and Enhancement.
ACM Trans. Softw. Eng. Methodol., 2022
Assessing and Improving an Evaluation Dataset for Detecting Semantic Code Clones via Deep Learning.
ACM Trans. Softw. Eng. Methodol., 2022
Precise Learning of Source Code Contextual Semantics via Hierarchical Dependence Structure and Graph Attention Networks.
J. Syst. Softw., 2022
Empir. Softw. Eng., 2022
Incorporating domain knowledge through task augmentation for front-end JavaScript code generation.
Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2022
Are we building on the rock? on the importance of data preprocessing for code summarization.
Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2022
Fine-Tuning Pre-Trained Language Models Effectively by Optimizing Subnetworks Adaptively.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the 30th IEEE/ACM International Conference on Program Comprehension, 2022
Automated Assertion Generation via Information Retrieval and Its Integration with Deep learning.
Proceedings of the 44th IEEE/ACM 44th International Conference on Software Engineering, 2022
Proceedings of the 44th IEEE/ACM 44th International Conference on Software Engineering, 2022
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
SK2: Integrating Implicit Sentiment Knowledge and Explicit Syntax Knowledge for Aspect-Based Sentiment Analysis.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022
2021
Sci. China Inf. Sci., 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021
Proceedings of the 36th IEEE/ACM International Conference on Automated Software Engineering, 2021
2020
ACM Trans. Softw. Eng. Methodol., 2020
Empir. Softw. Eng., 2020
Detecting Code Clones with Graph Neural Networkand Flow-Augmented Abstract Syntax Tree.
CoRR, 2020
Detecting Code Clones with Graph Neural Network and Flow-Augmented Abstract Syntax Tree.
Proceedings of the 27th IEEE International Conference on Software Analysis, 2020
DeepCommenter: a deep code comment generation tool with hybrid lexical and syntactical information.
Proceedings of the ESEC/FSE '20: 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2020
Proceedings of the 35th IEEE/ACM International Conference on Automated Software Engineering, 2020
Proceedings of the 35th IEEE/ACM International Conference on Automated Software Engineering, 2020
Proceedings of the ICPC '20: 28th International Conference on Program Comprehension, 2020
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020
Generating Adversarial Examples for Holding Robustness of Source Code Processing Models.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the 27th International Conference on Program Comprehension, 2019
Why Do Neural Dialog Systems Generate Short and Meaningless Replies? a Comparison between Dialog and Translation.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the 43rd IEEE Annual Computer Software and Applications Conference, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
Proceedings of the 26th Conference on Program Comprehension, 2018
Proceedings of the 26th Conference on Program Comprehension, 2018
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018
2017
Proceedings of the Knowledge Science, Engineering and Management, 2017
Proceedings of the Knowledge Science, Engineering and Management, 2017
Learning Sparse Overcomplete Word Vectors Without Intermediate Dense Representations.
Proceedings of the Knowledge Science, Engineering and Management, 2017
2016
Int. J. Embed. Syst., 2016
Context-Aware Tree-Based Convolutional Neural Networks for Natural Language Inference.
Proceedings of the Knowledge Science, Engineering and Management, 2016
Learning Embeddings of API Tokens to Facilitate Deep Learning Based Program Processing.
Proceedings of the Knowledge Science, Engineering and Management, 2016
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016
Improved relation classification by deep recurrent neural networks with data augmentation.
Proceedings of the COLING 2016, 2016
Sequence to Backward and Forward Sequences: A Content-Introducing Approach to Generative Short-Text Conversation.
Proceedings of the COLING 2016, 2016
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016
Convolutional Neural Networks over Tree Structures for Programming Language Processing.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016
2015
Classifying Relations via Long Short Term Memory Networks along Shortest Dependency Path.
CoRR, 2015
CoRR, 2015
Proceedings of the Knowledge Science, Engineering and Management, 2015
Classifying Relations via Long Short Term Memory Networks along Shortest Dependency Paths.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015
A Comparative Study on Regularization Strategies for Embedding-based Neural Networks.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015
2014
Int. J. Softw. Eng. Knowl. Eng., 2014
TBCNN: A Tree-Based Convolutional Neural Network for Programming Language Processing.
CoRR, 2014
Verification Based on Hyponymy Hierarchical Characteristics for Web-Based Hyponymy Discovery.
Proceedings of the Knowledge Science, Engineering and Management, 2014
2013
Proceedings of the Safe and Secure Software Reuse, 2013
Proceedings of the 35th International Conference on Software Engineering, 2013
Domain Hyponymy Hierarchy Discovery by Iterative Web Searching and Inferable Semantics Based Concept Selecting.
Proceedings of the 37th Annual IEEE Computer Software and Applications Conference, 2013
2012
Modeling and Analyzing the Reliability and Cost of Service Composition in the IoT: A Probabilistic Approach.
Proceedings of the 2012 IEEE 19th International Conference on Web Services, 2012
Discovering Domain Concepts and Hyponymy Relations by Text Relevance Classifying Based Iterative Web Searching.
Proceedings of the 19th Asia-Pacific Software Engineering Conference, 2012
2011
Proceedings of the IEEE 6th International Symposium on Service Oriented System Engineering, 2011
An Ontology based Method for Building Understandable Hierarchical Classification Structure for Software Assets Browsing.
Proceedings of the 23rd International Conference on Software Engineering & Knowledge Engineering (SEKE'2011), 2011
An Engineerable Ontology Based Approach for Requirements Elicitation in Process Centered Problem Domain.
Proceedings of the Knowledge Science, Engineering and Management, 2011
APIExample: An effective web search based usage example recommendation system for java APIs.
Proceedings of the 26th IEEE/ACM International Conference on Automated Software Engineering (ASE 2011), 2011
2010
Enriching Descriptions for Public Web Services Using Information Captured from Related Web Pages on the Internet.
Proceedings of the Fifth IEEE International Symposium on Service-Oriented System Engineering, 2010
Assisting Developers to Read Code Help-Documents Efficiently through Discovering Document-section Relationships.
Proceedings of the 22nd International Conference on Software Engineering & Knowledge Engineering (SEKE'2010), Redwood City, San Francisco Bay, CA, USA, July 1, 2010
2009
Assisting Trustworthiness Based Web Services Selection Using the Fidelity of Websites.
Proceedings of the Service-Oriented Computing, 7th International Joint Conference, 2009
2008
Proceedings of the High Confidence Software Reuse in Large Systems, 2008
2007
Ontology Based Classification Generating Method for Browsing-Based Component Retrieval.
Proceedings of the Nineteenth International Conference on Software Engineering & Knowledge Engineering (SEKE'2007), 2007
Proceedings of the 22nd IEEE/ACM International Conference on Automated Software Engineering (ASE 2007), 2007
Proceedings of the 2007 IEEE International Conference on Web Services (ICWS 2007), 2007
Proceedings of the 31st Annual International Computer Software and Applications Conference, 2007
Proceedings of the 2007 IEEE International Conference on Services Computing (SCC 2007), 2007
2006
Shortening retrieval sequences in browsing-based component retrieval using information entropy.
J. Syst. Softw., 2006
2004
Attribute Ranking: An Entropy-Based Approach to Accelerating Browsing-Based Component Retrieval.
Proceedings of the Software Reuse: Methods, 2004