Xiaodong Gu

Orcid: 0000-0002-0529-6408

Affiliations:
  • Shanghai Jiao Tong University, China


According to our database1, Xiaodong Gu authored at least 68 papers between 2015 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
HEART-Bench: Do LLM Agents Exhibit Human-like Psychology?
CoRR, May, 2026

ClassEval-Pro: A Cross-Domain Benchmark for Class-Level Code Generation.
CoRR, April, 2026

ShredBench: Evaluating the Semantic Reasoning Capabilities of Multimodal LLMs in Document Reconstruction.
CoRR, April, 2026

EffiSkill: Agent Skill Based Automated Code Efficiency Optimization.
CoRR, March, 2026

Test vs Mutant: Adversarial LLM Agents for Robust Unit Test Generation.
CoRR, February, 2026

Rethinking the Value of Agent-Generated Tests for LLM-Based Software Engineering Agents.
CoRR, February, 2026

Rethinking Code Complexity Through the Lens of Large Language Models.
CoRR, February, 2026

CodeOCR: On the Effectiveness of Vision Language Models in Code Understanding.
CoRR, February, 2026

Synthetic Malware at Scale: Malicious Code Generation With Code Transplanting.
IEEE Trans. Software Eng., January, 2026

SWE-Pruner: Self-Adaptive Context Pruning for Coding Agents.
CoRR, January, 2026

Readability-Robust Code Summarization via Meta Curriculum Learning.
CoRR, January, 2026

GlimpRouter: Efficient Collaborative Inference by Glimpsing One Token of Thoughts.
CoRR, January, 2026

CatchAll: Repository-Aware Exception Handling with Knowledge-Guided LLMs.
CoRR, January, 2026

In Line with Context: Repository-Level Code Generation via Context Inlining.
CoRR, January, 2026

Reasoning in Trees: Improving Retrieval-Augmented Generation for Multi-Hop Question Answering.
Proceedings of the Companion Proceedings of the ACM Web Conference 2026, 2026

Anti-adversarial Learning: Desensitizing Prompts for Large Language Model.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Neuron-Guided Interpretation of Code LLMs: Where, Why, and How?
CoRR, December, 2025

Beyond Language Boundaries: Uncovering Programming Language Families for Code Language Models.
CoRR, December, 2025

SALT4Decompile: Inferring Source-level Abstract Logic Tree for LLM-Based Binary Decompilation.
CoRR, September, 2025

SWE-QA: Can Language Models Answer Repository-level Code Questions?
CoRR, September, 2025

LibRec: Benchmarking Retrieval-Augmented LLMs for Library Migration Recommendations.
CoRR, August, 2025

Pruning the Unsurprising: Efficient Code Reasoning via First-Token Surprisal.
CoRR, August, 2025

EVOC2RUST: A Skeleton-guided Framework for Project-Level C-to-Rust Translation.
CoRR, August, 2025

SWE-Exp: Experience-Driven Software Issue Resolution.
CoRR, July, 2025

SWE-Debate: Competitive Multi-Agent Debate for Software Issue Resolution.
CoRR, July, 2025

Anti-adversarial Learning: Desensitizing Prompts for Large Language Models.
CoRR, May, 2025

On the Effectiveness of Large Language Models in Domain-Specific Code Generation.
ACM Trans. Softw. Eng. Methodol., March, 2025

AttentionRAG: Attention-Guided Context Pruning in Retrieval-Augmented Generation.
CoRR, March, 2025

Just-in-time software defect prediction via bi-modal change representation learning.
J. Syst. Softw., 2025

LongCodeZip: Compress Long Context for Code Language Models.
Proceedings of the 40th IEEE/ACM International Conference on Automated Software Engineering, 2025

Between Lines of Code: Unraveling the Distinct Patterns of Machine and Human Programmers.
Proceedings of the 47th IEEE/ACM International Conference on Software Engineering, 2025

Transplant Then Regenerate: A New Paradigm for Text Data Augmentation.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

LastingBench: Defend Benchmarks Against Knowledge Leakage.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Empowering AI to Generate Better AI Code: Guided Generation of Deep Learning Projects with LLMs.
Proceedings of the 49th IEEE Annual Computers, Software, and Applications Conference, 2025

ApiRAT: Integrating Multi-source API Knowledge for Enhanced Code Translation with LLMs.
Proceedings of the 49th IEEE Annual Computers, Software, and Applications Conference, 2025

2024
VarGAN: Adversarial Learning of Variable Semantic Representations.
IEEE Trans. Software Eng., June, 2024

Project-specific code summarization with in-context learning.
J. Syst. Softw., 2024

Few-shot code translation via task-adapted prompt learning.
J. Syst. Softw., 2024

CodeCipher: Learning to Obfuscate Source Code Against LLMs.
CoRR, 2024

From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging.
CoRR, 2024

Zero-Shot Code Representation Learning via Prompt Tuning.
CoRR, 2024

How Effectively Do Code Language Models Understand Poor-Readability Code?
Proceedings of the 39th IEEE/ACM International Conference on Automated Software Engineering, 2024

Unraveling the Potential of Large Language Models in Code Translation: How Far are We?
Proceedings of the 31st Asia-Pacific Software Engineering Conference, 2024

2023
DuReSE: Rewriting Incomplete Utterances via Neural Sequence Editing.
Neural Process. Lett., December, 2023

Finding the best learning to rank algorithms for effort-aware defect prediction.
Inf. Softw. Technol., May, 2023

Self-Supervised Query Reformulation for Code Search.
Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2023

InfeRE: Step-by-Step Regex Generation via Chain of Inference.
Proceedings of the 38th IEEE/ACM International Conference on Automated Software Engineering, 2023

On the Evaluation of Neural Code Translation: Taxonomy and Benchmark.
Proceedings of the 38th IEEE/ACM International Conference on Automated Software Engineering, 2023

Influential Recommender System.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

2022
Clean and Learn: Improving Robustness to Spurious Solutions in API Question Answering.
Int. J. Softw. Eng. Knowl. Eng., 2022

Cross-Domain Deep Code Search with Few-Shot Meta Learning.
CoRR, 2022

Diet code is healthy: simplifying programs for pre-trained models of code.
Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2022

Answering Software Deployment Questions via Neural Machine Reading at Scale.
Proceedings of the 37th IEEE/ACM International Conference on Automated Software Engineering, 2022

Self-supervised learning of smart contract representations.
Proceedings of the 30th IEEE/ACM International Conference on Program Comprehension, 2022

Zero-shot program representation learning.
Proceedings of the 30th IEEE/ACM International Conference on Program Comprehension, 2022

Cross-Domain Deep Code Search with Meta Learning.
Proceedings of the 44th IEEE/ACM 44th International Conference on Software Engineering, 2022

Continuous Decomposition of Granularity for Neural Paraphrase Generation.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Code Question Answering via Task-Adaptive Sequence-to-Sequence Pre-training.
Proceedings of the 29th Asia-Pacific Software Engineering Conference, 2022

2021
Response Generation with Context-Aware Prompt Learning.
CoRR, 2021

A Multi-Modal Transformer-based Code Summarization Approach for Smart Contracts.
Proceedings of the 29th IEEE/ACM International Conference on Program Comprehension, 2021

Do Bugs Propagate? An Empirical Analysis of Temporal Correlations Among Software Bugs.
Proceedings of the 35th European Conference on Object-Oriented Programming, 2021

DialogBERT: Discourse-Aware Response Generation via Learning to Recover and Rank Utterances.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2019
CodeKernel: A Graph Kernel Based Approach to the Selection of API Usage Examples.
Proceedings of the 34th IEEE/ACM International Conference on Automated Software Engineering, 2019

DialogWAE: Multimodal Response Generation with Conditional Wasserstein Auto-Encoder.
Proceedings of the 7th International Conference on Learning Representations, 2019

2018
Deep code search.
Proceedings of the 40th International Conference on Software Engineering, 2018

2017
DeepAM: Migrate APIs with Multi-modal Sequence to Sequence Learning.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

2016
Deep API learning.
Proceedings of the 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering, 2016

2015
"What Parts of Your Apps are Loved by Users?" (T).
Proceedings of the 30th IEEE/ACM International Conference on Automated Software Engineering, 2015


  Loading...