Yongbin Li

Orcid: 0000-0002-0686-0883

Affiliations:
  • Capital Normal University, College of Life Sciences, Beijing, China
  • Alibaba Group, Tongyi Lab, China


According to our database1, Yongbin Li authored at least 163 papers between 2019 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Think Anywhere in Code Generation.
CoRR, March, 2026

P-GenRM: Personalized Generative Reward Model with Test-time User-based Scaling.
CoRR, February, 2026

Beyond Quantity: Trajectory Diversity Scaling for Code Agents.
CoRR, February, 2026

ExpSeek: Self-Triggered Experience Seeking for Web Agents.
CoRR, January, 2026

Controlling Multimodal Conversational Agents with Coverage-Enhanced Latent Actions.
CoRR, January, 2026

Selective Weak-to-Strong Generalization.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Large Language Model Unlearning for Source Code.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Understanding Generalization in Role-Playing Models via Information Theory.
CoRR, December, 2025

MOA: Multi-Objective Alignment for Role-Playing Agents.
CoRR, December, 2025

Mapping-by-Sequencing with MiModD: A Teaching Dataset Based on C. elegans N2 WS235.
Dataset, November, 2025

Empowering RepoQA-Agent based on Reinforcement Learning Driven by Monte-carlo Tree Search.
CoRR, October, 2025

CodeRL+: Improving Code Generation via Reinforcement with Execution Semantics Alignment.
CoRR, October, 2025

InspectCoder: Dynamic Analysis-Enabled Self Repair through interactive LLM-Debugger Collaboration.
CoRR, October, 2025

Saber: An Efficient Sampling with Adaptive Acceleration and Backtracking Enhanced Remasking for Diffusion Language Model.
CoRR, October, 2025

Agentic Reinforcement Learning with Implicit Step Rewards.
CoRR, September, 2025

lybCNU/xol1RI: Datasets for Aberrant X chromosome dosage compensation causes hybrid male inviability in Caenorhabditis.
Dataset, September, 2025

CPO: Addressing Reward Ambiguity in Role-playing Dialogue via Comparative Policy Optimization.
CoRR, August, 2025

RL-PLUS: Countering Capability Boundary Collapse of LLMs in Reinforcement Learning with Hybrid-policy Optimization.
CoRR, August, 2025

Format-Adapter: Improving Reasoning Capability of LLMs by Adapting Suitable Format.
CoRR, June, 2025

MeRF: Motivation-enhanced Reinforcement Finetuning for Large Reasoning Models.
CoRR, June, 2025

Large Language Model Unlearning for Source Code.
CoRR, June, 2025

TimeHC-RL: Temporal-aware Hierarchical Cognitive Reinforcement Learning for Enhancing LLMs' Social Intelligence.
CoRR, May, 2025

ChARM: Character-based Act-adaptive Reward Modeling for Advanced Role-Playing Language Agents.
CoRR, May, 2025

Socratic-PRMBench: Benchmarking Process Reward Models with Systematic Reasoning Patterns.
CoRR, May, 2025

Adaptive Thinking via Mode Policy Optimization for Social Language Agents.
CoRR, May, 2025

Thinking Longer, Not Larger: Enhancing Software Engineering Agents via Scaling Test-Time Compute.
CoRR, March, 2025

A Survey of Direct Preference Optimization.
CoRR, March, 2025

Sequence for C. briggsae alleles, C. nigoni alleles, related vectors and a C. tribulationis scaffold.
Dataset, March, 2025

EPO: Explicit Policy Optimization for Strategic Reasoning in LLMs via Reinforcement Learning.
CoRR, February, 2025

OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis.
CoRR, January, 2025

Diverse AI Feedback For Large Language Model Alignment.
Trans. Assoc. Comput. Linguistics, 2025

SWE-GPT: A Process-Centric Language Model for Automated Software Improvement.
Proc. ACM Softw. Eng., 2025

Alibaba LingmaAgent: Improving Automated Issue Resolution via Comprehensive Repository Exploration.
Proceedings of the 33rd ACM International Conference on the Foundations of Software Engineering, 2025

Transferable Post-training via Inverse Value Learning.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Do Code LLMs Understand Design Patterns?
Proceedings of the IEEE/ACM International Workshop on Large Language Models for Code, 2025

Thinking Longer, Not Larger: Enhancing Software Engineering Agents via Scaling Test-Time Compute.
Proceedings of the 40th IEEE/ACM International Conference on Automated Software Engineering, 2025

On the Role of Attention Heads in Large Language Model Safety.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

EIFBENCH: Extremely Complex Instruction Following Benchmark for Large Language Models.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

CPO: Addressing Reward Ambiguity in Role-playing Dialogue via Comparative Policy Optimization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Supervised Optimism Correction: Be Confident When LLMs Are Sure.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

IOPO: Empowering LLMs with Complex Instruction Following via Input-Output Preference Optimization.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

OmniCharacter: Towards Immersive Role-Playing Agents with Seamless Speech-Language Personality Interaction.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

ExploraCoder: Advancing Code Generation for Multiple Unseen APIs via Planning and Chained Exploration.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

EPO: Explicit Policy Optimization for Strategic Reasoning in LLMs via Reinforcement Learning.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

SDPO: Segment-Level Direct Preference Optimization for Social Agents.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Reverse Preference Optimization for Complex Instruction Following.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Debate Helps Weak-to-Strong Generalization.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Sequence for updated Cbr-bnsp-2 gene, C. briggsae alleles, C. nigoni alleles, related vectors and a C. tribulationis scaffold.
Dataset, February, 2024

A Survey on Out-of-Distribution Detection in NLP.
Trans. Mach. Learn. Res., 2024

Unifying Structured Data as Graph for Data-to-Text Pre-Training.
Trans. Assoc. Comput. Linguistics, 2024

LLMs as Continuous Learners: Improving the Reproduction of Defective Code in Software Issues.
CoRR, 2024

Lingma SWE-GPT: An Open Development-Process-Centric Language Model for Automated Software Improvement.
CoRR, 2024

EvoCodeBench: An Evolving Code Generation Benchmark with Domain-Specific Evaluations.
CoRR, 2024

On the Role of Attention Heads in Large Language Model Safety.
CoRR, 2024

In-Context Transfer Learning: Demonstration Synthesis by Transferring Similar Tasks.
CoRR, 2024

Codev-Bench: How Do LLMs Understand Developer-Centric Code Completion?
CoRR, 2024

The Imperative of Conversation Analysis in the Era of LLMs: A Survey of Tasks, Techniques, and Trends.
CoRR, 2024

MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct.
CoRR, 2024

Extend Model Merging from Fine-Tuned to Pre-Trained Large Language Models via Weight Disentanglement.
CoRR, 2024

Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA.
CoRR, 2024

How to Understand Whole Software Repository?
CoRR, 2024

DevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories.
CoRR, 2024

A Survey on Self-Evolution of Large Language Models.
CoRR, 2024

Self-Retrieval: Building an Information Retrieval System with One Large Language Model.
CoRR, 2024

Automated segmentation and recognition of <i>C. elegans</i> whole-body cells.
Bioinform., 2024

Self-Retrieval: End-to-End Information Retrieval with One Large Language Model.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

EvoCodeBench: An Evolving Code Generation Benchmark with Domain-Specific Evaluations.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Fine-Tuning Language Models with Reward Learning on Policy.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

DialCLIP: Empowering Clip As Multi-Modal Dialog Retriever.
Proceedings of the IEEE International Conference on Acoustics, 2024

How Alignment and Jailbreak Work: Explain LLM Safety through Intermediate Hidden States.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Improving Factual Consistency of News Summarization by Contrastive Preference Optimization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Tree-Instruct: A Preliminary Study of the Intrinsic Relationship between Complexity and Alignment.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Out-of-Domain Intent Detection Considering Multi-Turn Dialogue Contexts.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Self-Explanation Prompting Improves Dialogue Understanding in Large Language Models.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Scaling Data Diversity for Fine-Tuning Language Models in Human Alignment.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Iterative Forward Tuning Boosts In-Context Learning in Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

SoFA: Shielded On-the-fly Alignment via Priority Rule Following.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

One-Shot Learning as Instruction Data Prospector for Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

DevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Fortify the Shortest Stave in Attention: Enhancing Context Awareness of Large Language Models for Effective Tool Use.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Preference Ranking Optimization for Human Alignment.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Schema dependency-enhanced curriculum pre-training for table semantic parsing.
Knowl. Based Syst., February, 2023

Image stacks for full-body transcription factor expression atlas with completely resolved cell identities in C. elegans.
Dataset, February, 2023

C. elegans image stacks at L1 stage with completely resolved cell identities.
Dataset, February, 2023

One Shot Learning as Instruction Data Prospector for Large Language Models.
CoRR, 2023

Improving Factual Consistency of Text Summarization by Adversarially Decoupling Comprehension and Embellishment Abilities of LLMs.
CoRR, 2023

Constructive Large Language Models Alignment with Diverse Feedback.
CoRR, 2023

Self-Explanation Prompting Improves Dialogue Understanding in Large Language Models.
CoRR, 2023

VDialogUE: A Unified Evaluation Benchmark for Visually-grounded Dialogue.
CoRR, 2023

A Preliminary Study of the Intrinsic Relationship between Complexity and Alignment.
CoRR, 2023

Wider and Deeper LLM Networks are Fairer LLM Evaluators.
CoRR, 2023

SpokenWOZ: A Large-Scale Speech-Text Benchmark for Spoken Task-Oriented Dialogue in Multiple Domains.
CoRR, 2023

Iterative Forward Tuning Boosts In-context Learning in Language Models.
CoRR, 2023

Speech-Text Dialog Pre-training for Spoken Dialog Understanding with Explicit Cross-Modal Alignment.
CoRR, 2023

Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs.
CoRR, 2023

Unsupervised Dialogue Topic Segmentation with Topic-aware Utterance Representation.
CoRR, 2023

API-Bank: A Benchmark for Tool-Augmented LLMs.
CoRR, 2023

Large Language Models are Versatile Decomposers: Decompose Evidence and Questions for Table-based Reasoning.
CoRR, 2023

Large Language Models are Versatile Decomposers: Decomposing Evidence and Questions for Table-based Reasoning.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Unsupervised Dialogue Topic Segmentation with Topic-aware Contrastive Learning.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

U-NEED: A Fine-grained Dataset for User Needs-Centric E-commerce Conversational Recommendation.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

SpokenWOZ: A Large-Scale Speech-Text Benchmark for Spoken Task-Oriented Dialogue Agents.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Coarse-To-Fine Knowledge Selection for Document Grounded Dialogs.
Proceedings of the IEEE International Conference on Acoustics, 2023

Causal Document-Grounded Dialogue Pre-training.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Improving Question Generation with Multi-level Content Planning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

API-Bank: A Comprehensive Benchmark for Tool-Augmented LLMs.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Diversify Question Generation with Retrieval-Augmented Style Transfer.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Exploring Large Language Models for Multi-Modal Out-of-Distribution Detection.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Conversational Recommender System and Large Language Model Are Made for Each Other in E-commerce Pre-sales Dialogue.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Speech-Text Pre-training for Spoken Dialog Understanding with Explicit Cross-Modal Alignment.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Multimodal Recommendation Dialog with Subjective Preference: A New Challenge and Benchmark.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

PaCE: Unified Multi-modal Dialogue Pre-training with Progressive and Compositional Experts.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

CATS: A Pragmatic Chinese Answer-to-Sequence Dataset with Large Scale and High Quality.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Long-Tailed Question Answering in an Open World.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Domain Incremental Lifelong Learning in an Open World.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Universal Information Extraction with Meta-Pretrained Self-Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Unified Language Representation for Question Answering over Text, Tables, and Images.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

SPRING: Situated Conversation Agent Pretrained with Multimodal Questions from Incremental Layout Graph.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Graphix-T5: Mixing Pre-trained Transformers with Graph-Aware Layers for Text-to-SQL Parsing.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Towards Generalized Open Information Extraction.
CoRR, 2022

Semi-Supervised Lifelong Language Learning.
CoRR, 2022

Prompt Conditioned VAE: Enhancing Generative Replay for Lifelong Learning in Task-Oriented Dialogue.
CoRR, 2022

SPACE-3: Unified Dialog Model Pre-training for Task-Oriented Dialog Understanding and Generation.
CoRR, 2022

A Survey on Text-to-SQL Parsing: Concepts, Methods, and Future Directions.
CoRR, 2022

Proton: Probing Schema Linking Information from Pre-trained Language Models for Text-to-SQL Parsing.
CoRR, 2022

Duplex Conversation: Towards Human-like Interaction in Spoken Dialogue System.
CoRR, 2022

S<sup>2</sup>SQL: Injecting Syntax to Question-Schema Interaction Graph Encoder for Text-to-SQL Parsers.
CoRR, 2022

Unified Dialog Model Pre-training for Task-Oriented Dialog Understanding and Generation.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Layout-Aware Information Extraction for Document-Grounded Dialogue: Dataset, Method and Demonstration.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Proton: Probing Schema Linking Information from Pre-trained Language Models for Text-to-SQL Parsing.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Duplex Conversation: Towards Human-like Interaction in Spoken Dialogue Systems.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Semi-Supervised Lifelong Language Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Dial2vec: Self-Guided Contrastive Learning of Unsupervised Dialogue Embeddings.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Estimating Soft Labels for Out-of-Domain Intent Detection.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Towards Generalizable and Robust Text-to-SQL Parsing.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Doc2Bot: Accessing Heterogeneous Documents via Conversational Bots.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

CGoDial: A Large-Scale Benchmark for Chinese Goal-oriented Dialog Evaluation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

STAR: SQL Guided Pre-Training for Context-dependent Text-to-SQL Parsing.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

SUN: Exploring Intrinsic Uncertainties in Text-to-SQL Parsers.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

SPACE-2: Tree-Structured Semi-Supervised Contrastive Pre-training for Task-Oriented Dialog Understanding.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

A Slot Is Not Built in One Utterance: Spoken Language Dialogs with Sub-Slots.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

S²SQL: Injecting Syntax to Question-Schema Interaction Graph Encoder for Text-to-SQL Parsers.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

GALAXY: A Generative Pre-trained Model for Task-Oriented Dialog with Semi-supervised Learning and Explicit Policy Injection.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
SDCUP: Schema Dependency-Enhanced Curriculum Pre-Training for Table Semantic Parsing.
CoRR, 2021

Preview, Attend and Review: Schema-Aware Curriculum Learning for Multi-Domain Dialog State Tracking.
CoRR, 2021

Improving Text-to-SQL with Schema Dependency Learning.
CoRR, 2021

Dynamic Hybrid Relation Network for Cross-Domain Context-Dependent Semantic Parsing.
CoRR, 2021

Preview, Attend and Review: Schema-Aware Curriculum Learning for Multi-Domain Dialogue State Tracking.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Dynamic Hybrid Relation Exploration Network for Cross-Domain Context-Dependent Semantic Parsing.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
A Survey on Dialog Management: Recent Advances and Challenges.
CoRR, 2020

Dynamic Memory Induction Networks for Few-Shot Text Classification.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Learning Low-Resource End-To-End Goal-Oriented Dialog for Fast and Reliable System Deployment.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Few-Shot Text Classification with Induction Network.
CoRR, 2019

Induction Networks for Few-Shot Text Classification.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019


  Loading...