Hengyi Cai

Orcid: 0000-0002-7147-5666

According to our database1, Hengyi Cai authored at least 51 papers between 2017 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Erratum: Learning Discrete Identifiers and Dense Vectors for Generative Retrieval.
ACM Trans. Inf. Syst., March, 2026

Learning Discrete Identifiers and Dense Vectors for Generative Retrieval.
ACM Trans. Inf. Syst., February, 2026

AgentSkiller: Scaling Generalist Agent Intelligence through Semantically Integrated Cross-Domain Data Synthesis.
CoRR, February, 2026

Not All Preferences Are Created Equal: Stability-Aware and Gradient-Efficient Alignment for Reasoning Models.
CoRR, February, 2026

MatchTIR: Fine-Grained Supervision for Tool-Integrated Reasoning via Bipartite Matching.
CoRR, January, 2026

FlexSpec: Frozen Drafts Meet Evolving Targets in Edge-Cloud Collaborative LLM Speculative Decoding.
CoRR, January, 2026

Probe-and-Fetch: Dynamic KV Cache Pruning for Accelerated Long-Context Inference in Web-Scale AI Search.
Proceedings of the ACM Web Conference 2026, 2026

Retain to Refine: Adaptive Online Question Answering via Query Routing and Long-Short Memory.
Proceedings of the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.1, 2026

Beyond Step Pruning: Information Theory Based Step-level Optimization for Self-Refining Large Language Models.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

AdaFuse: Accelerating Dynamic Adapter Inference via Token-Level Pre-Gating and Fused Kernel Optimization.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Efficient Thought Space Exploration Through Strategic Intervention.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

VPN: Visual Prompt Navigation.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
AdaSwitch: Adaptive Switching Generation for Knowledge Distillation.
CoRR, October, 2025

Solving the Granularity Mismatch: Hierarchical Preference Learning for Long-Horizon LLM Agents.
CoRR, October, 2025

CurES: From Gradient Analysis to Efficient Curriculum Learning for Reasoning LLMs.
CoRR, October, 2025

Staying in the Sweet Spot: Responsive Reasoning Evolution via Capability-Adaptive Hint Scaffolding.
CoRR, September, 2025

Tool learning with large language models: a survey.
Frontiers Comput. Sci., August, 2025

Towards AI Search Paradigm.
CoRR, June, 2025

From Prompting to Alignment: A Generative Framework for Query Recommendation.
CoRR, April, 2025

PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

MARA: A Multimodal Adaptive Retrieval-Augmented Framework for Document Question Answering.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Multi-Agent Proactive Information Seeking with Adaptive LLM Orchestration for Non-Factoid Question Answering.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

FULTR: A Large-Scale Fusion Learning to Rank Dataset and Its Application for Satisfaction-Oriented Ranking.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

RankExpert: A Mixture of Textual-and-Behavioral Experts for Multi-Objective Learning-to-Rank in Web Search.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Uplift-RAG: Uplift-Driven Knowledge Preference Alignment for Retrieval-Augmented Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

CTR-Guided Generative Query Suggestion in Conversational Search.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Enhancing Retrieval-Augmented Generation via Evidence Tree Search.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Explainability for Large Language Models: A Survey.
ACM Trans. Intell. Syst. Technol., April, 2024

COLT: Towards Completeness-Oriented Tool Retrieval for Large Language Models.
CoRR, 2024

XL<sup>2</sup>Bench: A Benchmark for Extremely Long Context Understanding with Long-range Dependencies.
CoRR, 2024

Text-Video Retrieval via Variational Multi-Modal Hypergraph Networks.
CoRR, 2024

Text-Video Retrieval via Multi-Modal Hypergraph Networks.
Proceedings of the 17th ACM International Conference on Web Search and Data Mining, 2024

Cross-model Control: Improving Multiple Large Language Models in One-time Training.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Towards Verifiable Text Generation with Evolving Memory and Self-Reflection.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

AdaSwitch: Adaptive Switching between Small and Large Agents for Effective Cloud-Local Collaborative Learning.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Towards Completeness-Oriented Tool Retrieval for Large Language Models.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

2023
Pre-trained Language Model-based Retrieval and Ranking for Web Search.
ACM Trans. Web, February, 2023

Contrastive Learning with Dialogue Attributes for Neural Dialogue Generation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Answering Ambiguous Questions via Iterative Prompting.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Fast Semantic Matching via Flexible Contextualized Interaction.
Proceedings of the WSDM '22: The Fifteenth ACM International Conference on Web Search and Data Mining, Virtual Event / Tempe, AZ, USA, February 21, 2022

Approximated Doubly Robust Search Relevance Estimation.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

2021
Pre-trained Language Model based Ranking in Baidu Search.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

2020
Exemplar Guided Neural Dialogue Generation.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Group-wise Contrastive Learning for Neural Dialogue Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Data Manipulation: Towards Effective Instance Learning for Neural Dialogue Generation via Learning to Augment and Reweight.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Learning from Easy to Complex: Adaptive Multi-Curricula Learning for Neural Dialogue Generation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Low powered blockchain consensus protocols based on consistent hash.
Frontiers Inf. Technol. Electron. Eng., 2019

Adaptive Parameterization for Neural Dialogue Generation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

2018
KNPTC: Knowledge and Neural Machine Translation Powered Chinese Pinyin Typo Correction.
CoRR, 2018

2017
FTGWS: Forming Optimal Tutor Group for Weak Students Discovered in Educational Settings.
Proceedings of the Database and Expert Systems Applications, 2017


  Loading...