We stand with Ukraine

We stand with Ukraine

Wenbo Su

Orcid: 0000-0003-3465-8284

According to our database¹, Wenbo Su authored at least 87 papers between 2014 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

YOCO++: Enhancing YOCO with KV Residual Connections for Efficient LLM Inference.

[DOI]

,

,

,

,

,

,

,

,

CoRR, April, 2026

PoC: Performance-oriented Context Compression for Large Language Models via Performance Prediction.

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, March, 2026

Complementary Reinforcement Learning.

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, March, 2026

Expert Divergence Learning for MoE-based Language Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, March, 2026

SpiralFormer: Looped Transformers Can Learn Hierarchical Dependencies via Multi-Resolution Recursion.

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, February, 2026

Dissecting Outlier Dynamics in LLM NVFP4 Pretraining.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, February, 2026

PretrainRL: Alleviating Factuality Hallucination of Large Language Models at the Beginning.

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, February, 2026

Read As Human: Compressing Context via Parallelizable Close Reading and Skimming.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, February, 2026

Data Distribution Matters: A Data-Centric Perspective on Context Compression for Large Language Model.

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, February, 2026

CoMeT: Collaborative Memory Transformer for Efficient Long Context Modeling.

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, February, 2026

COMI: Coarse-to-fine Context Compression via Marginal Information Gain.

[DOI]

,

,

,

,

,

,

CoRR, February, 2026

CE-RM: A Pointwise Generative Reward Model Optimized via Two-Stage Rollout and Unified Criteria.

[DOI]

,

,

,

,

,

,

,

,

CoRR, January, 2026

ShopSimulator: Evaluating and Exploring RL-Driven LLM Agent for Shopping Assistants.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, January, 2026

One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling.

[DOI]

,

,

,

,

,

,

,

,

CoRR, January, 2026

Logics-STEM: Empowering LLM Reasoning via Failure-Driven Post-Training and Document Knowledge Enhancement.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, January, 2026

Evolution of Driver Strategies Under Platform-Led Incentives: A Stackelberg-Evolutionary Game Model with Large-Scale Ride-Hailing Data.

[DOI]

,

,

Zhengfeng Huang

,

,

,

,

Syst., 2026

NEZHA: A Zero-sacrifice and Hyperspeed Decoding Architecture for Generative Recommendations.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the ACM Web Conference 2026, 2026

Unlocking Scaling Law in Industrial Recommendation Systems with a Three-step Paradigm based Large User Model.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the Nineteenth ACM International Conference on Web Search and Data Mining, 2026

RollPacker: Taming Long-Tail Rollouts for RL Post-Training with Tail Batching.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the 23rd USENIX Symposium on Networked Systems Design and Implementation, 2026

Think-J: Learning to Think for Generative LLM-as-a-Judge.

[DOI]

,

,

,

,

,

,

,

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

RollArt: Scaling Agentic RL Training via Disaggregated Infrastructure.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, December, 2025

Reasoning Palette: Modulating Reasoning via Latent Contextualization for Controllable Exploration for (V)LMs.

[DOI]

,

,

,

,

Tianqianjin Lin

,

,

,

,

,

CoRR, December, 2025

Reconstructing KV Caches with Cross-layer Fusion For Enhanced Transformers.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, December, 2025

BARD: budget-aware reasoning distillation.

[DOI]

,

,

,

,

,

,

CoRR, November, 2025

RAVR: Reference-Answer-guided Variational Reasoning for Large Language Models.

[DOI]

Tianqianjin Lin

,

,

,

,

,

,

,

CoRR, October, 2025

Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, October, 2025

Part II: ROLL Flash - Accelerating RLVR and Agentic Training with Asynchrony.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, October, 2025

QAgent: A modular Search Agent with Interactive Query Understanding.

[DOI]

,

,

,

,

,

CoRR, October, 2025

MeSH: Memory-as-State-Highways for Recursive Transformers.

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, October, 2025

Asymmetric Proximal Policy Optimization: mini-critics boost LLM reasoning.

[DOI]

,

Johan S. Obando-Ceron

,

,

,

,

,

,

Pablo Samuel Castro

,

Aaron C. Courville

,

CoRR, October, 2025

RollPacker: Mitigating Long-Tail Rollouts for Fast, Synchronous RL Post-Training.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, September, 2025

DESIGNER: Design-Logic-Guided Multidisciplinary Data Synthesis for LLM Reasoning.

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, August, 2025

Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, August, 2025

RecGPT Technical Report.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, July, 2025

Unified Linear Parametric Map Modeling and Perception-aware Trajectory Planning for Mobile Robotics.

[DOI]

,

,

,

,

,

CoRR, July, 2025

Comparative Analysis and Optimization of Magnetic Field Energy Harvesters Based on Split Three-Phase Power Line Joint Energy Harvesting.

[DOI]

,

,

,

,

IEEE Trans. Ind. Informatics, June, 2025

Multi-task Offline Reinforcement Learning for Online Advertising in Recommender Systems.

[DOI]

,

,

,

,

,

,

,

,

Zhao-Xiang Zhang

CoRR, June, 2025

Reinforcement Learning Optimization for Large-Scale Learning: An Efficient and User-Friendly Scaling Library.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, June, 2025

USB: A Comprehensive and Unified Safety Evaluation Benchmark for Multimodal Large Language Models.

[DOI]

,

,

Hongqiong Zhong

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, May, 2025

Weight Spectra Induced Efficient Model Adaptation.

[DOI]

,

,

,

,

,

,

,

CoRR, May, 2025

Beyond Safe Answers: A Benchmark for Evaluating True Risk Awareness in Large Reasoning Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, May, 2025

NAN: A Training-Free Solution to Coefficient Estimation in Model Merging.

[DOI]

,

,

,

,

,

,

,

,

CoRR, May, 2025

Think-J: Learning to Think for Generative LLM-as-a-Judge.

[DOI]

,

,

,

,

,

,

,

,

CoRR, May, 2025

A Comprehensive Survey on Long Context Language Modeling.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Wangchunshu Zhou

,

,

,

Zhaoxiang Zhang

CoRR, March, 2025

Deconstructing Long Chain-of-Thought: A Structured Reasoning Optimization Framework for Long CoT Distillation.

[DOI]

,

,

,

,

,

,

,

CoRR, March, 2025

ECKGBench: Benchmarking Large Language Models in E-commerce Leveraging Knowledge Graph.

[DOI]

,

,

,

,

,

,

Zhao-Xiang Zhang

,

CoRR, March, 2025

Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?

[DOI]

,

,

,

,

,

,

,

Zhaoxiang Zhang

,

,

,

CoRR, February, 2025

AIR: Complex Instruction Generation via Automatic Iterative Refinement.

[DOI]

,

,

,

,

,

,

,

CoRR, February, 2025

ChineseSimpleVQA - "See the World, Discover Knowledge": A Chinese Factuality Evaluation for Large Vision Language Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, February, 2025

Equilibrate RLHF: Towards Balancing Helpfulness-Safety Trade-off in Large Language Models.

[DOI]

,

,

,

,

,

,

,

,

CoRR, February, 2025

MIM: Multi-modal Content Interest Modeling Paradigm for User Behavior Modeling.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, February, 2025

2D-DPO: Scaling Direct Preference Optimization with 2-Dimensional Supervision.

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Multi-task Offline Reinforcement Learning for Online Advertising in Recommender Systems.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

UQABench: Evaluating User Embedding for Prompting LLMs in Personalized Question Answering.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

ChineseEcomQA: A Scalable E-commerce Concept Evaluation Benchmark for Large Language Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

Zhaoxiang Zhang

,

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

How to inject knowledge efficiently? Knowledge Infusion Scaling Law for Pre-training Large Language Models.

[DOI]

,

,

,

,

,

,

,

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

AIR: Complex Instruction Generation via Automatic Iterative Refinement.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

ECKGBench: Benchmarking Large Language Models in E-commerce Leveraging Knowledge Graph.

[DOI]

,

,

,

,

,

,

,

Proceedings of the 34th ACM International Conference on Information and Knowledge Management, 2025

Chinese SafetyQA: A Safety Short-form Factuality Benchmark for Large Language Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

ProgCo: Program Helps Self-Correction of Large Language Models.

[DOI]

,

,

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2025

M2RC-EVAL: Massively Multilingual Repository-level Code Completion Evaluation.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Zekun Moore Wang

,

,

,

,

Zhaoxiang Zhang

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?

[DOI]

,

,

,

,

,

,

,

Zhaoxiang Zhang

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

See the World, Discover Knowledge: A Chinese Factuality Evaluation for Large Vision Language Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024

Chinese SafetyQA: A Safety Short-form Factuality Benchmark for Large Language Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

WiS Platform: Enhancing Evaluation of LLM-Based Multi-Agent Systems Through Game-Based Analysis.

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, 2024

Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

Adaptive Dense Reward: Understanding the Gap Between Action and Reward Space in Alignment.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

M2rc-Eval: Massively Multilingual Repository-level Code Completion Evaluation.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

Zhaoxiang Zhang

,

,

CoRR, 2024

R2C2-Coder: Enhancing and Benchmarking Real-world Repository-level Code Completion Abilities of Code Large Language Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

E^2-LLM: Efficient and Extreme Length Extension of Large Language Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

D-CPT Law: Domain-specific Continual Pre-Training Scaling Law for Large Language Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

DDK: Distilling Domain Knowledge for Efficient Large Language Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2024

E2-LLM: Efficient and Extreme Length Extension of Large Language Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2024

MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues.

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

Sensitivity Analysis of Free-Standing Columnar Magnetic Field Energy Harvester for Powering Wireless Monitoring Sensors.

[DOI]

,

,

,

,

IEEE Trans. Circuits Syst. I Regul. Pap., November, 2023

Antisaturation and Power Decoupling Control of Multiwinding Energy Harvester Based on Magnetomotive Force Compensation.

[DOI]

,

,

,

,

IEEE Trans. Ind. Informatics, October, 2023

2022

GBA: A Tuning-free Approach to Switch between Synchronous and Asynchronous Training for Recommendation Model.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2022

GBA: A Tuning-free Approach to Switch between Synchronous and Asynchronous Training for Recommendation Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2018

Visualizing and Understanding Deep Neural Networks in CTR Prediction.

[DOI]

,

,

,

,

,

Proceedings of the SIGIR 2018 Workshop On eCommerce co-located with the 41st International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018), 2018

2015

SLA-Aware Tenant Placement and Dynamic Resource Provision in SaaS.

[DOI]

,

,

,

Sherman X. Shen

Proceedings of the 2015 IEEE International Conference on Web Services, 2015

Modeling and Analysis of Availability in Multi-Tenant SaaS.

[DOI]

,

,

,

Sherman X. Shen

Proceedings of the 24th International Conference on Computer Communication and Networks, 2015

2014

Modeling and Analysis of Availability for SaaS Multi-tenant Architecture.

[DOI]

,

,

,

Proceedings of the 8th IEEE International Symposium on Service Oriented System Engineering, 2014

Loading...