Bo Zheng

Orcid: 0000-0002-4037-6315

Affiliations:
  • Alibaba Group, Beijing, China


According to our database1, Bo Zheng authored at least 317 papers between 2005 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
DDA-Thinker: Decoupled Dual-Atomic Reinforcement Learning for Reasoning-Driven Image Editing.
CoRR, April, 2026

All Languages Matter: Understanding and Mitigating Language Bias in Multilingual RAG.
CoRR, April, 2026

Tstars-Tryon 1.0: Robust and Realistic Virtual Try-On for Diverse Fashion Items.
CoRR, April, 2026

LoopCTR: Unlocking the Loop Scaling Power for Click-Through Rate Prediction.
CoRR, April, 2026

YOCO++: Enhancing YOCO with KV Residual Connections for Efficient LLM Inference.
CoRR, April, 2026

Geoparsing: Diagram Parsing for Plane and Solid Geometry with a Unified Formal Language.
CoRR, April, 2026

AdaSpark: Adaptive Sparsity for Efficient Long-Video Understanding.
CoRR, April, 2026

MOON3.0: Reasoning-aware Multimodal Representation Learning for E-commerce Product Understanding.
CoRR, April, 2026

AdaGen: Learning Adaptive Policy for Image Synthesis.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2026

PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference.
CoRR, March, 2026

GIFT: Global Irreplaceability Frame Targeting for Efficient Video Understanding.
CoRR, March, 2026

UniScale: Synergistic Entire Space Data and Model Scaling for Search Ranking.
CoRR, March, 2026

SpatialReward: Verifiable Spatial Reward Modeling for Fine-Grained Spatial Consistency in Text-to-Image Generation.
CoRR, March, 2026

PoC: Performance-oriented Context Compression for Large Language Models via Performance Prediction.
CoRR, March, 2026

Complementary Reinforcement Learning.
CoRR, March, 2026

ICPRL: Acquiring Physical Intuition from Interactive Control.
CoRR, March, 2026

SecAgent: Efficient Mobile GUI Agent with Semantic Context.
CoRR, March, 2026

MAC: A Conversion Rate Prediction Benchmark Featuring Labels Under Multiple Attribution Mechanisms.
CoRR, March, 2026

Cross-modal Identity Mapping: Minimizing Information Loss in Modality Conversion via Reinforcement Learning.
CoRR, March, 2026

Expert Divergence Learning for MoE-based Language Models.
CoRR, March, 2026

Pailitao-VL: Unified Embedding and Reranker for Real-Time Multi-Modal Industrial Search.
CoRR, February, 2026

SpiralFormer: Looped Transformers Can Learn Hierarchical Dependencies via Multi-Resolution Recursion.
CoRR, February, 2026

Learning from the Irrecoverable: Error-Localized Policy Optimization for Tool-Integrated LLM Reasoning.
CoRR, February, 2026

E-VAds: An E-commerce Short Videos Understanding Benchmark for MLLMs.
CoRR, February, 2026

PretrainRL: Alleviating Factuality Hallucination of Large Language Models at the Beginning.
CoRR, February, 2026

Read As Human: Compressing Context via Parallelizable Close Reading and Skimming.
CoRR, February, 2026

Data Distribution Matters: A Data-Centric Perspective on Context Compression for Large Language Model.
CoRR, February, 2026

CoMeT: Collaborative Memory Transformer for Efficient Long Context Modeling.
CoRR, February, 2026

COMI: Coarse-to-fine Context Compression via Marginal Information Gain.
CoRR, February, 2026

CE-RM: A Pointwise Generative Reward Model Optimized via Two-Stage Rollout and Unified Criteria.
CoRR, January, 2026

ShopSimulator: Evaluating and Exploring RL-Driven LLM Agent for Shopping Assistants.
CoRR, January, 2026

The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models.
CoRR, January, 2026

Parallel Latent Reasoning for Sequential Recommendation.
CoRR, January, 2026

Unified Thinker: A General Reasoning Modular Core for Image Generation.
CoRR, January, 2026

One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling.
CoRR, January, 2026

ThinkRec: Thinking-based recommendation via LLM.
Proceedings of the ACM Web Conference 2026, 2026

Modeling Cascaded Delay Feedback for Online Net Conversion Rate Prediction: Benchmark, Insights and Solutions.
Proceedings of the ACM Web Conference 2026, 2026

Multi-Behavior Sequential Modeling with Transition-Aware Graph Attention Network for E-Commerce Recommendation.
Proceedings of the ACM Web Conference 2026, 2026

PolicySim: An LLM-Based Agent Social Simulation Sandbox for Proactive Policy Optimization.
Proceedings of the ACM Web Conference 2026, 2026

MOON: Generative MLLM-based Multimodal Representation Learning for E-commerce Product Understanding.
Proceedings of the Nineteenth ACM International Conference on Web Search and Data Mining, 2026

Unlocking Scaling Law in Industrial Recommendation Systems with a Three-step Paradigm based Large User Model.
Proceedings of the Nineteenth ACM International Conference on Web Search and Data Mining, 2026

RollPacker: Taming Long-Tail Rollouts for RL Post-Training with Tail Batching.
Proceedings of the 23rd USENIX Symposium on Networked Systems Design and Implementation, 2026

VALUE: Value-Aware Large Language Model for Query Rewriting via Weighted Trie in Sponsored Search.
Proceedings of the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.1, 2026

LoFT-LLM: Low-Frequency Time-series Forecasting with Large Language Models.
Proceedings of the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.1, 2026

IMPACTNet: Unifying Auto-bidding in End-to-End Merged Auctions.
Proceedings of the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.1, 2026

Navigating the Infinite Dynamic Web Space: Effective In-Context Exploration via Cognitive Multi-Agent Collaboration.
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

DeepPhy: Benchmarking Agentic VLMs on Physical Reasoning.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Contribution-aware Token Compression for Efficient Video Understanding via Reinforcement Learning.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

MIRAGE: Towards AI-Generated Image Detection in the Wild.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Trimming the Fat: Redundancy-Aware Acceleration Framework for DGNNs.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
RAIR: A Rule-Aware Benchmark Uniting Challenging Long-Tail and Visual Salience Subset for E-commerce Relevance Assessment.
CoRR, December, 2025

RollArt: Scaling Agentic RL Training via Disaggregated Infrastructure.
CoRR, December, 2025

AndroidLens: Long-latency Evaluation with Nested Sub-targets for Android GUI Agents.
CoRR, December, 2025

Reasoning Palette: Modulating Reasoning via Latent Contextualization for Controllable Exploration for (V)LMs.
CoRR, December, 2025

QuadSentinel: Sequent Safety for Machine-Checkable Control in Multi-agent Systems.
CoRR, December, 2025

LLM-Auction: Generative Auction towards LLM-Native Advertising.
CoRR, December, 2025

MUSE: A Simple Yet Effective Multimodal Search-Based Framework for Lifelong User Interest Modeling.
CoRR, December, 2025

Beyond Existing Retrievals: Cross-Scenario Incremental Sample Learning Framework.
CoRR, December, 2025

Reconstructing KV Caches with Cross-layer Fusion For Enhanced Transformers.
CoRR, December, 2025

LORE: A Large Generative Model for Search Relevance.
CoRR, December, 2025

ViT<sup>3</sup>: Unlocking Test-Time Training in Vision.
CoRR, December, 2025

Thinking with Drafts: Speculative Temporal Reasoning for Efficient Long Video Understanding.
CoRR, December, 2025

AI Deception: Risks, Dynamics, and Controls.
CoRR, November, 2025

Enhancing Sequential Recommendation with World Knowledge from Large Language Models.
CoRR, November, 2025

ConceptGuard: Proactive Safety in Text-and-Image-to-Video Generation through Multimodal Risk Detection.
CoRR, November, 2025

From Code Foundation Models to Agents and Applications: A Comprehensive Survey and Practical Guide to Code Intelligence.
CoRR, November, 2025

AIF: Asynchronous Inference Framework for Cost-Effective Pre-Ranking.
CoRR, November, 2025

MOON2.0: Dynamic Modality-balanced Multimodal Representation Learning for E-commerce Product Understanding.
CoRR, November, 2025

From Scaling to Structured Expressivity: Rethinking Transformers for CTR Prediction.
CoRR, November, 2025

MOON Embedding: Multimodal Representation Learning for E-commerce Search Advertising.
CoRR, November, 2025

BARD: budget-aware reasoning distillation.
CoRR, November, 2025

Unbiased Platform-Level Causal Estimation for Search Systems: A Competitive Isolation PSM-DID Framework.
CoRR, November, 2025

RAVR: Reference-Answer-guided Variational Reasoning for Large Language Models.
CoRR, October, 2025

Bid2X: Revealing Dynamics of Bidding Environment in Online Advertising from A Foundation Model Lens.
CoRR, October, 2025

Think before Recommendation: Autonomous Reasoning-enhanced Recommender.
CoRR, October, 2025

REVISION:Reflective Intent Mining and Online Reasoning Auxiliary for E-commerce Visual Search System Optimization.
CoRR, October, 2025

CausalRec: A CausalBoost Attention Model for Sequential Recommendation.
CoRR, October, 2025

Identity-Preserving Image-to-Video Generation via Reward-Guided Optimization.
CoRR, October, 2025

Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization.
CoRR, October, 2025

Reinforced Preference Optimization for Recommendation.
CoRR, October, 2025

Part II: ROLL Flash - Accelerating RLVR and Agentic Training with Asynchrony.
CoRR, October, 2025

QAgent: A modular Search Agent with Interactive Query Understanding.
CoRR, October, 2025

A Unified Multi-Task Learning Framework for Generative Auto-Bidding with Validation-Aligned Optimization.
CoRR, October, 2025

MeSH: Memory-as-State-Highways for Recursive Transformers.
CoRR, October, 2025

Asymmetric Proximal Policy Optimization: mini-critics boost LLM reasoning.
CoRR, October, 2025

Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs.
CoRR, October, 2025

HiDe: Rethinking The Zoom-IN method in High Resolution MLLMs via Hierarchical Decoupling.
CoRR, October, 2025

Leveraging Scene Context with Dual Networks for Sequential User Behavior Modeling.
CoRR, September, 2025

Discovering "Words" in Music: Unsupervised Learning of Compositional Sparse Code for Symbolic Music.
CoRR, September, 2025

GSID: Generative Semantic Indexing for E-Commerce Product Understanding.
CoRR, September, 2025

ReWatch-R1: Boosting Complex Video Reasoning in Large Vision-Language Models through Agentic Data Synthesis.
CoRR, September, 2025

Mixture-of-Visual-Thoughts: Exploring Context-Adaptive Reasoning Mode Selection for General Visual Reasoning.
CoRR, September, 2025

Interactive Recommendation Agent with Active User Commands.
CoRR, September, 2025

RollPacker: Mitigating Long-Tail Rollouts for Fast, Synchronous RL Post-Training.
CoRR, September, 2025

FORGE: Forming Semantic Identifiers for Generative Retrieval in Industrial Datasets.
CoRR, September, 2025

RecIS: Sparse to Dense, A Unified Training Framework for Recommendation Models.
CoRR, September, 2025

Equip Pre-ranking with Target Attention by Residual Quantization.
CoRR, September, 2025

Enhancing Generative Auto-bidding with Offline Reward Evaluation and Policy Search.
CoRR, September, 2025

VStyle: A Benchmark for Voice Style Adaptation with Spoken Instructions.
CoRR, September, 2025

InquireMobile: Teaching VLM-based Mobile Agent to Request Human Assistance via Reinforcement Fine-Tuning.
CoRR, August, 2025

Visual-CoG: Stage-Aware Reinforcement Learning with Chain of Guidance for Text-to-Image Generation.
CoRR, August, 2025

MIRAGE: Towards AI-Generated Image Detection in the Wild.
CoRR, August, 2025

DESIGNER: Design-Logic-Guided Multidisciplinary Data Synthesis for LLM Reasoning.
CoRR, August, 2025

Creative4U: MLLMs-based Advertising Creative Image Selector with Comparative Reasoning.
CoRR, August, 2025

Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning.
CoRR, August, 2025

Bidding-Aware Retrieval for Multi-Stage Consistency in Online Advertising.
CoRR, August, 2025

RecGPT Technical Report.
CoRR, July, 2025

CTR-Driven Ad Text Generation via Online Feedback Preference Optimization.
CoRR, July, 2025

Multi-task Offline Reinforcement Learning for Online Advertising in Recommender Systems.
CoRR, June, 2025

Mobile-R1: Towards Interactive Reinforcement Learning for VLM-Based Mobile Agent via Task-Level Rewards.
CoRR, June, 2025

MSR-Align: Policy-Grounded Multimodal Alignment for Safety-Aware Reasoning in Vision-Language Models.
CoRR, June, 2025

GFlowGR: Fine-tuning Generative Recommendation Frameworks with Generative Flow Networks.
CoRR, June, 2025

Reinforcement Learning Optimization for Large-Scale Learning: An Efficient and User-Friendly Scaling Library.
CoRR, June, 2025

Do not Abstain! Identify and Solve the Uncertainty.
CoRR, June, 2025

USB: A Comprehensive and Unified Safety Evaluation Benchmark for Multimodal Large Language Models.
CoRR, May, 2025

Weight Spectra Induced Efficient Model Adaptation.
CoRR, May, 2025

Beyond Safe Answers: A Benchmark for Evaluating True Risk Awareness in Large Reasoning Models.
CoRR, May, 2025

NAN: A Training-Free Solution to Coefficient Estimation in Model Merging.
CoRR, May, 2025

Think-J: Learning to Think for Generative LLM-as-a-Judge.
CoRR, May, 2025

TranSUN: A Preemptive Paradigm to Eradicate Retransformation Bias Intrinsically from Regression Models in Recommender Systems.
CoRR, May, 2025

Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free.
CoRR, May, 2025

Video Echoed in Harmony: Learning and Sampling Video-Integrated Chord Progression Sequences for Controllable Video Background Music Generation.
IEEE Trans. Comput. Soc. Syst., April, 2025

FiLA-Video: Spatio-Temporal Compression for Fine-Grained Long Video Understanding.
CoRR, April, 2025

GeoSense: Evaluating Identification and Application of Geometric Principles in Multimodal Reasoning.
CoRR, April, 2025

DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging.
CoRR, April, 2025

MMKB-RAG: A Multi-Modal Knowledge-Based Retrieval-Augmented Generation Framework.
CoRR, April, 2025

VALUE: Value-Aware Large Language Model for Query Rewriting via Weighted Trie in Sponsored Search.
CoRR, April, 2025

A Comprehensive Survey on Long Context Language Modeling.
CoRR, March, 2025

Deconstructing Long Chain-of-Thought: A Structured Reasoning Optimization Framework for Long CoT Distillation.
CoRR, March, 2025

ECKGBench: Benchmarking Large Language Models in E-commerce Leveraging Knowledge Graph.
CoRR, March, 2025

Nash Equilibrium Constrained Auto-bidding With Bi-level Reinforcement Learning.
CoRR, March, 2025

CombatVLA: An Efficient Vision-Language-Action Model for Combat Tasks in 3D Action Role-Playing Games.
CoRR, March, 2025

Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?
CoRR, February, 2025

AIR: Complex Instruction Generation via Automatic Iterative Refinement.
CoRR, February, 2025

HiddenDetect: Detecting Jailbreak Attacks against Large Vision-Language Models via Monitoring Hidden States.
CoRR, February, 2025

ChineseSimpleVQA - "See the World, Discover Knowledge": A Chinese Factuality Evaluation for Large Vision Language Models.
CoRR, February, 2025

Equilibrate RLHF: Towards Balancing Helpfulness-Safety Trade-off in Large Language Models.
CoRR, February, 2025

Large Language Models Are Universal Recommendation Learners.
CoRR, February, 2025

MIM: Multi-modal Content Interest Modeling Paradigm for User Behavior Modeling.
CoRR, February, 2025

PAID: A Framework of Product-Centric Advertising Image Design.
CoRR, January, 2025

Compression with Global Guidance: Towards Training-free High-Resolution MLLMs Acceleration.
CoRR, January, 2025

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings.
CoRR, January, 2025

Explainable LLM-driven Multi-dimensional Distillation for E-Commerce Relevance Learning.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2025, 2025

Computational Advertising: Recent Advances.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2025, 2025

DMLoRA: Dynamic Multi-Subspace Low-Rank Adaptation.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2025, 2025

DAGPrompT: Pushing the Limits of Graph Prompting with a Distribution-aware Graph Prompt Tuning Approach.
Proceedings of the ACM on Web Conference 2025, 2025

Learning against Non-credible Second-Price Auctions.
Proceedings of the ACM on Web Conference 2025, 2025

Gradient Deconfliction via Orthogonal Projections onto Subspaces For Multi-task Learning.
Proceedings of the Eighteenth ACM International Conference on Web Search and Data Mining, 2025

POGS: Position-Optimized Gaussian Splatting for Reconstruction in Unevenly Illuminated Scenarios.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2025

A Generative Re-ranking Model for List-level Multi-objective Optimization at Taobao.
Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

User Long-Term Multi-Interest Retrieval Model for Recommendation.
Proceedings of the Nineteenth ACM Conference on Recommender Systems, 2025

USD: A User-Intent-Driven Sampling and Dual-Debiasing Framework for Large-Scale Homepage Recommendations.
Proceedings of the Nineteenth ACM Conference on Recommender Systems, 2025

2D-DPO: Scaling Direct Preference Optimization with 2-Dimensional Supervision.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

DEPO: Enhancing E-commerce Image Background Generation with Short Trajectory Direct Expected Preference Optimization.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Contextual Generative Auction with Permutation-level Externalities for Online Advertising.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.1, 2025

Multi-task Offline Reinforcement Learning for Online Advertising in Recommender Systems.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

UQABench: Evaluating User Embedding for Prompting LLMs in Personalized Question Answering.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

Robust Data-Driven Auction Design.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

Bid2X: Revealing Dynamics of Bidding Environment in Online Advertising from A Foundation Model Lens.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

ChineseEcomQA: A Scalable E-commerce Concept Evaluation Benchmark for Large Language Models.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

An Adaptable Budget Planner for Enhancing Budget-Constrained Auto-Bidding in Online Advertising.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.1, 2025

Beyond Advertising: Mechanism Design for Platform-Wide Marketing Service "QuanZhanTui".
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

Differentiable Solver Search for Fast Diffusion Sampling.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Enhancing Document Understanding with Group Position Embedding: A Novel Approach to Incorporate Layout Information.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Minimal Impact ControlNet: Advancing Multi-ControlNet Integration.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

OmniKV: Dynamic Context Selection for Efficient Long-Context LLMs.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

GSID: Generative Semantic Indexing for E-Commerce Product Understanding.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

How to inject knowledge efficiently? Knowledge Infusion Scaling Law for Pre-training Large Language Models.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

AIR: Complex Instruction Generation via Automatic Iterative Refinement.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Token Preference Optimization with Self-Calibrated Visual-Anchored Rewards for Hallucination Mitigation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

VC4VG: Optimizing Video Captions for Text-to-Video Generation.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Interpretable Word Representation Learning Framework for Modeling Semantic Relevance in E-commerce.
Proceedings of the Database Systems for Advanced Applications, 2025

LLM-Based Keyphrase-Augmented Framework for Semantic Relevance Assessment in E-Commerce.
Proceedings of the Database Systems for Advanced Applications, 2025

PosterMaker: Towards High-Quality Product Poster Generation with Accurate Text Rendering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

MPPO: Multi Pair-wise Preference Optimization for LLMs with Arbitrary Negative Samples.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

ECKGBench: Benchmarking Large Language Models in E-commerce Leveraging Knowledge Graph.
Proceedings of the 34th ACM International Conference on Information and Knowledge Management, 2025

TBGRecall: A Generative Retrieval Model for E-commerce Recommendation Scenarios.
Proceedings of the 34th ACM International Conference on Information and Knowledge Management, 2025

T-Stars-Poster: A Framework for Product-Centric Advertising Image Design.
Proceedings of the 34th ACM International Conference on Information and Knowledge Management, 2025

See Beyond a Single View: Multi-Attribution Learning Leads to Better Conversion Rate Prediction.
Proceedings of the 34th ACM International Conference on Information and Knowledge Management, 2025

Masked Self-distilled Transducer-based Keyword Spotting with Semi-autoregressive Decoding.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2025

AIGuard: A Benchmark and Lightweight Detection for E-commerce AIGC Risks.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Chinese SafetyQA: A Safety Short-form Factuality Benchmark for Large Language Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

ProgCo: Program Helps Self-Correction of Large Language Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2025

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Do not Abstain! Identify and Solve the Uncertainty.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

M2RC-EVAL: Massively Multilingual Repository-level Code Completion Evaluation.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

HiddenDetect: Detecting Jailbreak Attacks against Multimodal Large Language Models via Monitoring Hidden States.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

See the World, Discover Knowledge: A Chinese Factuality Evaluation for Large Vision Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

LongDocURL: a Comprehensive Multimodal Long Document Benchmark Integrating Understanding, Reasoning, and Locating.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

RHanDS: Refining Malformed Hands for Generated Images with Decoupled Structure and Style Guidance.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Align Anything: Training All-Modality Models to Follow Instructions with Language Feedback.
CoRR, 2024

Chinese SafetyQA: A Safety Short-form Factuality Benchmark for Large Language Models.
CoRR, 2024

Qwen2.5 Technical Report.
CoRR, 2024

LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer.
CoRR, 2024

HyViLM: Enhancing Fine-Grained Recognition with a Hybrid Encoder for Vision-Language Models.
CoRR, 2024

WiS Platform: Enhancing Evaluation of LLM-Based Multi-Agent Systems Through Game-Based Analysis.
CoRR, 2024

FocusLLaVA: A Coarse-to-Fine Approach for Efficient and Effective Visual Token Compression.
CoRR, 2024

Enhancing Vision-Language Model Safety through Progressive Concept-Bottleneck-Driven Alignment.
CoRR, 2024

Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models.
CoRR, 2024

Adaptive Dense Reward: Understanding the Gap Between Action and Reward Space in Alignment.
CoRR, 2024

FlowDCN: Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution.
CoRR, 2024

M2rc-Eval: Massively Multilingual Repository-level Code Completion Evaluation.
CoRR, 2024

MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models.
CoRR, 2024

Can VLMs Play Action Role-Playing Games? Take Black Myth Wukong as a Study Case.
CoRR, 2024

Qwen2 Technical Report.
CoRR, 2024

R2C2-Coder: Enhancing and Benchmarking Real-world Repository-level Code Completion Abilities of Code Large Language Models.
CoRR, 2024

ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models.
CoRR, 2024

Safety Alignment for Vision Language Models.
CoRR, 2024

Enhancing Prompt Following with Visual Control Through Training-Free Mask-Guided Diffusion.
CoRR, 2024

Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generation.
CoRR, 2024

AtomoVideo: High Fidelity Image-to-Video Generation.
CoRR, 2024

Scalable Virtual Valuations Combinatorial Auction Design by Combining Zeroth-Order and First-Order Optimization Method.
CoRR, 2024

E^2-LLM: Efficient and Extreme Length Extension of Large Language Models.
CoRR, 2024

MusicAOG: an Energy-Based Model for Learning and Sampling a Hierarchical Representation of Symbolic Music.
CoRR, 2024

MetaSplit: Meta-Split Network for Limited-Stock Product Recommendation.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2024, 2024

Ad vs Organic: Revisiting Incentive Compatible Mechanism Design in E-commerce Platforms.
Proceedings of the ACM on Web Conference 2024, 2024

Unified Visual Preference Learning for User Intent Understanding.
Proceedings of the 17th ACM International Conference on Web Search and Data Mining, 2024

Calibration-compatible Listwise Distillation of Privileged Features for CTR Prediction.
Proceedings of the 17th ACM International Conference on Web Search and Data Mining, 2024

Exploring DCN-like architecture for fast image generation with arbitrary resolution.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

AuctionNet: A Novel Benchmark for Decision-Making in Large-Scale Games.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

D-CPT Law: Domain-specific Continual Pre-Training Scaling Law for Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

DDK: Distilling Domain Knowledge for Efficient Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Demystify Mamba in Vision: A Linear Attention Perspective.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Deep Bag-of-Words Model: An Efficient and Interpretable Relevance Architecture for Chinese E-Commerce.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Generative Auto-bidding via Conditional Diffusion Modeling.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

GeoGPT4V: Towards Geometric Multi-modal Large Language Models with Geometric Image Generation.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Accelerating Image Generation with Sub-path Linear Approximation Model.
Proceedings of the Computer Vision - ECCV 2024, 2024

Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Large Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

Laner-GNN: Adapting to Long-Tail Degree Distribution with Latent Neighborhood Restorers.
Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

Enhancing Taobao Display Advertising with Multimodal Representations: Challenges, Approaches and Insights.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

Synchronized Video Storytelling: Generating Video Narrations with Structured Storyline.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

E2-LLM: Efficient and Extreme Length Extension of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Percentile Risk-Constrained Budget Pacing for Guaranteed Display Advertising in Online Optimization.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Learning against Non-credible Auctions.
CoRR, 2023

Robust Representation Learning for Unified Online Top-K Recommendation.
CoRR, 2023

Correlative Preference Transfer with Hierarchical Hypergraph Network for Multi-Domain Recommendation.
Proceedings of the ACM Web Conference 2023, 2023

Rethinking Structural Encodings: Adaptive Graph Transformer for Node Classification Task.
Proceedings of the ACM Web Conference 2023, 2023

Fairness-aware Guaranteed Display Advertising Allocation under Traffic Cost Constraint.
Proceedings of the ACM Web Conference 2023, 2023

Gradient Coordination for Quantifying and Maximizing Knowledge Transference in Multi-Task Learning.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

FedAds: A Benchmark for Privacy-Preserving CVR Estimation with Vertical Federated Learning.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Multi-Scenario Ranking with Adaptive Feature Learning.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Combating Bilateral Edge Noise for Robust Link Prediction.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

On Structural Expressive Power of Graph Transformers.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

A Personalized Automated Bidding Framework for Fairness-aware Online Advertising.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

E-commerce Search via Content Collaborative Graph Neural Network.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

RLTP: Reinforcement Learning to Pace for Delayed Impression Modeling in Preloaded Ads.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Adversarial Constrained Bidding via Minimax Regret Optimization with Causality-Aware Reinforcement Learning.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Joint Optimization of Ranking and Calibration with Contextualized Hybrid Model.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

End-to-End Inventory Prediction and Contract Allocation for Guaranteed Delivery Advertising.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Learning-Based Ad Auction Design with Externalities: The Framework and A Matching-Based Approach.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Capturing Conversion Rate Fluctuation during Sales Promotions: A Novel Historical Data Reuse Approach.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Entire Space Cascade Delayed Feedback Modeling for Effective Conversion Rate Prediction.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

COPR: Consistency-Oriented Pre-Ranking for Online Advertising.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Hybrid Contrastive Constraints for Multi-Scenario Ad Ranking.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

PS-SA: An Efficient Self-Attention via Progressive Sampling for User Behavior Sequence Modeling.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

MEBS: Multi-task End-to-end Bid Shading for Multi-slot Display Advertising.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Rec4Ad: A Free Lunch to Mitigate Sample Selection Bias for Ads CTR Prediction in Taobao.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

BOMGraph: Boosting Multi-scenario E-commerce Search with a Unified Graph Neural Network.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

2022
Correlative Preference Transfer with Hierarchical Hypergraph Network for Multi-Domain Recommendation.
CoRR, 2022

Towards Understanding the Overfitting Phenomenon of Deep Click-Through Rate Prediction Models.
CoRR, 2022

Personalized Inter-Task Contrastive Learning for CTR&CVR Joint Estimation.
CoRR, 2022

MCMF: Multi-Constraints With Merging Features Bid Optimization in Online Display Advertising.
CoRR, 2022

GBA: A Tuning-free Approach to Switch between Synchronous and Asynchronous Training for Recommendation Model.
CoRR, 2022

Visual Encoding and Debiasing for CTR Prediction.
CoRR, 2022

APG: Adaptive Parameter Generation Network for Click-Through Rate Prediction.
CoRR, 2022

UKD: Debiasing Conversion Rate Estimation via Uncertainty-regularized Knowledge Distillation.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

MBCT: Tree-Based Feature-Aware Binning for Individual Uncertainty Calibration.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

DC-GNN: Decoupled Graph Neural Networks for Improving and Accelerating Large-Scale E-commerce Retrieval.
Proceedings of the Companion of The Web Conference 2022, Virtual Event / Lyon, France, April 25, 2022

Asymptotically Unbiased Estimation for Delayed Feedback Modeling via Label Correction.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Leaving No One Behind: A Multi-Scenario Multi-Task Meta Learning Approach for Advertiser Modeling.
Proceedings of the WSDM '22: The Fifteenth ACM International Conference on Web Search and Data Mining, Virtual Event / Tempe, AZ, USA, February 21, 2022

A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in Online Advertising.
Proceedings of the WSDM '22: The Fifteenth ACM International Conference on Web Search and Data Mining, Virtual Event / Tempe, AZ, USA, February 21, 2022

Posterior Probability Matters: Doubly-Adaptive Calibration for Neural Predictions in Online Advertising.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Towards Personalized Bundle Creative Generation with Contrastive Non-Autoregressive Decoding.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Joint Optimization of Ad Ranking and Creative Selection.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Transform Cold-Start Users into Warm via Fused Behaviors in Large-Scale Recommendation.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Learning Disentangled Representations for Counterfactual Regression via Mutual Information Minimization.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

APG: Adaptive Parameter Generation Network for Click-Through Rate Prediction.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

GBA: A Tuning-free Approach to Switch between Synchronous and Asynchronous Training for Recommendation Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Sustainable Online Reinforcement Learning for Auto-bidding.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

CREATER: CTR-driven Advertising Text Generation with Controlled Pre-Training and Contrastive Fine-Tuning.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Track, 2022

Adversarial Gradient Driven Exploration for Deep Click-Through Rate Prediction.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

ROI-Constrained Bidding via Curriculum-Guided Bayesian Reinforcement Learning.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Pretraining Representations of Multi-modal Multi-query E-commerce Search.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

EXTR: Click-Through Rate Prediction with Externalities in E-Commerce Sponsored Search.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

PICASSO: Unleashing the Potential of GPU-centric Training for Wide-and-deep Recommender Systems.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

AMCAD: Adaptive Mixed-Curvature Representation based Advertisement Retrieval System.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Towards Understanding the Overfitting Phenomenon of Deep Click-Through Rate Models.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

KEEP: An Industrial Pre-Training Framework for Online Recommendation via Knowledge Extraction and Plugging.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Graph-based Weakly Supervised Framework for Semantic Relevance Learning in E-commerce.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

AdaSparse: Learning Adaptively Sparse Structures for Multi-Domain Click-Through Rate Prediction.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Visual Encoding and Debiasing for CTR Prediction.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

STARDOM: Semantic Aware Deep Hierarchical Forecasting Model for Search Traffic Prediction.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Adaptive Domain Interest Network for Multi-domain Recommendation.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Hierarchically Constrained Adaptive Ad Exposure in Feeds.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Approximate Nearest Neighbor Search under Neural Similarity Metric for Large-Scale Recommendation.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

2021
Towards a Better Tradeoff between Effectiveness and Efficiency in Pre-Ranking: A Learnable Feature Selection based Approach.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Explicit Semantic Cross Feature Learning via Pre-trained Graph Neural Networks for CTR Prediction.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Multi-Agent Cooperative Bidding Games for Multi-Objective Optimization in e-Commercial Sponsored Search.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Learning Effective and Efficient Embedding via an Adaptively-Masked Twins-based Layer.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Binary Code based Hash Embedding for Web-scale Applications.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

SMAD: Scalable Multi-view Ad Retrieval System for E-Commerce Sponsored Search.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

AutoHERI: Automated Hierarchical Representation Integration for Post-Click Conversion Rate Estimation.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Heterogeneous Graph Neural Networks for Large-Scale Bid Keyword Matching.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

2020
Cross-Graph Convolution Learning for Large-Scale Text-Picture Shopping Guide in E-Commerce Search.
Proceedings of the 36th IEEE International Conference on Data Engineering, 2020

2019
Aggregating E-commerce Search Results from Heterogeneous Sources via Hierarchical Reinforcement Learning.
Proceedings of the World Wide Web Conference, 2019

A Minimax Game for Instance based Selective Transfer Learning.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

2006
AM-Trie: An OC-192 Parallel Multidimensional Packet Classification Algorithm.
Proceedings of the Interdisciplinary and Multidisciplinary Research in Computer Science, 2006

AM-Trie: A High-Speed Parallel Packet Classification Algorithm for Network Processor.
Proceedings of the Computational Science, 2006

AntiWorm NPU-based Parallel Bloom filters in Giga-Ethernet LAN.
Proceedings of IEEE International Conference on Communications, 2006

2005
AntiWorm NPU-based Parallel Bloom Filters for TCP/IP Content Processing in Giga-Ethernet LAN.
Proceedings of the 30th Annual IEEE Conference on Local Computer Networks (LCN 2005), 2005


  Loading...