We stand with Ukraine

We stand with Ukraine

Chi Zhang

Orcid: 0000-0001-8409-1189

Affiliations:

JD.com Silicon Valley Research Center, Mountain View, CA, USA

According to our database¹, Chi Zhang authored at least 47 papers between 2018 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

Online presence:

on linkedin.com
on orcid.org

On csauthors.net:

Bibliography

2026

TelePhysics: Physics-Grounded Multi-Object Scene Generation from a Single Image with Real-Time Interaction.

[DOI]

,

,

,

,

,

,

,

CoRR, May, 2026

PRTS: A Primitive Reasoning and Tasking System via Contrastive Representations.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, April, 2026

Learning to Credit the Right Steps: Objective-aware Process Optimization for Visual Generation.

[DOI]

,

,

,

,

,

,

CoRR, April, 2026

TeleBoost: A Systematic Alignment Framework for High-Fidelity, Controllable, and Robust Video Generation.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, February, 2026

Point2Insert: Video Object Insertion via Sparse Point Guidance.

[DOI]

,

,

,

,

,

,

,

,

CoRR, February, 2026

TeleStyle: Content-Preserving Style Transfer in Images and Videos.

[DOI]

,

,

,

,

,

CoRR, January, 2026

Are LLMs Vulnerable to Preference-Undermining Attacks (PUA)? A Factorial Analysis Methodology for Diagnosing the Trade-off between Preference Alignment and Real-World Validity.

[DOI]

,

,

,

,

,

CoRR, January, 2026

QwenStyle: Content-Preserving Style Transfer with Qwen-Image-Edit.

[DOI]

,

,

,

CoRR, January, 2026

TeleWorld: Towards Dynamic Multimodal Synthesis with a 4D World Model.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Chengcheng Zhou

,

,

,

CoRR, January, 2026

OmniVDiff: Omni Controllable Video Diffusion for Generation and Understanding.

[DOI]

,

,

,

,

,

,

,

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

When Safe Unimodal Inputs Collide: Optimizing Reasoning Chains for Cross-Modal Safety in Multimodal Large Language Models.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

Generative Video Compression: Towards 0.01% Compression Rate for Video Transmission.

[DOI]

,

,

,

,

,

CoRR, December, 2025

Theoretical Foundations of Scaling Law in Familial Models.

[DOI]

,

,

,

,

,

,

,

CoRR, December, 2025

The Law of Multi-Model Collaboration: Scaling Limits of Model Ensembling for Large Language Models.

[DOI]

,

,

,

,

,

CoRR, December, 2025

TeleAI-Safety: A comprehensive LLM jailbreaking benchmark towards attacks, defenses, and evaluations.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, December, 2025

Aetheria: A multimodal interpretable content safety framework based on multi-agent debate and collaboration.

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, December, 2025

CtrlVDiff: Controllable Video Generation via Unified Multimodal Video Diffusion.

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, November, 2025

Growing with the Generator: Self-paced GRPO for Video Generation.

[DOI]

,

,

,

,

,

CoRR, November, 2025

Seeing What Matters: Visual Preference Policy Optimization for Visual Generation.

[DOI]

,

,

,

,

,

,

CoRR, November, 2025

UniModel: A Visual-Only Framework for Unified Multimodal Understanding and Generation.

[DOI]

,

,

,

,

,

,

,

CoRR, November, 2025

Information Capacity: Evaluating the Efficiency of Large Language Models via Text Compression.

[DOI]

,

,

,

CoRR, November, 2025

ScRPO: From Errors to Insights.

[DOI]

,

,

,

,

CoRR, November, 2025

TeleEgo: Benchmarking Egocentric AI Assistants in the Wild.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, October, 2025

A Parameter-Efficient Mixture-of-Experts Framework for Cross-Modal Geo-Localization.

[DOI]

,

,

,

,

,

,

,

,

CoRR, October, 2025

Towards Reliable LLM-based Robot Planning via Combined Uncertainty Estimation.

[DOI]

,

,

,

,

,

,

CoRR, October, 2025

RADAR: A Risk-Aware Dynamic Multi-Agent Framework for LLM Safety Evaluation via Role-Specialized Collaboration.

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, September, 2025

Pipeline Parallelism is All You Need for Optimized Early-Exit Based Self-Speculative Decoding.

[DOI]

,

,

,

,

,

CoRR, September, 2025

Align-Then-stEer: Adapting the Vision-Language Action Models through Unified Latent Guidance.

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, September, 2025

Integrating Reinforcement Learning with Visual Generative Models: Foundations and Advances.

[DOI]

,

,

,

,

,

,

CoRR, August, 2025

Conditional Video Generation for High-Efficiency Video Compression.

[DOI]

,

,

,

,

CoRR, July, 2025

Skill-Nav: Enhanced Navigation with Versatile Quadrupedal Locomotion via Waypoint Interface.

[DOI]

,

,

,

,

,

,

CoRR, June, 2025

AI Flow: Perspectives, Scenarios, and Approaches.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, June, 2025

Towards a Generalizable Bimanual Foundation Policy via Flow-based Video Prediction.

[DOI]

,

,

,

,

,

,

CoRR, May, 2025

Metric-Solver: Sliding Anchored Metric Depth Estimation from a Single Image.

[DOI]

,

,

,

,

,

CoRR, April, 2025

AgiBot World Colosseo: A Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems.

[DOI]

AgiBot-World-Contributors

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, March, 2025

External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Xingliang Huang

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Amit Anand Amlesahwaram

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Alireza Vahdatpour

,

,

,

Toshinari Kureha

,

,

Musharaf Sultan

,

,

,

,

,

,

,

,

,

CoRR, February, 2025

UniForm: A Unified Diffusion Transformer for Audio-Video Generation.

[DOI]

,

,

,

,

,

,

CoRR, February, 2025

InterSyn: Interleaved Learning for Dynamic Motion Synthesis in the Wild.

[DOI]

,

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

MAGiC: An LLM-Powered Multi-Agent Framework for Unleashing Visual Creativity.

[DOI]

,

,

,

,

Proceedings of the ECAI 2025 - 28th European Conference on Artificial Intelligence, 25-30 October 2025, Bologna, Italy, 2025

2024

VAST 1.0: A Unified Framework for Controllable and Consistent Video Generation.

[DOI]

,

,

,

,

CoRR, 2024

2022

Scenario-based Multi-product Advertising Copywriting Generation for E-Commerce.

[DOI]

,

,

,

,

,

,

,

CoRR, 2022

Automatic Generation of Product-Image Sequence in E-commerce.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

2021

DSGPT: Domain-Specific Generative Pre-Training of Transformers for Text Generation in E-commerce Title and Review Summarization.

[DOI]

,

,

,

,

,

,

,

Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

2019

Learning with Non-Convex Truncated Losses by SGD.

[DOI]

,

,

,

,

,

Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence, 2019

Multi-Horizon Time Series Forecasting with Temporal Attention Learning.

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Heterogeneous Memory Enhanced Multimodal Attention Model for Video Question Answering.

[DOI]

,

,

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

A Top-Down Approach to Articulated Human Pose Estimation and Tracking.

[DOI]

,

,

,

Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Loading...