Chi Zhang

Orcid: 0000-0001-8409-1189

Affiliations:
  • JD.com Silicon Valley Research Center, Mountain View, CA, USA


According to our database1, Chi Zhang authored at least 47 papers between 2018 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
TelePhysics: Physics-Grounded Multi-Object Scene Generation from a Single Image with Real-Time Interaction.
CoRR, May, 2026

PRTS: A Primitive Reasoning and Tasking System via Contrastive Representations.
CoRR, April, 2026

Learning to Credit the Right Steps: Objective-aware Process Optimization for Visual Generation.
CoRR, April, 2026

TeleBoost: A Systematic Alignment Framework for High-Fidelity, Controllable, and Robust Video Generation.
CoRR, February, 2026

Point2Insert: Video Object Insertion via Sparse Point Guidance.
CoRR, February, 2026

TeleStyle: Content-Preserving Style Transfer in Images and Videos.
CoRR, January, 2026

Are LLMs Vulnerable to Preference-Undermining Attacks (PUA)? A Factorial Analysis Methodology for Diagnosing the Trade-off between Preference Alignment and Real-World Validity.
CoRR, January, 2026

QwenStyle: Content-Preserving Style Transfer with Qwen-Image-Edit.
CoRR, January, 2026

TeleWorld: Towards Dynamic Multimodal Synthesis with a 4D World Model.
CoRR, January, 2026

OmniVDiff: Omni Controllable Video Diffusion for Generation and Understanding.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

When Safe Unimodal Inputs Collide: Optimizing Reasoning Chains for Cross-Modal Safety in Multimodal Large Language Models.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Generative Video Compression: Towards 0.01% Compression Rate for Video Transmission.
CoRR, December, 2025

Theoretical Foundations of Scaling Law in Familial Models.
CoRR, December, 2025

The Law of Multi-Model Collaboration: Scaling Limits of Model Ensembling for Large Language Models.
CoRR, December, 2025

TeleAI-Safety: A comprehensive LLM jailbreaking benchmark towards attacks, defenses, and evaluations.
CoRR, December, 2025

Aetheria: A multimodal interpretable content safety framework based on multi-agent debate and collaboration.
CoRR, December, 2025

CtrlVDiff: Controllable Video Generation via Unified Multimodal Video Diffusion.
CoRR, November, 2025

Growing with the Generator: Self-paced GRPO for Video Generation.
CoRR, November, 2025

Seeing What Matters: Visual Preference Policy Optimization for Visual Generation.
CoRR, November, 2025

UniModel: A Visual-Only Framework for Unified Multimodal Understanding and Generation.
CoRR, November, 2025

Information Capacity: Evaluating the Efficiency of Large Language Models via Text Compression.
CoRR, November, 2025

ScRPO: From Errors to Insights.
CoRR, November, 2025

TeleEgo: Benchmarking Egocentric AI Assistants in the Wild.
CoRR, October, 2025

A Parameter-Efficient Mixture-of-Experts Framework for Cross-Modal Geo-Localization.
CoRR, October, 2025

Towards Reliable LLM-based Robot Planning via Combined Uncertainty Estimation.
CoRR, October, 2025

RADAR: A Risk-Aware Dynamic Multi-Agent Framework for LLM Safety Evaluation via Role-Specialized Collaboration.
CoRR, September, 2025

Pipeline Parallelism is All You Need for Optimized Early-Exit Based Self-Speculative Decoding.
CoRR, September, 2025

Align-Then-stEer: Adapting the Vision-Language Action Models through Unified Latent Guidance.
CoRR, September, 2025

Integrating Reinforcement Learning with Visual Generative Models: Foundations and Advances.
CoRR, August, 2025

Conditional Video Generation for High-Efficiency Video Compression.
CoRR, July, 2025

Skill-Nav: Enhanced Navigation with Versatile Quadrupedal Locomotion via Waypoint Interface.
CoRR, June, 2025

AI Flow: Perspectives, Scenarios, and Approaches.
CoRR, June, 2025

Towards a Generalizable Bimanual Foundation Policy via Flow-based Video Prediction.
CoRR, May, 2025

Metric-Solver: Sliding Anchored Metric Depth Estimation from a Single Image.
CoRR, April, 2025

AgiBot World Colosseo: A Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems.
CoRR, March, 2025

External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation.
CoRR, February, 2025

UniForm: A Unified Diffusion Transformer for Audio-Video Generation.
CoRR, February, 2025

InterSyn: Interleaved Learning for Dynamic Motion Synthesis in the Wild.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

MAGiC: An LLM-Powered Multi-Agent Framework for Unleashing Visual Creativity.
Proceedings of the ECAI 2025 - 28th European Conference on Artificial Intelligence, 25-30 October 2025, Bologna, Italy, 2025

2024
VAST 1.0: A Unified Framework for Controllable and Consistent Video Generation.
CoRR, 2024

2022
Scenario-based Multi-product Advertising Copywriting Generation for E-Commerce.
CoRR, 2022

Automatic Generation of Product-Image Sequence in E-commerce.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

2021
DSGPT: Domain-Specific Generative Pre-Training of Transformers for Text Generation in E-commerce Title and Review Summarization.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

2019
Learning with Non-Convex Truncated Losses by SGD.
Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence, 2019

Multi-Horizon Time Series Forecasting with Temporal Attention Learning.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Heterogeneous Memory Enhanced Multimodal Attention Model for Video Question Answering.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
A Top-Down Approach to Articulated Human Pose Estimation and Tracking.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018


  Loading...