Zhaoxi Chen

Orcid: 0000-0003-3998-7044

Affiliations:
  • Nanyang Technological University, Singaporte


According to our database1, Zhaoxi Chen authored at least 50 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
FreeTraj: Tuning-Free Trajectory Control via Noise Guided Video Diffusion.
Int. J. Comput. Vis., April, 2026

Kinema4D: Kinematic 4D World Modeling for Spatiotemporal Embodied Simulation.
CoRR, March, 2026

HSImul3R: Physics-in-the-Loop Reconstruction of Simulation-Ready Human-Scene Interactions.
CoRR, March, 2026

ArtHOI: Articulated Human-Object Interaction Synthesis by 4D Reconstruction from Video Priors.
CoRR, March, 2026

Compositional Generative Model of Unbounded 4D Cities.
IEEE Trans. Pattern Anal. Mach. Intell., January, 2026

DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation.
CoRR, January, 2026

PI-Light: Physics-Inspired Diffusion for Full-Image Relighting.
CoRR, January, 2026

OnlineSI: Taming Large Language Model for Online 3D Understanding and Grounding.
CoRR, January, 2026

2025
LongVie 2: Multimodal Controllable Ultra-Long Video World Model.
CoRR, December, 2025

Light-X: Generative 4D Video Rendering with Camera and Illumination Control.
CoRR, December, 2025

NeAR: Coupled Neural Asset-Renderer Stack.
CoRR, November, 2025

PhysX-Anything: Simulation-Ready Physical 3D Assets from Single Image.
CoRR, November, 2025

Simulating the Visual World with Artificial Intelligence: A Roadmap.
CoRR, November, 2025

One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation.
CoRR, September, 2025

Collaborative Multi-Modal Coding for High-Quality 3D Generation.
CoRR, August, 2025

4DNeX: Feed-Forward 4D Generative Modeling Made Easy.
CoRR, August, 2025

LongVie: Multimodal-Guided Controllable Ultra-Long Video Generation.
CoRR, August, 2025

Reconstructing 4D Spatial Intelligence: A Survey.
CoRR, July, 2025

PhysX-3D: Physical-Grounded 3D Asset Generation.
CoRR, July, 2025

Light of Normals: Unified Feature Representation for Universal Photometric Stereo.
CoRR, June, 2025

Unifying Appearance Codes and Bilateral Grids for Driving Scene Gaussian Splatting.
CoRR, June, 2025

Simulate Any Radar: Attribute-Controllable Radar Simulation via Waveform Parameter Embedding.
CoRR, June, 2025

DCM: Dual-Expert Consistency Model for Efficient and High-Quality Video Generation.
CoRR, June, 2025

ORV: 4D Occupancy-centric Robot Video Generation.
CoRR, June, 2025

3D Scene Generation: A Survey.
CoRR, May, 2025

Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency.
CoRR, March, 2025

CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities.
CoRR, January, 2025

Dual-Expert Consistency Model for Efficient and High-Quality Video Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Free4D: Tuning-Free 4D Scene Generation with Spatial-Temporal Consistency.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Generative Gaussian Splatting for Unbounded 3D City Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
PERF: Panoramic Neural Radiance Field From a Single Panorama.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2024

ReliTalk: Relightable Talking Portrait Generation from a Single Video.
Int. J. Comput. Vis., August, 2024

FreeTraj: Tuning-Free Trajectory Control in Video Diffusion Models.
CoRR, 2024

GaussianCity: Generative Gaussian Splatting for Unbounded 3D City Generation.
CoRR, 2024

FashionEngine: Interactive Generation and Editing of 3D Clothed Humans.
CoRR, 2024

3DTopia: Large Text-to-3D Generation Model with Hybrid Diffusion Priors.
CoRR, 2024

LGM: Large Multi-view Gaussian Model for High-Resolution 3D Content Creation.
Proceedings of the Computer Vision - ECCV 2024, 2024

CityDreamer: Compositional Generative Model of Unbounded 3D Cities.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024


2023
SceneDreamer: Unbounded 3D Scene Generation From 2D Image Collections.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

SynBody: Synthetic Dataset with Layered Human Models for 3D Human Perception and Modeling.
CoRR, 2023

PrimDiffusion: Volumetric Primitives Diffusion for 3D Human Generation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

EVA3D: Compositional 3D Human Generation from 2D Image Collections.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

SynBody: Synthetic Dataset with Layered Human Models for 3D Human Perception and Modeling.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SparseNeRF: Distilling Depth Ranking for Few-shot Novel View Synthesis.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

F<sup>2</sup>-NeRF: Fast Neural Radiance Field Training with Free Camera Trajectories.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Text2Light: Zero-Shot Text-Driven HDR Panorama Generation.
ACM Trans. Graph., 2022

Relighting4D: Neural Relightable Human from Videos.
Proceedings of the Computer Vision - ECCV 2022, 2022


  Loading...