Kai Zhang

Orcid: 0000-0002-1727-1689

Affiliations:

Cornell University, Cornell Tech, Ithaca, IA, USA

According to our database¹, Kai Zhang authored at least 47 papers between 2016 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

OmniRoam: World Wandering via Long-Horizon Panoramic Video Generation.

[BibT_eX]

[DOI]

Yannick Hold-Geoffroy

CoRR, March, 2026

2025

Self-Evaluation Unlocks Any-Step Text-to-Image Generation.

[BibT_eX]

[DOI]

CoRR, December, 2025

Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing.

[BibT_eX]

[DOI]

CoRR, December, 2025

E-RayZer: Self-supervised 3D Reconstruction as Spatial Visual Pre-training.

[BibT_eX]

[DOI]

CoRR, December, 2025

SplatPainter: Interactive Authoring of 3D Gaussians from 2D Edits via Test-Time Training.

[BibT_eX]

[DOI]

CoRR, December, 2025

pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation.

[BibT_eX]

[DOI]

CoRR, October, 2025

Aligning Visual Foundation Encoders to Tokenizers for Diffusion Models.

[BibT_eX]

[DOI]

CoRR, September, 2025

KnapFormer: An Online Load Balancer for Efficient Diffusion Transformers Training.

[BibT_eX]

[DOI]

CoRR, August, 2025

Test-Time Training Done Right.

[BibT_eX]

[DOI]

CoRR, May, 2025

RayZer: A Self-supervised Large View Synthesis Model.

[BibT_eX]

[DOI]

CoRR, May, 2025

Neural BRDF Importance Sampling by Reparameterization.

[BibT_eX]

[DOI]

Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2025

4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Gaussian Mixture Flow Matching Models.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

RelitLRM: Generative Relightable Radiance for Large Reconstruction Models.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Rayzer: a Self-Supervised Large View Synthesis Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Long-LRM: Long-Sequence Large Reconstruction Model for Wide-Coverage Gaussian Splats.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Baking Gaussian Splatting Into Diffusion Denoiser for Fast and Scalable Single-Stage Image-to-3D Generation and Reconstruction.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

RandAR: Decoder-only Autoregressive Visual Generation in Random Orders.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Buffer Anytime: Zero-Shot Video Depth and Normal from Image Priors.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Turbo3D: Ultra-fast Text-to-3D Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Generating 3D-Consistent Videos from Unposed Internet Photos.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

LazyDiT: Lazy Learning for the Acceleration of Diffusion Transformers.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

EP-CFG: Energy-Preserving Classifier-Free Guidance.

[BibT_eX]

[DOI]

CoRR, 2024

Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation.

[BibT_eX]

[DOI]

CoRR, 2024

PBIR-NIE: Glossy Object Capture under Non-Distant Lighting.

[BibT_eX]

[DOI]

CoRR, 2024

MeshLRM: Large Reconstruction Model for High-Quality Mesh.

[BibT_eX]

[DOI]

CoRR, 2024

LRM-Zero: Training Large Reconstruction Models with Synthesized Data.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Neural Gaffer: Relighting Any Object via Diffusion.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

DMV3D: Denoising Multi-view Diffusion Using 3D Large Reconstruction Model.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Instant3D: Fast Text-to-3D with Sparse-view Generation and Large Reconstruction Model.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

LRM: Large Reconstruction Model for Single Image to 3D.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

PF-LRM: Pose-Free Large Reconstruction Model for Joint Pose and Shape Prediction.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

GS-LRM: Large Reconstruction Model for 3D Gaussian Splatting.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

MegaScenes: Scene-Level View Synthesis at Scale.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

DATENeRF: Depth-Aware Text-Based Editing of NeRFs.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Neural Directional Encoding for Efficient and Accurate View-Dependent Appearance Modeling.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2022

ARF: Artistic Radiance Fields.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

IRON: Inverse Rendering by Optimizing Neural SDFs and Materials from Photometric Images.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

PhySG: Inverse Rendering With Spherical Gaussians for Physics-Based Material Editing and Relighting.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

NeRF++: Analyzing and Improving Neural Radiance Fields.

[BibT_eX]

[DOI]

CoRR, 2020

Depth Sensing Beyond LiDAR Range.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Leveraging Vision Reconstruction Pipelines for Satellite Imagery.

[BibT_eX]

[DOI]

Kai Zhang

Noah Snavely

Jin Sun

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

2017

Robust Non-line-of-sight Imaging with Single Photon Detectors.

[BibT_eX]

[DOI]

CoRR, 2017

2016

Improving fairness of network bandwidth allocation for virtual machines in cloud environment.

[BibT_eX]

[DOI]

Zhiyuan Shao

Kai Zhang

Hai Jin

Proceedings of the 2016 IEEE International Black Sea Conference on Communications and Networking, 2016

Kai Zhang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...