Kai Zhang

Orcid: 0000-0002-1727-1689

Affiliations:
  • Cornell University, Cornell Tech, Ithaca, IA, USA


According to our database1, Kai Zhang authored at least 47 papers between 2016 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
OmniRoam: World Wandering via Long-Horizon Panoramic Video Generation.
CoRR, March, 2026

2025
Self-Evaluation Unlocks Any-Step Text-to-Image Generation.
CoRR, December, 2025

Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing.
CoRR, December, 2025

E-RayZer: Self-supervised 3D Reconstruction as Spatial Visual Pre-training.
CoRR, December, 2025

SplatPainter: Interactive Authoring of 3D Gaussians from 2D Edits via Test-Time Training.
CoRR, December, 2025

pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation.
CoRR, October, 2025

Aligning Visual Foundation Encoders to Tokenizers for Diffusion Models.
CoRR, September, 2025

KnapFormer: An Online Load Balancer for Efficient Diffusion Transformers Training.
CoRR, August, 2025

4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time.
CoRR, June, 2025

Test-Time Training Done Right.
CoRR, May, 2025

RayZer: A Self-supervised Large View Synthesis Model.
CoRR, May, 2025

Neural BRDF Importance Sampling by Reparameterization.
Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2025

Gaussian Mixture Flow Matching Models.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

RelitLRM: Generative Relightable Radiance for Large Reconstruction Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Rayzer: a Self-Supervised Large View Synthesis Model.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Long-LRM: Long-Sequence Large Reconstruction Model for Wide-Coverage Gaussian Splats.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Baking Gaussian Splatting Into Diffusion Denoiser for Fast and Scalable Single-Stage Image-to-3D Generation and Reconstruction.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

RandAR: Decoder-only Autoregressive Visual Generation in Random Orders.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Buffer Anytime: Zero-Shot Video Depth and Normal from Image Priors.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Turbo3D: Ultra-fast Text-to-3D Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Generating 3D-Consistent Videos from Unposed Internet Photos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

LazyDiT: Lazy Learning for the Acceleration of Diffusion Transformers.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
EP-CFG: Energy-Preserving Classifier-Free Guidance.
CoRR, 2024

Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation.
CoRR, 2024

PBIR-NIE: Glossy Object Capture under Non-Distant Lighting.
CoRR, 2024

MeshLRM: Large Reconstruction Model for High-Quality Mesh.
CoRR, 2024

LRM-Zero: Training Large Reconstruction Models with Synthesized Data.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Neural Gaffer: Relighting Any Object via Diffusion.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

DMV3D: Denoising Multi-view Diffusion Using 3D Large Reconstruction Model.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Instant3D: Fast Text-to-3D with Sparse-view Generation and Large Reconstruction Model.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

LRM: Large Reconstruction Model for Single Image to 3D.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

PF-LRM: Pose-Free Large Reconstruction Model for Joint Pose and Shape Prediction.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

GS-LRM: Large Reconstruction Model for 3D Gaussian Splatting.
Proceedings of the Computer Vision - ECCV 2024, 2024

MegaScenes: Scene-Level View Synthesis at Scale.
Proceedings of the Computer Vision - ECCV 2024, 2024

DATENeRF: Depth-Aware Text-Based Editing of NeRFs.
Proceedings of the Computer Vision - ECCV 2024, 2024

GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Neural Directional Encoding for Efficient and Accurate View-Dependent Appearance Modeling.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2022
ARF: Artistic Radiance Fields.
Proceedings of the Computer Vision - ECCV 2022, 2022

IRON: Inverse Rendering by Optimizing Neural SDFs and Materials from Photometric Images.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
PhySG: Inverse Rendering With Spherical Gaussians for Physics-Based Material Editing and Relighting.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
NeRF++: Analyzing and Improving Neural Radiance Fields.
CoRR, 2020

Depth Sensing Beyond LiDAR Range.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Leveraging Vision Reconstruction Pipelines for Satellite Imagery.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

2017
Robust Non-line-of-sight Imaging with Single Photon Detectors.
CoRR, 2017

2016
Improving fairness of network bandwidth allocation for virtual machines in cloud environment.
Proceedings of the 2016 IEEE International Black Sea Conference on Communications and Networking, 2016


  Loading...