Yinghao Xu

Orcid: 0000-0003-2696-9664

Affiliations:
  • Stanford University, USA
  • Chinese University of Hong Kong (CUHK), Multimedia Laboratory, Shatin, China


According to our database1, Yinghao Xu authored at least 61 papers between 2020 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Interspatial Attention for Efficient 4D Human Video Generation.
ACM Trans. Graph., August, 2025

Video World Models with Long-term Spatial Memory.
CoRR, June, 2025

CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models.
CoRR, March, 2025

RelightVid: Temporal-Consistent Diffusion Model for Video Relighting.
CoRR, January, 2025

3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

CameraCtrl: Enabling Camera Control for Video Diffusion Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

GroomLight: Hybrid Inverse Rendering for Relightable Human Hair Appearance Modeling.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

FLARE: Feed-forward Geometry, Appearance and Camera Estimation from Uncalibrated Sparse Views.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Learning Naturally Aggregated Appearance for Efficient 3D Editing.
Proceedings of the International Conference on 3D Vision, 2025

2024
Representing Long Volumetric Video with Temporal Gaussian Hierarchy.
ACM Trans. Graph., December, 2024

Spatial Steerability of GANs via Self-Supervision from Discriminator.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

In-Domain GAN Inversion for Faithful Reconstruction and Editability.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Edicho: Consistent Image Editing in the Wild.
CoRR, 2024

CameraCtrl: Enabling Camera Control for Text-to-Video Generation.
CoRR, 2024

FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

DMV3D: Denoising Multi-view Diffusion Using 3D Large Reconstruction Model.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Instant3D: Fast Text-to-3D with Sparse-view Generation and Large Reconstruction Model.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

PF-LRM: Pose-Free Large Reconstruction Model for Joint Pose and Shape Prediction.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Real-Time 3D-Aware Portrait Editing from a Single Image.
Proceedings of the Computer Vision - ECCV 2024, 2024

BerfScene: Bev-conditioned Equivariant Radiance Fields for Infinite 3D Scene Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Towards Text-guided 3D Scene Composition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Gaussian Shell Maps for Efficient 3D Human Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Flow as the Cross-domain Manipulation Interface.
Proceedings of the Conference on Robot Learning, 6-9 November 2024, Munich, Germany., 2024

Efficient 3D Articulated Human Generation with Layered Surface Volumes.
Proceedings of the International Conference on 3D Vision, 2024

2023
Implicit Neural Representations With Structured Latent Codes for Human Body Modeling.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

GH-Feat: Learning Versatile Generative Hierarchical Features From GANs.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

SceneWiz3D: Towards Text-guided 3D Scene Composition.
CoRR, 2023

Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis.
CoRR, 2023

Improving Out-of-Distribution Robustness of Classifiers via Generative Interpolation.
CoRR, 2023

Spatial Steerability of GANs via Self-Supervision from Discriminator.
CoRR, 2023

Learning Modulated Transformation in GANs.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Benchmarking and Analyzing 3D-aware Image Synthesis with a Modularized Codebase.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Towards Smooth Video Composition.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

3D generation on ImageNet.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

One-Shot Generative Domain Adaptation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DisCoScene: Spatially Disentangled Generative Radiance Fields for Controllable 3D-aware Scene Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning 3D-Aware Image Synthesis with Unknown Pose Distribution.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

GLeaD: Improving GANs with A Generator-Leading Task.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Deep Generative Models on 3D Representations: A Survey.
CoRR, 2022

Improving GANs with A Dynamic Discriminator.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Region-Based Semantic Factorization in GANs.
Proceedings of the International Conference on Machine Learning, 2022

Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation.
Proceedings of the Computer Vision - ECCV 2022, 2022

High-Fidelity GAN Inversion with Padding Space.
Proceedings of the Computer Vision - ECCV 2022, 2022

Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

3D-aware Image Synthesis via Learning Structural and Textural Representations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Improving GAN Equilibrium by Raising Spatial Awareness.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Decorating Your Own Bedroom: Locally Controlling Image Generation with Generative Adversarial Networks.
CoRR, 2021

Data-Efficient Instance Generation from Instance Discrimination.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Learning Object-Compositional Neural Radiance Field for Editable Scene Rendering.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

CompConv: A Compact Convolution Module for Efficient Feature Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Generative Hierarchical Features From Synthesizing Images.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Neural Body: Implicit Neural Representations With Structured Latent Codes for Novel View Synthesis of Dynamic Humans.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Unsupervised Landmark Learning from Unpaired Data.
CoRR, 2020

Video Representation Learning with Visual Tempo Consistency.
CoRR, 2020

Dense RepPoints: Representing Visual Objects with Dense Point Sets.
Proceedings of the Computer Vision - ECCV 2020, 2020

Temporal Pyramid Network for Action Recognition.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020


  Loading...