Yinda Zhang

Orcid: 0000-0001-5386-8872

Affiliations:
  • Google, Mountain View, CA, USA
  • Princeton University (former)


According to our database1, Yinda Zhang authored at least 74 papers between 2013 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
T-Pixel2Mesh: Combining Global and Local Transformer for 3D Mesh Generation from a Single Image.
CoRR, 2024

Pushing Auto-regressive Models for 3D Shape Generation at Capacity and Scalability.
CoRR, 2024

One2Avatar: Generative Implicit Head Avatar For Few-shot User Adaptation.
CoRR, 2024

GO-NeRF: Generating Virtual Objects in Neural Radiance Fields.
CoRR, 2024

2023
H4MER: Human 4D Modeling by Learning Neural Compositional Representation With Transformer.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Pixel2Mesh++: 3D Mesh Generation and Refinement From Multi-View Images.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Recurrent 3D Hand Pose Estimation Using Cascaded Pose-Guided 3D Alignments.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

InstructPipe: Building Visual Programming Pipelines with Human Instructions.
CoRR, 2023

MVDD: Multi-View Depth Diffusion Models.
CoRR, 2023

Gaussian3Diff: 3D Gaussian Diffusion for 3D Full Head Synthesis and Editing.
CoRR, 2023

Learning Versatile 3D Shape Generation with Improved AR Models.
CoRR, 2023

Portrait Expression Editing With Mobile Photo Sequence.
Proceedings of the SIGGRAPH Asia 2023 Technical Communications, 2023

LitNeRF: Intrinsic Radiance Decomposition for High-Quality View Synthesis and Relighting of Faces.
Proceedings of the SIGGRAPH Asia 2023 Conference Papers, 2023

Self-supervised Learning of Implicit Shape Representation with Dense Correspondence for Deformable Objects.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Spectral Graphormer: Spectral Graph-based Transformer for Egocentric Two-Hand Reconstruction using Multi-View Color Images.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Novel-view Synthesis and Pose Estimation for Hand-Object Interaction from Sparse Views.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Learning Versatile 3D Shape Generation with Improved Auto-regressive Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Multi-Modal Neural Radiance Field for Monocular Dense SLAM with a Light-Weight ToF Sensor.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Grad-PU: Arbitrary-Scale Point Cloud Upsampling via Gradient Descent with Learned Distance Functions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Hybrid Neural Rendering for Large-Scale Scenes with Motion Blur.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

SINE: Semantic-driven Image-based NeRF Editing with Prior-guided Editing Field.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning Personalized High Quality Volumetric Head Avatars from Monocular RGB Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Rapsai: Accelerating Machine Learning Prototyping of Multimedia Applications through Visual Programming.
Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

2022
Neural rendering in a room: amodal 3D understanding and free-viewpoint rendering for the closed scene composed of pre-captured objects.
ACM Trans. Graph., 2022

OmniSyn: Synthesizing 360 Videos with Wide-baseline Panoramas.
Proceedings of the 2022 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops, 2022

VoLux-GAN: A Generative Model for 3D Face Synthesis with HDRI Relighting.
Proceedings of the SIGGRAPH '22: Special Interest Group on Computer Graphics and Interactive Techniques Conference, Vancouver, BC, Canada, August 7, 2022

NeuMesh: Learning Disentangled Neural Mesh-Based Implicit Field for Geometry and Texture Editing.
Proceedings of the Computer Vision - ECCV 2022, 2022

DELTAR: Depth Estimation from a Light-Weight ToF Sensor and RGB Image.
Proceedings of the Computer Vision - ECCV 2022, 2022

LoRD: Local 4D Implicit Representation for High-Fidelity Dynamic Human Modeling.
Proceedings of the Computer Vision - ECCV 2022, 2022

PRIF: Primary Ray-Based Implicit Function.
Proceedings of the Computer Vision - ECCV 2022, 2022

H4D: Human 4D Modeling by Learning Neural Compositional Representation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Density-preserving Deep Point Cloud Compression.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Opportunistic Interfaces for Augmented Reality: Transforming Everyday Objects into Tangible 6DoF Interfaces Using Ad hoc UI.
Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

Efficient Virtual View Selection for 3D Hand Pose Estimation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Weakly Supervised Learning for Single Depth-Based Hand Shape Recovery.
IEEE Trans. Image Process., 2021

Hand Pose Understanding With Large-Scale Photo-Realistic Rendering Dataset.
IEEE Trans. Image Process., 2021

Pixel2Mesh: 3D Mesh Model Generation via Image Guided Deformation.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Interacting Two-Hand 3D Pose and Shape Reconstruction from Single Color Image.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

DeepPanoContext: Panoramic 3D Scene Understanding with Holistic Scene Context Graph and Relation-based Optimization.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Learning Object-Compositional Neural Radiance Field for Editable Scene Rendering.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Deep Hybrid Self-Prior for Full 3D Mesh Generation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Multiresolution Deep Implicit Functions for 3D Shape Representation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Spatially-Varying Outdoor Lighting Estimation From Intrinsics.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Holistic 3D Scene Understanding From a Single Image With Implicit Representation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

HITNet: Hierarchical Iterative Tile Refinement Network for Real-time Stereo Matching.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

HumanGPS: Geodesic PreServing Feature for Dense Human Correspondences.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning Compositional Representation for 4D Captures With Neural ODE.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Deep relightable textures: volumetric performance capture with neural rendering.
ACM Trans. Graph., 2020

PBR-Net: Imitating Physically Based Rendering Using Deep Neural Network.
IEEE Trans. Image Process., 2020

HITNet: Hierarchical Iterative Tile Refinement Network for Real-time Stereo Matching.
CoRR, 2020

Du<sup>2</sup>Net: Learning Depth Estimation from Dual-Cameras and Dual-Pixels.
Proceedings of the Computer Vision - ECCV 2020, 2020

DeepSFM: Structure from Motion via Deep Bundle Adjustment.
Proceedings of the Computer Vision - ECCV 2020, 2020

GeoLayout: Geometry Driven Room Layout Estimation Based on Depth Maps of Planes.
Proceedings of the Computer Vision - ECCV 2020, 2020

Neural Pose Transfer by Spatially Adaptive Instance Normalization.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Deep Implicit Volume Compression.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

DIST: Rendering Deep Implicit Signed Distance Function With Differentiable Sphere Tracing.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Neural Point Cloud Rendering via Multi-Plane Projection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Multi-Level Semantic Feature Augmentation for One-Shot Learning.
IEEE Trans. Image Process., 2019

Pixel2Mesh++: Multi-View 3D Mesh Generation via Deformation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

DeepLiDAR: Deep Surface Normal Guided Depth Prediction for Outdoor Scene From Sparse LiDAR Data and Single Color Image.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Joint Hand Detection and Rotation Estimation Using CNN.
IEEE Trans. Image Process., 2018

Semantic Feature Augmentation in Few-shot Learning.
CoRR, 2018

ActiveStereoNet: End-to-End Self-supervised Learning for Active Stereo Systems.
Proceedings of the Computer Vision - ECCV 2018, 2018

Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images.
Proceedings of the Computer Vision - ECCV 2018, 2018

Deep Depth Completion of a Single RGB-D Image.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Hand3D: Hand Pose Estimation using 3D Neural Network.
CoRR, 2017

DeepContext: Context-Encoding Neural Pathways for 3D Holistic Scene Understanding.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Physically-Based Rendering for Indoor Scene Understanding Using Convolutional Neural Networks.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Matterport3D: Learning from RGB-D Data in Indoor Environments.
Proceedings of the 2017 International Conference on 3D Vision, 2017

2016
Joint Hand Detection and Rotation Estimation by Using CNN.
CoRR, 2016

2015
LSUN: Construction of a Large-scale Image Dataset using Deep Learning with Humans in the Loop.
CoRR, 2015

TurkerGaze: Crowdsourcing Saliency with Webcam based Eye Tracking.
CoRR, 2015

2014
PanoContext: A Whole-Room 3D Context Model for Panoramic Scene Understanding.
Proceedings of the Computer Vision - ECCV 2014, 2014

2013
FrameBreak: Dramatic Image Extrapolation by Guided Shift-Maps.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013


  Loading...