Yuanhao Cai

Orcid: 0009-0008-2077-4915

According to our database1, Yuanhao Cai authored at least 56 papers between 2020 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Active View Selection with Perturbed Gaussian Ensemble for Tomographic Reconstruction.
CoRR, March, 2026

Contrastive Learning Feature Enhancement and High-Low Frequency Texture Interaction Networks for DIBR-Synthesized View Quality Assessment.
IEEE Trans. Artif. Intell., February, 2026

Efficient Autoregressive Video Diffusion with Dummy Head.
CoRR, January, 2026

Semi-Wavelet Attention Transformer-Based MPI Reconstruction Algorithm.
Proceedings of the 23rd IEEE International Symposium on Biomedical Imaging, 2026

Explainable Variational Networks with Unrolled Gradient Descent for Magnetic Particle Imaging Reconstruction.
Proceedings of the 23rd IEEE International Symposium on Biomedical Imaging, 2026

X-LRM: X-Ray Large Reconstruction Model for Extremely Sparse-View Computed Tomography Recovery in One Second.
Proceedings of the International Conference on 3D Visio, 2026

VideoLifter: Lifting Videos to 3D with Fast and Efficient Hierarchical Stereo Alignment.
Proceedings of the International Conference on 3D Visio, 2026

2025
PhyGDPO: Physics-Aware Groupwise Direct Preference Optimization for Physically Consistent Text-to-Video Generation.
CoRR, December, 2025

DenoiseGS: Gaussian Reconstruction Model for Burst Denoising.
CoRR, November, 2025

Flow-Matching Guided Deep Unfolding for Hyperspectral Image Reconstruction.
CoRR, October, 2025

EditVerse: Unifying Image and Video Editing and Generation with In-Context Learning.
CoRR, September, 2025

OmniVCus: Feedforward Subject-driven Video Customization with Multimodal Control Conditions.
CoRR, June, 2025

Are Pixel-Wise Metrics Reliable for Sparse-View Computed Tomography Reconstruction?
CoRR, June, 2025

X<sup>2</sup>-Gaussian: 4D Radiative Gaussian Splatting for Continuous-time Tomographic Reconstruction.
CoRR, March, 2025

Motion-X++: A Large-Scale Multimodal 3D Whole-body Human Motion Dataset.
CoRR, January, 2025

VideoLifter: Lifting Videos to 3D with Fast Hierarchical Stereo Alignment.
CoRR, January, 2025

Study on vehicle state and tire-road friction coefficient estimation based on maximum correntropy generalized high-degree cubature Kalman filter.
Trans. Inst. Meas. Control, 2025

Research on the Stability Control of Four-Wheel Steering for Distributed Drive Electric Vehicles.
Symmetry, 2025

Robotic grinding and polishing of complex aeroengine blades based on new device design and variable impedance control.
Robotics Comput. Integr. Manuf., 2025

Blind DIBR-synthesized view quality assessment by integrating local geometry and global structure analysis.
Displays, 2025

LucidFusion: Reconstructing 3D Gaussians with Arbitrary Unposed Images.
Comput. Graph. Forum, 2025

LangSplatV2: High-dimensional 3D Language Gaussian Splatting with 450+ FPS.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

OmniVCus: Feedforward Subject-driven Video Customization with Multimodal Control Conditions.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

$\mathbf{X}^{\mathbf{2}}$-Gaussian: 4D Radiative Gaussian Splatting for Continuous-Time Tomographic Reconstruction.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Baking Gaussian Splatting Into Diffusion Denoiser for Fast and Scalable Single-Stage Image-to-3D Generation and Reconstruction.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

2024
Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation.
CoRR, 2024

LucidFusion: Generating 3D Gaussians with Arbitrary Unposed Images.
CoRR, 2024

NTIRE 2024 Challenge on Low Light Image Enhancement: Methods and Results.
CoRR, 2024

R<sup>2</sup>-Gaussian: Rectifying Radiative Gaussian Splatting for Tomographic Reconstruction.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

ITportrait: Image-Text Coupled 3D Portrait Domain Adaptation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Unveiling Advanced Frequency Disentanglement Paradigm for Low-Light Image Enhancement.
Proceedings of the Computer Vision - ECCV 2024, 2024

Radiative Gaussian Splatting for Efficient X-Ray Novel View Synthesis.
Proceedings of the Computer Vision - ECCV 2024, 2024

NTIRE 2024 Challenge on Low Light Image Enhancement: Methods and Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Structure-Aware Sparse-View X-Ray 3D Reconstruction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
dMIL-Transformer: Multiple Instance Learning Via Integrating Morphological and Spatial Information for Lymph Node Metastasis Classification.
IEEE J. Biomed. Health Informatics, September, 2023

3D Face Arbitrary Style Transfer.
CoRR, 2023

Motion-X: A Large-scale 3D Expressive Whole-body Human Motion Dataset.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Binarized Spectral Compressive Imaging.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Retinexformer: One-stage Retinex-based Transformer for Low-light Image Enhancement.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
RFormer: Transformer-Based Generative Adversarial Network for Real Fundus Image Restoration on a New Clinical Benchmark.
IEEE J. Biomed. Health Informatics, 2022

RFormer: Transformer-based Generative Adversarial Network for Real Fundus Image Restoration on A New Clinical Benchmark.
CoRR, 2022

Degradation-Aware Unfolding Half-Shuffle Transformer for Spectral Compressive Imaging.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Unsupervised Flow-Aligned Sequence-to-Sequence Learning for Video Restoration.
Proceedings of the International Conference on Machine Learning, 2022

Flow-Guided Sparse Transformer for Video Deblurring.
Proceedings of the International Conference on Machine Learning, 2022

Coarse-to-Fine Sparse Transformer for Hyperspectral Image Reconstruction.
Proceedings of the Computer Vision - ECCV 2022, 2022

HDNet: High-resolution Dual-domain Learning for Spectral Compressive Imaging.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

MST++: Multi-stage Spectral-wise Transformer for Efficient Spectral Reconstruction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Mask-guided Spectral-wise Transformer for Efficient Hyperspectral Image Reconstruction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022


2021
Learning to Generate Realistic Noisy Images via Pixel-level Noise-aware Adversarial Training.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Multi-Scale Selective Feedback Network with Dual Loss for Real Image Denoising.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Efficient Human Pose Estimation by Learning Deeply Aggregated Representations.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Pyramid Orthogonal Attention Network based on Dual Self-Similarity for Accurate Mr Image Super-Resolution.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Pseudo 3D Auto-Correlation Network for Real Image Denoising.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Learning Delicate Local Representations for Multi-person Pose Estimation.
Proceedings of the Computer Vision - ECCV 2020, 2020


  Loading...