Dave Zhenyu Chen

Orcid: 0000-0002-3883-1905

According to our database1, Dave Zhenyu Chen authored at least 20 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
AnchorSplat: Feed-Forward 3D Gaussian Splatting with 3D Geometric Priors.
CoRR, April, 2026

Reliev3R: Relieving Feed-forward Reconstruction from Multi-View Geometric Annotations.
CoRR, April, 2026

GAP-MLLM: Geometry-Aligned Pre-training for Activating 3D Spatial Perception in Multimodal Large Language Models.
CoRR, March, 2026

VGGT-Det: Mining VGGT Internal Priors for Sensor-Geometry-Free Multi-View Indoor 3D Object Detection.
CoRR, March, 2026

Map2Thought: Explicit 3D Spatial Reasoning via Metric Cognitive Maps.
CoRR, January, 2026

2025
WPT: World-to-Policy Transfer via Online World Model Distillation.
CoRR, November, 2025

Does Your 3D Encoder Really Work? When Pretrain-SFT from 2D VLMs Meets 3D VLMs.
CoRR, June, 2025

Taming Video Diffusion Prior with Scene-Grounding Guidance for 3D Gaussian Splatting from Sparse Inputs.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Grounding Natural Language to 3D Scenes.
PhD thesis, 2024

When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models.
CoRR, 2024

EchoScene: Indoor Scene Generation via Information Echo Over Scene Graph Diffusion.
Proceedings of the Computer Vision - ECCV 2024, 2024

SceneTex: High-Quality Texture Synthesis for Indoor Scenes via Diffusion Priors.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Federated Learning via Decentralized Dataset Distillation in Resource-Constrained Edge Environments.
Proceedings of the International Joint Conference on Neural Networks, 2023

Text2Tex: Text-driven Texture Synthesis via Diffusion Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

UniT3D: A Unified Transformer for 3D Dense Captioning and Visual Grounding.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Generating Context-Aware Natural Answers for Questions in 3D Scenes.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022
D<sup>3</sup>Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
D3Net: A Speaker-Listener Architecture for Semi-supervised Dense Captioning and Visual Grounding in RGB-D Scans.
CoRR, 2021

Scan2Cap: Context-Aware Dense Captioning in RGB-D Scans.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
ScanRefer: 3D Object Localization in RGB-D Scans Using Natural Language.
Proceedings of the Computer Vision - ECCV 2020, 2020


  Loading...