Xuweiyi Chen

According to our database1, Xuweiyi Chen authored at least 15 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
WildRayZer: Self-supervised Large View Synthesis in Dynamic Environments.
CoRR, January, 2026

Open Vocabulary Monocular 3D Object Detection.
Proceedings of the International Conference on 3D Visio, 2026

Semantic-Free Procedural 3D Shapes are Surprisingly Good Teachers.
Proceedings of the International Conference on 3D Visio, 2026

2025
Next-Embedding Prediction Makes Strong Vision Learners.
CoRR, December, 2025

Empowering Dynamic Urban Navigation with Stereo and Mid-Level Vision.
CoRR, December, 2025

SAB3R: Semantic-Augmented Backbone in 3D Reconstruction.
CoRR, June, 2025

Point-MoE: Towards Cross-Domain Generalization in 3D Semantic Segmentation via Mixture-of-Experts.
CoRR, May, 2025

Frame In-N-Out: Unbounded Controllable Image-to-Video Generation.
CoRR, May, 2025

4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less Hallucination.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Probing the Mid-level Vision Capabilities of Self-Supervised Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention Control.
Trans. Mach. Learn. Res., 2024

Learning 3D Representations from Procedural 3D Programs.
CoRR, 2024

Multi-Object Hallucination in Vision Language Models.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an Agent.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024


  Loading...