Xin Zhou

Orcid: 0009-0009-4752-6118

Affiliations:
  • Huazhong University of Science and Technology, Wuhan, Hubei, China


According to our database1, Xin Zhou authored at least 17 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Divide and Conquer: Decoupled Representation Alignment for Multimodal World Models.
CoRR, May, 2026

HERMES++: Toward a Unified Driving World Model for 3D Scene Understanding and Generation.
CoRR, April, 2026

When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models.
CoRR, April, 2026

PointTPA: Dynamic Network Parameter Adaptation for 3D Scene Understanding.
CoRR, April, 2026

Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models.
CoRR, March, 2026

2025
Parameter-Efficient Fine-Tuning in Spectral Domain for Point Cloud Learning.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2025

Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive Caching.
CoRR, July, 2025

Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving.
CoRR, May, 2025

Seeing the Future, Perceiving the Future: A Unified Driving World Model for Future Generation and Perception.
CoRR, March, 2025

The Role of World Models in Shaping Autonomous Driving: A Comprehensive Survey.
CoRR, February, 2025

More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

MINIMA: Modality Invariant Image Matching.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
PointMamba: A Simple State Space Model for Point Cloud Analysis.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

A Unified Framework for 3D Scene Understanding.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Diffusion-Based 3D Object Detection with Random Boxes.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023


  Loading...