Song Wang

Orcid: 0000-0002-8758-7988

Affiliations:
  • Zhejiang University, Hangzhou, China


According to our database1, Song Wang authored at least 39 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
AI for Auto-Research: Roadmap & User Guide.
CoRR, May, 2026

AdaSFormer: Adaptive Serialized Transformers for Monocular Semantic Scene Completion from Indoor Environments.
CoRR, March, 2026

VisionTrim: Unified Vision Token Compression for Training-Free MLLM Acceleration.
CoRR, January, 2026

GUIDE: Gaussian Unified Instance Detection for Enhanced Obstacle Perception in Autonomous Driving.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems.
CoRR, December, 2025

Vision-Language-Action Models for Autonomous Driving: Past, Present, and Future.
CoRR, December, 2025

RoboCOIN: An Open-Sourced Bimanual Robotic Data COllection for INtegrated Manipulation.
CoRR, November, 2025

OneOcc: Semantic Occupancy Prediction for Legged Robots with a Single Panoramic Camera.
CoRR, November, 2025

Offboard Occupancy Refinement With Hybrid Propagation for Autonomous Driving.
IEEE Trans. Intell. Transp. Syst., October, 2025

TokenPacker: Efficient Visual Projector for Multimodal LLM.
Int. J. Comput. Vis., October, 2025

RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning.
CoRR, October, 2025

3D and 4D World Modeling: A Survey.
CoRR, September, 2025

PixelThink: Towards Efficient Chain-of-Pixel Reasoning.
CoRR, May, 2025

Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps.
CoRR, May, 2025

Event-aided Semantic Scene Completion.
CoRR, February, 2025

A Coarse-to-Fine Approach to Multi-Modality 3D Occupancy Grounding.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2025

Reliable and Calibrated Semantic Occupancy Prediction by Hybrid Uncertainty Learning.
Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

PianoMotion10M: Dataset and Benchmark for Hand Motion Generation in Piano Performance.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

SAM4D: Segment Anything in Camera and LiDAR Streams.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Monocular Semantic Scene Completion via Masked Recurrent Networks.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

PointLoRA: Low-Rank Adaptation with Token Selection for Point Cloud Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Uncertainty-Instructed Structure Injection for Generalizable HD Map Construction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Domain Adaptation Transformer for Unsupervised Driving-Scene Segmentation in Adverse Conditions.
IEEE Trans. Intell. Transp. Syst., December, 2024

DTCLMapper: Dual Temporal Consistent Learning for Vectorized HD Map Construction.
IEEE Trans. Intell. Transp. Syst., December, 2024

ReliOcc: Towards Reliable Semantic Occupancy Prediction via Uncertainty Learning.
CoRR, 2024

GenMapping: Unleashing the Potential of Inverse Perspective Mapping for Robust Online HD Map Construction.
CoRR, 2024

TokenPacker: Efficient Visual Projector for Multimodal LLM.
CoRR, 2024

Not All Voxels Are Equal: Hardness-Aware Semantic Scene Completion with Self-Distillation.
CoRR, 2024

OccFiner: Offboard Occupancy Refinement with Hybrid Propagation.
CoRR, 2024

Label-efficient Semantic Scene Completion with Scribble Annotations.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Not All Voxels are Equal: Hardness-Aware Semantic Scene Completion with Self-Distillation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

MGMap: Mask-Guided Learning for Online Vectorized HD Map Construction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
LiDAR2Map: In Defense of LiDAR-Based Semantic Map Construction Using Online Camera Distillation.
CoRR, 2023

Label-efficient Segmentation via Affinity Propagation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Point2Mask: Point-supervised Panoptic Segmentation via Optimal Transport.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

LiDAR2Map: In Defense of LiDAR-Based Semantic Map Construction Using Online Camera Distillation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Meta-RangeSeg: LiDAR Sequence Semantic Segmentation Using Multiple Feature Aggregation.
IEEE Robotics Autom. Lett., 2022


  Loading...