Wentong Li

Orcid: 0000-0002-2715-0995

Affiliations:
  • Zhejiang University, College of Computer Science and Technology, Hangzhou, China


According to our database1, Wentong Li authored at least 32 papers between 2018 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?
CoRR, June, 2025

Large Models are Good Annotators for Zero-Shot Learning.
Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Uncertainty-Instructed Structure Injection for Generalizable HD Map Construction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Box2Mask: Box-Supervised Instance Segmentation via Level-Set Evolution.
IEEE Trans. Pattern Anal. Mach. Intell., 2024

ReliOcc: Towards Reliable Semantic Occupancy Prediction via Uncertainty Learning.
CoRR, 2024

TokenPacker: Efficient Visual Projector for Multimodal LLM.
CoRR, 2024

Not All Voxels Are Equal: Hardness-Aware Semantic Scene Completion with Self-Distillation.
CoRR, 2024

Label-efficient Semantic Scene Completion with Scribble Annotations.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Osprey: Pixel Understanding with Visual Instruction Tuning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Not All Voxels are Equal: Hardness-Aware Semantic Scene Completion with Self-Distillation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

MGMap: Mask-Guided Learning for Online Vectorized HD Map Construction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Fine-Grained Multi-View Hand Reconstruction Using Inverse Rendering.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Improving Nighttime Driving-Scene Segmentation via Dual Image-Adaptive Learnable Filters.
IEEE Trans. Circuits Syst. Video Technol., October, 2023

LiDAR2Map: In Defense of LiDAR-Based Semantic Map Construction Using Online Camera Distillation.
CoRR, 2023

Label-efficient Segmentation via Affinity Propagation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Point2Mask: Point-supervised Panoptic Segmentation via Optimal Transport.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

LiDAR2Map: In Defense of LiDAR-Based Semantic Map Construction Using Online Camera Distillation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Translational Symmetry-Aware Facade Parsing for 3-D Building Reconstruction.
IEEE Multim., 2022

Box-supervised Instance Segmentation with Level Set Evolution.
CoRR, 2022

Improving Nighttime Driving-Scene Segmentation via Dual Image-adaptive Learnable Filters.
CoRR, 2022

Box-Supervised Instance Segmentation with Level Set Evolution.
Proceedings of the Computer Vision - ECCV 2022, 2022

Oriented RepPoints for Aerial Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Deep Level Set for Box-supervised Instance Segmentation in Aerial Images.
CoRR, 2021

Translational Symmetry-Aware Facade Parsing for 3D Building Reconstruction.
CoRR, 2021

Oriented RepPoints for Aerial Object Detection.
CoRR, 2021

2020
Multi-Scale Feature Integrated Attention-Based Rotation Network for Object Detection in VHR Aerial Images.
Sensors, 2020

2019
Multi-Scale Object Detection in Satellite Imagery Based On YOLT.
Proceedings of the 2019 IEEE International Geoscience and Remote Sensing Symposium, 2019

S3OD: Single Stage Small Object Detector from Scratch for Remote Sensing Images.
Proceedings of the Image and Graphics - 10th International Conference, 2019

2018
The Analysis Between Traditional Convolution Neural Network and CapsuleNet.
Proceedings of the 2018 International Conference on Control, 2018

Unscented Particle Double Layer Filter.
Proceedings of the 21st International Conference on Information Fusion, 2018


  Loading...