Weixin Mao

Orcid: 0000-0002-0444-1079

According to our database1, Weixin Mao authored at least 24 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
BFA++: Hierarchical Best-Feature-Aware Token Prune for Multi-View Vision Language Action Model.
IEEE Robotics Autom. Lett., May, 2026

EMKG: Embodied Memory Knowledge Graphs for Object-Goal Navigation in Dynamic Open Worlds.
IEEE Robotics Autom. Lett., April, 2026

Long-Term Memory for VLA-based Agents in Open-World Task Execution.
CoRR, April, 2026

ARM: Advantage Reward Modeling for Long-Horizon Manipulation.
CoRR, April, 2026

2025
See Once, Then Act: Vision-Language-Action Model with Task Learning from One-Shot Video Demonstrations.
CoRR, December, 2025

VAT: Vision Action Transformer by Unlocking Full Representation of ViT.
CoRR, December, 2025

BFA: Best-Feature-Aware Fusion for Multi-View Fine-Grained Manipulation.
IEEE Robotics Autom. Lett., September, 2025

PADriver: Towards Personalized Autonomous Driving.
CoRR, May, 2025

PillarNeSt: Embracing Backbone Scaling and Pretraining for Pillar-Based 3D Object Detection.
IEEE Trans. Intell. Veh., April, 2025

PADriver: Towards Personalized Autonomous Driving.
Proceedings of the International Joint Conference on Neural Networks, 2025

SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Exploring Recurrent Long-Term Temporal Fusion for Multi-View 3D Perception.
IEEE Robotics Autom. Lett., July, 2024

Seeing both sides: context-aware heterogeneous graph matching networks for extracting-related arguments.
Neural Comput. Appl., 2024

Multi-GraspLLM: A Multimodal LLM for Multi-Hand Semantic Guided Grasp Generation.
CoRR, 2024

RoboMatrix: A Skill-centric Hierarchical Framework for Scalable Robot Task Planning and Execution in Open-World.
CoRR, 2024

SegGrasp: Zero-Shot Task-Oriented Grasping via Semantic and Geometric Guided Segmentation.
CoRR, 2024

Is a 3D-Tokenized LLM the Key to Reliable Autonomous Driving?
CoRR, 2024

Stream Query Denoising for Vectorized HD-Map Construction.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
ADriver-I: A General World Model for Autonomous Driving.
CoRR, 2023

GMM: Delving into Gradient Aware and Model Perceive Depth Mining for Monocular 3D Detection.
CoRR, 2023

DBQ-SSD: Dynamic Ball Query for Efficient 3D Object Detection.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
Towards 3D Object Detection with 2D Supervision.
CoRR, 2022

PersDet: Monocular 3D Detection in Perspective Bird's-Eye-View.
CoRR, 2022

Dense Teacher: Dense Pseudo-Labels for Semi-supervised Object Detection.
Proceedings of the Computer Vision - ECCV 2022, 2022


  Loading...