Bencheng Liao

According to our database1, Bencheng Liao authored at least 29 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
UniDriveVLA: Unifying Understanding, Perception, and Action Planning for Autonomous Driving.
CoRR, April, 2026

Mixture-of-Depths Attention.
CoRR, March, 2026

Better early detector for high-performance detection transformer.
Image Vis. Comput., 2026

2025
DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models.
CoRR, December, 2025

InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models.
CoRR, December, 2025

DiffusionDriveV2: Reinforcement Learning-Constrained Truncated Diffusion Modeling in End-to-End Autonomous Driving.
CoRR, December, 2025

MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning.
Int. J. Comput. Vis., September, 2025

Breaking Imitation Bottlenecks: Reinforced Diffusion Powers Diverse Trajectory Generation.
CoRR, July, 2025

MapTRv2: An End-to-End Framework for Online Vectorized HD Map Construction.
Int. J. Comput. Vis., March, 2025

OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models.
CoRR, March, 2025

Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation.
CoRR, February, 2025

RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning.
CoRR, February, 2025

MaTVLM: Hybrid Mamba-Transformer for Efficient Vision-Language Modeling.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

ViG: Linear-complexity Visual Sequence Learning with Gated Linear Attention.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Learning accurate monocular 3D voxel representation via bilateral voxel transformer.
Image Vis. Comput., 2024

Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving.
CoRR, 2024

ViG: Linear-complexity Visual Sequence Learning with Gated Linear Attention.
CoRR, 2024

MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning.
CoRR, 2024

VADv2: End-to-End Vectorized Autonomous Driving via Probabilistic Planning.
CoRR, 2024

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Lane Graph as Path: Continuity-Preserving Path-Wise Modeling for Online Lane Graph Construction.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
VMA: Divide-and-Conquer Vectorized Map Annotation System for Large-Scale Driving Scene.
CoRR, 2023

MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

VAD: Vectorized Scene Representation for Efficient Autonomous Driving.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Perceive, Interact, Predict: Learning Dynamic and Static Clues for End-to-End Motion Prediction.
CoRR, 2022

2021
Real-time and accurate object detection in compressed video by long short-term feature aggregation.
Comput. Vis. Image Underst., 2021

You Only Look at One Sequence: Rethinking Transformer in Vision through Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021


  Loading...