Bencheng Liao

According to our database¹, Bencheng Liao authored at least 29 papers between 2021 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

UniDriveVLA: Unifying Understanding, Perception, and Action Planning for Autonomous Driving.

[BibT_eX]

[DOI]

CoRR, April, 2026

Mixture-of-Depths Attention.

[BibT_eX]

[DOI]

CoRR, March, 2026

Better early detector for high-performance detection transformer.

[BibT_eX]

[DOI]

Image Vis. Comput., 2026

2025

DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models.

[BibT_eX]

[DOI]

CoRR, December, 2025

InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, December, 2025

DiffusionDriveV2: Reinforcement Learning-Constrained Truncated Diffusion Modeling in End-to-End Autonomous Driving.

[BibT_eX]

[DOI]

CoRR, December, 2025

MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., September, 2025

Breaking Imitation Bottlenecks: Reinforced Diffusion Powers Diverse Trajectory Generation.

[BibT_eX]

[DOI]

CoRR, July, 2025

MapTRv2: An End-to-End Framework for Online Vectorized HD Map Construction.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., March, 2025

OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models.

[BibT_eX]

[DOI]

CoRR, March, 2025

Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation.

[BibT_eX]

[DOI]

CoRR, February, 2025

RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, February, 2025

MaTVLM: Hybrid Mamba-Transformer for Efficient Vision-Language Modeling.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

ViG: Linear-complexity Visual Sequence Learning with Gated Linear Attention.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Learning accurate monocular 3D voxel representation via bilateral voxel transformer.

[BibT_eX]

[DOI]

Image Vis. Comput., 2024

Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving.

[BibT_eX]

[DOI]

CoRR, 2024

ViG: Linear-complexity Visual Sequence Learning with Gated Linear Attention.

[BibT_eX]

[DOI]

CoRR, 2024

MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning.

[BibT_eX]

[DOI]

CoRR, 2024

VADv2: End-to-End Vectorized Autonomous Driving via Probabilistic Planning.

[BibT_eX]

[DOI]

CoRR, 2024

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Lane Graph as Path: Continuity-Preserving Path-Wise Modeling for Online Lane Graph Construction.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

2023

VMA: Divide-and-Conquer Vectorized Map Annotation System for Large-Scale Driving Scene.

[BibT_eX]

[DOI]

CoRR, 2023

MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

VAD: Vectorized Scene Representation for Efficient Autonomous Driving.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022

Perceive, Interact, Predict: Learning Dynamic and Static Clues for End-to-End Motion Prediction.

[BibT_eX]

[DOI]

CoRR, 2022

2021

Real-time and accurate object detection in compressed video by long short-term feature aggregation.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2021

You Only Look at One Sequence: Rethinking Transformer in Vision through Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Bencheng Liao

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...