Botian Shi

Orcid: 0000-0003-3677-7252

According to our database1, Botian Shi authored at least 41 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
SensorX2Vehicle: Online Sensors-to-Vehicle Rotation Calibration Methods in Road Scenarios.
IEEE Robotics Autom. Lett., 2024

Human-Like Decision Making at Unsignalized Intersections Using Social Value Orientation.
IEEE Intell. Transp. Syst. Mag., 2024

ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning.
CoRR, 2024

OASim: an Open and Adaptive Simulator based on Neural Rendering for Autonomous Driving.
CoRR, 2024

LimSim++: A Closed-Loop Platform for Deploying Multimodal LLMs in Autonomous Driving.
CoRR, 2024

2023
Multi-Sensor Fusion and Cooperative Perception for Autonomous Driving: A Review.
IEEE Intell. Transp. Syst. Mag., 2023

Realistic Rainy Weather Simulation for LiDARs in CARLA Simulator.
CoRR, 2023

Towards Knowledge-driven Autonomous Driving.
CoRR, 2023

SceneDM: Scene-level Multi-agent Trajectory Generation with Consistent Diffusion Models.
CoRR, 2023

On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving.
CoRR, 2023

DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language Models.
CoRR, 2023

StructChart: Perception, Structuring, Reasoning for Visual Chart Understanding.
CoRR, 2023

SPOT: Scalable 3D Pre-training via Occupancy Prediction for Autonomous Driving.
CoRR, 2023

ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target Simulation.
CoRR, 2023

TrafficMCTS: A Closed-Loop Traffic Flow Generation Framework with Group-Based Monte Carlo Tree Search.
CoRR, 2023

Drive Like a Human: Rethinking Autonomous Driving with Large Language Models.
CoRR, 2023

DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point Clouds.
CoRR, 2023

StreetSurf: Extending Multi-view Implicit Surface Reconstruction to Street Views.
CoRR, 2023

AD-PT: Autonomous Driving Pre-Training with Large-scale Point Cloud Dataset.
CoRR, 2023

SUG: Single-dataset Unified Generalization for 3D Point Cloud Classification.
CoRR, 2023

LoGoNet: Towards Accurate 3D Object Detection with Local-to-Global Cross-Modal Fusion.
CoRR, 2023

AD-PT: Autonomous Driving Pre-Training with Large-scale Point Cloud Dataset.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

RangePerception: Taming LiDAR Range View for Efficient and Accurate 3D Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

SUG: Single-dataset Unified Generalization for 3D Point Cloud Classification.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point Clouds.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Uni3D: A Unified Baseline for Multi-Dataset 3D Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Bi3D: Bi-Domain Active Learning for Cross-Domain 3D Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

LoGoNet: Towards Accurate 3D Object Detection with Local-to-Global Cross- Modal Fusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

LWSIS: LiDAR-Guided Weakly Supervised Instance Segmentation for Autonomous Driving.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
ADAS: A Simple Active-and-Adaptive Baseline for Cross-Domain 3D Semantic Segmentation.
CoRR, 2022

Multi-modal Sensor Fusion for Auto Driving Perception: A Survey.
CoRR, 2022

Learning Cross-Image Object Semantic Relation in Transformer for Few-Shot Fine-Grained Image Classification.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object Detection.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
Hashing based Efficient Inference for Image-Text Matching.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
A Benchmark for Structured Procedural Knowledge Extraction from Cooking Videos.
CoRR, 2020

UniViLM: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation.
CoRR, 2020

Learning Semantic Concepts and Temporal Alignment for Narrated Video Procedural Captioning.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Functionality Discovery and Prediction of Physical Objects.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Microsoft Concept Graph: Mining Semantic Concepts for Short Text Understanding.
Data Intell., 2019

Knowledge Aware Semantic Concept Expansion for Image-Text Matching.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Dense Procedure Captioning in Narrated Instructional Videos.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019


  Loading...