Haobo Yuan

Orcid: 0000-0001-9770-7720

According to our database¹, Haobo Yuan authored at least 33 papers between 2021 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

ParaCook: On Time-Efficient Planning for Multi-Agent Systems.

[BibT_eX]

[DOI]

CoRR, October, 2025

LSVOS 2025 Challenge Report: Recent Advances in Complex Video Object Segmentation.

[BibT_eX]

[DOI]

CoRR, October, 2025

The 1st Solution for 7th LSVOS RVOS Track: SaSaSa2VA.

[BibT_eX]

[DOI]

CoRR, September, 2025

DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World.

[BibT_eX]

[DOI]

CoRR, June, 2025

PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild.

[BibT_eX]

[DOI]

CoRR, April, 2025

An Empirical Study of GPT-4o Image Generation Capabilities.

[BibT_eX]

[DOI]

CoRR, April, 2025

4th PVUW MeViS 3rd Place Report: Sa2VA.

[BibT_eX]

[DOI]

CoRR, April, 2025

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos.

[BibT_eX]

[DOI]

CoRR, January, 2025

On Path to Multimodal Generalist: General-Level and General-Bench.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Point Cloud Mamba: Point Cloud Learning via State Space Model.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

Panoptic-PartFormer++: A Unified and Decoupled View for Panoptic Part Segmentation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Transformer-Based Visual Segmentation: A Survey.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Multi-Task Learning With Multi-Query Transformer for Dense Prediction.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., February, 2024

Towards Open Vocabulary Learning: A Survey.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2024

LLAVADI: What Matters For Multimodal Large Language Models Distillation.

[BibT_eX]

[DOI]

CoRR, 2024

Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model.

[BibT_eX]

[DOI]

CoRR, 2024

Point Cloud Mamba: Point Cloud Learning via State Space Model.

[BibT_eX]

[DOI]

CoRR, 2024

OMG-Seg: Is One Model Good Enough For All Segmentation?

[BibT_eX]

[DOI]

CoRR, 2024

RAP-SAM: Towards Real-Time All-Purpose Segment Anything.

[BibT_eX]

[DOI]

CoRR, 2024

OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Open-Vocabulary SAM: Segment and Recognize Twenty-Thousand Classes Interactively.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

OMG-Seg: Is One Model Good Enough for all Segmentation?

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Monocular Road Planar Parallax Estimation.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2023

Neural Collapse Terminus: A Unified Solution for Class Incremental Learning and Its Variants.

[BibT_eX]

[DOI]

CoRR, 2023

Tube-Link: A Flexible Cross Tube Baseline for Universal Video Segmentation.

[BibT_eX]

[DOI]

CoRR, 2023

PanopticPartFormer++: A Unified and Decoupled View for Panoptic Part Segmentation.

[BibT_eX]

[DOI]

CoRR, 2023

Neural Collapse Inspired Feature-Classifier Alignment for Few-Shot Class-Incremental Learning.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Tube-Link: A Flexible Cross Tube Framework for Universal Video Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022

Multi-Task Learning with Multi-query Transformer for Dense Prediction.

[BibT_eX]

[DOI]

CoRR, 2022

Towards Theoretically Inspired Neural Initialization Optimization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

PolyphonicFormer: Unified Query Learning for Depth-Aware Video Panoptic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

2021

BOSSA: A Decentralized System for Proofs of Data Retrievability and Replication.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2021

Haobo Yuan

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...