Shusheng Yang

According to our database¹, Shusheng Yang authored at least 31 papers between 2021 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Cambrian-P: Pose-Grounded Video Understanding.

[BibT_eX]

[DOI]

CoRR, May, 2026

Better early detector for high-performance detection transformer.

[BibT_eX]

[DOI]

Image Vis. Comput., 2026

Multi-level reinforcement learning with agent-based simulation for dynamic concrete scheduling in high-speed railway construction.

[BibT_eX]

[DOI]

Appl. Soft Comput., 2026

High-speed railway operational safety indicator assessment under earthquakes: a physics-informed hybrid prediction framework.

[BibT_eX]

[DOI]

Adv. Eng. Informatics, 2026

2025

Cambrian-S: Towards Spatial Supersensing in Video.

[BibT_eX]

[DOI]

CoRR, November, 2025

Benchmark Designers Should "Train on the Test Set" to Expose Exploitable Non-Visual Shortcuts.

[BibT_eX]

[DOI]

CoRR, November, 2025

BLIP3o-NEXT: Next Frontier of Native Image Generation.

[BibT_eX]

[DOI]

CoRR, October, 2025

VideoNSA: Native Sparse Attention Scales Video Understanding.

[BibT_eX]

[DOI]

CoRR, October, 2025

Towards Ambiguity-Free Spatial Foundation Model: Rethinking and Decoupling Depth Ambiguity.

[BibT_eX]

[DOI]

Matthew Johnson-Roberson

Xiaonan Huang

CoRR, March, 2025

Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

ViTMatte: Boosting image matting with pre-trained plain vision transformers.

[BibT_eX]

[DOI]

Inf. Fusion, March, 2024

Biobjective optimization for railway alignment fine-grained designs with parallel existing railways.

[BibT_eX]

[DOI]

Comput. Aided Civ. Infrastructure Eng., February, 2024

Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs.

[BibT_eX]

[DOI]

CoRR, 2024

The static ridesharing routing problem with flexible locations: A Norwegian case study.

[BibT_eX]

[DOI]

Jacob Nitter

Shusheng Yang

Kjetil Fagerholt

Andreas Breivik Ormevik

Comput. Oper. Res., 2024

Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Rethinking Pragmatics in Large Language Models: Towards Open-Ended Evaluation and Preference Tuning.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

MobileInst: Video Instance Segmentation on the Mobile.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

A sequential exploration algorithm for the design optimization of horizontal road alignment.

[BibT_eX]

[DOI]

Comput. Aided Civ. Infrastructure Eng., October, 2023

Qwen Technical Report.

[BibT_eX]

[DOI]

CoRR, 2023

TouchStone: Evaluating Vision-Language Models by Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities.

[BibT_eX]

[DOI]

CoRR, 2023

ViTMatte: Boosting Image Matting with Pretrained Plain Vision Transformers.

[BibT_eX]

[DOI]

CoRR, 2023

Masked Visual Reconstruction in Language Semantic Space.

[BibT_eX]

[DOI]

CoRR, 2023

Masked Image Modeling with Denoising Contrast.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Unleashing Vanilla Vision Transformer with Masked Image Modeling for Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

RILS: Masked Visual Reconstruction in Language Semantic Space.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Relational Surrogate Loss Learning.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Temporally Efficient Vision Transformer for Video Instance Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Tracking Instances as Queries.

[BibT_eX]

[DOI]

CoRR, 2021

Crossover Learning for Fast Online Video Instance Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Instances as Queries.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Shusheng Yang

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...