Shoubin Yu
Orcid: 0009-0006-1670-0054
According to our database1,
Shoubin Yu authored at least 27 papers
between 2021 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
CoRR, March, 2026
Balancing Faithfulness and Performance in Reasoning via Multi-Listener Soft Execution.
CoRR, February, 2026
When and How Much to Imagine: Adaptive Test-Time Scaling with World Models for Visual Spatial Reasoning.
CoRR, February, 2026
A Novel Approach to Evaluating the Effectiveness of Large Language Models for Multimodal Analysis of Embodied Learning in Classrooms.
Proceedings of the LAK26: 16th International Learning Analytics and Knowledge Conference, 2026
2025
Prune-Then-Plan: Step-Level Calibration for Stable Frontier Exploration in Embodied Question Answering.
CoRR, November, 2025
CoRR, October, 2025
CoRR, June, 2025
CoRR, June, 2025
Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization.
CoRR, April, 2025
A Multimodal Classroom Video Question-Answering Framework for Automated Understanding of Collaborative Learning.
Proceedings of the 27th International Conference on Multimodal Interaction, 2025
CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
2024
Regularity Learning via Explicit Distribution Modeling for Skeletal Video Anomaly Detection.
IEEE Trans. Circuits Syst. Video Technol., August, 2024
CoRR, 2024
CREMA: Multimodal Compositional Video Reasoning via Efficient Modular Adaptation and Fusion.
CoRR, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
2021
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021