Zhenbo Luo
Orcid: 0009-0002-5836-0749
According to our database1,
Zhenbo Luo authored at least 51 papers
between 2014 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
CoRR, May, 2026
CoRR, April, 2026
CoRR, April, 2026
Q-Mask: Query-driven Causal Masks for Text Anchoring in OCR-Oriented Vision-Language Models.
CoRR, April, 2026
CoRR, March, 2026
IMTBench: A Multi-Scenario Cross-Modal Collaborative Evaluation Benchmark for In-Image Machine Translation.
CoRR, March, 2026
CoRR, March, 2026
EMO-R3: Reflective Reinforcement Learning for Emotional Reasoning in Multimodal Large Language Models.
CoRR, February, 2026
CoRR, February, 2026
MSJoE: Jointly Evolving MLLM and Sampler for Efficient Long-Form Video Understanding.
CoRR, February, 2026
CoRR, February, 2026
GeoFocus: Blending Efficient Global-to-Local Perception for Multimodal Geometry Problem-Solving.
CoRR, February, 2026
Video-OPD: Efficient Post-Training of Multimodal Large Language Models for Temporal Video Grounding via On-Policy Distillation.
CoRR, February, 2026
Restoring Exploration after Post-Training: Latent Exploration Decoding for Large Reasoning Models.
CoRR, February, 2026
CoRR, January, 2026
CoRR, January, 2026
AutoLink: Autonomous Schema Exploration and Expansion for Scalable Schema Linking in Text-to-SQL at Scale.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
2025
TimeViper: A Hybrid Mamba-Transformer Vision-Language Model for Efficient Long Video Understanding.
CoRR, November, 2025
REVISOR: Beyond Textual Reflection, Towards Multimodal Introspective Reasoning in Long-Form Video Understanding.
CoRR, November, 2025
CoRR, October, 2025
Thinking in cocktail party: Chain-of-Thought and reinforcement learning for target speaker automatic speech recognition.
CoRR, September, 2025
Omni-CLST: Error-aware Curriculum Learning with guided Selective chain-of-Thought for audio question answering.
CoRR, September, 2025
Shuffle-R1: Efficient RL framework for Multimodal Large Language Models via Data-centric Dynamic Shuffle.
CoRR, August, 2025
CoRR, May, 2025
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025
Let Your Car Listen to Your Respiration Contactlessly with Ubiquitous Acoustic Signals.
Proceedings of the Companion of the 2025 ACM International Joint Conference on Pervasive and Ubiquitous Computing, 2025
2020
Pattern Recognit., 2020
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
2019
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
2018
Auto-painter: Cartoon image generation from sketch by using conditional Wasserstein generative adversarial networks.
Neurocomputing, 2018
R<sup>2</sup> CNN: Rotational Region CNN for Arbitrarily-Oriented Scene Text Detection.
Proceedings of the 24th International Conference on Pattern Recognition, 2018
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018
2017
Auto-painter: Cartoon Image Generation from Sketch by Using Conditional Generative Adversarial Networks.
CoRR, 2017
Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, 2017
Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, 2017
ICDAR2017 Robust Reading Challenge on Multi-Lingual Scene Text Detection and Script Identification - RRC-MLT.
Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, 2017
Proceedings of the 4th IAPR Asian Conference on Pattern Recognition, 2017
2016
Proceedings of the 15th International Conference on Frontiers in Handwriting Recognition, 2016
Proceedings of the 15th International Conference on Frontiers in Handwriting Recognition, 2016
2014
Enhanced Non-linear Features for On-line Handwriting Recognition Using Deep Learning.
Proceedings of the Neural Information Processing - 21st International Conference, 2014