Yingsen Zeng

According to our database1, Yingsen Zeng authored at least 9 papers between 2019 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
ViType: High-Fidelity Visual Text Rendering via Glyph-Aware Multimodal Diffusion.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
DisTime: Distribution-Based Time Representation for Video Large Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Instructseg: Unifying Instructed Visual Segmentation with Multi-Modal Large Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

2024
InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models.
CoRR, 2024

LinVT: Empower Your Image-level Large Language Model to Understand Videos.
CoRR, 2024

UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection.
Proceedings of the Computer Vision - ECCV 2024, 2024

2019
Efficient Dual Attention Module for Real-Time Visual Tracking.
Proceedings of the 2019 IEEE Visual Communications and Image Processing, 2019

Enhanced Semantic Features via Attention for Real-Time Visual Tracking.
Proceedings of the 2019 IEEE Visual Communications and Image Processing, 2019

Learning Spatial-Channel Attention for Visual Tracking.
Proceedings of the 2019 IEEE/CIC International Conference on Communications in China, 2019


  Loading...