Tongtian Yue

Orcid: 0000-0001-5774-4084

According to our database1, Tongtian Yue authored at least 15 papers between 2023 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
LaVi: Efficient Large Vision-Language Models via Internal Feature Modulation.
CoRR, June, 2025

Prefix Grouper: Efficient GRPO Training through Shared-Prefix Forward.
CoRR, June, 2025

Towards Unified Referring Expression Segmentation Across Omni-Level Visual Target Granularities.
CoRR, April, 2025

Efficient Motion-Aware Video MLLM.
CoRR, March, 2025

ChatSearch: A dataset and a generative retrieval model for general conversational image retrieval.
Pattern Recognit., 2025

Needle In A Video Haystack: A Scalable Synthetic Evaluator for Video MLLMs.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Ada-K Routing: Boosting the Efficiency of MoE-based LLMs.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Efficient Motion-Aware Video MLLM.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
EEGPT: Unleashing the Potential of EEG Generalist Foundation Model by Autoregressive Pre-training.
CoRR, 2024

Needle In A Video Haystack: A Scalable Synthetic Framework for Benchmarking Video MLLMs.
CoRR, 2024

Collaborative Training of Tiny-Large Vision Language Models.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

SC- Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

OneDiff: A Generalist Model for Image Difference Captioning.
Proceedings of the Computer Vision - ACCV 2024, 2024

2023
ChatBridge: Bridging Modalities with Large Language Model as a Language Catalyst.
CoRR, 2023


  Loading...