We stand with Ukraine

We stand with Ukraine

Tianyu Yu

Orcid: 0000-0001-9752-6655

Affiliations:

Shenzhen International Graduate School, Tsinghua University, Shenzhen, Guangdong, China

According to our database¹, Tianyu Yu authored at least 28 papers between 2020 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

Online presence:

on orcid.org

On csauthors.net:

Bibliography

2026

Deep Pre-Alignment for VLMs.

[DOI]

,

,

,

,

,

,

,

CoRR, May, 2026

LLaVA-UHD v4: What Makes Efficient Visual Encoding in MLLMs?

[DOI]

,

,

,

,

,

CoRR, May, 2026

MiniCPM-o 4.5: Towards Real-Time Full-Duplex Omni-Modal Interaction.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, April, 2026

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe.

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, April, 2026

The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, January, 2026

Thinking with Blueprints: Assisting Vision-Language Models in Spatial Reasoning via Structured Object Representation.

[DOI]

,

,

,

,

,

CoRR, January, 2026

Process Reinforcement through Implicit Rewards.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Trans. Mach. Learn. Res., 2026

2025

MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, September, 2025

RLPR: Extrapolating RLVR to General Domains without Verifiers.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, June, 2025

Process Reinforcement through Implicit Rewards.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, February, 2025

EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, January, 2025

UltraWiki: Ultra-Fine-Grained Entity Set Expansion with Negative Seed Entities.

[DOI]

,

,

,

,

,

,

,

Proceedings of the 41st IEEE International Conference on Data Engineering, 2025

RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

MiniCPM-V: A GPT-4V Level MLLM on Your Phone.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness.

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

UltraWiki: Ultra-fine-grained Entity Set Expansion with Negative Seed Entities.

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, 2024

Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-Grained Correctional Human Feedback.

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

SeqGPT: An Out-of-the-Box Large Language Model for Open Domain Sequence Understanding.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

MESED: A Multi-Modal Entity Set Expansion Dataset with Fine-Grained Semantic Classes and Hard Negative Entities.

[DOI]

,

,

,

,

,

,

,

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Embracing ambiguity: Improving similarity-oriented tasks with contextual synonym knowledge.

[DOI]

,

,

,

,

,

Neurocomputing, October, 2023

RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback.

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, 2023

Reformulating Vision-Language Foundation Models and Datasets Towards Universal Multimodal Assistants.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2023

Knowledge-augmented Few-shot Visual Relation Detection.

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, 2023

AutoMTLSpec: Learning to Generate MTL Specifications from Natural Language Contracts.

[DOI]

,

,

,

Proceedings of the 27th International Conference on Engineering of Complex Computer Systems, 2023

Visually Grounded Commonsense Knowledge Acquisition.

[DOI]

,

,

,

,

,

Cornelius Weber

,

,

,

,

,

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Contrastive Learning with Hard Negative Entities for Entity Set Expansion.

[DOI]

,

,

,

,

,

Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

2020

Cross-Modal Omni Interaction Modeling for Phrase Grounding.

[DOI]

,

,

,

,

,

,

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Loading...