Tianyu Yu

Orcid: 0000-0001-9752-6655

Affiliations:
  • Shenzhen International Graduate School, Tsinghua University, Shenzhen, Guangdong, China


According to our database1, Tianyu Yu authored at least 25 papers between 2020 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
MiniCPM-o 4.5: Towards Real-Time Full-Duplex Omni-Modal Interaction.
CoRR, April, 2026

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe.
CoRR, April, 2026

The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models.
CoRR, January, 2026

Thinking with Blueprints: Assisting Vision-Language Models in Spatial Reasoning via Structured Object Representation.
CoRR, January, 2026

2025
MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe.
CoRR, September, 2025

RLPR: Extrapolating RLVR to General Domains without Verifiers.
CoRR, June, 2025

Process Reinforcement through Implicit Rewards.
CoRR, February, 2025

EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents.
CoRR, January, 2025

UltraWiki: Ultra-Fine-Grained Entity Set Expansion with Negative Seed Entities.
Proceedings of the 41st IEEE International Conference on Data Engineering, 2025

RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
MiniCPM-V: A GPT-4V Level MLLM on Your Phone.
CoRR, 2024

RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness.
CoRR, 2024

UltraWiki: Ultra-fine-grained Entity Set Expansion with Negative Seed Entities.
CoRR, 2024

Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-Grained Correctional Human Feedback.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

SeqGPT: An Out-of-the-Box Large Language Model for Open Domain Sequence Understanding.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

MESED: A Multi-Modal Entity Set Expansion Dataset with Fine-Grained Semantic Classes and Hard Negative Entities.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Embracing ambiguity: Improving similarity-oriented tasks with contextual synonym knowledge.
Neurocomputing, October, 2023

RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback.
CoRR, 2023

Reformulating Vision-Language Foundation Models and Datasets Towards Universal Multimodal Assistants.
CoRR, 2023

Knowledge-augmented Few-shot Visual Relation Detection.
CoRR, 2023

AutoMTLSpec: Learning to Generate MTL Specifications from Natural Language Contracts.
Proceedings of the 27th International Conference on Engineering of Complex Computer Systems, 2023

Visually Grounded Commonsense Knowledge Acquisition.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Contrastive Learning with Hard Negative Entities for Entity Set Expansion.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

2020
Cross-Modal Omni Interaction Modeling for Phrase Grounding.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020


  Loading...