Zhibin Lan

Orcid: 0009-0008-3930-3101

According to our database1, Zhibin Lan authored at least 17 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Beyond Chain-of-Thought: Rewrite as a Universal Interface for Generative Multimodal Embeddings.
CoRR, April, 2026

Countering the Over-Reliance Trap: Mitigating Object Hallucination for LVLMs via a Self-Validation Framework.
CoRR, January, 2026

2025
Towards Fine-Grained Code-Switch Speech Translation with Semantic Space Alignment.
CoRR, November, 2025

UME-R1: Exploring Reasoning-Driven Generative Multimodal Embeddings.
CoRR, November, 2025

PATIMT-Bench: A Multi-Scenario Benchmark for Position-Aware Text Image Machine Translation in Large Vision-Language Models.
CoRR, September, 2025

Towards better text image machine translation with multimodal codebook and multi-stage training.
Neural Networks, 2025

A novel signature of cartilage aging-related immunophenotyping biomarkers in osteoarthritis.
Comput. Biol. Medicine, 2025

"I've Heard of You!": Generate Spoken Named Entity Recognition Data for Unseen Entities.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

PATIMT-Bench: A Multi-Scenario Benchmark for Position-Aware Text Image Machine Translation in Large Vision-Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

AVG-LLaVA: An Efficient Large Multimodal Model with Adaptive Visual Granularity.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
AVG-LLaVA: A Large Multimodal Model with Adaptive Visual Granularity.
CoRR, 2024

A Survey on Multi-modal Machine Translation: Tasks, Methods and Challenges.
CoRR, 2024

Empowering Backbone Models for Visual Text Generation with Input Granularity Control and Glyph-Aware Training.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Translatotron-V(ison): An End-to-End Model for In-Image Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
FactGen: Faithful Text Generation by Factuality-aware Pre-training and Contrastive Ranking Fine-tuning.
J. Artif. Intell. Res., 2023

Exploring Better Text Image Translation with Multimodal Codebook.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023


  Loading...