Zhibin Lan

Orcid: 0009-0008-3930-3101

According to our database¹, Zhibin Lan authored at least 17 papers between 2023 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Beyond Chain-of-Thought: Rewrite as a Universal Interface for Generative Multimodal Embeddings.

[BibT_eX]

[DOI]

CoRR, April, 2026

Countering the Over-Reliance Trap: Mitigating Object Hallucination for LVLMs via a Self-Validation Framework.

[BibT_eX]

[DOI]

CoRR, January, 2026

2025

Towards Fine-Grained Code-Switch Speech Translation with Semantic Space Alignment.

[BibT_eX]

[DOI]

CoRR, November, 2025

UME-R1: Exploring Reasoning-Driven Generative Multimodal Embeddings.

[BibT_eX]

[DOI]

CoRR, November, 2025

PATIMT-Bench: A Multi-Scenario Benchmark for Position-Aware Text Image Machine Translation in Large Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, September, 2025

Towards better text image machine translation with multimodal codebook and multi-stage training.

[BibT_eX]

[DOI]

Neural Networks, 2025

A novel signature of cartilage aging-related immunophenotyping biomarkers in osteoarthritis.

[BibT_eX]

[DOI]

Comput. Biol. Medicine, 2025

"I've Heard of You!": Generate Spoken Named Entity Recognition Data for Unseen Entities.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

PATIMT-Bench: A Multi-Scenario Benchmark for Position-Aware Text Image Machine Translation in Large Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

AVG-LLaVA: An Efficient Large Multimodal Model with Adaptive Visual Granularity.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024

AVG-LLaVA: A Large Multimodal Model with Adaptive Visual Granularity.

[BibT_eX]

[DOI]

CoRR, 2024

A Survey on Multi-modal Machine Translation: Tasks, Methods and Challenges.

[BibT_eX]

[DOI]

CoRR, 2024

Empowering Backbone Models for Visual Text Generation with Input Granularity Control and Glyph-Aware Training.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Translatotron-V(ison): An End-to-End Model for In-Image Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

FactGen: Faithful Text Generation by Factuality-aware Pre-training and Contrastive Ranking Fine-tuning.

[BibT_eX]

[DOI]

J. Artif. Intell. Res., 2023

Exploring Better Text Image Translation with Multimodal Codebook.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Zhibin Lan

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...