Nimrod Shabtay

Orcid: 0009-0008-3241-0916

According to our database1, Nimrod Shabtay authored at least 14 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Balanced Thinking: Improving Chain of Thought Training in Vision Language Models.
CoRR, March, 2026

Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMs.
CoRR, March, 2026

CLIMP: Contrastive Language-Image Mamba Pretraining.
CoRR, January, 2026

Positional Encoding Image Prior.
IEEE Trans. Image Process., 2026

2025
CARES: Context-Aware Resolution Selector for VLMs.
CoRR, October, 2025

Advancing Speech Understanding in Speech-Aware Language Models with GRPO.
CoRR, September, 2025

Granite Vision: a lightweight, open-source multimodal model for enterprise Intelligence.
CoRR, February, 2025

Spoken Question Answering for Visual Queries.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

LiveXiv - A Multi-Modal live benchmark based on Arxiv papers content.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Teaching VLMs to Localize Specific Objects from In-Context Examples.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Speech Synthesis From Continuous Features Using Per-Token Latent Diffusion.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2025

2024
Teaching VLMs to Localize Specific Objects from In-context Examples.
CoRR, 2024

Deep Phase Coded Image Prior.
Proceedings of the IEEE International Conference on Computational Photography, 2024

2022
PIP: Positional-encoding Image Prior.
CoRR, 2022


  Loading...