Idan Schwartz

According to our database¹, Idan Schwartz authored at least 27 papers between 2017 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Detection-Driven Object Count Optimization for Text-to-Image Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2026

2025

TempoControl: Temporal Attention Guidance for Text-to-Video Models.

[BibT_eX]

[DOI]

Shira Schiber

Ofir Lindenbaum

Idan Schwartz

CoRR, October, 2025

Single Image Iterative Subject-driven Generation and Editing.

[BibT_eX]

[DOI]

Yair Shpitzer

Gal Chechik

Idan Schwartz

CoRR, March, 2025

2024

Iterative Object Count Optimization for Text-to-image Diffusion Models.

[BibT_eX]

[DOI]

Oz Zafar

Lior Wolf

Idan Schwartz

CoRR, 2024

Improving Visual Commonsense in Language Models via Multiple Image Generation.

[BibT_eX]

[DOI]

CoRR, 2024

Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation.

[BibT_eX]

[DOI]

CoRR, 2023

Discriminative Class Tokens for Text-to-Image Diffusion Models.

[BibT_eX]

[DOI]

Idan Schwartz

Vésteinn Snæbjarnarson

CoRR, 2023

Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Discriminative Class Tokens for Text-to-Image Diffusion Models.

[BibT_eX]

[DOI]

Idan Schwartz

Vésteinn Snæbjarnarson

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Zero-Shot Video Captioning by Evolving Pseudo-tokens.

[BibT_eX]

[DOI]

Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022

Cognitive Models in Deep Learning.

[BibT_eX]

[DOI]

Idan Schwartz

PhD thesis, 2022

Zero-Shot Video Captioning with Evolving Pseudo-Tokens.

[BibT_eX]

[DOI]

CoRR, 2022

Video and Text Matching with Conditioned Embeddings.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Optimizing Relevance Maps of Vision Transformers Improves Robustness.

[BibT_eX]

[DOI]

Hila Chefer

Idan Schwartz

Lior Wolf

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Ordered Attention for Coherent Visual Storytelling.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Describing Sets of Images with Textual-PCA.

[BibT_eX]

[DOI]

Oded Hupert

Idan Schwartz

Lior Wolf

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Latent Space Explanation by Intervention.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic.

[BibT_eX]

[DOI]

CoRR, 2021

Towards Coherent Visual Storytelling with Ordered Image Attention.

[BibT_eX]

[DOI]

CoRR, 2021

Perceptual Score: What Data Modalities Does Your Model Perceive?

[BibT_eX]

[DOI]

Itai Gat

Idan Schwartz

Alexander G. Schwing

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Ensemble of MRR and NDCG models for Visual Dialog.

[BibT_eX]

[DOI]

Idan Schwartz

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

2020

Removing Bias in Multi-modal Classifiers: Regularization by Maximizing Functional Entropies.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

2019

Factor Graph Attention.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

A Simple Baseline for Audio-Visual Scene-Aware Dialog.

[BibT_eX]

[DOI]

Idan Schwartz

Alexander G. Schwing

Tamir Hazan

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2017

High-Order Attention Models for Visual Question Answering.

[BibT_eX]

[DOI]

Idan Schwartz

Alexander G. Schwing

Tamir Hazan

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Idan Schwartz

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...