Amir Bar

According to our database1, Amir Bar authored at least 46 papers between 2010 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Lifting Embodied World Models for Planning and Control.
CoRR, April, 2026

Hierarchical Planning with Latent World Models.
CoRR, April, 2026

V-JEPA 2.1: Unlocking Dense Features in Video Self-Supervised Learning.
CoRR, March, 2026

Beyond Language Modeling: An Exploration of Multimodal Pretraining.
CoRR, March, 2026

A Lightweight Library for Energy-Based Joint-Embedding Predictive Architectures.
CoRR, February, 2026

Grounding Generated Videos in Feasible Plans via World Models.
CoRR, February, 2026

Parallel Stochastic Gradient-Based Planning for World Models.
CoRR, February, 2026

DeFM: Learning Foundation Representations from Depth for Robotics.
CoRR, January, 2026

2025
World Models Can Leverage Human Videos for Dexterous Manipulation.
CoRR, December, 2025

TV2TV: A Unified Framework for Interleaved Language and Video Generation.
CoRR, December, 2025

From Generated Human Videos to Physically Plausible Robot Trajectories.
CoRR, December, 2025

OpenApps: Simulating Environment Variations to Measure UI-Agent Reliability.
CoRR, November, 2025

Whole-Body Conditioned Egocentric Video Prediction.
CoRR, June, 2025

Forgotten Polygons: Multimodal Large Language Models are Shape-Blind.
CoRR, February, 2025

Vision-Language Models Create Cross-Modal Task Representations.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Scaling Language-Free Visual Representation Learning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Pixels Versus Priors: Controlling Knowledge Priors in Vision-Language Models through Visual Counterfacts.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Navigation World Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Forgotten Polygons: Multimodal Large Language Models are Shape-Blind.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks.
Trans. Mach. Learn. Res., 2024

Task Vectors are Cross-Modal.
CoRR, 2024

Stochastic positional embeddings improve masked image modeling.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Finding Visual Task Vectors.
Proceedings of the Computer Vision - ECCV 2024, 2024

EgoPet: Egomotion and Interaction Data from an Animal's Perspective.
Proceedings of the Computer Vision - ECCV 2024, 2024

Sequential Modeling Enables Scalable Learning for Large Vision Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Object-based (yet Class-agnostic) Video Domain Adaptation.
CoRR, 2023

Predicting masked tokens in stochastic locations improves masked image modeling.
CoRR, 2023

A Cookbook of Self-Supervised Learning.
CoRR, 2023

2022
Structured Video Tokens @ Ego4D PNR Temporal Localization Challenge 2022.
CoRR, 2022

Using Machine Learning to Identify Intravenous Contrast Phases on Computed Tomography.
Comput. Methods Programs Biomed., 2022

Bringing Image Scene Structure to Video via Frame-Clip Consistency of Object Tokens.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Visual Prompting via Image Inpainting.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Object-Region Video Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

DETReg: Unsupervised Pretraining with Region Priors for Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Learning Interclass Relations for Intravenous Contrast Phase Classification in CT.
Proceedings of the Medical Imaging with Deep Learning, 7-9 July 2021, Lübeck, Germany., 2021

Compositional Video Synthesis with Action Graphs.
Proceedings of the 38th International Conference on Machine Learning, 2021

2020
Compositional Video Synthesis with Action Graphs.
CoRR, 2020

Learning Interclass Relations for Image Classification.
CoRR, 2020

3D Convolutional Sequence to Sequence Model for Vertebral Compression Fractures Identification in CT.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

Learning Canonical Representations for Scene Graph to Image Generation.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
PHT-bot: a deep learning based system for automatic risk stratification of COPD patients based upon signs of pulmonary hypertension.
Proceedings of the Medical Imaging 2019: Computer-Aided Diagnosis, San Diego, 2019

Improved ICH Classification Using Task-Dependent Learning.
Proceedings of the 16th IEEE International Symposium on Biomedical Imaging, 2019

Learning Individual Styles of Conversational Gesture.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2017
Language Generation with Recurrent Generative Adversarial Networks without Pre-training.
CoRR, 2017

Compression fractures detection on CT.
Proceedings of the Medical Imaging 2017: Computer-Aided Diagnosis, 2017

2010
Widespread Compensatory Evolution Conserves DNA-Encoded Nucleosome Organization in Yeast.
PLoS Comput. Biol., 2010


  Loading...