Mohamed Elhoseiny

This page is a disambiguation page, it actually contains multiple papers from persons of the same or a similar name.

Known people with the same name:

Bibliography

2026
Afford-VLA: Action-Aligned Visual Planning via Internalized Affordance.
CoRR, May, 2026

CompoSE: Compositional Synthesis and Editing of 3D Shapes via Part-Aware Control.
CoRR, May, 2026

Cross-Layer Energy Analysis of Multimodal Training on Grace Hopper Superchips.
CoRR, May, 2026

Freshness-Aware Prioritized Experience Replay for LLM/VLM Reinforcement Learning.
CoRR, April, 2026

The Second Challenge on Cross-Domain Few-Shot Object Detection at NTIRE 2026: Methods and Results.
CoRR, April, 2026

M-MiniGPT4: Multilingual VLLM Alignment via Translated Data.
CoRR, March, 2026

Sketch2Stitch: GANs for Abstract Sketch-Based Dress Synthesis.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2026

2025
Step-by-step Layered Design Generation.
CoRR, December, 2025

FishNet++: Analyzing the capabilities of Multimodal Large Language Models in marine biology.
CoRR, September, 2025

Category-level Text-to-Image Retrieval Improved: Bridging the Domain Gap with Diffusion Models and Vision Encoders.
CoRR, September, 2025

Time Blindness: Why Video-Language Models Can't See What Humans Can?
CoRR, May, 2025

3DCoMPaT200: Language-Grounded Compositional Understanding of Parts and Materials of 3D Shapes.
CoRR, January, 2025

ToddlerDiffusion: Interactive Structured Image Generation with Cascaded Schrödinger Bridge.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Towards AI-Assisted Psychotherapy: Emotion-Guided Generative Interventions.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2024
AI Art Neural Constellation: Revealing the Collective and Contrastive State of AI-Generated and Human Art.
CoRR, 2024

3DCoMPaT200: Language Grounded Large-Scale 3D Vision Dataset for Compositional Recognition.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

CoT3DRef: Chain-of-Thoughts Data-Efficient 3D Visual Grounding.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
ToddlerDiffusion: Flash Interpretable Controllable Diffusion Model.
CoRR, 2023

2022
It is Okay to Not Be Okay: Overcoming Emotional Bias in Affective Image Captioning by Contrastive Data Collection.
CoRR, 2022

Look Around and Refer: 2D Synthetic Semantics Knowledge Distillation for 3D Visual Grounding.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Creative Walk Adversarial Networks: Novel Art Generation with Probabilistic Random Walk Deviation from Style Norms.
Proceedings of the 13th International Conference on Computational Creativity, 2022

2021
CIZSL++: Creativity Inspired Generative Zero-Shot Learning.
CoRR, 2021

Class Normalization for (Continual)? Generalized Zero-Shot Learning.
Proceedings of the 9th International Conference on Learning Representations, 2021

Aligning Latent and Image Spaces to Connect the Unconnectable.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Wölfflin's Affective Generative Analysis for Visual Art.
Proceedings of the Twelfth International Conference on Computational Creativity, 2021

2019
Video Object Segmentation using Teacher-Student Adaptation in a Human Robot Interaction (HRI) Setting.
Proceedings of the International Conference on Robotics and Automation, 2019

GDPP: Learning Diverse Generations using Determinantal Point Processes.
Proceedings of the 36th International Conference on Machine Learning, 2019

Creativity Inspired Zero-Shot Learning.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018
GDPP: Learning Diverse Generations Using Determinantal Point Process.
CoRR, 2018

Video Segmentation using Teacher-Student Adaptation in a Human Robot Interaction (HRI) Setting.
CoRR, 2018

Choose Your Neuron: Incorporating Domain Knowledge Through Neuron-Importance.
Proceedings of the Computer Vision - ECCV 2018, 2018

DesIGN: Design Inspiration from Generative Networks.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

A Generative Adversarial Approach for Zero-Shot Learning From Noisy Texts.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

The Shape of Art History in the Eyes of the Machine.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Imagine it for me: Generative Adversarial Approach for Zero-Shot Learning from Noisy Texts.
CoRR, 2017

Overlapping Cover Local Regression Machines.
CoRR, 2017

CAN: Creative Adversarial Networks, Generating "Art" by Learning About Styles and Deviating from Style Norms.
Proceedings of the Eighth International Conference on Computational Creativity, 2017

Relationship Proposal Networks.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Link the Head to the "Beak": Zero Shot Learning from Noisy Text Description at Part Precision.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Sherlock: Scalable Fact Learning in Images.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Text to multi-level MindMaps - A novel method for hierarchical visual abstraction of natural language text.
Multim. Tools Appl., 2016

Write a Classifier: Predicting Visual Classifiers from Unstructured Text Descriptions.
CoRR, 2016

Digging Deep into the Layers of CNNs: In Search of How CNNs Achieve View Invariance.
Proceedings of the 4th International Conference on Learning Representations, 2016

Joint object recognition and pose estimation using a nonlinear view-invariant latent generative model.
Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision, 2016

A Comparative Analysis and Study of Multiview CNN Models for Joint Object Categorization and Pose Estimation.
Proceedings of the 33nd International Conference on Machine Learning, 2016

SPDA-CNN: Unifying Semantic Part Detection and Abstraction for Fine-Grained Recognition.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Automatic Annotation of Structured Facts in Images.
Proceedings of the 5th Workshop on Vision and Language, 2016

Zero-Shot Event Detection by Multimodal Distributional Semantic Embedding of Videos.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Generalized Twin Gaussian processes using Sharma-Mittal divergence.
Mach. Learn., 2015

Tell and Predict: Kernel Classifier Prediction for Unseen Visual Classes from Unstructured Text Descriptions.
CoRR, 2015

Convolutional Models for Joint Object Categorization and Pose Estimation.
CoRR, 2015

Sherlock: Modeling Structured Knowledge in Images.
CoRR, 2015

Weather classification with deep convolutional neural networks.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Learning Hypergraph-regularized Attribute Predictors.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Overlapping Domain Cover for Scalable and Accurate Regression Kernel Machines.
Proceedings of the British Machine Vision Conference 2015, 2015

Visual Classifier Prediction by Distributional Semantic Embedding of Text Descriptions.
Proceedings of the Fourth Workshop on Vision and Language, 2015

2014
Text to Multi-level MindMaps: A New Way for Interactive Visualization and Summarization of Natural Language Text.
CoRR, 2014

SRI-Sarnoff AURORA System at TRECVID 2014 Multimedia Event Detection and Recounting.
Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014

Improving non-negative matrix factorization via ranking its bases.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

2013
GPU-Framework for Teamwork Action Recognition.
CoRR, 2013

Low-bitrate benefits of JPEG compression on sift recognition.
Proceedings of the IEEE International Conference on Image Processing, 2013

Write a Classifier: Zero-Shot Learning Using Purely Textual Descriptions.
Proceedings of the IEEE International Conference on Computer Vision, 2013

MultiClass Object Classification in Video Surveillance Systems - Experimental Study.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
English2MindMap: An Automated System for MindMap Generation from English Text.
Proceedings of the 2012 IEEE International Symposium on Multimedia, 2012


  Loading...