Marcella Cornia

Orcid: 0000-0001-9640-9385

According to our database1, Marcella Cornia authored at least 76 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Video Surveillance and Privacy: A Solvable Paradox?
Computer, March, 2024

Unveiling the Truth: Exploring Human Gaze Patterns in Fake Images.
IEEE Signal Process. Lett., 2024

Multimodal-Conditioned Latent Diffusion Models for Fashion Image Editing.
CoRR, 2024

Trends, Applications, and Challenges in Human Attention Modelling.
CoRR, 2024

The (R)Evolution of Multimodal Large Language Models: A Survey.
CoRR, 2024

2023
Fully-attentive iterative networks for region-based controllable image and video captioning.
Comput. Vis. Image Underst., December, 2023

Fashion-Oriented Image Captioning with External Knowledge Retrieval and Fully Attentive Gates.
Sensors, February, 2023

Computer Vision in Human Analysis: From Face and Body to Clothes.
Sensors, 2023

From Show to Tell: A Survey on Deep Learning-Based Image Captioning.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Removing NSFW Concepts from Vision-and-Language Models for Text-to-Image Retrieval and Generation.
CoRR, 2023

Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training.
CoRR, 2023

Multi-Class Explainable Unlearning for Image Classification via Weight Filtering.
CoRR, 2023

Parents and Children: Distinguishing Multimodal DeepFakes from Natural Images.
CoRR, 2023

Positive-Augmented Constrastive Learning for Image and Video Captioning Evaluation.
CoRR, 2023

LaDI-VTON: Latent Diffusion Textual-Inversion Enhanced Virtual Try-On.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Where Research meets Industry: New Challenges and Opportunities at AImageLab.
Proceedings of the Italia Intelligenza Artificiale, 2023

Embodied Agents for Efficient Exploration and Smart Scene Description.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Towards Explainable Navigation and Recounting.
Proceedings of the Image Analysis and Processing - ICIAP 2023, 2023

Unveiling the Impact of Image Transformations on Deepfake Detection: An Experimental Analysis.
Proceedings of the Image Analysis and Processing - ICIAP 2023, 2023

OpenFashionCLIP: Vision-and-Language Contrastive Learning with Open-Source Fashion Data.
Proceedings of the Image Analysis and Processing - ICIAP 2023, 2023

SynthCap: Augmenting Transformers with Synthetic Data for Image Captioning.
Proceedings of the Image Analysis and Processing - ICIAP 2023, 2023

With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Transform, Warp, and Dress: A New Transformation-guided Model for Virtual Try-on.
ACM Trans. Multim. Comput. Commun. Appl., 2022

Matching Faces and Attributes Between the Artistic and the Real Domain: the PersonArt Approach.
ACM Trans. Multim. Comput. Commun. Appl., 2022

Focus on Impact: Indoor Exploration With Intrinsic Motivation.
IEEE Robotics Autom. Lett., 2022

Boosting modern and historical handwritten text recognition with deformable convolutions.
Int. J. Document Anal. Recognit., 2022

Explaining transformer-based image captioning models: An empirical analysis.
AI Commun., 2022

Spot the Difference: A Novel Task for Embodied Agents in Changing Environments.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

The LAM Dataset: A Novel Benchmark for Line-Level Handwritten Text Recognition.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

CaMEL: Mean Teacher Learning for Image Captioning.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Investigating Bidimensional Downsampling in Vision Transformer Models.
Proceedings of the Image Analysis and Processing - ICIAP 2022, 2022

Embodied Navigation at the Art Gallery.
Proceedings of the Image Analysis and Processing - ICIAP 2022, 2022

Dress Code: High-Resolution Multi-Category Virtual Try-On.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Dual-Branch Collaborative Transformer for Virtual Try-On.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

The Unreasonable Effectiveness of CLIP Features for Image Captioning: An Experimental Analysis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Retrieval-Augmented Transformer for Image Captioning.
Proceedings of the CBMI 2022: International Conference on Content-based Multimedia Indexing, Graz, Austria, September 14, 2022

ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval.
Proceedings of the CBMI 2022: International Conference on Content-based Multimedia Indexing, Graz, Austria, September 14, 2022

2021
Working Memory Connections for LSTM.
Neural Networks, 2021

Multimodal attention networks for low-level vision-and-language navigation.
Comput. Vis. Image Underst., 2021

Universal Captioner: Long-Tail Vision-and-Language Model Training through Content-Style Separation.
CoRR, 2021

From Show to Tell: A Survey on Image Captioning.
CoRR, 2021

Learning to Select: A Fully Attentive Approach for Novel Object Captioning.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

FashionSearch++: Improving Consumer-to-Shop Clothes Retrieval with Hard Negatives.
Proceedings of the 11th Italian Information Retrieval Workshop 2021, 2021

Revisiting the Evaluation of Class Activation Mapping for Explainability: A Novel Metric and Experimental Analysis.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Learning to Read L'Infinito: Handwritten Text Recognition with Synthetic Training Data.
Proceedings of the Computer Analysis of Images and Patterns, 2021

Out of the Box: Embodied Navigation in the Real World.
Proceedings of the Computer Analysis of Images and Patterns, 2021

2020
Explaining digital humanities by aligning images and textual descriptions.
Pattern Recognit. Lett., 2020

A unified cycle-consistent neural model for text and image retrieval.
Multim. Tools Appl., 2020

SMArT: Training Shallow Memory-aware Transformers for Robotic Explainability.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

A Novel Attention-based Aggregation Function to Combine Vision and Language.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

VITON-GT: An Image-based Virtual Try-On Model with Geometric Transformations.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Explore and Explain: Self-supervised Navigation and Recounting.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Meshed-Memory Transformer for Image Captioning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
M-VAD names: a dataset for video captioning with naming.
Multim. Tools Appl., 2019

M<sup>2</sup>: Meshed-Memory Transformer for Image Captioning.
CoRR, 2019

Perceive, Transform, and Act: Multi-Modal Attention Networks for Vision-and-Language Navigation.
CoRR, 2019

Image-to-Image Translation to Unfold the Reality of Artworks: An Empirical Analysis.
Proceedings of the Image Analysis and Processing - ICIAP 2019, 2019

Artpedia: A New Visual-Semantic Dataset with Visual and Contextual Sentences in the Artistic Domain.
Proceedings of the Image Analysis and Processing - ICIAP 2019, 2019

Art2Real: Unfolding the Reality of Artworks via Semantically-Aware Image-To-Image Translation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Paying More Attention to Saliency: Image Captioning with Saliency and Context Attention.
ACM Trans. Multim. Comput. Commun. Appl., 2018

Predicting Human Eye Fixations via an LSTM-Based Saliency Attentive Model.
IEEE Trans. Image Process., 2018

Attentive models in vision: Computing saliency maps in the deep learning era.
Intelligenza Artificiale, 2018

Automatic Image Cropping and Selection Using Saliency: An Application to Historical Manuscripts.
Proceedings of the Digital Libraries and Multimedia Archives, 2018

Aligning Text and Document Illustrations: Towards Visually Explainable Digital Humanities.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

What Was Monet Seeing While Painting? Translating Artworks to Photo-Realistic Images.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Towards Cycle-Consistent Models for Text and Image Retrieval.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Visual-Semantic Alignment Across Domains Using a Semi-Supervised Approach.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

SAM: Pushing the Limits of Saliency Prediction Models.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

2017
Modeling multimodal cues in a deep learning-based framework for emotion recognition in the wild.
Proceedings of the 19th ACM International Conference on Multimodal Interaction, 2017

Visual saliency for image captioning in new multimedia services.
Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops, 2017

Towards Video Captioning with Naming: A Novel Dataset and a Multi-modal Approach.
Proceedings of the Image Analysis and Processing - ICIAP 2017, 2017

2016
A deep multi-level network for saliency prediction.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Multi-level Net: A Visual Saliency Prediction Model.
Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016


  Loading...