We stand with Ukraine

We stand with Ukraine

Lorenzo Baraldi

Orcid: 0000-0001-5125-4957

Affiliations:

University of Modena and Reggio Emilia, Italy
University of Pisa, Pisa, Toscana, Italy - PhD student

According to our database¹, Lorenzo Baraldi authored at least 25 papers between 2023 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

Online presence:

on lorenzobaraldi.com
on orcid.org

On csauthors.net:

Bibliography

2025

Positive-Augmented Contrastive Learning for Vision-and-Language Evaluation and Training.

[BibT_eX]

[DOI]

,

Nicholas Moratelli

,

Marcella Cornia

,

Lorenzo Baraldi

,

Int. J. Comput. Vis., November, 2025

The Safety Challenge of World Models for Embodied AI Agents: A Review.

[BibT_eX]

[DOI]

Lorenzo Baraldi

,

,

,

,

,

,

,

,

,

,

Angelo Cangelosi

,

Lorenzo Baraldi

CoRR, October, 2025

RAID: A Dataset for Testing the Adversarial Robustness of AI-Generated Image Detectors.

[BibT_eX]

[DOI]

,

,

Federico Cocchi

,

Lorenzo Baraldi

,

,

,

Marcella Cornia

,

Lorenzo Baraldi

,

,

,

Battista Biggio

CoRR, June, 2025

What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models.

[BibT_eX]

[DOI]

Lorenzo Baraldi

,

Davide Bucciarelli

,

,

Marcella Cornia

,

Lorenzo Baraldi

,

,

CoRR, May, 2025

Learning to mask and permute visual tokens for Vision Transformer pre-training.

[BibT_eX]

[DOI]

Lorenzo Baraldi

,

Roberto Amoroso

,

Marcella Cornia

,

Lorenzo Baraldi

,

,

Comput. Vis. Image Underst., 2025

Semantically Conditioned Prompts for Visual Recognition Under Missing Modality Scenarios.

[BibT_eX]

[DOI]

Vittorio Pipoli

,

Federico Bolelli

,

,

Marcella Cornia

,

Lorenzo Baraldi

,

Costantino Grana

,

,

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

Perceive. Query & Reason: Enhancing Video QA with Question-Guided Temporal Queries.

[BibT_eX]

[DOI]

Roberto Amoroso

,

,

,

Lorenzo Baraldi

,

,

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

AIGeN-Llama: An Adversarial Approach for Instruction Generation in VLN using Llama2 Model.

[BibT_eX]

[DOI]

,

Lorenzo Baraldi

,

Proceedings of the 21st Conference on Information and Research science Connecting to Digital and Library science, 2025

Causal Graphical Models for Vision-Language Compositional Understanding.

[BibT_eX]

[DOI]

Fiorenzo Parascandolo

,

Nicholas Moratelli

,

Enver Sangineto

,

Lorenzo Baraldi

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Multimodal Emotion Recognition in Conversation via Possible Speaker's Audio and Visual Sequence Selection.

[BibT_eX]

[DOI]

Rahul Singh Maharjan

,

,

,

Lorenzo Baraldi

,

,

Angelo Cangelosi

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Hyperbolic Safety-Aware Vision-Language Models.

[BibT_eX]

[DOI]

,

Tejaswi Kasarla

,

,

Lorenzo Baraldi

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering.

[BibT_eX]

[DOI]

Federico Cocchi

,

Nicholas Moratelli

,

Marcella Cornia

,

Lorenzo Baraldi

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval.

[BibT_eX]

[DOI]

Davide Caffagni

,

,

Marcella Cornia

,

Lorenzo Baraldi

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Tracing Information Flow in LLaMA Vision: A Step Toward Multimodal Understanding.

[BibT_eX]

[DOI]

Alessia Saporita

,

Vittorio Pipoli

,

Federico Bolelli

,

Lorenzo Baraldi

,

Andrea Acquaviva

,

Proceedings of the Computer Analysis of Images and Patterns, 2025

2024

Personalizing Multimodal Large Language Models for Image Captioning: An Experimental Analysis.

[BibT_eX]

[DOI]

Davide Bucciarelli

,

Nicholas Moratelli

,

Marcella Cornia

,

Lorenzo Baraldi

,

CoRR, 2024

Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities.

[BibT_eX]

[DOI]

Lorenzo Baraldi

,

Federico Cocchi

,

Marcella Cornia

,

Lorenzo Baraldi

,

Alessandro Nicolosi

,

CoRR, 2024

The (R)Evolution of Multimodal Large Language Models: A Survey.

[BibT_eX]

[DOI]

Davide Caffagni

,

Federico Cocchi

,

Luca Barsellotti

,

Nicholas Moratelli

,

,

Lorenzo Baraldi

,

Lorenzo Baraldi

,

Marcella Cornia

,

CoRR, 2024

Adapt to Scarcity: Few-Shot Deepfake Detection via Low-Rank Adaptation.

[BibT_eX]

[DOI]

Silvia Cappelletti

,

Lorenzo Baraldi

,

Federico Cocchi

,

Marcella Cornia

,

Lorenzo Baraldi

,

Proceedings of the Pattern Recognition - 27th International Conference, 2024

Intelligent Multimodal Artificial Agents that Talk and Express Emotions.

[BibT_eX]

[DOI]

,

Rahul Singh Maharjan

,

,

Roberto Bigazzi

,

Lorenzo Baraldi

,

,

Angelo Cangelosi

Proceedings of the Human-Friendly Robotics 2024 - HFR: 17th International Workshop on Human-Friendly Robotics, Lugano, Switzerland, 30 September, 2024

Optimizing Resource Consumption in Diffusion Models Through Hallucination Early Detection.

[BibT_eX]

[DOI]

,

Lorenzo Baraldi

,

Lorenzo Baraldi

,

,

Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024

AIGeN: An Adversarial Approach for Instruction Generation in VLN.

[BibT_eX]

[DOI]

,

Roberto Bigazzi

,

Lorenzo Baraldi

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs.

[BibT_eX]

[DOI]

Davide Caffagni

,

Federico Cocchi

,

Nicholas Moratelli

,

,

Marcella Cornia

,

Lorenzo Baraldi

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization.

[BibT_eX]

[DOI]

Nicholas Moratelli

,

Davide Caffagni

,

Marcella Cornia

,

Lorenzo Baraldi

,

Proceedings of the 35th British Machine Vision Conference, 2024

2023

Let's ViCE! Mimicking Human Cognitive Behavior in Image Generation Evaluation.

[BibT_eX]

[DOI]

,

,

Lorenzo Baraldi

,

Lorenzo Baraldi

,

,

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Unveiling the Impact of Image Transformations on Deepfake Detection: An Experimental Analysis.

[BibT_eX]

[DOI]

Federico Cocchi

,

Lorenzo Baraldi

,

,

Marcella Cornia

,

Lorenzo Baraldi

,

Proceedings of the Image Analysis and Processing - ICIAP 2023, 2023

Loading...