Emanuele Vivoli

Orcid: 0000-0002-9971-8738

According to our database1, Emanuele Vivoli authored at least 14 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
CoSMo: A Multimodal Transformer for Page Stream Segmentation in Comic Books.
CoRR, July, 2025

ComicsPAP: understanding comic strips by picking the correct panel.
CoRR, March, 2025

HoloMine: A Synthetic Dataset for Buried Landmines Recognition Using Microwave Holographic Imaging.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2025

2024
Deep Learning-Based Real-Time Detection of Surface Landmines Using Optical Imaging.
Remote. Sens., February, 2024

One missing piece in Vision and Language: A Survey on Comics Understanding.
CoRR, 2024

CoMix: A Comprehensive Benchmark for Multi-Task Comic Understanding.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Comics Datasets Framework: Mix of Comics Datasets for Detection Benchmarking.
Proceedings of the Document Analysis and Recognition - ICDAR 2024 Workshops, 2024

Multimodal Transformer for Comics Text-Cloze.
Proceedings of the Document Analysis and Recognition - ICDAR 2024 - 18th International Conference, Athens, Greece, August 30, 2024

ComiCap: A VLMs Pipeline for Dense Captioning of Comic Panels.
Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024

Towards Generative Class Prompt Learning for Fine-grained Visual Recognition.
Proceedings of the 35th British Machine Vision Conference, 2024

2023
CTE: A Dataset for Contextualized Table Extraction.
Proceedings of the 19th The Conference on Information and Research science Connecting to Digital and Library science, 2023

Deep-learning for dysgraphia detection in children handwritings.
Proceedings of the ACM Symposium on Document Engineering 2023, 2023

2022
Graph Neural Networks and Representation Embedding for Table Extraction in PDF Documents.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

MUST-VQA: MUltilingual Scene-Text VQA.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022


  Loading...