Giovanni Puccetti

Orcid: 0000-0002-8906-0987

Affiliations:
  • Institute of Information Science and Technology, Italy
  • Scuola Normale Superiore, Pisa, Italy (former)


According to our database1, Giovanni Puccetti authored at least 26 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Is Human-Like Text Liked by Humans? Multilingual Human Detection and Preference Against AI.
CoRR, February, 2025

Automatic Extraction of Regesta for Medieval Latin Text Summarization.
ERCIM News, 2025

Digital Transformation in Legal History Through Automatic Machine Learning Annotation.
ERCIM News, 2025

Optimizing LLMs for Italian: Reducing Token Fertility and Enhancing Efficiency Through Vocabulary Adaptation.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Automatic Annotation of Legal References (Allegationes) in the Liber Extra's Ordinary Gloss.
Proceedings of the 21st Conference on Information and Research science Connecting to Digital and Library science, 2025

REVERINO: REgesta generation VERsus latIN summarizatiOn.
Proceedings of the 21st Conference on Information and Research science Connecting to Digital and Library science, 2025


The Invalsi Benchmarks: measuring the Linguistic and Mathematical understanding of Large Language Models in Italian.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

Stress-testing Machine Generated Text Detection: Shifting Language Models Writing Style to Fool Detectors.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Black-Box Machine-Generated Text Detection.
Dataset, April, 2024

SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection.
CoRR, 2024

The Invalsi Benchmark: measuring Language Models Mathematical and Language understanding in Italian.
CoRR, 2024

M4GT-Bench: Evaluation Benchmark for Black-Box Machine-Generated Text Detection.
CoRR, 2024

SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection.
Proceedings of the 18th International Workshop on Semantic Evaluation, 2024

You Write like a GPT.
Proceedings of the Tenth Italian Conference on Computational Linguistics (CLiC-it 2024), 2024

ABRICOT - ABstRactness and Inclusiveness in COntexT: A CALAMITA Challenge.
Proceedings of the Tenth Italian Conference on Computational Linguistics (CLiC-it 2024), 2024

INVALSI - Mathematical and Language Understanding in Italian: A CALAMITA Challenge.
Proceedings of the Tenth Italian Conference on Computational Linguistics (CLiC-it 2024), 2024

M4GT-Bench: Evaluation Benchmark for Black-Box Machine-Generated Text Detection.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

AI 'News' Content Farms Are Easy to Make and Hard to Detect: A Case Study in Italian.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Unveiling the inventive process from patents by extracting problems, solutions and advantages with natural language processing.
Expert Syst. Appl., November, 2023

AIMH at MULTI-Fake-DetectIVE: System Report (short paper).
Proceedings of the Eighth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2023), 2023

2022
Outliers Dimensions that Disrupt Transformers Are Driven by Frequency.
CoRR, 2022

Outlier Dimensions that Disrupt Transformers are Driven by Frequency.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021
A simple and fast method for Named Entity context extraction from patents.
Expert Syst. Appl., 2021

How Do BERT Embeddings Organize Linguistic Knowledge?
Proceedings of Deep Learning Inside Out: The 2nd Workshop on Knowledge Extraction and Integration for Deep Learning Architectures, 2021

2020
B4DS @ PRELEARN: Ensemble Method for Prerequisite Learning (short paper).
Proceedings of the Seventh Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2020), 2020


  Loading...