Diego Alves

According to our database1, Diego Alves authored at least 24 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
A Parallel Cross-Lingual Benchmark for Multimodal Idiomaticity Understanding.
CoRR, January, 2026

Self-Explaining Hate Speech Detection with Moral Rationales.
CoRR, January, 2026

Dependency Distance Effects on Eye-Tracking Measures in Brazilian Portuguese.
Proceedings of the 17th International Conference on Computational Processing of Portuguese, 2026

Cheese it up: CamemBERT Outperforms Large Language Models for Identification of French Multi-word Expressions.
Proceedings of the 22nd Workshop on Multiword Expressions, 2026

Cognitive Signatures of Multi-Word Expressions: Reading-Time and Surprisal.
Proceedings of the 22nd Workshop on Multiword Expressions, 2026

2025
MFTCXplain: A Multilingual Benchmark Dataset for Evaluating the Moral Reasoning of LLMs through Hate Speech Multi-hop Explanation.
CoRR, June, 2025

Diachronic Analysis of Phrasal Verbs in English Scientific Writing.
Proceedings of the Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies, 2025

Syntagmatic Productivity of MWEs in Scientific English.
Proceedings of the 21st Workshop on Multiword Expressions, 2025

Surprisal Dynamics for the Detection of Multi-Word Expressions in English.
Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2025

MFTCXplain: A Multilingual Benchmark Dataset for Evaluating the Moral Reasoning of LLMs through Multi-hop Hate Speech Explanation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Information Theory and Linguistic Variation: A Study of Brazilian and European Portuguese.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

2024
An evaluation of Portuguese language models' adaptation to African Portuguese varieties.
Proceedings of the 16th International Conference on Computational Processing of Portuguese, 2024

Diachronic Analysis of Multi-word Expression Functional Categories in Scientific English.
Proceedings of the Joint Workshop on Multiword Expressions and Universal Dependencies, 2024

2022
Quotations, Coreference Resolution, and Sentiment Annotations in Croatian News Articles: An Exploratory Study.
CoRR, 2022

2021
OEKG: The Open Event Knowledge Graph.
Proceedings of the 2nd International Workshop on Cross-lingual Event-centric Open Analytics co-located with the 30th The Web Conference (WWW 2021), 2021

Building and Evaluating Universal Named-Entity Recognition English corpus.
Proceedings of the 2nd International Workshop on Cross-lingual Event-centric Open Analytics co-located with the 30th The Web Conference (WWW 2021), 2021

Quotations, Coreference Resolution, andSentiment Annotations in Croatian NewsArticles: An Exploratory Study.
Proceedings of the Conference on Digital Curation Technologies (Qurator 2021), Berlin, Germany, February 8th - to, 2021

Building Multilingual Corpora for a Complex Named Entity Recognition and Classification Hierarchy using Wikipedia and DBpedia.
Proceedings of the Conference on Digital Curation Technologies (Qurator 2021), Berlin, Germany, February 8th - to, 2021

2020
UNER: Universal Named-Entity RecognitionFramework.
CoRR, 2020

Natural Language Processing Chains Inside a Cross-lingual Event-Centric Knowledge Pipeline for European Union Under-resourced Languages.
Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, 2020

The Optimization of Portuguese Named-Entity Recognition and Classification by Combining Local Grammars and Conditional Random Fields Trained with a Parsed Corpus.
Proceedings of the Formalising Natural Languages: Applications to Natural Language Processing and Digital Humanities, 2020

Evaluating Language Tools for Fifteen EU-official Under-resourced Languages.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Data Augmentation for Pipeline-Based Speech Translation.
Proceedings of the Human Language Technologies - The Baltic Perspective, 2020

UNER: Universal Named-Entity Recognition Framework.
Proceedings of the 1st International Workshop on Cross-lingual Event-centric Open Analytics co-located with the 17th Extended Semantic Web Conference (ESWC 2020), 2020


  Loading...