Marcos García

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

2025

Assessing lexical ambiguity resolution in language models with new WiC datasets in Galician and Spanish.

[BibT_eX]

[DOI]

Marta Vázquez Abuín

Proces. del Leng. Natural, 2025

Clasificación automática de textos por niveis de lecturabilidade: recursos e modelos para o galego.

[BibT_eX]

[DOI]

Sandra Rodríguez Rey

Linguamática, 2025

Enhancing Large Language Models for Underrepresented Varieties: Pretraining Strategies in the Galician-Portuguese Diasystem.

[BibT_eX]

[DOI]

J. Braz. Comput. Soc., 2025

Gathering Compositionality Ratings of Ambiguous Noun-Adjective Multiword Expressions in Galician.

[BibT_eX]

[DOI]

Laura Castro

Proceedings of the 21st Workshop on Multiword Expressions, 2025

2024

Training and evaluation of vector models for Galician.

[BibT_eX]

[DOI]

Lang. Resour. Evaluation, December, 2024

Towards accurate dependency parsing for Galician with limited resources.

[BibT_eX]

[DOI]

Albina Sarymsakova

Xulia Sánchez-Rodríguez

Proces. del Leng. Natural, 2024

Multi-label Discourse Function Classification of Lexical Bundles in Basque and Spanish via transformer-based models.

[BibT_eX]

[DOI]

Proces. del Leng. Natural, 2024

Open Generative Large Language Models for Galician.

[BibT_eX]

[DOI]

Proces. del Leng. Natural, 2024

Investigating Idiomaticity in Word Representations.

[BibT_eX]

[DOI]

CoRR, 2024

DeepR3: Reducing, Reusing and Recycling Large Models for Developing Responsible and Green Language Technologies.

[BibT_eX]

[DOI]

Aitor Soroa

German Rigau

Jose Maria Alonso-Moral

Maite Melero

Marta Villegas

Proceedings of the Seminar of the Spanish Society for Natural Language Processing: Projects and System Demonstrations (SEPLN-CEDI-PD 2024) co-located with the 7th Spanish Conference on Informatics (CEDI 2024), 2024

CorpusNÓS: A massive Galician corpus for training large language models.

[BibT_eX]

[DOI]

Daniel Bardanca Outeiriño

Silvia Paniagua Suárez

Cristina Carbajal-Pérez

Proceedings of the 16th International Conference on Computational Processing of Portuguese, 2024

Increasing manually annotated resources for Galician: the Parallel Universal Dependencies Treebank.

[BibT_eX]

[DOI]

Xulia Sánchez-Rodríguez

Albina Sarymsakova

Laura Castro

Proceedings of the 16th International Conference on Computational Processing of Portuguese, 2024

Compositionality and Ambiguity in Multiword Expressions: A Dataset for the Evaluation of Language Models in Galician.

[BibT_eX]

[DOI]

Laura Castro

Anna Temerko

Proceedings of the Progress in Artificial Intelligence, 2024

WordNet Expansion with Bilingual Word Embeddings and Neural Machine Translation.

[BibT_eX]

[DOI]

Marta Vázquez Abuín

Proceedings of the Progress in Artificial Intelligence, 2024

2023

Annotation of lexical bundles with discourse functions in a Spanish academic corpus.

[BibT_eX]

[DOI]

Eleonora Guzzi

Proceedings of the 19th Workshop on Multiword Expressions, 2023

Dependency resolution at the syntax-semantics interface: psycholinguistic and computational insights on control dependencies.

[BibT_eX]

[DOI]

Juan Garcia Amboage

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

A computational psycholinguistic evaluation of the syntactic abilities of Galician BERT models at the interface of dependency resolution and training time.

[BibT_eX]

[DOI]

Proces. del Leng. Natural, 2022

Evaluating Contextualized Vectors from both Large Language Models and Compositional Strategies.

[BibT_eX]

[DOI]

Proces. del Leng. Natural, 2022

Proxecto Nós: Artificial intelligence at the service of the Galician language.

[BibT_eX]

[DOI]

Manuel González González

Senén Barro

Xose Luis Regueira

Proceedings of the Annual Conference of the Spanish Association for Natural Language Processing: Projects and Demonstrations (SEPLN-PD 2022) co-located with the Conference of the Spanish Society for Natural Language Processing (SEPLN 2022), 2022

An exploration of the semantic knowledge in vector models: polysemy, synonymy and idiomaticity.

[BibT_eX]

[DOI]

Martin Pereira-Fariña

SemEval-2022 Task 2: Multilingual Idiomaticity Detection and Sentence Embedding.

[BibT_eX]

[DOI]

Harish Tayyar Madabushi

Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

A Targeted Assessment of the Syntactic Abilities of Transformer Models for Galician-Portuguese.

[BibT_eX]

[DOI]

Alfredo Crespo-Otero

Proceedings of the Computational Processing of the Portuguese Language, 2022

2021

Bertinho: Galician BERT Representations.

[BibT_eX]

[DOI]

David Vilares

Proces. del Leng. Natural, 2021

Editor's Note.

[BibT_eX]

[DOI]

Amparo Alonso-Betanzos

Pedro Cabalar

Graçaliz Pereira Dimuro

José Hernández-Orallo

Raquel Hervás

Angeles Manjarés

Fernando Martínez-Plumed

Inmaculada Mora-Jiménez

Miquel Sànchez-Marrè

Int. J. Interact. Multim. Artif. Intell., 2021

Embeddings in Natural Language Processing: Theory and Advances in Vector Representations of Meaning.

[BibT_eX]

[DOI]

Comput. Linguistics, 2021

Comparing Dependency-based Compositional Models with Contextualized Word Embeddings.

[BibT_eX]

[DOI]

Manuel de Prada Corral

Proceedings of the 13th International Conference on Agents and Artificial Intelligence, 2021

Probing for idiomaticity in vector space models.

[BibT_eX]

[DOI]

Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Assessing the Representations of Idiomaticity in Vector Models with a Noun Compound Dataset Labeled at Type and Token Levels.

[BibT_eX]

[DOI]

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Exploring the Representation of Word Meanings in Context: A Case Study on Homonymy and Synonymy.

[BibT_eX]

[DOI]

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2019

Uma utilidade para o reconhecimento de topónimos em documentos medievais.

[BibT_eX]

[DOI]

Linguamática, 2019

Editorial for the Special Issue on "Natural Language Processing and Text Mining".

[BibT_eX]

[DOI]

Inf., 2019

NER and Open Information Extraction for Portuguese: Notebook for IberLEF 2019 Portuguese Named Entity Recognition and Relation Extraction Tasks.

[BibT_eX]

[DOI]

Patricia Martín-Rodilla

Proceedings of the Iberian Languages Evaluation Forum co-located with 35th Conference of the Spanish Society for Natural Language Processing, 2019

A comparison of statistical association measures for identifying dependency-based collocations in various languages.

[BibT_eX]

[DOI]

Proceedings of the Joint Workshop on Multiword Expressions and WordNet, 2019

Unsupervised Compositional Translation of Multiword Expressions.

[BibT_eX]

[DOI]

Proceedings of the Joint Workshop on Multiword Expressions and WordNet, 2019

Exploring cross-lingual word embeddings for the inference of bilingual dictionaries.

[BibT_eX]

[DOI]

Proceedings of TIAD-2019 Shared Task, 2019

Weighted Compositional Vectors for Translating Collocations Using Monolingual Corpora.

[BibT_eX]

[DOI]

Proceedings of the Computational and Corpus-Based Phraseology, 2019

Identifying Lexical Bundles for an Academic Writing Assistant in Spanish.

[BibT_eX]

[DOI]

Proceedings of the Computational and Corpus-Based Phraseology, 2019

Pay Attention when you Pay the Bills. A Multilingual Corpus with Dependency-based and Semantic Annotation of Collocations.

[BibT_eX]

[DOI]

Susana Sotelo

Estela Mosqueira Suárez

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

A Method to Automatically Identify Diachronic Variation in Collocations.

[BibT_eX]

[DOI]

Proceedings of the 1st International Workshop on Computational Approaches to Historical Language Change, 2019

2018

New treebank or repurposed? On the feasibility of cross-lingual parsing of Romance languages with Universal Dependencies.

[BibT_eX]

[DOI]

Nat. Lang. Eng., 2018

Dependency parsing with finite state transducers and compression rules.

[BibT_eX]

[DOI]

Inf. Process. Manag., 2018

Distributional semantics for diachronic search.

[BibT_eX]

[DOI]

Iván Rodríguez-Torres

Comput. Electr. Eng., 2018

LinguaKit: A Big Data-Based Multilingual Tool for Linguistic Analysis and Information Extraction.

[BibT_eX]

[DOI]

Pablo Gamallo Otero

César Piñeiro

Rodrigo Martínez-Castaño

Juan Carlos Pichel

Proceedings of the Fifth International Conference on Social Networks Analysis, 2018

Task-Oriented Evaluation of Dependency Parsing with Open Information Extraction.

[BibT_eX]

[DOI]

Proceedings of the Computational Processing of the Portuguese Language, 2018

A Lexical Tool for Academic Writing in Spanish based on Expert and Novice Corpora.

[BibT_eX]

[DOI]

Milka Villayandre-Llamazares

Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

2017

LinguaKit: uma ferramenta multilingue para a análise linguística e a extração de informação.

[BibT_eX]

[DOI]

Linguamática, 2017

Towards Syntactic Iberian Polarity Classification.

[BibT_eX]

[DOI]

David Vilares

Proceedings of the 8th Workshop on Computational Approaches to Subjectivity, 2017

Using bilingual word-embeddings for multilingual collocation extraction.

[BibT_eX]

[DOI]

Proceedings of the 13th Workshop on Multiword Expressions, 2017

A Web Interface for Diachronic Semantic Search in Spanish.

[BibT_eX]

[DOI]

Iván Rodríguez-Torres

Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

A rule-based system for cross-lingual parsing of Romance languages with Universal Dependencies.

[BibT_eX]

[DOI]

Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, 2017

2016

Creación de un treebank de dependencias universales mediante recursos existentes para lenguas próximas: el caso del gallego.

[BibT_eX]

[DOI]

Proces. del Leng. Natural, 2016

Semantic Relation Extraction. Resources, Tools and Strategies.

[BibT_eX]

[DOI]

Proceedings of the Computational Processing of the Portuguese Language, 2016

Entity Linking with Distributional Semantics.

[BibT_eX]

[DOI]

Proceedings of the Computational Processing of the Portuguese Language, 2016

Incorporating Lexico-semantic Heuristics into Coreference Resolution Sieves for Named Entity Recognition at Document-level.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

2015

Exploring the effectiveness of linguistic knowledge for biographical relation extraction.

[BibT_eX]

[DOI]

Nat. Lang. Eng., 2015

Yet Another Suite of Multilingual NLP Tools.

[BibT_eX]

[DOI]

Proceedings of the Languages, Applications and Technologies - 4th International Symposium, 2015

Multilingual Open Information Extraction.

[BibT_eX]

[DOI]

Proceedings of the Progress in Artificial Intelligence, 2015

2014

PoS-tagging the Web in Portuguese. National varieties, text typologies and spelling systems.

[BibT_eX]

[DOI]

Iria Gayo

Miguel A. Pousada Cruz

Proces. del Leng. Natural, 2014

Entity-Centric Coreference Resolution of Person Entities for Open Information Extraction.

[BibT_eX]

[DOI]

Proces. del Leng. Natural, 2014

Análisis morfosintáctico y clasificación de entidades nombradas en un entorno Big Data.

[BibT_eX]

[DOI]

Proces. del Leng. Natural, 2014

Comparing Ranking-based and Naive Bayes Approaches to Language Detection on Tweets.

[BibT_eX]

[DOI]

Susana Sotelo

Proceedings of the Tweet Language Identification Workshop co-located with 30th Conference of the Spanish Society for Natural Language Processing, 2014

Citius: A Naive-Bayes Strategy for Sentiment Analysis on English Tweets.

[BibT_eX]

[DOI]

Proceedings of the 8th International Workshop on Semantic Evaluation, 2014

Multilingual corpora with coreferential annotation of person entities.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

An Entity-Centric Coreference Resolution System for Person Entities with Rich Linguistic Information.

[BibT_eX]

[DOI]

Proceedings of the COLING 2014, 2014

Perldoop: Efficient execution of Perl scripts on Hadoop clusters.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Big Data (IEEE BigData 2014), 2014

2013

A Method to Lexical Normalisation of Tweets.

[BibT_eX]

[DOI]

Proceedings of the Tweet Normalization Workshop co-located with 29th Conference of the Spanish Society for Natural Language Processing (SEPLN 2013), 2013

2012

Automatic Phonetic Transcription by Phonological Derivation.

[BibT_eX]

[DOI]

Isaac J. González

Proceedings of the Computational Processing of the Portuguese Language, 2012

Extraction of Bilingual Cognates from Wikipedia.

[BibT_eX]

[DOI]