Iñaki San Vicente

Orcid: 0000-0003-1765-0555

According to our database1, Iñaki San Vicente authored at least 27 papers between 2008 and 2023.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Not Enough Data to Pre-train Your Language Model? MT to the Rescue!
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Scaling Laws for BERT in Low-Resource Settings.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Information retrieval and question answering: A case study on COVID-19 scientific literature.
Knowl. Based Syst., 2022

BasqueGLUE: A Natural Language Understanding Benchmark for Basque.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

2021
GEPSA, a tool for monitoring social challenges in digital press.
Proceedings of the First Workshop on Language Technology for Equality, 2021

Fine-Tuning BERT for COVID-19 Domain Ad-Hoc IR by Using Pseudo-qrels.
Proceedings of the Advances in Information Retrieval, 2021

2020
Building a Task-oriented Dialog System for Languages with no Training Data: the Case for Basque.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Give your Text Representation Models some Love: the Case for Basque.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

2018
Talaia: a Real time Monitor of Social Media and Digital Press.
CoRR, 2018

2017
Q-WordNet PPV: Simple, Robust and (almost) Unsupervised Generation of Polarity Lexicons for Multiple Languages.
CoRR, 2017

2016
TweetLID: a benchmark for tweet language identification.
Lang. Resour. Evaluation, 2016

Polarity Lexicon Building: to what Extent Is the Manual Effort Worth?
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

TweetMT: A Parallel Microblog Corpus.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

2015
TweetNorm: a benchmark for lexical normalization of Spanish tweets.
Lang. Resour. Evaluation, 2015

Overview of TweetMT: A Shared Task on Machine Translation of Tweets at SEPLN 2015.
Proceedings of the Tweet Translation Workshop 2015 co-located with 31st Conference of the Spanish Society for Natural Language Processing (SEPLN 2015), 2015

EliXa: A Modular and Flexible ABSA Platform.
Proceedings of the 9th International Workshop on Semantic Evaluation, 2015

2014
Overview of TweetLID: Tweet Language Identification at SEPLN 2014.
Proceedings of the Tweet Language Identification Workshop co-located with 30th Conference of the Spanish Society for Natural Language Processing, 2014

TweetNorm_es: an annotated corpus for Spanish microtext normalization.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Simple, Robust and (almost) Unsupervised Generation of Polarity Lexicons for Multiple Languages.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

2013
Elhuyar at Tweet-Norm 2013.
Proceedings of the Tweet Normalization Workshop co-located with 29th Conference of the Spanish Society for Natural Language Processing (SEPLN 2013), 2013

Introducción a la Tarea Compartida Tweet-Norm 2013: Normalización Léxica de Tuits en Español.
Proceedings of the Tweet Normalization Workshop co-located with 29th Conference of the Spanish Society for Natural Language Processing (SEPLN 2013), 2013

Cross-Lingual Projections vs. Corpora Extracted Subjectivity Lexicons for Less-Resourced Languages.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2013

Automatic Comparable Web Corpora Collection and Bilingual Terminology Extraction for Specialized Dictionary Making.
Proceedings of the Building and Using Comparable Corpora., 2013

2012
PaCo2: A Fully Automated tool for gathering Parallel Corpora from the Web.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Building a Basque-Chinese Dictionary by Using English as Pivot.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

2011
Analyzing Methods for Improving Precision of Pivot Based Bilingual Dictionaries.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

2008
Mining Term Translation from Domain Restricted Comparable Corpora.
Proces. del Leng. Natural, 2008


  Loading...