Iñaki Alegria

Orcid: 0000-0002-0272-1472

Affiliations:
  • University of the Basque Country, Spain


According to our database1, Iñaki Alegria authored at least 83 papers between 1992 and 2021.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2021
A Methodology to Measure the Diachronic Language Distance between Three Languages Based on Perplexity.
J. Quant. Linguistics, 2021

2020
Measuring diachronic language distance using perplexity: Application to English, Portuguese, and Spanish.
Nat. Lang. Eng., 2020

Distância diacrónica automática entre variantes diatópicas do português e do espanhol.
Linguamática, 2020

Measuring Language Distance of Isolated European Languages.
Inf., 2020

2019
Adapting NMT to caption translation inWikimedia Commons for low-resource languages.
Proces. del Leng. Natural, 2019

Cross-lingual Diachronic Distance: Application to Portuguese and Spanish.
Proces. del Leng. Natural, 2019

Weighted finite-state transducers for normalization of historical texts.
Nat. Lang. Eng., 2019

Large Scale Linguistic Processing of Tweets to Understand Social Interactions among Speakers of Less Resourced Languages: The Basque Case.
Inf., 2019

2018
QUALES: Estimación Automática de Calidad de Traducción Mediante Aprendizaje Automático Supervisado y No-Supervisado.
Proces. del Leng. Natural, 2018

Measuring language distance among historical varieties using perplexity. Application to European Portuguese.
Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects, 2018

Neural Machine Translation of Basque.
Proceedings of the 21st Annual Conference of the European Association for Machine Translation, 2018

Massively multilingual accessible audioguides via cell phones.
Proceedings of the 21st Annual Conference of the European Association for Machine Translation, 2018

Verbal Multiword Expressions in Basque Corpora.
Proceedings of the Joint Workshop on Linguistic Annotation, 2018

2017
A Perplexity-Based Method for Similar Languages Discrimination.
Proceedings of the Fourth Workshop on NLP for Similar Languages, 2017

A Comparison of Feature-Based and Neural Scansion of Poetry.
Proceedings of the International Conference Recent Advances in Natural Language Processing, 2017

2016
Tectogrammar-based machine translation for English-Spanish and English-Basque.
Proces. del Leng. Natural, 2016

TweetLID: a benchmark for tweet language identification.
Lang. Resour. Evaluation, 2016

Comparing Two Basic Methods for Discriminating Between Similar Languages and Varieties.
Proceedings of the Third Workshop on NLP for Similar Languages, Varieties and Dialects, 2016

EHU at the SIGMORPHON 2016 Shared Task. A Simple Proposal: Grapheme-to-Phoneme for Inflection.
Proceedings of the 14th SIGMORPHON Workshop on Computational Research in Phonetics, 2016

TweetMT: A Parallel Microblog Corpus.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Evaluating Translation Quality and CLIR Performance of Query Sessions.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Domain Adaptation in MT Using Titles in Wikipedia as a Parallel Corpus: Resources and Evaluation.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Evaluating the Noisy Channel Model for the Normalization of Historical Texts: Basque, Spanish and Slovene.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Combining Phonology and Morphology for the Normalization of Historical Texts.
Proceedings of the 10th SIGHUM Workshop on Language Technology for Cultural Heritage, 2016

Machine Learning for Metrical Analysis of English Poetry.
Proceedings of the COLING 2016, 2016

2015
Lexical semantics, Basque and Spanish in QTLeap: Quality Translation by Deep Language Engineering Approaches.
Proces. del Leng. Natural, 2015

TweetNorm: a benchmark for lexical normalization of Spanish tweets.
Lang. Resour. Evaluation, 2015

Analyzing English-Spanish Named-Entity enhanced Machine Translation.
Proceedings of the Ninth Workshop on Syntax, 2015

EHU at TweetMT: Adapting MT Engines for Formal Tweets.
Proceedings of the Tweet Translation Workshop 2015 co-located with 31st Conference of the Spanish Society for Natural Language Processing (SEPLN 2015), 2015

Overview of TweetMT: A Shared Task on Machine Translation of Tweets at SEPLN 2015.
Proceedings of the Tweet Translation Workshop 2015 co-located with 31st Conference of the Spanish Society for Natural Language Processing (SEPLN 2015), 2015

2014
Learning to map variation-standard forms using a limited parallel corpus and the standard morphology.
Proces. del Leng. Natural, 2014

Overview of TweetLID: Tweet Language Identification at SEPLN 2014.
Proceedings of the Tweet Language Identification Workshop co-located with 30th Conference of the Spanish Society for Natural Language Processing, 2014

TweetNorm_es: an annotated corpus for Spanish microtext normalization.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

2013
Reciprocal Enrichment Between Basque Wikipedia and Machine Translation.
Proceedings of the People's Web Meets NLP, Collaboratively Constructed Language Resources, 2013

Morphological query expansion and language-filtering words for improving Basque web retrieval.
Lang. Resour. Evaluation, 2013

Una Cascada de Transductores Simples para Normalizar Tweets.
Proceedings of the Tweet Normalization Workshop co-located with 29th Conference of the Spanish Society for Natural Language Processing (SEPLN 2013), 2013

Introducción a la Tarea Compartida Tweet-Norm 2013: Normalización Léxica de Tuits en Español.
Proceedings of the Tweet Normalization Workshop co-located with 29th Conference of the Spanish Society for Natural Language Processing (SEPLN 2013), 2013

Combining Different Features of Idiomaticity for the Automatic Classification of Noun+Verb Expressions in Basque.
Proceedings of the 9th Workshop on Multiword Expressions, 2013

2012
Measuring the compositionality of NV expressions in Basque by means of distributional similarity techniques.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

BAD: An Assistant tool for making verses in Basque.
Proceedings of the 6th Workshop on Language Technology for Cultural Heritage, 2012

Finite-State Technology in a Verse-Making Tool.
Proceedings of the 10th International Workshop on Finite State Methods and Natural Language Processing, 2012

2011
Matxin-Informatika: versión del traductor Matxin adaptada al dominio de la informática.
Proces. del Leng. Natural, 2011

Syntactic error detection and correction in date expressions using finite-state transducers.
Nat. Lang. Eng., 2011

<i>Matxin</i>, an open-source rule-based machine translation system for Basque.
Mach. Transl., 2011

Teknologia garatzeko estrategiak baliabide urriko hizkuntzetarako: euskararen eta Ixa taldearen adibidea.
Linguamática, 2011

A multiclass/multilabel document categorization system: Combining multiple classifiers in a reduced dimension.
Appl. Soft Comput., 2011

Semantic Relatedness for Named Entity Disambiguation Using a Small Wikipedia.
Proceedings of the Text, Speech and Dialogue - 14th International Conference, 2011

Recognition and Classification of Numerical Entities in Basque.
Proceedings of the Recent Advances in Natural Language Processing, 2011

Automatic Extraction of NV Expressions in Basque: Basic Issues on Cooccurrence Techniques.
Proceedings of the Workshop on Multiword Expressions: from Parsing and Generation to the Real World, 2011

2010
Errores ortográficos y de competencia en textos de la web en euskera.
Proces. del Leng. Natural, 2010

A Morphological Processor Based on Foma for Biscayan (a Basque dialect).
Proceedings of the International Conference on Language Resources and Evaluation, 2010

2009
Evaluación de un sistema de traducción automática basado en reglas o por qué BLEU sólo sirve para lo que sirve.
Proces. del Leng. Natural, 2009

Morfología de estados finitos en software libre: aplicación al euskera.
Proces. del Leng. Natural, 2009

Valuable Language Resources and Applications Supporting the Use of Basque.
Proceedings of the Human Language Technology. Challenges for Computer Science and Linguistics, 2009

Porting Basque Morphological Grammars to <i>foma</i>, an Open-Source Tool.
Proceedings of the Finite-State Methods and Natural Language Processing, 2009

Exploring Basque Document Categorization for Educational Purposes using LSI.
Proceedings of the CSEDU 2009 - Proceedings of the First International Conference on Computer Supported Education, Lisboa, Portugal, March 23-26, 2009, 2009

Overview of ResPubliQA 2009: Question Answering Evaluation over European Legislation.
Proceedings of the Multilingual Information Access Evaluation I. Text Retrieval Experiments, 2009

2008
Chunk and Clause Identification for Basque by Filtering and Ranking with Perceptrons.
Proces. del Leng. Natural, 2008

Sylviane Cardey, Peter Greenfield, Séverine Vienney (eds): Machine Translation, Controlled Languages and Specialised Languages.
Mach. Transl., 2008

Spelling Correction: from Two-Level Morphology to Open Source.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Strategies for sustainable MT for Basque: incremental design, reusability, standardization and open-source.
Proceedings of the Third International Joint Conference on Natural Language Processing, 2008

Overview of the CLEF 2008 Multilingual Question Answering Track.
Proceedings of the Working Notes for CLEF 2008 Workshop co-located with the 12th European Conference on Digital Libraries (ECDL 2008) , 2008

Overview of the Clef 2008 Multilingual Question Answering Track.
Proceedings of the Evaluating Systems for Multilingual and Multimodal Information Access, 2008

Spanish-to-Basque MultiEngine Machine Translation for a Restricted Domain.
Proceedings of the 8th Conference of the Association for Machine Translation in the Americas: Research Papers, 2008

2007
Similitud entre Documentos Multilinges de Carácter Científico-Técnico en un Entorno Web.
Proces. del Leng. Natural, 2007

MCR for CLIR.
Proces. del Leng. Natural, 2007

Transfer-Based MT from Spanish into Basque: Reusability, Standardization and Open Source.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2007

2006
OpenTrad: Traducción automática de código abierto para las lenguas del Estado español.
Proces. del Leng. Natural, 2006

Lessons from the Development of a Named Entity Recognizer for Basque.
Proces. del Leng. Natural, 2006

Structure, Annotation and Tools in the Basque ZT Corpus.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

A Multiclassifier based Document Categorization System: profiting from the Singular Value Decomposition Dimensionality Reduction Technique.
Proceedings of the Workshop on Learning Structured Information in Natural Language Applications@EACL 2006, 2006

Using Machine Learning Techniques to Build a Comma Checker for Basque.
Proceedings of the ACL 2006, 2006

2005
An FST Grammar for Verb Chain Transfer in a Spanish-Basque MT System.
Proceedings of the Finite-State Methods and Natural Language Processing, 2005

An open-source shallow-transfer machine translation engine for the Romance languages of Spain.
Proceedings of the 10th EAMT Conference: Practical applications of machine translation, 2005

2004
A XML-Based Term Extraction Tool for Basque.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

2001
An Assistant Tool for Verse-Making in Basque Based on Two-Level Morphology.
Lit. Linguistic Comput., 2001

Using Finite State Technology in Natural Language Processing of Basque.
Proceedings of the Implementation and Application of Automata, 2001

2000
A word-grammar based morphological analyzer for agglutinative languages.
Proceedings of the COLING 2000, 18th International Conference on Computational Linguistics, Proceedings of the Conference, 2 Volumes, July 31, 2000

1999
Designing spelling correctors for inflected languages using lexical transducers.
Proceedings of the EACL 1999, 1999

1998
Combining Stochastic and Rule-Based Methods for Disambiguation in Agglutinative Languages.
Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, 1998

1995
Different Issues in the Design of a Lemmatizer/Tagger for Basque
CoRR, 1995

1993
A Morphological Analysis Based Method for Spelling Correction.
Proceedings of the Sixth Conference of the European Chapter of the Association for Computational Linguistics, 1993

1992
XUXEN: A Spelling Checker/Corrector for Basque Based on Two-Level Morphology.
Proceedings of the 3rd Applied Natural Language Processing Conference, 1992


  Loading...