Tomaz Erjavec

Lang. Resour. Evaluation, 2015

The slWaC Corpus of the SloveneWeb.

[BibT_eX]

[DOI]

Natasa Logar

Informatica (Slovenia), 2015

Predicting the Level of Text Standardness in User-generated Content.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Natural Language Processing, 2015

2014

TweetCaT: a tool for building Twitter corpora of smaller languages.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

sloWCrowd: A crowdsourcing tool for lexicographic tasks.

[BibT_eX]

[DOI]

Ales Tavcar

Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Standardizing Tweets with Character-Level Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the Computational Linguistics and Intelligent Text Processing, 2014

2013

Modernizing historical Slovene words with character-based SMT.

[BibT_eX]

[DOI]

Yves Scherrer

Proceedings of the 4th Biennial International Workshop on Balto-Slavic Natural Language Processing, 2013

2012

MULTEXT-East: morphosyntactic resources for Central and Eastern European languages.

[BibT_eX]

[DOI]

Lang. Resour. Evaluation, 2012

NLP Web Services for Slovene and English: Morphosyntactic Tagging, Lemmatisation and Definition Extraction.

[BibT_eX]

[DOI]

Informatica (Slovenia), 2012

The goo300k corpus of historical Slovene.

[BibT_eX]

[DOI]

Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Lexicon Construction and Corpus Annotation of Historical Language with the CoBaLT Editor.

[BibT_eX]

[DOI]

Proceedings of the 6th Workshop on Language Technology for Cultural Heritage, 2012

2011

hrWaC and slWac: Compiling Web Corpora for Croatian and Slovene.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue - 14th International Conference, 2011

Automatic linguistic annotation of historical language: ToTrTaLe and XIX century Slovene.

[BibT_eX]

[DOI]

Proceedings of the 5th ACL Workshop on Language Technology for Cultural Heritage, 2011

OWL/DL formalization of the MULTEXT-East morphosyntactic specifications.

[BibT_eX]

[DOI]

Christian Chiarcos

Proceedings of the Fifth Linguistic Annotation Workshop, 2011

2010

LemmaGen: Multilingual Lemmatisation with Induced Ripple-Down Rules.

[BibT_eX]

[DOI]

J. Univers. Comput. Sci., 2010

Experimental Deployment of a Grid Virtual Organization for Human Language Technologies.

[BibT_eX]

[DOI]

Jan Jona Javorsek

Proceedings of the International Conference on Language Resources and Evaluation, 2010

The JOS Linguistically Tagged Corpus of Slovene.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Language Resources and Evaluation, 2010

MULTEXT-East Version 4: Multilingual Morphosyntactic Specifications, Lexicons and Corpora.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Language Resources and Evaluation, 2010

2009

A Common XML-based Framework for Syntactic Annotations

[BibT_eX]

[DOI]

Laurent Romary

CoRR, 2009

2008

Improving Morphosyntactic Tagging of Slovene Language through Meta-tagging.

[BibT_eX]

[DOI]

Jan Rupnik

Miha Grcar

Informatica (Slovenia), 2008

A Web Corpus and Word Sketches for Japanese.

[BibT_eX]

[DOI]

Irena Srdanovic Erjavec

Adam Kilgarriff

Inf. Media Technol., 2008

Ripple Down Rule learning for automated word lemmatisation.

[BibT_eX]

[DOI]

AI Commun., 2008

Designing and Evaluating a Russian Tagset.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Language Resources and Evaluation, 2008

The JOS Morphosyntactically Tagged Corpus of Slovene.

[BibT_eX]

[DOI]

Simon Krek

Proceedings of the International Conference on Language Resources and Evaluation, 2008

2007

Quantifying the MULTEXT-East morphosyntactic resources.

[BibT_eX]

[DOI]

Proceedings of the Exact Methods in the Study of Language and Text, 2007

2006

Morphosyntactic Tagging of Slovene Legal Language.

[BibT_eX]

[DOI]

Bence Sárossy

Informatica (Slovenia), 2006

A tool set for the quick and efficient exploration of large document collections

[BibT_eX]

[DOI]

CoRR, 2006

The JRC-Acquis: A Multilingual Aligned Parallel Corpus with 20+ Languages.

[BibT_eX]

[DOI]

Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Building Slovene WordNet.

[BibT_eX]

[DOI]

Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

The English-Slovene ACQUIS corpus.

[BibT_eX]

[DOI]

Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Towards a Slovene Dependency Treebank.

[BibT_eX]

[DOI]

Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

TEI and Microsoft: a marriage made in....

[BibT_eX]

[DOI]

Proceedings of the Digital Historical Corpora - Architecture, Annotation, and Retrieval, 03.12., 2006

2005

The VoiceTRAN Speech-to-Speech Communicator.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue, 8th International Conference, 2005

Digital Critical Editions of Slovenian Literature: an Application of Collaborative Work Using Open Standards.

[BibT_eX]

[DOI]

Matija Ogrin

Proceedings of the From Author to Reader: Challenges for the Digital Content Chain: Proceedings of the 9th ICCC International Conference on Electronic Publishing held at Katholieke Universiteit Leuven, 2005

Initial considerations in building a speech-to-speech translation system for the Slovenian-English language pair.

[BibT_eX]

[DOI]

Proceedings of the 10th EAMT Conference: Practical applications of machine translation, 2005

2004

Morpho-Syntactic Descriptions in MULTEXT-East - the Case of Serbian.

[BibT_eX]

Cvetana Krstev

Dusko Vitas

Informatica (Slovenia), 2004

Machine Learning of Morphosyntactic Structure: Lemmatizing Unknown Slovene Words.

[BibT_eX]

[DOI]

Éric Villemonte de la Clergerie

Appl. Artif. Intell., 2004

Towards an International Standard on Feature Structure Representation.

[BibT_eX]

[DOI]

Kiyong Lee

Lou Burnard

Laurent Romary

Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

Making an XML-based Japanese-Slovene Learners' Dictionary.

[BibT_eX]

[DOI]

Kristina Hmeljak Sangawa

Irena Srdanovic

Anton ml. Vahcic

Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

MULTEXT-East Version 3: Multilingual Morphosyntactic Specifications, Lexicons and Corpora.

[BibT_eX]

[DOI]

Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

Migrating Language Resources from SGML to XML: The Text Encoding Initiative Recommendations.

[BibT_eX]

[DOI]

Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

2003

Encoding Biomedical Resources in TEI: The Case of the GENIA Corpus.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Natural Language Processing in Biomedicine, 2003

Stretching TEI: Converting the Genia Corpus.

[BibT_eX]

[DOI]

Proceedings of 4th International Workshop on Linguistically Interpreted Corpora, 2003

2002

Compiling and Using the IJS-ELAN Parallel Corpus.

[BibT_eX]

Informatica (Slovenia), 2002

Sense Discrimination with Parallel Corpora.

[BibT_eX]

[DOI]

Dan Tufis

Proceedings of the ACL Workshop on Word Sense Disambiguation: Recent Successes and Future Directions, 2002

2001

Automatic Sense Tagging Using Parallel Corpora.

[BibT_eX]

[DOI]

Dan Tufis

Proceedings of the Sixth Natural Language Processing Pacific Rim Symposium, 2001

Harmonised Morphosyntactic Tagging for Seven Languages and Orwell's 1984.

[BibT_eX]

[DOI]

Proceedings of the Sixth Natural Language Processing Pacific Rim Symposium, 2001

2000

Rules for Automatic Grapheme-to-Allophone Transcription in Slovene.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue - Third International Workshop, 2000

Corpora of Slovene Spoken Language for Multi-lingual Applications.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Language Resources and Evaluation, 2000

The Concede Model for Lexical Databases.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Language Resources and Evaluation, 2000

Morphosyntactic Tagging of Slovene: Evaluating Taggers and Tagsets.

[BibT_eX]

[DOI]

Jakub Zavrel

Proceedings of the Second International Conference on Language Resources and Evaluation, 2000

1999

The ELAN Slovene-English aligned corpus.

[BibT_eX]

[DOI]

Proceedings of Machine Translation Summit VII, 1999

Learning to Lemmatise Slovene Words.

[BibT_eX]

[DOI]

Proceedings of the Learning Language in Logic, 1999

Learning Word Segmentation Rules for Tag Prediction.

[BibT_eX]

[DOI]

Dimitar Kazakov

Suresh Manandhar

Proceedings of the Inductive Logic Programming, 9th International Workshop, 1999

Morphosyntactic Tagging of Slovene Using Progol.

[BibT_eX]

[DOI]

James Cussens

Proceedings of the Inductive Logic Programming, 9th International Workshop, 1999

1998

Standardised specifications, development and assessment of large morpho-lexical resources for six central and eastern european languages.

[BibT_eX]

Dan Tufis

Proceedings of the First International Conference on Language Resources and Evaluation, 1998

East meets West: multilingual resources in a European context.

[BibT_eX]

Ann Lawson

Laurent Romary

Proceedings of the First International Conference on Language Resources and Evaluation, 1998

The MULTEXT East corpus.

[BibT_eX]

Proceedings of the First International Conference on Language Resources and Evaluation, 1998

Learning Multilingual Morphology with CLOG.

[BibT_eX]

[DOI]

Suresh Manandhar

Proceedings of the Inductive Logic Programming, 8th International Workshop, 1998

Multext-East: Parallel and Comparable Corpora and Lexicons for Six Central and Eastern European Languages.

[BibT_eX]

[DOI]

Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, 1998

1997

Induction of Slovene Nominal Paradigms.

[BibT_eX]

[DOI]

Proceedings of the Inductive Logic Programming, 7th International Workshop, 1997

1990

An Integrated System For Morphological Analysis Of The Slovene Language.

[BibT_eX]

[DOI]