Mathias Creutz

According to our database1, Mathias Creutz authored at least 40 papers between 2002 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
On Using Distribution-Based Compositionality Assessment to Evaluate Compositional Generalisation in Machine Translation.
CoRR, 2023

Guiding Zero-Shot Paraphrase Generation with Fine-Grained Control Tokens.
Proceedings of the The 12th Joint Conference on Lexical and Computational Semantics, 2023

Evaluating Morphological Generalisation in Machine Translation by Distribution-Based Compositionality Assessment.
Proceedings of the 24th Nordic Conference on Computational Linguistics, 2023

2022
GEMv2: Multilingual NLG Benchmarking in a Single Line of Code.
CoRR, 2022

Helsinki-NLP at SemEval-2022 Task 2: A Feature-Based Approach to Multilingual Idiomaticity Detection.
Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

Modeling Noise in Paraphrase Detection.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

A Closer Look at Parameter Contributions When Training Neural Language and Translation Models.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

It Is Not Easy To Detect Paraphrases: Analysing Semantic Similarity With Antonyms and Negation Using the New SemAntoNeg Benchmark.
Proceedings of the Fifth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2022

2021
Semantic Search as Extractive Paraphrase Span Detection.
CoRR, 2021

Grammatical Error Generation Based on Translated Fragments.
Proceedings of the 23rd Nordic Conference on Computational Linguistics, 2021

An Empirical Investigation of Word Alignment Supervision for Zero-Shot Multilingual Neural Machine Translation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Coping with Noisy Training Data Labels in Paraphrase Detection.
Proceedings of the Seventh Workshop on Noisy User-generated Text, 2021

On the differences between BERT and MT encoder spaces and how to address them in translation tasks.
Proceedings of the ACL-IJCNLP 2021 Student Research Workshop, 2021

2020
A Systematic Study of Inner-Attention-Based Sentence Representations in Multilingual Neural Machine Translation.
Comput. Linguistics, 2020

Paraphrase Generation and Evaluation on Colloquial-Style Sentences.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

2019
Multilingual NMT with a Language-Independent Attention Bridge.
Proceedings of the 4th Workshop on Representation Learning for NLP, 2019

An Evaluation of Language-Agnostic Inner-Attention-Based Representations in Machine Translation.
Proceedings of the 4th Workshop on Representation Learning for NLP, 2019

Annotation of subtitle paraphrases using a new web tool.
Proceedings of the Digital Humanities in the Nordic Countries 4th Conference, 2019

2018
Open Subtitles Paraphrase Corpus for Six Languages.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Paraphrase Detection on Noisy Subtitles in Six Languages.
Proceedings of the 4th Workshop on Noisy User-generated Text, 2018

2009
Web Augmentation of Language Models for Continuous Speech Recognition of SMS Text Messages.
Proceedings of the EACL 2009, 12th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference, Athens, Greece, March 30, 2009

2008
Speech to speech machine translation: Biblical chatter from Finnish to English.
Proceedings of the Third International Joint Conference on Natural Language Processing, 2008

Morpho Challenge Evaluation by Information Retrieval Experiments.
Proceedings of the Evaluating Systems for Multilingual and Multimodal Information Access, 2008

2007
Unsupervised models for morpheme segmentation and morphology learning.
ACM Trans. Speech Lang. Process., 2007

Morph-based speech recognition and modeling of out-of-vocabulary words across languages.
ACM Trans. Speech Lang. Process., 2007

Analysis of Morph-Based Speech Recognition and the Modeling of Out-of-Vocabulary Words Across Languages.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

Morphology-aware statistical machine translation based on morphs induced in an unsupervised manner.
Proceedings of Machine Translation Summit XI: Papers, 2007

Morfessor and variKN machine learning tools for speech and language technology.
Proceedings of the INTERSPEECH 2007, 2007

Unsupervised Morpheme Analysis Evaluation by a Comparison to a Linguistic Gold Standard - Morpho Challenge 2007.
Proceedings of the Working Notes for CLEF 2007 Workshop co-located with the 11th European Conference on Digital Libraries (ECDL 2007), 2007

Morpho Challenge Evaluation Using a Linguistic Gold Standard.
Proceedings of the Advances in Multilingual and Multimodal Information Retrieval, 2007

Unsupervised Morpheme Analysis Evaluation by IR experiments - Morpho Challenge 2007.
Proceedings of the Working Notes for CLEF 2007 Workshop co-located with the 11th European Conference on Digital Libraries (ECDL 2007), 2007

Overview of Morpho Challenge in CLEF 2007.
Proceedings of the Working Notes for CLEF 2007 Workshop co-located with the 11th European Conference on Digital Libraries (ECDL 2007), 2007

2006
Unlimited vocabulary speech recognition with morph language models applied to Finnish.
Comput. Speech Lang., 2006

Unsupervised segmentation of words into morphemes - morpho challenge 2005 application to automatic speech recognition.
Proceedings of the INTERSPEECH 2006, 2006

2005
Unsupervised Morphology Induction Using Morfessor.
Proceedings of the Finite-State Methods and Natural Language Processing, 2005

2004
Induction of a Simple Morphology for Highly-Inflecting Languages.
Proceedings of the 7th Meeting of the ACL Special Interest Group in Computational Phonology: Current Themes in Computational Phonology and Morphology, 2004

2003
Unlimited vocabulary speech recognition based on morphs discovered in an unsupervised manner.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

On lexicon creation for turkish LVCSR.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Unsupervised Segmentation of Words Using Prior Distributions of Morph Length and Frequency.
Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, 2003

2002
Unsupervised Discovery of Morphemes.
Proceedings of the ACL-02 Workshop on Morphological and Phonological Learning, 2002


  Loading...