Mikko Kurimo

Anja Virkkunen

Proceedings of the 4th on Multimodal Sentiment Analysis Challenge and Workshop: Mimicked Emotions, 2023

Advancing Audio Emotion and Intent Recognition with Large Pre-Trained Models and Bayesian Inference.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Investigating wav2vec2 context representations and the effects of fine-tuning, a case-study of a Finnish model.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Topic Identification for Spontaneous Speech: Enriching Audio Features with Embedded Linguistic Information.

[BibT_eX]

[DOI]

Proceedings of the 31st European Signal Processing Conference, 2023

2022

Data Augmentation Using Spectral Warping for Low Resource Children ASR.

[BibT_eX]

[DOI]

Virender Kadyan

J. Signal Process. Syst., December, 2022

A formant modification method for improved ASR of children's speech.

[BibT_eX]

[DOI]

Paavo Alku

Speech Commun., 2022

End-to-end Ensemble-based Feature Selection for Paralinguistics Tasks.

[BibT_eX]

[DOI]

CoRR, 2022

Finnish Parliament ASR corpus - Analysis, benchmarks and statistics.

[BibT_eX]

[DOI]

CoRR, 2022

Lahjoita puhetta - a large-scale corpus of spoken Finnish with some benchmarks.

[BibT_eX]

[DOI]

CoRR, 2022

Automatic Rating of Spontaneous Speech for Low-Resource Languages.

[BibT_eX]

[DOI]

Ragheb Al-Ghezi

Yaroslav Getman

Ekaterina Voskoboinik

Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Wav2vec2-based Paralinguistic Systems to Recognise Vocalised Emotions and Stuttering.

[BibT_eX]

[DOI]

Yaroslav Getman

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Low Resource Comparison of Attention-based and Hybrid ASR Exploiting wav2vec 2.0.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Comparison and Analysis of New Curriculum Criteria for End-to-End ASR.

[BibT_eX]

[DOI]

Georgios Karakasidis

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

wav2vec2-based Speech Rating System for Children with Speech Sound Disorder.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Tracing Signs of Urbanity in the Finnish Fiction Film of the 1950s: Toward a Multimodal Analysis of Audiovisual Data.

[BibT_eX]

[DOI]

Proceedings of the 6th Digital Humanities in the Nordic and Baltic Countries Conference (DHNB 2022), 2022

When to Laugh and How Hard? A Multimodal Approach to Detecting Humor and Its Intensity.

[BibT_eX]

[DOI]

Proceedings of the 29th International Conference on Computational Linguistics, 2022

2021

Morphologically motivated word classes for very large vocabulary speech recognition of Finnish and Estonian.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2021

Advances in subword-based HMM-DNN speech recognition across languages.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2021

Attention-Based End-to-End Named Entity Recognition from Speech.

[BibT_eX]

[DOI]

Juho Leinonen

Proceedings of the Text, Speech, and Dialogue - 24th International Conference, 2021

LSTM-XL: Attention Enhanced Long-Term Memory for LSTM Cells.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech, and Dialogue - 24th International Conference, 2021

An Equal Data Setting for Attention-Based Encoder-Decoder and HMM/DNN Models: A Case Study in Finnish ASR.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 23rd International Conference, 2021

Synthesis Speech Based Data Augmentation for Low Resource Children ASR.

[BibT_eX]

[DOI]

Virender Kadyan

Prajjval Govil

Proceedings of the Speech and Computer - 23rd International Conference, 2021

Grapheme-Based Cross-Language Forced Alignment: Results with Uralic Languages.

[BibT_eX]

[DOI]

Juho Leinonen

Proceedings of the 23rd Nordic Conference on Computational Linguistics, 2021

Spectral modification for recognition of children's speech undermismatched conditions.

[BibT_eX]

[DOI]

Paavo Alku

Proceedings of the 23rd Nordic Conference on Computational Linguistics, 2021

Speaker Verification Experiments for Adults and Children Using Shared Embedding Spaces.

[BibT_eX]

[DOI]

Tuomas Kaseva

Aku Rouhe

Proceedings of the 23rd Nordic Conference on Computational Linguistics, 2021

Self-Supervised End-to-End ASR for Low Resource L2 Swedish.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Vowel Non-Vowel Based Spectral Warping and Time Scale Modification for Improvement in Children's ASR.

[BibT_eX]

[DOI]

Avinash Kumar

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Brain activity reflects the predictability of word sequences in listened continuous speech.

[BibT_eX]

[DOI]

NeuroImage, 2020

Transfer learning and subword sampling for asymmetric-resource one-to-many neural translation.

[BibT_eX]

[DOI]

Mach. Transl., 2020

Aalto's End-to-End DNN systems for the INTERSPEECH 2020 Computational Paralinguistics Challenge.

[BibT_eX]

[DOI]

CoRR, 2020

Finnish Language Modeling with Deep Transformer Models.

[BibT_eX]

[DOI]

CoRR, 2020

Using Fan-Made Content, Subtitles and Face Recognition for Character-Centric Video Summarization.

[BibT_eX]

[DOI]

Proceedings of the 2020 TREC Video Retrieval Evaluation, 2020

Effects of Language Relatedness for Cross-lingual Transfer Learning in Character-Based Language Models.

[BibT_eX]

[DOI]

Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, 2020

Named Entity Recognition for Spoken Finnish.

[BibT_eX]

[DOI]

Juho Leinonen

Proceedings of the AI4TV '20: Proceedings of the 2nd International Workshop on AI for Smart TV Content Production, 2020

Visual Interpretation of DNN-based Acoustic Models using Deep Autoencoders.

[BibT_eX]

[DOI]

Proceedings of the 3rd Workshop on Machine Learning Methods in Visualisation for Big Data, 2020

Morfessor EM+Prune: Improved Subword Segmentation with Expectation Maximization and Pruning.

[BibT_eX]

[DOI]

Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Releasing a Toolkit and Comparing the Performance of Language Embeddings Across Various Spoken Language Identification Datasets.

[BibT_eX]

[DOI]

Matias Lindgren

Tommi Jauhiainen

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

FinChat: Corpus and Evaluation Setup for Finnish Chat Conversations on Everyday Topics.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Data Augmentation Using Prosody and False Starts to Recognize Non-Native Children's Speech.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Finnish ASR with Deep Transformer Models.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Speaker-Aware Training of Attention-Based End-to-End Speech Recognition Using Neural Speaker Embeddings.

[BibT_eX]

[DOI]

Aku Rouhe

Tuomas Kaseva

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Study of Formant Modification for Children ASR.

[BibT_eX]

[DOI]

Paavo Alku

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Service registration chatbot: collecting and comparing dialogues from AMT workers and service's users.

[BibT_eX]

[DOI]

Proceedings of the Sixth Workshop on Noisy User-generated Text, 2020

2019

A user study to compare two conversational assistants designed for people with hearing impairments.

[BibT_eX]

[DOI]

Proceedings of the Eighth Workshop on Speech and Language Processing for Assistive Technologies, 2019

Computer-supported form design using keystroke-level modeling with reinforcement learning.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Intelligent User Interfaces: Companion, 2019

RL-KLM: automating keystroke-level modeling with reinforcement learning.

[BibT_eX]

[DOI]

Katri Leino

Antti Oulasvirta

Proceedings of the 24th International Conference on Intelligent User Interfaces, 2019

Subword RNNLM Approximations for Out-Of-Vocabulary Keyword Search.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Transparent Pronunciation Scoring Using Articulatorily Weighted Phoneme Edit Distance.

[BibT_eX]

[DOI]

Sari Ylinen

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Spherediar: An Effective Speaker Diarization System for Meeting Data.

[BibT_eX]

[DOI]

Tuomas Kaseva

Aku Rouhe

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018

User Experiences from L2 Children Using a Speech Learning Application: Implications for Developing Speech Training Applications for Children.

[BibT_eX]

[DOI]

Maria Uther

Adv. Hum. Comput. Interact., 2018

Cognate-aware morphological segmentation for multilingual neural translation.

[BibT_eX]

[DOI]

Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018

The MeMAD Submission to the WMT18 Multimodal Translation Task.

[BibT_eX]

[DOI]

Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018

First-Pass Techniques for Very Large Vocabulary Speech Recognition ff Morphologically Rich Languages.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

The MeMAD Submission to the IWSLT 2018 Speech Translation Task.

[BibT_eX]

[DOI]

Proceedings of the 15th International Conference on Spoken Language Translation, 2018

Captaina: Integrated Pronunciation Practice and Data Collection Portal.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

The Aalto system based on fine-tuned AudioSet features for DCASE2018 task2 - general purpose audio tagging.

[BibT_eX]

[DOI]

Zhicun Xu

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

2017

Automatic Speech Recognition With Very Large Conversational Finnish and Estonian Vocabularies.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2017

Modeling under-resourced languages for speech recognition.

[BibT_eX]

[DOI]

Lang. Resour. Evaluation, 2017

Extending hybrid word-character neural machine translation with multi-task learning of morphological analysis.

[BibT_eX]

[DOI]

Proceedings of the Second Conference on Machine Translation, 2017

A pipeline for automatic assessment of foreign language pronunciation.

[BibT_eX]

[DOI]

Proceedings of the 7th ISCA International Workshop on Speech and Language Technology in Education, 2017

Acoustic Model Compression with MAP adaptation.

[BibT_eX]

[DOI]

Katri Leino

Proceedings of the 21st Nordic Conference on Computational Linguistics, 2017

Improved Subword Modeling for WFST-Based Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Reading Validation for Pronunciation Evaluation in the Digitala Project.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Automatic Construction of the Finnish Parliament Speech Corpus.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

SIAK - A Game for Foreign Language Pronunciation Learning.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

LDA-based context dependent recurrent neural network language model using document-based topic distribution of words.

[BibT_eX]

[DOI]

Md. Akmal Haidar

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

ASR in Classroom Today: Automatic Visualization of Conceptual Network in Science Classrooms.

[BibT_eX]

[DOI]

Proceedings of the Data Driven Approaches in Digital Education, 2017

Aalto system for the 2017 Arabic multi-genre broadcast challenge.

[BibT_eX]

[DOI]

Siva Reddy Gangireddy

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Character-based units for unlimited vocabulary continuous speech recognition.

[BibT_eX]

[DOI]

Siva Reddy Gangireddy

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016

FinnPos: an open-source morphological tagging and lemmatization toolkit for Finnish.

[BibT_eX]

[DOI]

Lang. Resour. Evaluation, 2016

Comparing human and automatic speech recognition in a perceptual restoration experiment.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2016

A Comparative Study of Minimally Supervised Morphological Segmentation.

[BibT_eX]

[DOI]

Comput. Linguistics, 2016

Hybrid Morphological Segmentation for Phrase-Based Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the First Conference on Machine Translation, 2016

In-Document Adaptation for a Human Guided Automatic Transcription Service.

[BibT_eX]

[DOI]

Krister Lindén

Proceedings of the Speech and Computer - 18th International Conference, 2016

Class n-Gram Models for Very Large Vocabulary Speech Recognition of Finnish and Estonian.

[BibT_eX]

[DOI]

Proceedings of the Statistical Language and Speech Processing, 2016

Towards SamiTalk: A Sami-Speaking Robot Linked to Sami Wikipedia.

[BibT_eX]

[DOI]

Proceedings of the Dialogues with Social Robots, 2016

Digitala: An Augmented Test and Review Process Prototype for High-Stakes Spoken Foreign Language Examination.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Recurrent Neural Network Language Model with Incremental Updated Context Information Generated Using Bag-of-Words Representation.

[BibT_eX]

[DOI]

Md. Akmal Haidar

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

TheanoLM - An Extensible Toolkit for Neural Network Language Modeling.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Experiments on Adaptation Methods to Improve Acoustic Modeling for French Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Pattern Recognition Applications and Methods, 2016

2015

Bounded Conditional Mean Imputation with Observation Uncertainties and Acoustic Model Adaptation.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2015

Adaptation of Morph-Based Speech Recognition for Foreign Names and Acronyms.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2015

Tuning Phrase-Based Segmented Translation for a Morphologically Complex Target Language.

[BibT_eX]

[DOI]

Proceedings of the Tenth Workshop on Statistical Machine Translation, 2015

Unsupervised and User Feedback Based Lexicon Adaptation for Foreign Names and Acronyms.

[BibT_eX]

[DOI]

Olli Philippe Lautenbacher

Proceedings of the Statistical Language and Speech Processing, 2015

Designing multichannel source separation based on single-channel source separation.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Towards Reliable Automatic Multimodal Content Analysis.

[BibT_eX]

[DOI]

Proceedings of the Fourth Workshop on Vision and Language, 2015

2014

Noise in HMM-Based Speech Synthesis Adaptation: Analysis, Evaluation Methods and Experiments.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2014

A word-level token-passing decoder for subword n-gram LVCSR.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

A Toolkit for Efficient Learning of Lexical Units for Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Spectral tilt modelling with extrapolated GMMs for intelligibility enhancement of narrowband telephone speech.

[BibT_eX]

[DOI]

Proceedings of the 14th International Workshop on Acoustic Signal Enhancement, 2014

Spectral tilt modelling with GMMs for intelligibility enhancement of narrowband telephone speech.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

On the role of missing data imputation and NMF feature enhancement in building synthetic voices using reverberant speech.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Unsupervised feature extraction for multimedia event detection and ranking using audio content.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Morfessor 2.0: Toolkit for statistical morphological segmentation.

[BibT_eX]

[DOI]

Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

Accelerated Estimation of Conditional Random Fields using a Pseudo-Likelihood-inspired Perceptron Variant.

[BibT_eX]

[DOI]

Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

Painless Semi-Supervised Morphological Segmentation using Conditional Random Fields.

[BibT_eX]

[DOI]

Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

Morfessor FlatCat: An HMM-Based Method for Unsupervised and Semi-Supervised Learning of Morphology.

[BibT_eX]

[DOI]

Proceedings of the COLING 2014, 2014

Part-of-Speech Tagging using Conditional Random Fields: Exploiting Sub-Label Dependencies for Improved Accuracy.

[BibT_eX]

[DOI]

Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013

Personalising speech-to-speech translation: Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2013

PicSOM Experiments in TRECVID 2013.

[BibT_eX]

[DOI]

Proceedings of the 2013 TREC Video Retrieval Evaluation, 2013

Objective evaluation measures for speaker-adaptive HMM-TTS systems.

[BibT_eX]

[DOI]

Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Robust spectral representation using group delay function and stabilized weighted linear prediction for additive noise degradations.

[BibT_eX]

[DOI]

Proceedings of the 7th Conference on Speech Technology and Human-Computer Dialogue, 2013

A novel discriminative method for pruning pronunciation dictionary entries.

[BibT_eX]

[DOI]

Proceedings of the 7th Conference on Speech Technology and Human-Computer Dialogue, 2013

Results for Variable Speaker and Recording Conditions on Spoken IR in Finnish.

[BibT_eX]

[DOI]

Sami Keronen

Proceedings of the Speech and Computer - 15th International Conference, 2013

Studies on training text selection for conversational Finnish language modeling.

[BibT_eX]

[DOI]

Proceedings of the 10th International Workshop on Spoken Language Translation: Papers, 2013

Lombard modified text-to-speech synthesis for improved intelligibility: submission for the hurricane challenge 2013.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Unsupervised topic adaptation for morph-based speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Robust formant detection using group delay function and stabilized weighted linear prediction.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Analysis of breathy, modal and pressed phonation based on low frequency spectral density.

[BibT_eX]

[DOI]

Dhananjaya N. Gowda

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

HMM-based speech synthesis adaptation using noisy data: Analysis and evaluation methods.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Supervised Morphological Segmentation in a Low-Resource Learning Setting using Conditional Random Fields.

[BibT_eX]

[DOI]

Proceedings of the Seventeenth Conference on Computational Natural Language Learning, 2013

Learning a subword vocabulary based on unigram likelihood.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012

Analysis of Extended Baum-Welch and Constrained Optimization for Discriminative Training of HMMs.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2012

Bandwidth Extension of Telephone Speech to Low Frequencies Using Sinusoidal Synthesis and a Gaussian Mixture Model.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2012

Unsupervised Vocabulary Adaptation for Morph-based Language Models.

[BibT_eX]

[DOI]

Proceedings of the Workshop: Will We Ever Really Replace the N-gram Model? On the Future of Language Modeling for HLT, 2012

Optimization-Based Control for the Extended Baum-Welch Algorithm.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Improving Discriminative Training for Robust Acoustic Models in Large Vocabulary Continuous Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Creating synthetic voices for children by adapting adult average voice using stacked transformations and VTLN.

[BibT_eX]

[DOI]

Doddipatla Rama Sanand

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Adaptation of Morpheme-based Speech Recognition for Foreign Entity Names.

[BibT_eX]

[DOI]

Proceedings of the Human Language Technologies - The Baltic Perspective, 2012

2011

An augmented reality interface to contextual information.

[BibT_eX]

[DOI]

Virtual Real., 2011

Speech retrieval from unsegmented finnish audio using statistical morpheme-like units for segmentation, recognition, and retrieval.

[BibT_eX]

[DOI]

ACM Trans. Speech Lang. Process., 2011

Empirical Comparison of Evaluation Methods for Unsupervised Learning of Morphology.

[BibT_eX]

[DOI]

Trait. Autom. des Langues, 2011

Missing-Feature Reconstruction With a Bounded Nonlinear State-Space Model.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2011

A Study on Combining VTLN and SAT to Improve the Performance of Automatic Speech Recognition.

[BibT_eX]

[DOI]

Doddipatla Rama Sanand

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Low-Frequency Bandwidth Extension of Telephone Speech Using Sinusoidal Synthesis and Gaussian Mixture Model.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Noise Robust Feature Extraction Based on Extended Weighted Linear Prediction in LVCSR.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Using stacked transformations for recognizing foreign accented speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Speech bandwidth extension using Gaussian mixture model-based estimation of the highband mel spectrum.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

2010

Thousands of Voices for HMM-Based Speech Synthesis-Analysis and Application of TTS Systems Built on Various ASR Corpora.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2010

Applying Morphological Decompositions to Statistical Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR, 2010

Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project.

[BibT_eX]

[DOI]

Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010

Unsupervised cross-lingual speaker adaptation for accented speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Morpho Challenge 2005-2010: Evaluations and Results.

[BibT_eX]

[DOI]

Proceedings of the 11th Meeting of the ACL Special Interest Group on Computational Morphology and Phonology, 2010

Efficient estimation of maximum entropy language models with n-gram features: an SRILM extension.

[BibT_eX]

[DOI]

Tanel Alumäe

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis using two-pass decision tree construction.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Comparison of noise robust methods in large vocabulary speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 18th European Signal Processing Conference, 2010

Personalising Speech-To-Speech Translation in the EMIME Project.

[BibT_eX]

[DOI]

Proceedings of the ACL 2010, 2010

Domain Adaptation of Maximum Entropy Language Models.

[BibT_eX]

[DOI]

Tanel Alumäe

Proceedings of the ACL 2010, 2010

2009

Importance of High-Order N-Gram Models in Morph-Based Speech Recognition.

[BibT_eX]

[DOI]

Teemu Hirsimäki

IEEE Trans. Speech Audio Process., 2009

Morpho Challenge - Evaluation of algorithms for unsupervised learning of morphology in various tasks and languages.

[BibT_eX]

[DOI]

Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Analysing Recognition Errors in Unlimited-Vocabulary Speech Recognition.

[BibT_eX]

[DOI]

Teemu Hirsimäki

Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Minimum Bayes Risk Combination of Translation Hypotheses from Alternative Morphological Decompositions.

[BibT_eX]

[DOI]

Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Thousands of voices for HMM-based speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Weighted linear prediction for speech analysis in noisy conditions.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Robust automatic speech recognition using acoustic model adaptation prior to missing feature reconstruction.

[BibT_eX]

[DOI]

Kalle J. Palomäki

Proceedings of the 17th European Signal Processing Conference, 2009

Overview and Results of Morpho Challenge 2009.

[BibT_eX]

[DOI]

Proceedings of the Multilingual Information Access Evaluation I. Text Retrieval Experiments, 2009

2008

Speech to speech machine translation: Biblical chatter from Finnish to English.

[BibT_eX]

[DOI]

Proceedings of the Third International Joint Conference on Natural Language Processing, 2008

Missing feature reconstruction and acoustic model adaptation combined for large vocabulary continuous speech recognition.

[BibT_eX]

[DOI]

Kalle J. Palomäki

Proceedings of the 2008 16th European Signal Processing Conference, 2008

Unsupervised Morpheme Analysis Evaluation by a Comparison to a Linguistic Gold Standard - Morpho Challenge 2008.

[BibT_eX]

[DOI]

Proceedings of the Working Notes for CLEF 2008 Workshop co-located with the 12th European Conference on Digital Libraries (ECDL 2008) , 2008

Overview of Morpho Challenge 2008.

[BibT_eX]

[DOI]

Proceedings of the Evaluating Systems for Multilingual and Multimodal Information Access, 2008

Unsupervised Morpheme Analysis Evaluation by IR experiments - Morpho Challenge 2008.

[BibT_eX]

[DOI]

Proceedings of the Working Notes for CLEF 2008 Workshop co-located with the 12th European Conference on Digital Libraries (ECDL 2008) , 2008

Morpho Challenge Evaluation by Information Retrieval Experiments.

[BibT_eX]

[DOI]

Proceedings of the Evaluating Systems for Multilingual and Multimodal Information Access, 2008

2007

Morph-based speech recognition and modeling of out-of-vocabulary words across languages.

[BibT_eX]

[DOI]

ACM Trans. Speech Lang. Process., 2007

Indexing confusion networks for morph-based spoken document retrieval.

[BibT_eX]

[DOI]

Proceedings of the SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007

Analysis of Morph-Based Speech Recognition and the Modeling of Out-of-Vocabulary Words Across Languages.

[BibT_eX]

[DOI]

Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

Comparison of subspace methods for Gaussian mixture models in speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Morfessor and variKN machine learning tools for speech and language technology.

[BibT_eX]

[DOI]

Vesa Siivola

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Segregation of Speakers for Speaker Adaptation in TV News Audio.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

Unsupervised Morpheme Analysis Evaluation by a Comparison to a Linguistic Gold Standard - Morpho Challenge 2007.

[BibT_eX]

[DOI]

Proceedings of the Working Notes for CLEF 2007 Workshop co-located with the 11th European Conference on Digital Libraries (ECDL 2007), 2007

Morpho Challenge Evaluation Using a Linguistic Gold Standard.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multilingual and Multimodal Information Retrieval, 2007

Unsupervised Morpheme Analysis Evaluation by IR experiments - Morpho Challenge 2007.

[BibT_eX]

[DOI]

Proceedings of the Working Notes for CLEF 2007 Workshop co-located with the 11th European Conference on Digital Libraries (ECDL 2007), 2007

Overview of Morpho Challenge in CLEF 2007.

[BibT_eX]

[DOI]

Proceedings of the Working Notes for CLEF 2007 Workshop co-located with the 11th European Conference on Digital Libraries (ECDL 2007), 2007

Vocabulary Decomposition for Estonian Open Vocabulary Speech Recognition.

[BibT_eX]

[DOI]

Antti Puurula

Proceedings of the ACL 2007, 2007

2006

Unlimited vocabulary speech recognition with morph language models applied to Finnish.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2006

Unlimited vocabulary speech recognition for agglutinative languages.

[BibT_eX]

[DOI]

Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006

Compact n-gram models by incremental growing and clustering of histories.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Using latent semantic indexing for morph-based spoken document retrieval.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Unsupervised segmentation of words into morphemes - morpho challenge 2005 application to automatic speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

2005

SpeechFind: Advances in Spoken Document Retrieval for a National Gallery of the Spoken Word.

[BibT_eX]

[DOI]

Pongtep Angkititrakul

IEEE Trans. Speech Audio Process., 2005

To recover from speech recognition errors in spoken document retrieval.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Methods for combining language models in speech recognition.

[BibT_eX]

[DOI]

Simo Broman

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

2004

Speech Transcription and Spoken Document Retrieval in Finnish.

[BibT_eX]

[DOI]

Inger Ekman

Proceedings of the Machine Learning for Multimodal Interaction, 2004

Duration modeling techniques for continuous speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

An evaluation of a spoken document retrieval baseline system in finish.

[BibT_eX]

[DOI]

Inger Ekman

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Language modeling structures in audio transcription for retrieval of historical speeches.

[BibT_eX]

[DOI]

Proceedings of the 2004 12th European Signal Processing Conference, 2004

2003

Unlimited vocabulary speech recognition based on morphs discovered in an unsupervised manner.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

On lexicon creation for turkish LVCSR.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002

Thematic indexing of spoken documents by using self-organizing maps.

[BibT_eX]

[DOI]

Speech Commun., 2002

Language model adaptation in speech recognition using document maps.

[BibT_eX]

[DOI]

Krista Lagus

Proceedings of the 12th IEEE Workshop on Neural Networks for Signal Processing, 2002

An Efficiently Focusing Large Vocabulary Language Model.

[BibT_eX]

[DOI]

Krista Lagus

Proceedings of the Artificial Neural Networks, 2002

2001

Large vocabulary statistical language modeling for continuous speech recognition in finnish.

[BibT_eX]

[DOI]

Vesa Siivola

Krista Lagus

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000

Fast latent semantic indexing of spoken documents by using self-organizing maps.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2000

Indexing spoken audio by LSA and SOMS.

[BibT_eX]

[DOI]

Proceedings of the 10th European Signal Processing Conference, 2000

1998

Improving vocabulary independent HMM decoding results by using the dynamically expanding context.

[BibT_eX]

[DOI]

Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

Self-organization in mixture densities of HMM based speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 6th European Symposium on Artificial Neural Networks, 1998

1997

Training mixture density HMMs with SOM and LVQ.

[BibT_eX]

[DOI]

Comput. Speech Lang., 1997

Comparison results for segmental training algorithms for mixture density HMMs.

[BibT_eX]

[DOI]

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

1996

Using the self-organizing map to speed up the probability density estimation for speech recognition with mixture density HMMs.

[BibT_eX]

[DOI]

Panu Somervuo

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Segmental LVQ3 training for phoneme-wise tied mixture density HMMS.

[BibT_eX]

[DOI]

Proceedings of the 8th European Signal Processing Conference, 1996

1993

Using LVQ to enhance semi-continuous hidden Markov models for phonemes.

[BibT_eX]

[DOI]

Proceedings of the Third European Conference on Speech Communication and Technology, 1993

1992

Application of self-organizing maps and LVQ in training continuous density hidden Markov models for phonemes.

[BibT_eX]

[DOI]