Okko Johannes Räsänen

Speech Commun., 2019

Automatic Posture and Movement Tracking of Infants with Wearable Movement Sensors.

[BibT_eX]

[DOI]

CoRR, 2019

Vocal Effort Based Speaking Style Conversion Using Vocoder Features and Parallel Learning.

[BibT_eX]

[DOI]

IEEE Access, 2019

Augmented CycleGANs for Continuous Scale Normal-to-Lombard Speaking Style Conversion.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

A Computational Model of Early Language Acquisition from Audiovisual Experiences of Young Infants.

[BibT_eX]

[DOI]

Khazar Khorrami

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Cycle-consistent Adversarial Networks for Non-parallel Vocal Effort Based Speaking Style Conversion.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Data Augmentation Strategies for Neural Network F0 Estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

Comparison of spectral tilt measures for sentence prominence in speech - Effects of dimensionality and adverse noise conditions.

[BibT_eX]

[DOI]

Paavo Alku

Speech Commun., 2018

Comparison of Syllabification Algorithms and Training Strategies for Robust Word Count Estimation across Different Languages and Recording Conditions.

[BibT_eX]

[DOI]

Marisa Casillas

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Time-regularized Linear Prediction for Noise-robust Extraction of the Spectral Envelope of Speech.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017

An online model for vowel imitation learning.

[BibT_eX]

[DOI]

Speech Commun., 2017

Comparison of Non-Parametric Bayesian Mixture Models for Syllable Clustering and Zero-Resource Speech Processing.

[BibT_eX]

[DOI]

Ulpu Remes

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Speaking Style Conversion from Normal to Lombard Speech Using a Glottal Vocoder and Bayesian GMMs.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Evaluation of Spectral Tilt Measures for Sentence Prominence Under Different Noise Conditions.

[BibT_eX]

[DOI]

Paavo Alku

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Dirichlet process mixture models for clustering i-vector data.

[BibT_eX]

[DOI]

Ulpu Remes

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Connecting stimulus-driven attention to the properties of infant-directed speech - Is exaggerated intonation also more surprising?

[BibT_eX]

[DOI]

Melanie Soderstrom

Proceedings of the 39th Annual Meeting of the Cognitive Science Society, 2017

Blind Phoneme Segmentation With Temporal Prediction Errors.

[BibT_eX]

[DOI]

Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016

Sequence Prediction With Sparse Distributed Hyperdimensional Coding Applied to the Analysis of Mobile Phone Use Patterns.

[BibT_eX]

[DOI]

Jukka P. Saarinen

IEEE Trans. Neural Networks Learn. Syst., 2016

3PRO - An unsupervised method for the automatic detection of sentence prominence in speech.

[BibT_eX]

[DOI]

Speech Commun., 2016

Improving Phoneme segmentation with Recurrent Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2016

Perception of Sentence Stress in Speech Correlates With the Temporal Unpredictability of Prosodic Features.

[BibT_eX]

[DOI]

Cogn. Sci., 2016

Analyzing the Contribution of Top-Down Lexical and Bottom-Up Acoustic Cues in the Detection of Sentence Prominence.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Analyzing distributional learning of phonemic categories in unsupervised deep neural networks.

[BibT_eX]

[DOI]

Tasha Nagamine

Nima Mesgarani

Proceedings of the 38th Annual Meeting of the Cognitive Science Society, 2016

A Cognitive Approach to Modeling Sentence Level Prominence Based on Stimulus Unpredictability.

[BibT_eX]

[DOI]

Proceedings of the 38th Annual Meeting of the Cognitive Science Society, 2016

Statistical Learning of Prosodic Patterns and Reversal of Perceptual Cues for Sentence Prominence.

[BibT_eX]

[DOI]

Proceedings of the 38th Annual Meeting of the Cognitive Science Society, 2016

2015

Feature selection methods and their combinations in high-dimensional classification of speaker likability, intelligibility and personality traits.

[BibT_eX]

[DOI]

Jouni Pohjalainen

Serdar Kadioglu

Comput. Speech Lang., 2015

Weakly-supervised word learning is improved by an active online algorithm.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Unsupervised word discovery from speech using automatic segmentation into syllable-like units.

[BibT_eX]

[DOI]

Gabriel Doyle

Michael C. Frank

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Automatic detection of sentence prominence in speech using predictability of word-level acoustic features.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Data-driven metric representing the maturation of preterm EEG.

[BibT_eX]

[DOI]

Ninah Koolen

Anneleen Dereymaeker

Proceedings of the 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2015

Computational evidence for effects of memory decay, familiarity preference and mutual exclusivity in cross-situational learning.

[BibT_eX]

[DOI]

Proceedings of the 37th Annual Meeting of the Cognitive Science Society, 2015

Cross-situational cues are relevant for early word segmentation.

[BibT_eX]

[DOI]

Proceedings of the 37th Annual Meeting of the Cognitive Science Society, 2015

Generating Hyperdimensional Distributed Representations from Continuous-Valued Multivariate Sensory Input.

[BibT_eX]

[DOI]

Proceedings of the 37th Annual Meeting of the Cognitive Science Society, 2015

Analyzing the Predictability of Lexeme-specific Prosodic Features as a Cue to Sentence Prominence.

[BibT_eX]

[DOI]

Proceedings of the 37th Annual Meeting of the Cognitive Science Society, 2015

2014

Modeling Dependencies in Multiple Parallel Data Streams with Hyperdimensional Computing.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2014

Perception of sentence stress in English infant directed speech.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Basic cuts revisited: Temporal segmentation of speech into phone-like units with statistical learning at a pre-linguistic level.

[BibT_eX]

[DOI]

Proceedings of the 36th Annual Meeting of the Cognitive Science Society, 2014

Statistical Unpredictability of F0 Trajectories as a Cue to Sentence Stress.

[BibT_eX]

[DOI]

Proceedings of the 36th Annual Meeting of the Cognitive Science Society, 2014

2013

Feedback and imitation by a caregiver guides a virtual infant to learn native phonemes and the skill of speech inversion.

[BibT_eX]

[DOI]

Speech Commun., 2013

Development of a novel robust measure for interhemispheric synchrony in the neonatal EEG: Activation Synchrony Index (ASI).

[BibT_eX]

[DOI]

Marjo Metsäranta

Sampsa Vanhatalo

NeuroImage, 2013

Random subset feature selection in automatic recognition of developmental disorders, affective states, and level of conflict from speech.

[BibT_eX]

[DOI]

Jouni Pohjalainen

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Automatic self-supervised learning of associations between speech and text.

[BibT_eX]

[DOI]

Juha Knuuttila

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Attention based temporal filtering of sensory signals for data redundancy reduction.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

Computational modeling of phonetic and lexical learning in early language acquisition: Existing models and future directions.

[BibT_eX]

[DOI]

Speech Commun., 2012

A method for noise-robust context-aware pattern discovery and recognition from categorical sequences.

[BibT_eX]

[DOI]

Pattern Recognit., 2012

Modeling spoken language acquisition with a generic cognitive architecture for associative learning.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Average Spectrotemporal Structure of Continuous Speech Matches with the Frequency Resolution of Human Hearing.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Non-auditory cognitive capabilities in computational modeling of early language acquisition.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Feature Selection for Speaker Traits.

[BibT_eX]

[DOI]

Jouni Pohjalainen

Serdar Kadioglu

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Context induced merging of synonymous word models in computational modeling of early language acquisition.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Hierarchical unsupervised discovery of user context from multivariate sensory data.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Acoustic analysis supports the existence of a single distributional learning mechanism in structural rule learning from an artificial language.

[BibT_eX]

[DOI]

Proceedings of the 34th Annual Meeting of the Cognitive Science Society, 2012

2011

Method for Speech Inversion with Large Scale Statistical Evaluation.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Comparison of classifiers in audio and acceleration based context classification in mobile phones.

[BibT_eX]

[DOI]

Jussi Leppänen

Jukka P. Saarinen

Proceedings of the 19th European Signal Processing Conference, 2011

2010

Estimation studies of vocal tract shape trajectory using a variable length and lossy kelly-lochbaum model.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Fully unsupervised word learning from continuous speech using transitional probabilities of atomic acoustic events.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009

Learning meaningful units from multimodal input - the effect of interaction strategies.

[BibT_eX]

[DOI]

Louis ten Bosch

Lou Boves

Proceedings of the Second Workshop on Child, Computer and Interaction, 2009

A comparison and combination of segmental and fixed-frame signal representations in NMF-based word recognition.

[BibT_eX]

[DOI]

Joris Driesen

Proceedings of the 17th Nordic Conference of Computational Linguistics, 2009

Indirect estimation of formant frequencies through mean spectral variance with application to automatic gender recognition.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, 2009

A noise robust method for pattern discovery in quantized time series: the concept matrix approach.

[BibT_eX]

[DOI]

Unto Kalervo Laine

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

An improved speech segmentation quality measure: the r-value.

[BibT_eX]

[DOI]

Unto Kalervo Laine

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Self-learning vector quantization for pattern discovery from speech.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Do multiple caregivers speed up language acquisition?

[BibT_eX]

[DOI]

Louis ten Bosch

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Discovering keywords from cross-modal input: ecological vs. engineering methods for enhancing acoustic repetitions.

[BibT_eX]

[DOI]

Guillaume Aimetti

Roger K. Moore

Louis ten Bosch

Unto Kalervo Laine

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

2008

Computational language acquisition by statistical bottom-up processing.

[BibT_eX]

[DOI]