Aku Rouhe

Orcid: 0009-0000-0977-609X

According to our database1, Aku Rouhe authored at least 20 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Principled Comparisons for End-to-End Speech Recognition: Attention vs Hybrid at the 1000-Hour Scale.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

2023
Finnish parliament ASR corpus.
Lang. Resour. Evaluation, December, 2023

Lahjoita puhetta: a large-scale corpus of spoken Finnish with some benchmarks.
Lang. Resour. Evaluation, September, 2023

2022
Finnish Parliament ASR corpus - Analysis, benchmarks and statistics.
CoRR, 2022

Lahjoita puhetta - a large-scale corpus of spoken Finnish with some benchmarks.
CoRR, 2022

Low Resource Comparison of Attention-based and Hybrid ASR Exploiting wav2vec 2.0.
Proceedings of the Interspeech 2022, 2022

2021
SpeechBrain: A General-Purpose Speech Toolkit.
CoRR, 2021

An Equal Data Setting for Attention-Based Encoder-Decoder and HMM/DNN Models: A Case Study in Finnish ASR.
Proceedings of the Speech and Computer - 23rd International Conference, 2021

Speaker Verification Experiments for Adults and Children Using Shared Embedding Spaces.
Proceedings of the 23rd Nordic Conference on Computational Linguistics, 2021

Self-Supervised End-to-End ASR for Low Resource L2 Swedish.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

2020
Multimodal machine translation through visuals and speech.
Mach. Transl., 2020

Finnish Language Modeling with Deep Transformer Models.
CoRR, 2020

Finnish ASR with Deep Transformer Models.
Proceedings of the Interspeech 2020, 2020

Speaker-Aware Training of Attention-Based End-to-End Speech Recognition Using Neural Speaker Embeddings.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Spherediar: An Effective Speaker Diarization System for Meeting Data.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
The MeMAD Submission to the IWSLT 2018 Speech Translation Task.
Proceedings of the 15th International Conference on Spoken Language Translation, 2018

Captaina: Integrated Pronunciation Practice and Data Collection Portal.
Proceedings of the Interspeech 2018, 2018

2017
A pipeline for automatic assessment of foreign language pronunciation.
Proceedings of the 7th ISCA International Workshop on Speech and Language Technology in Education, 2017

Reading Validation for Pronunciation Evaluation in the Digitala Project.
Proceedings of the Interspeech 2017, 2017

2016
Digitala: An Augmented Test and Review Process Prototype for High-Stakes Spoken Foreign Language Examination.
Proceedings of the Interspeech 2016, 2016


  Loading...