Ilja Baumann

According to our database1, Ilja Baumann authored at least 22 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Personalized Fine-Tuning with Controllable Synthetic Speech from LLM-Generated Transcripts for Dysarthric Speech Recognition.
CoRR, May, 2025

Vocoder-Free Non-parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks.
Proceedings of the Text, Speech, and Dialogue - 28th International Conference, 2025

Digital Operating Mode Classification of Real-World Amateur Radio Transmissions.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Optimized Self-supervised Training with BEST-RQ for Speech Recognition.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
Generative adversarial networks for whispered to voiced speech conversion: a comparative study.
Int. J. Speech Technol., December, 2024

A Survey of Music Generation in the Context of Interaction.
CoRR, 2024

Personalizing Large Sequence-to-Sequence Speech Foundation Models With Speaker Representations.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

It's Time to Take Action: Acoustic Modeling of Motor Verbs to Detect Parkinson's Disease.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Automatic Evaluation of a Sentence Memory Test for Preschool Children.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Towards Self-Attention Understanding for Automatic Articulatory Processes Analysis in Cleft Lip and Palate Speech.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Outlier Reduction with Gated Attention for Improved Post-training Quantization in Large Sequence-to-sequence Speech Foundation Models.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Large Language Models for Dysfluency Detection in Stuttered Speech.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Towards Interpretability of Automatic Phoneme Analysis in Cleft Lip and Palate Speech.
Proceedings of the IEEE International Conference on Acoustics, 2024

Optimized Speculative Sampling for GPU Hardware Accelerators.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
Multi-class Detection of Pathological Speech with Latent Features: How does it perform on unseen data?
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

A Stutter Seldom Comes Alone - Cross-Corpus Stuttering Detection as a Multi-label Problem.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Influence of Utterance and Speaker Characteristics on the Classification of Children with Cleft Lip and Palate.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Speaker Adaptation for End-to-End Speech Recognition Systems in Noisy Environments.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Detection of Vowel Errors in Children's Speech using Synthetic Phonetic Transcripts.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
The Importance of Speech Stimuli for Pathologic Speech Classification.
CoRR, 2022

Detecting Vocal Fatigue with Neural Embeddings.
CoRR, 2022

Nonwords Pronunciation Classification in Language Development Tests for Preschool Children.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022


  Loading...