Ales Prazák

Orcid: 0000-0001-9453-0034

According to our database¹, Ales Prazák authored at least 36 papers between 2006 and 2025.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of five.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2025

Lightweight Target-Speaker-Based Overlap Transcription for Practical Streaming ASR.

[BibT_eX]

[DOI]

Ales Prazák

Marie Kunesová

Josef V. Psutka

CoRR, June, 2025

Lightweight Target-Speaker-Based Overlap Transcription for Practical Streaming ASR.

[BibT_eX]

[DOI]

Ales Prazák

Marie Kunesová

Josef Psutka

Proceedings of the Text, Speech, and Dialogue - 28th International Conference, 2025

2022

Exploring Capabilities of Monolingual Audio Transformers using Large Datasets in Automatic Speech Recognition of Czech.

[BibT_eX]

[DOI]

CoRR, 2022

Exploring Capabilities of Monolingual Audio Transformers using Large Datasets in Automatic Speech Recognition of Czech.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021

CNN-TDNN-Based Architecture for Speech Recognition Using Grapheme Models in Bilingual Czech-Slovak Task.

[BibT_eX]

[DOI]

Josef V. Psutka

Jan Svec

Ales Prazák

Proceedings of the Text, Speech, and Dialogue - 24th International Conference, 2021

Recognition of Heavily Accented and Emotional Speech of English and Czech Holocaust Survivors Using Various DNN Architectures.

[BibT_eX]

[DOI]

Josef V. Psutka

Ales Prazák

Jan Vanek

Proceedings of the Speech and Computer - 23rd International Conference, 2021

Various DNN-HMM Architectures Used in Acoustic Modeling with Single-Speaker and Single-Channel.

[BibT_eX]

[DOI]

Josef V. Psutka

Jan Vanek

Ales Prazák

Proceedings of the Statistical Language and Speech Processing, 2021

Spoken Term Detection and Relevance Score Estimation Using Dot-Product of Pronunciation Embeddings.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Live TV Subtitling Through Respeaking.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020

Live TV subtitling through respeaking with remote cutting-edge technology.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2020

Complexity of the TDNN Acoustic Model with Respect to the HMM Topology.

[BibT_eX]

[DOI]

Josef V. Psutka

Jan Vanek

Ales Prazák

Proceedings of the Text, Speech, and Dialogue, 2020

2018

Online LDA-Based Language Model Adaptation.

[BibT_eX]

[DOI]

Jan Lehecka

Ales Prazák

Proceedings of the Text, Speech, and Dialogue - 21st International Conference, 2018

First Insight into the Processing of the Language Consulting Center Data.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 20th International Conference, 2018

Semi-Supervised Training of DNN-Based Acoustic Model for ATC Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 20th International Conference, 2018

Towards Processing of the Oral History Interviews and Related Printed Documents.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Multimodal Name Recognition in Live TV Subtitling.

[BibT_eX]

[DOI]

Marek Hrúz

Ales Prazák

Michal Busta

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2014

General framework for mining, processing and storing large amounts of electronic texts for language modeling purposes.

[BibT_eX]

[DOI]

Lang. Resour. Evaluation, 2014

Captioning of Live TV Commentaries from the Olympic Games in Sochi: Some Interesting Insights.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue - 17th International Conference, 2014

2013

Online Speaker Adaptation of an Acoustic Model Using Face Recognition.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

Towards Live Subtitling of TV Ice-hockey Commentary.

[BibT_eX]

[DOI]

Proceedings of the SIGMAP and WINSYS 2013, 2013

2012

Neural Network Language Model with Cache.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue - 15th International Conference, 2012

Captioning of Live TV Programs through Speech Recognition and Re-speaking.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue - 15th International Conference, 2012

Novel Approach to Live Captioning Through Re-speaking: Tailoring Speech Recognition to Re-speaker's Needs.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

2011

System for fast lexical and phonetic spoken term detection in a Czech cultural heritage archive.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2011

Automatic Topic Identification for Large Scale Language Modeling Data Filtering.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue - 14th International Conference, 2011

Four-phase Re-speaker Training System.

[BibT_eX]

Proceedings of the SIGMAP 2011, 2011

2010

Online TV Captioning of Czech Parliamentary Sessions.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue, 13th International Conference, 2010

Fast Phonetic/Lexical Searching in the Archives of the Czech Holocaust Testimonies: Advancing Towards the MALACH Project Visions.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue, 13th International Conference, 2010

2009

Discriminative Training of Gender-Dependent Acoustic Models.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue, 12th International Conference, 2009

Training of Speaker-clustered Acoustic Models for use in Real-time Recognizers.

[BibT_eX]

Proceedings of the SIGMAP 2009, 2009

Fast Speaker Adaptation in Automatic Online Subtitling.

[BibT_eX]

Proceedings of the SIGMAP 2009, 2009

2007

Searching for a Robust MFCC-Based Parameterization for ASR Application.

[BibT_eX]

Josef V. Psutka

Lubos Smídl

Ales Prazák

Proceedings of the SIGMAP 2007, 2007

Live TV Subtitling - Fast 2-pass LVCSR System for Online Subtitling.

[BibT_eX]

Proceedings of the SIGMAP 2007, 2007

2006

Automatic Online Subtitling of the Czech Parliament Meetings.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue, 9th International Conference, 2006

Benefit of a Class-based Language Model for Real-time Closed-captioning of TV Ice-hockey Commentaries.

[BibT_eX]

[DOI]

Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Adaptive language model in automatic online subtitling.

[BibT_eX]

Proceedings of the Second IASTED International Conference on Computational Intelligence, 2006

Ales Prazák

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...