Jan Svec

Orcid: 0000-0001-8362-5927

According to our database1, Jan Svec authored at least 77 papers between 2005 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Asking Questions Framework for Oral History Archives.
Proceedings of the Advances in Information Retrieval, 2024

2023
Ensemble of Deep Neural Network Models for MOS Prediction.
Proceedings of the IEEE International Conference on Acoustics, 2023

The System for Efficient Indexing and Search in the Large Archives of Scanned Historical Documents.
Proceedings of the Advances in Information Retrieval, 2023

Transformer-Based Encoder-Encoder Architecture for Spoken Term Detection.
Proceedings of the Pattern Recognition - 7th Asian Conference, 2023

T5G2P: Multilingual Grapheme-to-Phoneme Conversion with Text-to-Text Transfer Transformer.
Proceedings of the Pattern Recognition - 7th Asian Conference, 2023

Will XAI Provide Real Explanation or Just a Plausible Rationalization?
Proceedings of the Pattern Recognition - 7th Asian Conference, 2023

Voice-Interactive Learning Dialogue on a Low-Cost Device.
Proceedings of the Pattern Recognition - 7th Asian Conference, 2023

2022
Exploring Capabilities of Monolingual Audio Transformers using Large Datasets in Automatic Speech Recognition of Czech.
CoRR, 2022

Evaluation of Wav2Vec Speech Recognition for Speakers with Cognitive Disorders.
Proceedings of the Text, Speech, and Dialogue - 25th International Conference, 2022

Transfer Learning of Transformers for Spoken Language Understanding.
Proceedings of the Text, Speech, and Dialogue - 25th International Conference, 2022

Automatic Grammar Correction of Commas in Czech Written Texts: Comparative Study.
Proceedings of the Text, Speech, and Dialogue - 25th International Conference, 2022

Analysis of Impact of Emotions on Target Speech Extraction and Speech Separation.
Proceedings of the 17th International Workshop on Acoustic Signal Enhancement, 2022

Deep LSTM Spoken Term Detection using Wav2Vec 2.0 Recognizer.
Proceedings of the Interspeech 2022, 2022

Exploring Capabilities of Monolingual Audio Transformers using Large Datasets in Automatic Speech Recognition of Czech.
Proceedings of the Interspeech 2022, 2022

Revisiting joint decoding based multi-talker speech recognition with DNN acoustic model.
Proceedings of the Interspeech 2022, 2022

2021
Transformer-Based Automatic Punctuation Prediction and Word Casing Reconstruction of the ASR Output.
Proceedings of the Text, Speech, and Dialogue - 24th International Conference, 2021

CNN-TDNN-Based Architecture for Speech Recognition Using Grapheme Models in Bilingual Czech-Slovak Task.
Proceedings of the Text, Speech, and Dialogue - 24th International Conference, 2021

OCR Improvements for Images of Multi-page Historical Documents.
Proceedings of the Speech and Computer - 23rd International Conference, 2021

Initial Experiments on Question Answering from the Intrinsic Structure of Oral History Archives.
Proceedings of the Speech and Computer - 23rd International Conference, 2021

Comparison of Czech Transformers on Text Classification Tasks.
Proceedings of the Statistical Language and Speech Processing, 2021

Spoken Term Detection and Relevance Score Estimation Using Dot-Product of Pronunciation Embeddings.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

T5G2P: Using Text-to-Text Transfer Transformer for Grapheme-to-Phoneme Conversion.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Live TV Subtitling Through Respeaking.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

2020
Automatic Correction of i/y Spelling in Czech ASR Output.
Proceedings of the Text, Speech, and Dialogue, 2020

Adjusting BERT's Pooling Layer for Large-Scale Multi-Label Text Classification.
Proceedings of the Text, Speech, and Dialogue, 2020

An Automated Pipeline for Robust Image Processing and Optical Character Recognition of Historical Documents.
Proceedings of the Speech and Computer - 22nd International Conference, 2020

BERT-Based Sentiment Analysis Using Distillation.
Proceedings of the Statistical Language and Speech Processing, 2020

2019
Air traffic control communication (ATCC) speech corpora and their use for ASR and TTS development.
Lang. Resour. Evaluation, 2019

Question-Answering Dialog System for Large Audiovisual Archives.
Proceedings of the Text, Speech, and Dialogue - 22nd International Conference, 2019

On Using Stateful LSTM Networks for Key-Phrase Detection.
Proceedings of the Text, Speech, and Dialogue - 22nd International Conference, 2019

Increasing DER Hosting Capacity in LV Grids in the Czech Republic in Terms of European Project InterFlex.
Proceedings of the 2019 IEEE PES Innovative Smart Grid Technologies Europe, 2019

Analysis of Smart Technical Measures Impacts on DER and EV Hosting Capacity Increase in LV and MV Grids in the Czech Republic in Terms of European Project InterFlex.
Proceedings of the 2019 IEEE PES Innovative Smart Grid Technologies Europe, 2019

Multimodal Dialog with the MALACH Audiovisual Archive.
Proceedings of the Interspeech 2019, 2019

2018
Learning to Interrupt the User at the Right Time in Incremental Dialogue Systems.
Proceedings of the Text, Speech, and Dialogue - 21st International Conference, 2018

Semi-Supervised Training of DNN-Based Acoustic Model for ATC Speech Recognition.
Proceedings of the Speech and Computer - 20th International Conference, 2018

Towards Network Simplification for Low-Cost Devices by Removing Synapses.
Proceedings of the Speech and Computer - 20th International Conference, 2018

Towards Processing of the Oral History Interviews and Related Printed Documents.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Design and Development of Speech Corpora for Air Traffic Control Training.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

On the Use of Grapheme Models for Searching in Large Spoken Archives.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
An Analysis of the RNN-Based Spoken Term Detection Training.
Proceedings of the Speech and Computer - 19th International Conference, 2017

A Relevance Score Estimation for Spoken Term Detection Based on RNN-Generated Pronunciation Embeddings.
Proceedings of the Interspeech 2017, 2017

Combining Textual and Speech Features in the NLI Task Using State-of-the-Art Machine Learning Techniques.
Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications, 2017

Fast Subsequence Matching in Motion Capture Data.
Proceedings of the Advances in Databases and Information Systems, 2017

2016
Building Corpora for Stylometric Research.
Proceedings of the Text, Speech, and Dialogue - 19th International Conference, 2016

An Automatic Training Tool for Air Traffic Control Training.
Proceedings of the Interspeech 2016, 2016

An Engine for Online Video Search in Large Archives of the Holocaust Testimonies.
Proceedings of the Interspeech 2016, 2016

A Multimodal Dialogue System for Air Traffic Control Trainees Based on Discrete-Event Simulation.
Proceedings of the Interspeech 2016, 2016

A study of different weighting schemes for spoken language understanding based on convolutional neural networks.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Improving Multi-label Document Classification of Czech News Articles.
Proceedings of the Text, Speech, and Dialogue - 18th International Conference, 2015

Slavonic Corpus for Stylometry Research.
Proceedings of the 9th Workshop on Recent Advances in Slavonic Natural Languages Processing, 2015

Hierarchical discriminative model for spoken language understanding based on convolutional neural network.
Proceedings of the INTERSPEECH 2015, 2015

Word-semantic lattices for spoken language understanding.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
General framework for mining, processing and storing large amounts of electronic texts for language modeling purposes.
Lang. Resour. Evaluation, 2014

WISE 2014 Challenge: Multi-label Classification of Print Media Articles to Topics.
Proceedings of the Web Information Systems Engineering - WISE 2014, 2014

Semi-supervised Learning Algorithm for Binary Relevance Multi-label Classification.
Proceedings of the Web Information Systems Engineering - WISE 2014 Workshops, 2014

Inter-Annotator Agreement on Spontaneous Czech Language - Limits of Automatic Speech Recognition Accuracy.
Proceedings of the Text, Speech and Dialogue - 17th International Conference, 2014

Two-Layer Semantic Entity Detection and Utterance Validation for Spoken Dialogue Systems.
Proceedings of the Text, Speech and Dialogue - 17th International Conference, 2014

Semantic Entity Detection in the Spoken Air Traffic Control Data.
Proceedings of the Speech and Computer - 16th International Conference, 2014

2013
Phonetic Spoken Term Detection in Large Audio Archive Using the WFST Framework.
Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

On the Use of Phoneme Lattices in Spoken Language Understanding.
Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

Improving Speech Recognition by Detecting Foreign Inclusions and Generating Pronunciations.
Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

Hierarchical discriminative model for spoken language understanding.
Proceedings of the IEEE International Conference on Acoustics, 2013

Efficient algorithm for rational kernel evaluation in large lattice sets.
Proceedings of the IEEE International Conference on Acoustics, 2013

Semantic entity detection from multiple ASR hypotheses within the WFST framework.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012
Spoken Dialogue System Design in 3 Weeks.
Proceedings of the Text, Speech and Dialogue - 15th International Conference, 2012

Unsupervised Synchronization of Hidden Subtitles with Audio Track Using Keyword Spotting Algorithm.
Proceedings of the Text, Speech and Dialogue - 15th International Conference, 2012

2011
System for fast lexical and phonetic spoken term detection in a Czech cultural heritage archive.
EURASIP J. Audio Speech Music. Process., 2011

Web Text Data Mining for Building Large Scale Language Modelling Corpus.
Proceedings of the Text, Speech and Dialogue - 14th International Conference, 2011

2010
Prototype of Czech Spoken Dialog System with Mixed Initiative for Railway Information Service.
Proceedings of the Text, Speech and Dialogue, 13th International Conference, 2010

Fast Phonetic/Lexical Searching in the Archives of the Czech Holocaust Testimonies: Advancing Towards the MALACH Project Visions.
Proceedings of the Text, Speech and Dialogue, 13th International Conference, 2010

2009
Extended Hidden Vector State Parser.
Proceedings of the Text, Speech and Dialogue, 12th International Conference, 2009

The Czech Broadcast Conversation Corpus.
Proceedings of the Text, Speech and Dialogue, 12th International Conference, 2009

2008
Structural Metadata Annotation of Speech Corpora: Comparing Broadcast News and Broadcast Conversations.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Extension of HVS semantic parser by allowing left-right branching.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Parameterization of the Input in Training the HVS Semantic Parser.
Proceedings of the Text, Speech and Dialogue, 10th International Conference, 2007

2006
Use of Negative Examples in Training the HVS Semantic Model.
Proceedings of the Text, Speech and Dialogue, 9th International Conference, 2006

2005
Czech spontaneous speech corpus with structural metadata.
Proceedings of the INTERSPEECH 2005, 2005


  Loading...