We stand with Ukraine

We stand with Ukraine

Jan Silovský

According to our database¹, Jan Silovský authored at least 40 papers between 2006 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Neural Network Conversion of Machine Learning Pipelines.

[DOI]

,

,

,

,

Chinnu Pittapally

CoRR, March, 2026

2023

Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for Low-Resource Speech Recognition with Transducers.

[DOI]

,

,

,

,

,

Sasha Kuznietsov

,

,

,

CoRR, 2023

2020

Learning from Noisy Labels with Noise Modeling Network.

[DOI]

,

,

,

William Hartmann

,

,

CoRR, 2020

Improving Language Identification for Multilingual Speakers.

[DOI]

,

,

,

,

,

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Towards a New Understanding of the Training of Neural Networks with Mislabeled Training Data.

[DOI]

,

,

,

,

William Hartmann

,

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2018

Individual Ship Detection Using Underwater Acoustics.

[DOI]

Damianos G. Karakos

,

,

Richard M. Schwartz

,

William Hartmann

,

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2016

Search for speaker identity in historical oral archives.

[DOI]

,

,

Michaela Kucharová

Multim. Tools Appl., 2016

BBN technologies' OpenSAD system.

[DOI]

,

Damianos G. Karakos

,

,

Richard M. Schwartz

Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Sage: The New BBN Speech Processing Platform.

[DOI]

,

,

,

Zhongqiang Huang

,

,

,

,

,

William Hartmann

,

,

,

,

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015

Compensation of nonlinear distortions in speech for automatic recognition.

[DOI]

,

,

,

Zbynek Koldovský

,

,

Jindrich Zdánský

Proceedings of the 38th International Conference on Telecommunications and Signal Processing, 2015

Phone speech detection and recognition in the task of historical radio broadcast transcription.

[DOI]

Josef Chaloupka

,

,

,

Proceedings of the 38th International Conference on Telecommunications and Signal Processing, 2015

Large-scale speaker search using PLDA on mismatched conditions.

[DOI]

,

,

,

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014

Speech-to-text technology to transcribe and disclose 100, 000+ hours of bilingual documents from historical Czech and Czechoslovak radio archive.

[DOI]

,

,

Jindrich Zdánský

,

,

,

,

Josef Chaloupka

,

Michaela Kucharová

,

,

,

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

2013

Speaker-adaptive speech recognition using speaker diarization for improved transcription of large spoken archives.

[DOI]

,

,

Jindrich Zdánský

,

,

Speech Commun., 2013

Dealing with Bilingualism in Automatic Transcription of Historical Archive of Czech Radio.

[DOI]

,

,

Proceedings of the New Trends in Image Analysis and Processing - ICIAP 2013, 2013

Adding controlled amount of noise to improve recognition of compressed and spectrally distorted speech.

[DOI]

,

,

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

Making Czech Historical Radio Archive Accessible and Searchable for Wide Public.

[DOI]

,

,

,

Jindrich Zdánský

,

,

,

J. Multim., 2012

Incorporation of the ASR output in speaker segmentation and clustering within the task of speaker diarization of broadcast streams.

[DOI]

,

Jindrich Zdánský

,

,

,

Proceedings of the 14th IEEE International Workshop on Multimedia Signal Processing, 2012

Large-scale processing, indexing and search system for Czech audio-visual cultural heritage archives.

[DOI]

,

,

Jindrich Zdánský

,

,

,

,

Josef Chaloupka

,

Michaela Kucharová

,

Proceedings of the 14th IEEE International Workshop on Multimedia Signal Processing, 2012

Browsing, indexing and automatic transcription of lectures for distance learning.

[DOI]

,

,

Jindrich Zdánský

,

,

,

,

,

Proceedings of the 14th IEEE International Workshop on Multimedia Signal Processing, 2012

Study on Integration of Speaker Diarization with Speaker Adaptive Speech Recognition for Broadcast Transcription.

[DOI]

,

,

Jindrich Zdánský

,

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Real-Time Lecture Transcription using ASR for Czech Hearing Impaired or Deaf Students.

[DOI]

,

,

Jindrich Zdánský

,

,

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Speaker diarization of broadcast streams using two-stage clustering based on i-vectors and cosine distance scoring.

[DOI]

,

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011

Enhancement of emotion detection in spoken dialogue systems by combining several information sources.

[DOI]

Ramón López-Cózar

,

,

Speech Commun., 2011

Voice Technology to Enable Sophisticated Access to Historical Audio Archive of the Czech Radio.

[DOI]

,

,

,

,

Jindrich Zdánský

,

,

Proceedings of the Multimedia for Cultural Heritage - First International Workshop, 2011

PLDA-Based Clustering for Speaker Diarization of Broadcast Streams.

[DOI]

,

,

,

Jindrich Zdánský

,

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Using Unsupervised Feature-Based Speaker Adaptation for Improved Transcription of Spoken Archives.

[DOI]

,

,

,

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Speaker diarization using PLDA-based speaker clustering.

[DOI]

,

Proceedings of the IEEE 6th International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications, 2011

2010

Mejora del Funcionamiento de Sistemas de Diálogo Hablado Mediante Reconocimiento del Estado Emocional de Usuarios.

[DOI]

Ramón López-Cózar

,

,

Proces. del Leng. Natural, 2010

Adapting Lexical and Language Models for Transcription of Highly Spontaneous Spoken Czech.

[DOI]

,

Proceedings of the Text, Speech and Dialogue, 13th International Conference, 2010

F<sup>2</sup> - New Technique for Recognition of User Emotional States in Spoken Dialogue Systems.

[DOI]

Ramón López-Cózar

,

,

Proceedings of the SIGDIAL 2010 Conference, 2010

Comparison of Segmentation and Clustering Methods for Speaker Diarization of Broadcast Stream Audio.

[DOI]

,

Proceedings of the Analysis of Verbal and Nonverbal Communication and Enactment. The Processing Issues, 2010

Study on Cross-Lingual Adaptation of a Czech LVCSR System towards Slovak.

[DOI]

,

,

Proceedings of the Analysis of Verbal and Nonverbal Communication and Enactment. The Processing Issues, 2010

2009

Challenges in Speech Processing of Slavic Languages (Case Studies in Speech Recognition of Czech and Slovak).

[DOI]

,

Jindrich Zdánský

,

,

Proceedings of the Development of Multimodal Interfaces: Active Listening and Synchrony, 2009

2008

Two-Level Fusion to Improve Emotion Classification in Spoken Dialogue Systems.

[DOI]

Ramón López-Cózar

,

Zoraida Callejas

,

,

,

Proceedings of the Text, Speech and Dialogue, 11th International Conference, 2008

Study on Speaker Adaptation Methods in the Broadcast News Transcription Task.

[DOI]

,

Jindrich Zdánský

,

,

Proceedings of the Text, Speech and Dialogue, 11th International Conference, 2008

Czech-to-slovak adapted broadcast news transcription system.

[DOI]

,

,

Jindrich Zdánský

,

,

,

Josef Chaloupka

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

MLLR Transforms Based Speaker Recognition in Broadcast Streams.

[DOI]

,

,

Jindrich Zdánský

Proceedings of the Cross-Modal Analysis of Speech, Gestures, Gaze and Facial Expressions, 2008

Voice Technology Applied for Building a Prototype Smart Room.

[DOI]

Josef Chaloupka

,

,

Jindrich Zdánský

,

,

,

Proceedings of the Multimodal Signals: Cognitive and Algorithmic Issues, 2008

2006

Two-step unsupervised speaker adaptation based on speaker and gender recognition and HMM combination.

[DOI]

,

,

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Loading...