Jan Silovský

According to our database1, Jan Silovský authored at least 39 papers between 2006 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for Low-Resource Speech Recognition with Transducers.
CoRR, 2023

2020
Learning from Noisy Labels with Noise Modeling Network.
CoRR, 2020

Improving Language Identification for Multilingual Speakers.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Towards a New Understanding of the Training of Neural Networks with Mislabeled Training Data.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2018
Individual Ship Detection Using Underwater Acoustics.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2016
Search for speaker identity in historical oral archives.
Multim. Tools Appl., 2016

BBN technologies' OpenSAD system.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Sage: The New BBN Speech Processing Platform.
Proceedings of the Interspeech 2016, 2016

2015
Compensation of nonlinear distortions in speech for automatic recognition.
Proceedings of the 38th International Conference on Telecommunications and Signal Processing, 2015

Phone speech detection and recognition in the task of historical radio broadcast transcription.
Proceedings of the 38th International Conference on Telecommunications and Signal Processing, 2015

Large-scale speaker search using PLDA on mismatched conditions.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Speech-to-text technology to transcribe and disclose 100, 000+ hours of bilingual documents from historical Czech and Czechoslovak radio archive.
Proceedings of the INTERSPEECH 2014, 2014

2013
Speaker-adaptive speech recognition using speaker diarization for improved transcription of large spoken archives.
Speech Commun., 2013

Dealing with Bilingualism in Automatic Transcription of Historical Archive of Czech Radio.
Proceedings of the New Trends in Image Analysis and Processing - ICIAP 2013, 2013

Adding controlled amount of noise to improve recognition of compressed and spectrally distorted speech.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Making Czech Historical Radio Archive Accessible and Searchable for Wide Public.
J. Multim., 2012

Incorporation of the ASR output in speaker segmentation and clustering within the task of speaker diarization of broadcast streams.
Proceedings of the 14th IEEE International Workshop on Multimedia Signal Processing, 2012

Large-scale processing, indexing and search system for Czech audio-visual cultural heritage archives.
Proceedings of the 14th IEEE International Workshop on Multimedia Signal Processing, 2012

Browsing, indexing and automatic transcription of lectures for distance learning.
Proceedings of the 14th IEEE International Workshop on Multimedia Signal Processing, 2012

Study on Integration of Speaker Diarization with Speaker Adaptive Speech Recognition for Broadcast Transcription.
Proceedings of the INTERSPEECH 2012, 2012

Real-Time Lecture Transcription using ASR for Czech Hearing Impaired or Deaf Students.
Proceedings of the INTERSPEECH 2012, 2012

Speaker diarization of broadcast streams using two-stage clustering based on i-vectors and cosine distance scoring.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Enhancement of emotion detection in spoken dialogue systems by combining several information sources.
Speech Commun., 2011

Voice Technology to Enable Sophisticated Access to Historical Audio Archive of the Czech Radio.
Proceedings of the Multimedia for Cultural Heritage - First International Workshop, 2011

PLDA-Based Clustering for Speaker Diarization of Broadcast Streams.
Proceedings of the INTERSPEECH 2011, 2011

Using Unsupervised Feature-Based Speaker Adaptation for Improved Transcription of Spoken Archives.
Proceedings of the INTERSPEECH 2011, 2011

Speaker diarization using PLDA-based speaker clustering.
Proceedings of the IEEE 6th International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications, 2011

2010
Mejora del Funcionamiento de Sistemas de Diálogo Hablado Mediante Reconocimiento del Estado Emocional de Usuarios.
Proces. del Leng. Natural, 2010

Adapting Lexical and Language Models for Transcription of Highly Spontaneous Spoken Czech.
Proceedings of the Text, Speech and Dialogue, 13th International Conference, 2010

F<sup>2</sup> - New Technique for Recognition of User Emotional States in Spoken Dialogue Systems.
Proceedings of the SIGDIAL 2010 Conference, 2010

Comparison of Segmentation and Clustering Methods for Speaker Diarization of Broadcast Stream Audio.
Proceedings of the Analysis of Verbal and Nonverbal Communication and Enactment. The Processing Issues, 2010

Study on Cross-Lingual Adaptation of a Czech LVCSR System towards Slovak.
Proceedings of the Analysis of Verbal and Nonverbal Communication and Enactment. The Processing Issues, 2010

2009
Challenges in Speech Processing of Slavic Languages (Case Studies in Speech Recognition of Czech and Slovak).
Proceedings of the Development of Multimodal Interfaces: Active Listening and Synchrony, 2009

2008
Two-Level Fusion to Improve Emotion Classification in Spoken Dialogue Systems.
Proceedings of the Text, Speech and Dialogue, 11th International Conference, 2008

Study on Speaker Adaptation Methods in the Broadcast News Transcription Task.
Proceedings of the Text, Speech and Dialogue, 11th International Conference, 2008

Czech-to-slovak adapted broadcast news transcription system.
Proceedings of the INTERSPEECH 2008, 2008

MLLR Transforms Based Speaker Recognition in Broadcast Streams.
Proceedings of the Cross-Modal Analysis of Speech, Gestures, Gaze and Facial Expressions, 2008

Voice Technology Applied for Building a Prototype Smart Room.
Proceedings of the Multimodal Signals: Cognitive and Algorithmic Issues, 2008

2006
Two-step unsupervised speaker adaptation based on speaker and gender recognition and HMM combination.
Proceedings of the INTERSPEECH 2006, 2006


  Loading...