Petr Cerva

Orcid: 0000-0003-0767-0106

According to our database1, Petr Cerva authored at least 59 papers between 2005 and 2023.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Developing State-of-the-Art End-to-End ASR for Norwegian.
Proceedings of the Text, Speech, and Dialogue - 26th International Conference, 2023

Online Speaker Diarization Using Optimized SE-ResNet Architecture.
Proceedings of the Text, Speech, and Dialogue - 26th International Conference, 2023

2022
Lexicon-based vs. Lexicon-free ASR for Norwegian Parliament Speech Transcription.
Proceedings of the Text, Speech, and Dialogue - 25th International Conference, 2022

Overlapped Speech Detection in Broadcast Streams Using X-vectors.
Proceedings of the Interspeech 2022, 2022

2021
Identification of related languages from spoken data: Moving from off-line to on-line scenario.
Comput. Speech Lang., 2021

Identification of Scandinavian Languages from Speech Using Bottleneck Features and X-Vectors.
Proceedings of the Text, Speech, and Dialogue - 24th International Conference, 2021

Using X-Vectors for Speech Activity Detection in Broadcast Streams.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

2020
Very Fast Keyword Spotting System with Real Time Factor Below 0.01.
Proceedings of the Text, Speech, and Dialogue, 2020

Dealing with Newly Emerging OOVs in Broadcast Programs by Daily Updates of the Lexicon and Language Model.
Proceedings of the Speech and Computer - 22nd International Conference, 2020

Optical Character Recognition for Audio-Visual Broadcast Transcription System.
Proceedings of the 11th IEEE International Conference on Cognitive Infocommunications, 2020

2019
An Approach to Online Speaker Change Point Detection Using DNNs and WFSTs.
Proceedings of the Interspeech 2019, 2019

2018
Robust Recognition of Conversational Telephone Speech via Multi-condition Training and Data Augmentation.
Proceedings of the Text, Speech, and Dialogue - 21st International Conference, 2018

Using Deep Neural Networks for Identification of Slavic Languages from Acoustic Signal.
Proceedings of the Interspeech 2018, 2018

Robust Recognition of Speech with Background Music in Acoustically Under-Resourced Scenarios.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Speech Activity Detection in online broadcast transcription using Deep Neural Networks and Weighted Finite State Transducers.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Robust Automatic Recognition of Speech with background music.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Speech-to-Text Summarization Using Automatic Phrase Extraction from Recognized Text.
Proceedings of the Text, Speech, and Dialogue - 19th International Conference, 2016

Study on the Use of Deep Neural Networks for Speech Activity Detection in Broadcast Recordings.
Proceedings of the 13th International Joint Conference on e-Business and Telecommunications (ICETE 2016), 2016

Study on the Use and Adaptation of Bottleneck Features for Robust Speech Recognition of Nonlinearly Distorted Speech.
Proceedings of the 13th International Joint Conference on e-Business and Telecommunications (ICETE 2016), 2016

ASR for South Slavic Languages Developed in Almost Automated Way.
Proceedings of the Interspeech 2016, 2016

Investigation into the Use of WFSTs and DNNs for Speech Activity Detection in Broadcast Data Transcription.
Proceedings of the E-Business and Telecommunications - 13th International Joint Conference, 2016

2015
System for producing subtitles to internet audio-visual documents.
Proceedings of the 38th International Conference on Telecommunications and Signal Processing, 2015

Compensation of nonlinear distortions in speech for automatic recognition.
Proceedings of the 38th International Conference on Telecommunications and Signal Processing, 2015

Cross-Lingual Adaptation of Broadcast Transcription System to Polish Language Using Public Data Sources.
Proceedings of the Human Language Technology. Challenges for Computer Science and Linguistics, 2015

2014
A cross-lingual adaptation approach for rapid development of speech recognizers for learning disabled users.
EURASIP J. Audio Speech Music. Process., 2014

Investigation of deep neural networks for robust recognition of nonlinearly distorted speech.
Proceedings of the INTERSPEECH 2014, 2014

Speech-to-text technology to transcribe and disclose 100, 000+ hours of bilingual documents from historical Czech and Czechoslovak radio archive.
Proceedings of the INTERSPEECH 2014, 2014

Investigation of Latent Semantic Analysis for Clustering of Czech News Articles.
Proceedings of the 25th International Workshop on Database and Expert Systems Applications, 2014

2013
Speaker-adaptive speech recognition using speaker diarization for improved transcription of large spoken archives.
Speech Commun., 2013

Impact of microphone on computer applications with voice input modality.
Proceedings of the 36th International Conference on Telecommunications and Signal Processing, 2013

SummEC: A Summarization Engine for Czech.
Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

Downdating Lexicon and Language Model for Automatic Transcription of Czech Historical Spoken Documents.
Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

Dealing with Bilingualism in Automatic Transcription of Historical Archive of Czech Radio.
Proceedings of the New Trends in Image Analysis and Processing - ICIAP 2013, 2013

Adding controlled amount of noise to improve recognition of compressed and spectrally distorted speech.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Making Czech Historical Radio Archive Accessible and Searchable for Wide Public.
J. Multim., 2012

Incorporation of the ASR output in speaker segmentation and clustering within the task of speaker diarization of broadcast streams.
Proceedings of the 14th IEEE International Workshop on Multimedia Signal Processing, 2012

Large-scale processing, indexing and search system for Czech audio-visual cultural heritage archives.
Proceedings of the 14th IEEE International Workshop on Multimedia Signal Processing, 2012

Browsing, indexing and automatic transcription of lectures for distance learning.
Proceedings of the 14th IEEE International Workshop on Multimedia Signal Processing, 2012

Study on Integration of Speaker Diarization with Speaker Adaptive Speech Recognition for Broadcast Transcription.
Proceedings of the INTERSPEECH 2012, 2012

Real-Time Lecture Transcription using ASR for Czech Hearing Impaired or Deaf Students.
Proceedings of the INTERSPEECH 2012, 2012

2011
Voice Technology to Enable Sophisticated Access to Historical Audio Archive of the Czech Radio.
Proceedings of the Multimedia for Cultural Heritage - First International Workshop, 2011

PLDA-Based Clustering for Speaker Diarization of Broadcast Streams.
Proceedings of the INTERSPEECH 2011, 2011

Using Unsupervised Feature-Based Speaker Adaptation for Improved Transcription of Spoken Archives.
Proceedings of the INTERSPEECH 2011, 2011

Rainbow Bridge - Training Center based on Voice Technology for People with Physical Disabilities.
Proceedings of the HEALTHINF 2011, 2011

2010
Study on Cross-Lingual Adaptation of a Czech LVCSR System towards Slovak.
Proceedings of the Analysis of Verbal and Nonverbal Communication and Enactment. The Processing Issues, 2010

2009
Cost-Efficient Cross-Lingual Adaptation of a Speech Recognition System.
Proceedings of the Computer Recognition Systems 3, 2009

Very large vocabulary voice dictation for mobile devices.
Proceedings of the INTERSPEECH 2009, 2009

Challenges in Speech Processing of Slavic Languages (Case Studies in Speech Recognition of Czech and Slovak).
Proceedings of the Development of Multimodal Interfaces: Active Listening and Synchrony, 2009

2008
Study on Speaker Adaptation Methods in the Broadcast News Transcription Task.
Proceedings of the Text, Speech and Dialogue, 11th International Conference, 2008

Czech-to-slovak adapted broadcast news transcription system.
Proceedings of the INTERSPEECH 2008, 2008

MLLR Transforms Based Speaker Recognition in Broadcast Streams.
Proceedings of the Cross-Modal Analysis of Speech, Gestures, Gaze and Facial Expressions, 2008

Voice Technology Applied for Building a Prototype Smart Room.
Proceedings of the Multimodal Signals: Cognitive and Algorithmic Issues, 2008

2007
MyVoice goes Spanish. Cross-lingual Adaptation of a Voice Controlled PC Tool for Handicapped People.
Proces. del Leng. Natural, 2007

Design and development of voice controlled aids for motor-handicapped persons.
Proceedings of the INTERSPEECH 2007, 2007

2006
A System for Information Retrieval from Large Records of Czech Spoken Data.
Proceedings of the Text, Speech and Dialogue, 9th International Conference, 2006

Continual on-line monitoring of Czech spoken broadcast programs.
Proceedings of the INTERSPEECH 2006, 2006

Two-step unsupervised speaker adaptation based on speaker and gender recognition and HMM combination.
Proceedings of the INTERSPEECH 2006, 2006

2005
Supervised and Unsupervised Speaker Adaptation in Large Vocabulary Continuous Speech Recognition of Czech.
Proceedings of the Text, Speech and Dialogue, 8th International Conference, 2005

Fully automated system for Czech spoken broadcast transcription with very large (300k+) lexicon.
Proceedings of the INTERSPEECH 2005, 2005


  Loading...