Jan Cernocký

According to our database1, Jan Cernocký authored at least 145 papers between 1997 and 2019.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepages:

On csauthors.net:

Bibliography

2019
SpeakerBeam: Speaker Aware Neural Network for Target Speaker Extraction in Speech Mixtures.
J. Sel. Topics Signal Processing, 2019

Building and Evaluation of a Real Room Impulse Response Dataset.
J. Sel. Topics Signal Processing, 2019

Analysis of DNN Speech Signal Enhancement for Robust Speaker Recognition.
Computer Speech & Language, 2019

Detecting Spoofing Attacks Using VGG and SincNet: BUT-Omilia Submission to ASVspoof 2019 Challenge.
CoRR, 2019

Acoustic Scene Classification Using Fusion of Attentive Convolutional Neural Networks for DCASE2019 Challenge.
CoRR, 2019

Self-supervised Sequence-to-sequence ASR using Unpaired Speech and Text.
CoRR, 2019

Bayesian Subspace Hidden Markov Model for Acoustic Unit Discovery.
CoRR, 2019

How to Improve Your Speaker Embeddings Extractor in Generic Toolkits.
Proceedings of the IEEE International Conference on Acoustics, 2019

Promising Accurate Prefix Boosting for Sequence-to-sequence ASR.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Analysis of DNN Speech Signal Enhancement for Robust Speaker Recognition.
CoRR, 2018

Analysis of Multilingual Sequence-to-Sequence speech recognition systems.
CoRR, 2018

Promising Accurate Prefix Boosting for sequence-to-sequence ASR.
CoRR, 2018

How to Improve Your Speaker Embeddings Extractor in Generic Toolkits.
CoRR, 2018

Convolutional Neural Networks and x-vector Embedding for DCASE2018 Acoustic Scene Classification Challenge.
CoRR, 2018

Spoken Pass-Phrase Verification in the i-vector Space.
CoRR, 2018

Residual Memory Networks: Feed-forward approach to learn long temporal dependencies.
CoRR, 2018

Spoken Pass-Phrase Verification in the i-vector Space.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

BUT/Phonexia Bottleneck Feature Extractor.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Lightly Supervised vs. Semi-supervised Training of Acoustic Model on Luxembourgish for Low-resource Automatic Speech Recognition.
Proceedings of the Interspeech 2018, 2018

BUT System for Low Resource Indian Language ASR.
Proceedings of the Interspeech 2018, 2018

Dereverberation and Beamforming in Robust Far-Field Speaker Recognition.
Proceedings of the Interspeech 2018, 2018

BUT OpenSAT 2017 Speech Recognition System.
Proceedings of the Interspeech 2018, 2018

Optimization of Speaker-Aware Multichannel Speech Extraction with ASR Criterion.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Dereverberation and Beamforming in Far-Field Speaker Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Analysis of Multilingual Blstm Acoustic Model on Low and High Resource Languages.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Text-dependent speaker verification based on i-vectors, Neural Networks and Hidden Markov Models.
Computer Speech & Language, 2017

Multilingually trained bottleneck features in spoken language recognition.
Computer Speech & Language, 2017

Semi-Supervised DNN Training with Word Selection for ASR.
Proceedings of the Interspeech 2017, 2017

Alternative Approaches to Neural Network Based Speaker Verification.
Proceedings of the Interspeech 2017, 2017

Analysis of Score Normalization in Multilingual Speaker Recognition.
Proceedings of the Interspeech 2017, 2017

2016 BUT Babel System: Multilingual BLSTM Acoustic Model with i-Vector Based Adaptation.
Proceedings of the Interspeech 2017, 2017

Bayesian phonotactic Language Model for Acoustic Unit Discovery.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Topic identification of spoken documents using unsupervised acoustic unit discovery.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Residual memory networks: Feed-forward approach to learn long-term temporal dependencies.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Training Data Augmentation and Data Selection.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016
Investigation of Bottle-Neck Features for Emotion Recognition.
Proceedings of the Text, Speech, and Dialogue - 19th International Conference, 2016

Variational Inference for Acoustic Unit Discovery.
Proceedings of the SLTU-2016, 2016

Analysis of the DNN-based SRE systems in multi-language conditions.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Multilingual BLSTM and speaker-specific vector adaptation in 2016 but babel system.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Data Selection by Sequence Summarizing Neural Network in Mismatch Condition Training.
Proceedings of the Interspeech 2016, 2016

i-Vector/HMM Based Text-Dependent Speaker Verification System for RedDots Challenge.
Proceedings of the Interspeech 2016, 2016

Sequence Summarizing Neural Networks for Spoken Language Recognition.
Proceedings of the Interspeech 2016, 2016

Analysis of Speaker Recognition Systems in Realistic Scenarios of the SITW 2016 Challenge.
Proceedings of the Interspeech 2016, 2016

Learning Document Representations Using Subspace Multinomial Model.
Proceedings of the Interspeech 2016, 2016

Sequence summarizing neural network for speaker adaptation.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Analysis of DNN approaches to speaker identification.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Multilingual region-dependent transforms.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Three ways to adapt a CTS recognizer to unseen reverberated speech in BUT system for the ASpIRE challenge.
Proceedings of the INTERSPEECH 2015, 2015

Multilingual bottleneck features for language recognition.
Proceedings of the INTERSPEECH 2015, 2015

Copingwith channel mismatch in Query-by-Example - But QUESST 2014.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Robust speech recognition in unknown reverberant and noisy conditions.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
But ASR system for BABEL Surprise evaluation 2014.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

BUT 2014 Babel system: analysis of adaptation in NN based systems.
Proceedings of the INTERSPEECH 2014, 2014

Calibration and fusion of query-by-example systems - But SWS 2013.
Proceedings of the IEEE International Conference on Acoustics, 2014

But neural network features for spontaneous Vietnamese in BABEL.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Regularized subspace n-gram model for phonotactic ivector extraction.
Proceedings of the INTERSPEECH 2013, 2013

Improved feature processing for deep neural networks.
Proceedings of the INTERSPEECH 2013, 2013

A region-specific feature-space transformation for speaker adaptation and singularity analysis of jacobian matrix.
Proceedings of the INTERSPEECH 2013, 2013

Frequency warping and robust speaker verification: a comparison of alternative mel-scale representations.
Proceedings of the INTERSPEECH 2013, 2013

BUT BABEL system for spontaneous Cantonese.
Proceedings of the INTERSPEECH 2013, 2013

Manual and semi-automatic approaches to building a multilingual phoneme set.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Comparison of methods for language-dependent and language-independent query-by-example spoken term detection.
ACM Trans. Inf. Syst., 2012

Dealing with Numbers in Grapheme-Based Speech Recognition.
Proceedings of the Text, Speech and Dialogue - 15th International Conference, 2012

Speaker vectors from subspace Gaussian mixture model as complementary features for language identification.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

Description and analysis of the Brno276 system for LRE2011.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

A factorized representation of FMLLR transform based on QR-decomposition.
Proceedings of the INTERSPEECH 2012, 2012

Phonotactic Language Recognition using i-vectors and Phoneme Posteriogram Counts.
Proceedings of the INTERSPEECH 2012, 2012

Bi-Modal Person Recognition on a Mobile Phone: Using Mobile Phone Data.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops, 2012

Discriminative classifiers for phonotactic language recognition with iVectors.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Region dependent linear transforms in multilingual speech recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Application of speaker- and language identification state-of-the-art techniques for emotion recognition.
Speech Communication, 2011

Empirical Evaluation and Combination of Advanced Language Modeling Techniques.
Proceedings of the INTERSPEECH 2011, 2011

iVector Fusion of Prosodic and Cepstral Features for Speaker Verification.
Proceedings of the INTERSPEECH 2011, 2011

General chair's message.
Proceedings of the IEEE International Conference on Acoustics, 2011

Extensions of recurrent neural network language model.
Proceedings of the IEEE International Conference on Acoustics, 2011

Full-covariance UBM and heavy-tailed PLDA in i-vector speaker verification.
Proceedings of the IEEE International Conference on Acoustics, 2011

Recent progress in prosodic speaker verification.
Proceedings of the IEEE International Conference on Acoustics, 2011

Strategies for training large scale neural network language models.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

iVector-based discriminative adaptation for automatic speech recognition.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
Using Gradient Descent Optimization for Acoustics Training from Heterogeneous Data.
Proceedings of the Text, Speech and Dialogue, 13th International Conference, 2010

Acoustic keyword spotter - optimization from end-user perspective.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Speech@FIT lecture browser.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

PCA-based Feature Extraction for Phonotactic Language Recognition.
Proceedings of the Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 28, 2010

Data selection and calibration issues in automatic language recognition - investigation with BUT-AGNITIO NIST LRE 2009 system.
Proceedings of the Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 28, 2010

Recurrent neural network based language model.
Proceedings of the INTERSPEECH 2010, 2010

Prosodic speaker verification using subspace multinomial models with intersession compensation.
Proceedings of the INTERSPEECH 2010, 2010

Brno university of technology system for interspeech 2010 paralinguistic challenge.
Proceedings of the INTERSPEECH 2010, 2010


Tuning phone decoders for language identification.
Proceedings of the IEEE International Conference on Acoustics, 2010

Investigations into prosodic syllable contour features for speaker recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Brno University of Technology system for Interspeech 2009 emotion challenge.
Proceedings of the INTERSPEECH 2009, 2009

Investigation into variants of joint factor analysis for speaker recognition.
Proceedings of the INTERSPEECH 2009, 2009

BUT system for NIST 2008 speaker recognition evaluation.
Proceedings of the INTERSPEECH 2009, 2009

Neural network based language models for highly inflective languages.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Sub-word modeling of out of vocabulary words in spoken term detection.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Morphological random forests for language modeling of inflectional languages.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

BUT language recognition system for NIST 2007 evaluations.
Proceedings of the INTERSPEECH 2008, 2008

Discrimininative training of narrow band - wide band adapted systems for meeting recognition.
Proceedings of the INTERSPEECH 2008, 2008

Combination of strongly and weakly constrained recognizers for reliable detection of OOVS.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Analysis of Feature Extraction and Channel Compensation in a GMM Speaker Recognition System.
IEEE Trans. Audio, Speech & Language Processing, 2007

Fusion of Heterogeneous Speaker Recognition Systems in the STBU Submission for the NIST Speaker Recognition Evaluation 2006.
IEEE Trans. Audio, Speech & Language Processing, 2007

Maximum Likelihood and Maximum Mutual Information Training in Gender and Age Recognition System.
Proceedings of the Text, Speech and Dialogue, 10th International Conference, 2007

TRAP-Based Techniques for Recognition of Noisy Speech.
Proceedings of the Text, Speech and Dialogue, 10th International Conference, 2007

Spoken Term Detection System Based on Combination of LVCSR and Phonetic Search.
Proceedings of the Machine Learning for Multimodal Interaction , 2007

Application of CMLLR in narrow band wide band adapted systems.
Proceedings of the INTERSPEECH 2007, 2007

STBU System for the NIST 2006 Speaker Recognition Evaluation.
Proceedings of the IEEE International Conference on Acoustics, 2007

Probabilistic and Bottle-Neck Features for LVCSR of Meetings.
Proceedings of the IEEE International Conference on Acoustics, 2007

On Some Directions in Security-Oriented Research.
Proceedings of the 2007 ECSIS Symposium on Bio-inspired, 2007

2006
Indexing and Search Methods for Spoken Documents.
Proceedings of the Text, Speech and Dialogue, 9th International Conference, 2006

Brno University of Technology System for NIST 2005 Language Recognition Evaluation.
Proceedings of the Odyssey 2006, 2006

Robust Heteroscedastic Linear Discriminant Analysis and LCRC Posterior Features in Meeting Data Recognition.
Proceedings of the Machine Learning for Multimodal Interaction, 2006


Hierarchical Structures of Neural Networks for Phoneme Recognition.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Use of Anti-Models to Further Improve State-of-the-Art PRLM Language Recognition System.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Discriminative Training Techniques for Acoustic Language Identification.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Information Retrieval from Spoken Documents.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2006

2005
Phoneme Based Acoustics Keyword Spotting in Informal Continuous Speech.
Proceedings of the Text, Speech and Dialogue, 8th International Conference, 2005

Comparison of keyword spotting approaches for informal continuous speech.
Proceedings of the INTERSPEECH 2005, 2005

Non-parametric speaker turn segmentation of meeting data.
Proceedings of the INTERSPEECH 2005, 2005

Phonotactic language identification using high quality phoneme recognition.
Proceedings of the INTERSPEECH 2005, 2005

2004
Towards Lower Error Rates in Phoneme Recognition.
Proceedings of the Text, Speech and Dialogue, 7th International Conference, 2004

Multimodal Phoneme Recognition of Meeting Data.
Proceedings of the Text, Speech and Dialogue, 7th International Conference, 2004

Automatic Language Identification Using Phoneme and Automatically Derived Unit Strings.
Proceedings of the Text, Speech and Dialogue, 7th International Conference, 2004

Orthographic and Phonetic Annotation of Very Large Czech Corpora with Quality Assessment.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

TRAP based features for LVCSR of meting data.
Proceedings of the INTERSPEECH 2004, 2004

2003
All-Pole Modeling for Definition of Speech Features in Aurora3 DSR Task.
Proceedings of the Text, Speech and Dialogue, 6th International Conference, 2003

Phoneme Recognition Using Temporal Patterns.
Proceedings of the Text, Speech and Dialogue, 6th International Conference, 2003

Recognition of Speech with Non-random Attributes.
Proceedings of the Text, Speech and Dialogue, 6th International Conference, 2003

Recognition of phoneme strings using TRAP technique.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Autoregressive modeling based feature extraction for Aurora3 DSR task.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Time-domain based temporal processing with application of orthogonal transformations.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002
Some Like It Gaussian....
Proceedings of the Text, Speech and Dialogue, 5th International Conference, 2002

Advances in Very Low Bit Rate Speech Coding Using Recognition and Synthesis Techniques.
Proceedings of the Text, Speech and Dialogue, 5th International Conference, 2002

2001
Minimization of Transition Noise and HNM Synthesis in Very Low Bit Rate Speech Coding.
Proceedings of the Text, Speech and Dialogue, 4th International Conference, 2001

Speechdat-e: five eastern european speech databases for voice-operated teleservices completed.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
Segmental Approaches for Automatic Speaker Verification.
Digital Signal Processing, 2000

Codage de la parole a bas et tres bas debits.
Annales des Télécommunications, 2000

Optimal Pitch Path Tracking for More Reliable Pitch Detection.
Proceedings of the Text, Speech and Dialogue - Third International Workshop, 2000

1999
Recording of Czech and Slovak Telephone Databases within SpeechDat-E.
Proceedings of the Text, Speech and Dialogue - Second International Workshop, 1999

Very Low Bit Rate Speech Coding: Comparison of Data-Driven Units with Syllable Segments.
Proceedings of the Text, Speech and Dialogue - Second International Workshop, 1999

A segmental approach to text-independent speaker verification.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1998
Text-independent speaker verification using automatically labelled acoustic segments.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Segmental vocoder-going beyond the phonetic approach.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997
Quantization of spectral sequences using variable length spectral segments for speech coding at very low bit rate.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Speech spectrum representation and coding using multigrams with distance.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997


  Loading...