Jan Cernocký

Orcid: 0000-0002-8800-0210

Affiliations:
  • Brno University of Technology


According to our database1, Jan Cernocký authored at least 189 papers between 1997 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Beyond the Labels: Unveiling Text-Dependency in Paralinguistic Speech Recognition Datasets.
CoRR, 2024

Probing Self-supervised Learning Models with Target Speech Extraction.
CoRR, 2024

Target Speech Extraction with Pre-trained Self-supervised Learning Models.
CoRR, 2024

2023
Twenty-Five Years of Evolution in Speech and Language Processing.
IEEE Signal Process. Mag., July, 2023

Neural Target Speech Extraction: An overview.
IEEE Signal Process. Mag., May, 2023

End-to-End Open Vocabulary Keyword Search With Multilingual Neural Representations.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

DiaCorrect: Error Correction Back-end For Speaker Diarization.
CoRR, 2023

BUT Systems for IWSLT 2023 Marathi - Hindi Low Resource Speech Translation Task.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023

Parameter-Efficient Transfer Learning of Pre-Trained Transformer Models for Speaker Verification Using Adapters.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Spelling-Aware Word-Based End-to-End ASR.
IEEE Signal Process. Lett., 2022

ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications.
CoRR, 2022

Extracting Speaker and Emotion Information from Self-Supervised Speech Models via Channel-Wise Correlations.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

An Attention-Based Backend Allowing Efficient Fine-Tuning of Transformer Models for Speaker Verification.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Progressive Contrastive Learning for Self-Supervised Text-Independent Speaker Verification.
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022

Analysis of Impact of Emotions on Target Speech Extraction and Speech Separation.
Proceedings of the 17th International Workshop on Acoustic Signal Enhancement, 2022

Training speaker embedding extractors using multi-speaker audio with unknown speaker boundaries.
Proceedings of the Interspeech 2022, 2022

Learnable Sparse Filterbank for Speaker Verification.
Proceedings of the Interspeech 2022, 2022

Revisiting joint decoding based multi-talker speech recognition with DNN acoustic model.
Proceedings of the Interspeech 2022, 2022

Speaker adaptation for Wav2vec2 based dysarthric ASR.
Proceedings of the Interspeech 2022, 2022

Multi-Channel Speaker Verification with Conv-Tasnet Based Beamformer.
Proceedings of the IEEE International Conference on Acoustics, 2022

Multisv: Dataset for Far-Field Multi-Channel Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2022

DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation and Extraction.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Integration of Variational Autoencoder and Spatial Clustering for Adaptive Multi-Channel Neural Speech Separation.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Towards a Versatile Intelligent Conversational Agent as Personal Assistant for Migrants.
Proceedings of the Advances in Practical Applications of Agents, Multi-Agent Systems, and Social Good. The PAAMS Collection, 2021

The IWSLT 2021 BUT Speech Translation Systems.
Proceedings of the 18th International Conference on Spoken Language Translation, 2021

Auxiliary Loss Function for Target Speech Extraction and Recognition with Weak Supervision Based on Speaker Characteristics.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Detecting English Speech in the Air Traffic Control Voice Communication.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

ICSpk: Interpretable Complex Speaker Embedding Extractor from Raw Waveform.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Effective Phase Encoding for End-To-End Speaker Verification.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Boosting of Contextual Information in ASR for Air-Traffic Call-Sign Recognition.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Out-of-Vocabulary Words Detection with Attention and CTC Alignments in an End-to-End ASR System.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

A Hierarchical Subspace Model for Language-Attuned Acoustic Unit Discovery.
Proceedings of the IEEE International Conference on Acoustics, 2021

Jointly Trained Transformers Models for Spoken Language Translation.
Proceedings of the IEEE International Conference on Acoustics, 2021

Analysis of X-Vectors for Low-Resource Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

Eat: Enhanced ASR-TTS for Self-Supervised Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge.
Proceedings of the Fifth International Conference, 2021

2020
Analysis of Speaker Diarization Based on Bayesian HMM With Eigenvoice Priors.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

13 years of speaker recognition research at BUT, with longitudinal analysis of NIST SRE.
Comput. Speech Lang., 2020

A Technical Report: BUT Speech Translation Systems.
CoRR, 2020

BUT Opensat 2019 Speech Recognition System.
CoRR, 2020

Utilizing VOiCES Dataset for Multichannel Speaker Verification with Beamforming.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Optimizing Bayesian Hmm Based X-Vector Clustering for the Second Dihard Speech Diarization Challenge.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Investigation of Specaugment for Deep Speaker Embedding Learning.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
SpeakerBeam: Speaker Aware Neural Network for Target Speaker Extraction in Speech Mixtures.
IEEE J. Sel. Top. Signal Process., 2019

Building and Evaluation of a Real Room Impulse Response Dataset.
IEEE J. Sel. Top. Signal Process., 2019

Analysis of DNN Speech Signal Enhancement for Robust Speaker Recognition.
Comput. Speech Lang., 2019

Acoustic Scene Classification Using Fusion of Attentive Convolutional Neural Networks for DCASE2019 Challenge.
CoRR, 2019

Self-supervised Sequence-to-sequence ASR using Unpaired Speech and Text.
CoRR, 2019

Detecting Spoofing Attacks Using VGG and SincNet: BUT-Omilia Submission to ASVspoof 2019 Challenge.
Proceedings of the Interspeech 2019, 2019

On the Usage of Phonetic Information for Text-Independent Speaker Embedding Extraction.
Proceedings of the Interspeech 2019, 2019

Bayesian Subspace Hidden Markov Model for Acoustic Unit Discovery.
Proceedings of the Interspeech 2019, 2019

Analysis of Multilingual Sequence-to-Sequence Speech Recognition Systems.
Proceedings of the Interspeech 2019, 2019

Bayesian HMM Based x-Vector Clustering for Speaker Diarization.
Proceedings of the Interspeech 2019, 2019

Semi-Supervised Sequence-to-Sequence ASR Using Unpaired Speech and Text.
Proceedings of the Interspeech 2019, 2019

How to Improve Your Speaker Embeddings Extractor in Generic Toolkits.
Proceedings of the IEEE International Conference on Acoustics, 2019

Promising Accurate Prefix Boosting for Sequence-to-sequence ASR.
Proceedings of the IEEE International Conference on Acoustics, 2019

A Multi Purpose and Large Scale Speech Corpus in Persian and English for Speaker and Speech Recognition: The Deepmine Database.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Speaker Verification with Application-Aware Beamforming.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Residual Memory Networks: Feed-forward approach to learn long temporal dependencies.
CoRR, 2018

Spoken Pass-Phrase Verification in the i-vector Space.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

BUT/Phonexia Bottleneck Feature Extractor.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Lightly Supervised vs. Semi-supervised Training of Acoustic Model on Luxembourgish for Low-resource Automatic Speech Recognition.
Proceedings of the Interspeech 2018, 2018

BUT System for Low Resource Indian Language ASR.
Proceedings of the Interspeech 2018, 2018

Dereverberation and Beamforming in Robust Far-Field Speaker Recognition.
Proceedings of the Interspeech 2018, 2018

BUT OpenSAT 2017 Speech Recognition System.
Proceedings of the Interspeech 2018, 2018

Optimization of Speaker-Aware Multichannel Speech Extraction with ASR Criterion.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Dereverberation and Beamforming in Far-Field Speaker Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Analysis of Multilingual Blstm Acoustic Model on Low and High Resource Languages.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Convolutional neural networks and x-vector embedding for DCASE2018 Acoustic Scene Classification challenge.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

2017
Text-dependent speaker verification based on i-vectors, Neural Networks and Hidden Markov Models.
Comput. Speech Lang., 2017

Multilingually trained bottleneck features in spoken language recognition.
Comput. Speech Lang., 2017

Semi-Supervised DNN Training with Word Selection for ASR.
Proceedings of the Interspeech 2017, 2017

Alternative Approaches to Neural Network Based Speaker Verification.
Proceedings of the Interspeech 2017, 2017

Analysis of Score Normalization in Multilingual Speaker Recognition.
Proceedings of the Interspeech 2017, 2017

2016 BUT Babel System: Multilingual BLSTM Acoustic Model with i-Vector Based Adaptation.
Proceedings of the Interspeech 2017, 2017

Bayesian phonotactic Language Model for Acoustic Unit Discovery.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Topic identification of spoken documents using unsupervised acoustic unit discovery.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Residual memory networks: Feed-forward approach to learn long-term temporal dependencies.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Training Data Augmentation and Data Selection.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016
Investigation of Bottle-Neck Features for Emotion Recognition.
Proceedings of the Text, Speech, and Dialogue - 19th International Conference, 2016

Variational Inference for Acoustic Unit Discovery.
Proceedings of the SLTU-2016, 2016

Analysis of the DNN-based SRE systems in multi-language conditions.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Multilingual BLSTM and speaker-specific vector adaptation in 2016 but babel system.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Data Selection by Sequence Summarizing Neural Network in Mismatch Condition Training.
Proceedings of the Interspeech 2016, 2016

i-Vector/HMM Based Text-Dependent Speaker Verification System for RedDots Challenge.
Proceedings of the Interspeech 2016, 2016

Sequence Summarizing Neural Networks for Spoken Language Recognition.
Proceedings of the Interspeech 2016, 2016

Analysis of Speaker Recognition Systems in Realistic Scenarios of the SITW 2016 Challenge.
Proceedings of the Interspeech 2016, 2016

Learning Document Representations Using Subspace Multinomial Model.
Proceedings of the Interspeech 2016, 2016

Sequence summarizing neural network for speaker adaptation.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Analysis of DNN approaches to speaker identification.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Multilingual region-dependent transforms.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Three ways to adapt a CTS recognizer to unseen reverberated speech in BUT system for the ASpIRE challenge.
Proceedings of the INTERSPEECH 2015, 2015

Multilingual bottleneck features for language recognition.
Proceedings of the INTERSPEECH 2015, 2015

Copingwith channel mismatch in Query-by-Example - But QUESST 2014.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Robust speech recognition in unknown reverberant and noisy conditions.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
But ASR system for BABEL Surprise evaluation 2014.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

BUT 2014 Babel system: analysis of adaptation in NN based systems.
Proceedings of the INTERSPEECH 2014, 2014

Calibration and fusion of query-by-example systems - But SWS 2013.
Proceedings of the IEEE International Conference on Acoustics, 2014

But neural network features for spontaneous Vietnamese in BABEL.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Regularized subspace n-gram model for phonotactic ivector extraction.
Proceedings of the INTERSPEECH 2013, 2013

Improved feature processing for deep neural networks.
Proceedings of the INTERSPEECH 2013, 2013

A region-specific feature-space transformation for speaker adaptation and singularity analysis of jacobian matrix.
Proceedings of the INTERSPEECH 2013, 2013

Frequency warping and robust speaker verification: a comparison of alternative mel-scale representations.
Proceedings of the INTERSPEECH 2013, 2013

BUT BABEL system for spontaneous Cantonese.
Proceedings of the INTERSPEECH 2013, 2013

Manual and semi-automatic approaches to building a multilingual phoneme set.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Comparison of methods for language-dependent and language-independent query-by-example spoken term detection.
ACM Trans. Inf. Syst., 2012

Dealing with Numbers in Grapheme-Based Speech Recognition.
Proceedings of the Text, Speech and Dialogue - 15th International Conference, 2012

Speaker vectors from subspace Gaussian mixture model as complementary features for language identification.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

Description and analysis of the Brno276 system for LRE2011.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

A factorized representation of FMLLR transform based on QR-decomposition.
Proceedings of the INTERSPEECH 2012, 2012

Phonotactic Language Recognition using i-vectors and Phoneme Posteriogram Counts.
Proceedings of the INTERSPEECH 2012, 2012

Bi-Modal Person Recognition on a Mobile Phone: Using Mobile Phone Data.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops, 2012

Discriminative classifiers for phonotactic language recognition with iVectors.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Region dependent linear transforms in multilingual speech recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Application of speaker- and language identification state-of-the-art techniques for emotion recognition.
Speech Commun., 2011

Empirical Evaluation and Combination of Advanced Language Modeling Techniques.
Proceedings of the INTERSPEECH 2011, 2011

iVector Fusion of Prosodic and Cepstral Features for Speaker Verification.
Proceedings of the INTERSPEECH 2011, 2011

General chair's message.
Proceedings of the IEEE International Conference on Acoustics, 2011

Extensions of recurrent neural network language model.
Proceedings of the IEEE International Conference on Acoustics, 2011

Full-covariance UBM and heavy-tailed PLDA in i-vector speaker verification.
Proceedings of the IEEE International Conference on Acoustics, 2011

Recent progress in prosodic speaker verification.
Proceedings of the IEEE International Conference on Acoustics, 2011

Strategies for training large scale neural network language models.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

iVector-based discriminative adaptation for automatic speech recognition.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
Using Gradient Descent Optimization for Acoustics Training from Heterogeneous Data.
Proceedings of the Text, Speech and Dialogue, 13th International Conference, 2010

Acoustic keyword spotter - optimization from end-user perspective.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Speech@FIT lecture browser.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

PCA-based Feature Extraction for Phonotactic Language Recognition.
Proceedings of the Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 28, 2010

Data selection and calibration issues in automatic language recognition - investigation with BUT-AGNITIO NIST LRE 2009 system.
Proceedings of the Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 28, 2010

Recurrent neural network based language model.
Proceedings of the INTERSPEECH 2010, 2010

Prosodic speaker verification using subspace multinomial models with intersession compensation.
Proceedings of the INTERSPEECH 2010, 2010

Brno university of technology system for interspeech 2010 paralinguistic challenge.
Proceedings of the INTERSPEECH 2010, 2010


Tuning phone decoders for language identification.
Proceedings of the IEEE International Conference on Acoustics, 2010

Investigations into prosodic syllable contour features for speaker recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Brno University of Technology system for Interspeech 2009 emotion challenge.
Proceedings of the INTERSPEECH 2009, 2009

Investigation into variants of joint factor analysis for speaker recognition.
Proceedings of the INTERSPEECH 2009, 2009

BUT system for NIST 2008 speaker recognition evaluation.
Proceedings of the INTERSPEECH 2009, 2009

Neural network based language models for highly inflective languages.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Sub-word modeling of out of vocabulary words in spoken term detection.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Morphological random forests for language modeling of inflectional languages.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

BUT language recognition system for NIST 2007 evaluations.
Proceedings of the INTERSPEECH 2008, 2008

Discrimininative training of narrow band - wide band adapted systems for meeting recognition.
Proceedings of the INTERSPEECH 2008, 2008

Combination of strongly and weakly constrained recognizers for reliable detection of OOVS.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Analysis of Feature Extraction and Channel Compensation in a GMM Speaker Recognition System.
IEEE Trans. Speech Audio Process., 2007

Fusion of Heterogeneous Speaker Recognition Systems in the STBU Submission for the NIST Speaker Recognition Evaluation 2006.
IEEE Trans. Speech Audio Process., 2007

Maximum Likelihood and Maximum Mutual Information Training in Gender and Age Recognition System.
Proceedings of the Text, Speech and Dialogue, 10th International Conference, 2007

TRAP-Based Techniques for Recognition of Noisy Speech.
Proceedings of the Text, Speech and Dialogue, 10th International Conference, 2007

Spoken Term Detection System Based on Combination of LVCSR and Phonetic Search.
Proceedings of the Machine Learning for Multimodal Interaction , 2007

Application of CMLLR in narrow band wide band adapted systems.
Proceedings of the INTERSPEECH 2007, 2007

STBU System for the NIST 2006 Speaker Recognition Evaluation.
Proceedings of the IEEE International Conference on Acoustics, 2007

Probabilistic and Bottle-Neck Features for LVCSR of Meetings.
Proceedings of the IEEE International Conference on Acoustics, 2007

On Some Directions in Security-Oriented Research.
Proceedings of the 2007 ECSIS Symposium on Bio-inspired, 2007

2006
Indexing and Search Methods for Spoken Documents.
Proceedings of the Text, Speech and Dialogue, 9th International Conference, 2006

Brno University of Technology System for NIST 2005 Language Recognition Evaluation.
Proceedings of the Odyssey 2006, 2006

Robust Heteroscedastic Linear Discriminant Analysis and LCRC Posterior Features in Meeting Data Recognition.
Proceedings of the Machine Learning for Multimodal Interaction, 2006


Hierarchical Structures of Neural Networks for Phoneme Recognition.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Use of Anti-Models to Further Improve State-of-the-Art PRLM Language Recognition System.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Discriminative Training Techniques for Acoustic Language Identification.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Information Retrieval from Spoken Documents.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2006

2005
Phoneme Based Acoustics Keyword Spotting in Informal Continuous Speech.
Proceedings of the Text, Speech and Dialogue, 8th International Conference, 2005

Comparison of keyword spotting approaches for informal continuous speech.
Proceedings of the INTERSPEECH 2005, 2005

Non-parametric speaker turn segmentation of meeting data.
Proceedings of the INTERSPEECH 2005, 2005

Phonotactic language identification using high quality phoneme recognition.
Proceedings of the INTERSPEECH 2005, 2005

2004
Towards Lower Error Rates in Phoneme Recognition.
Proceedings of the Text, Speech and Dialogue, 7th International Conference, 2004

Multimodal Phoneme Recognition of Meeting Data.
Proceedings of the Text, Speech and Dialogue, 7th International Conference, 2004

Automatic Language Identification Using Phoneme and Automatically Derived Unit Strings.
Proceedings of the Text, Speech and Dialogue, 7th International Conference, 2004

Orthographic and Phonetic Annotation of Very Large Czech Corpora with Quality Assessment.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

TRAP based features for LVCSR of meting data.
Proceedings of the INTERSPEECH 2004, 2004

2003
All-Pole Modeling for Definition of Speech Features in Aurora3 DSR Task.
Proceedings of the Text, Speech and Dialogue, 6th International Conference, 2003

Phoneme Recognition Using Temporal Patterns.
Proceedings of the Text, Speech and Dialogue, 6th International Conference, 2003

Recognition of Speech with Non-random Attributes.
Proceedings of the Text, Speech and Dialogue, 6th International Conference, 2003

Recognition of phoneme strings using TRAP technique.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Autoregressive modeling based feature extraction for Aurora3 DSR task.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Time-domain based temporal processing with application of orthogonal transformations.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002
Some Like It Gaussian....
Proceedings of the Text, Speech and Dialogue, 5th International Conference, 2002

Advances in Very Low Bit Rate Speech Coding Using Recognition and Synthesis Techniques.
Proceedings of the Text, Speech and Dialogue, 5th International Conference, 2002

2001
Minimization of Transition Noise and HNM Synthesis in Very Low Bit Rate Speech Coding.
Proceedings of the Text, Speech and Dialogue, 4th International Conference, 2001

Speechdat-e: five eastern european speech databases for voice-operated teleservices completed.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
Segmental Approaches for Automatic Speaker Verification.
Digit. Signal Process., 2000

Codage de la parole a bas et tres bas debits.
Ann. des Télécommunications, 2000

Optimal Pitch Path Tracking for More Reliable Pitch Detection.
Proceedings of the Text, Speech and Dialogue - Third International Workshop, 2000

1999
Recording of Czech and Slovak Telephone Databases within SpeechDat-E.
Proceedings of the Text, Speech and Dialogue - Second International Workshop, 1999

Very Low Bit Rate Speech Coding: Comparison of Data-Driven Units with Syllable Segments.
Proceedings of the Text, Speech and Dialogue - Second International Workshop, 1999

A segmental approach to text-independent speaker verification.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1998
Text-independent speaker verification using automatically labelled acoustic segments.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Segmental vocoder-going beyond the phonetic approach.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997
Quantization of spectral sequences using variable length spectral segments for speech coding at very low bit rate.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Speech spectrum representation and coding using multigrams with distance.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997


  Loading...