Lukás Burget

Orcid: 0000-0002-4951-5908

According to our database1, Lukás Burget authored at least 231 papers between 2001 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Beyond the Labels: Unveiling Text-Dependency in Paralinguistic Speech Recognition Datasets.
CoRR, 2024

Do End-to-End Neural Diarization Attractors Need to Encode Speaker Characteristic Information?
CoRR, 2024

2023
Twenty-Five Years of Evolution in Speech and Language Processing.
IEEE Signal Process. Mag., July, 2023

DiaPer: End-to-End Neural Diarization with Perceiver-Based Attractors.
CoRR, 2023

Discriminative Training of VBx Diarization.
CoRR, 2023

DiaCorrect: Error Correction Back-end For Speaker Diarization.
CoRR, 2023

Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization.
CoRR, 2023

Hystoc: Obtaining word confidences for fusion of end-to-end ASR systems.
CoRR, 2023

Stabilized training of joint energy-based models and their practical applications.
CoRR, 2023

Toroidal Probabilistic Spherical Discriminant Analysis.
Proceedings of the IEEE International Conference on Acoustics, 2023

Parameter-Efficient Transfer Learning of Pre-Trained Transformer Models for Speaker Verification Using Adapters.
Proceedings of the IEEE International Conference on Acoustics, 2023

Multi-Speaker and Wide-Band Simulated Conversations as Training Data for End-to-End Neural Diarization.
Proceedings of the IEEE International Conference on Acoustics, 2023

Speech-Based Emotion Recognition with Self-Supervised Models Using Attentive Channel-Wise Correlations and Label Smoothing.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Non-Parametric Bayesian Subspace Models for Acoustic Unit Discovery.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Spelling-Aware Word-Based End-to-End ASR.
IEEE Signal Process. Lett., 2022

Bayesian HMM clustering of x-vector sequences (VBx) in speaker diarization: Theory, implementation and analysis on standard tasks.
Comput. Speech Lang., 2022

Extracting Speaker and Emotion Information from Self-Supervised Speech Models via Channel-Wise Correlations.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

An Attention-Based Backend Allowing Efficient Fine-Tuning of Transformer Models for Speaker Verification.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Analyzing Speaker Verification Embedding Extractors and Back-Ends Under Language and Channel Mismatch.
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022

Development of ABC Systems for the 2021 Edition of NIST Speaker Recognition Evaluation.
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022

Training speaker embedding extractors using multi-speaker audio with unknown speaker boundaries.
Proceedings of the Interspeech 2022, 2022

Learnable Sparse Filterbank for Speaker Verification.
Proceedings of the Interspeech 2022, 2022

From Simulated Mixtures to Simulated Conversations as Training Data for End-to-End Neural Diarization.
Proceedings of the Interspeech 2022, 2022

Revisiting joint decoding based multi-talker speech recognition with DNN acoustic model.
Proceedings of the Interspeech 2022, 2022

Probabilistic Spherical Discriminant Analysis: An Alternative to PLDA for length-normalized embeddings.
Proceedings of the Interspeech 2022, 2022

Speaker adaptation for Wav2vec2 based dysarthric ASR.
Proceedings of the Interspeech 2022, 2022

GPU-Accelerated Forward-Backward Algorithm with Application to Lattice-Free MMI.
Proceedings of the IEEE International Conference on Acoustics, 2022

Multi-Channel Speaker Verification with Conv-Tasnet Based Beamformer.
Proceedings of the IEEE International Conference on Acoustics, 2022

Multisv: Dataset for Far-Field Multi-Channel Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2022

DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation and Extraction.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Integration of Variational Autoencoder and Spatial Clustering for Adaptive Multi-Channel Neural Speech Separation.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

The IWSLT 2021 BUT Speech Translation Systems.
Proceedings of the 18th International Conference on Spoken Language Translation, 2021

Speaker Embeddings by Modeling Channel-Wise Correlations.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

ICSpk: Interpretable Complex Speaker Embedding Extractor from Raw Waveform.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Effective Phase Encoding for End-To-End Speaker Verification.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Out-of-Vocabulary Words Detection with Attention and CTC Alignments in an End-to-End ASR System.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Text Augmentation for Language Models in High Error Recognition Scenario.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

A Hierarchical Subspace Model for Language-Attuned Acoustic Unit Discovery.
Proceedings of the IEEE International Conference on Acoustics, 2021

Jointly Trained Transformers Models for Spoken Language Translation.
Proceedings of the IEEE International Conference on Acoustics, 2021

Analysis of the but Diarization System for Voxconverse Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2021

Eat: Enhanced ASR-TTS for Self-Supervised Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Learning Document Embeddings Along With Their Uncertainties.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Analysis of Speaker Diarization Based on Bayesian HMM With Eigenvoice Priors.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

End-to-end DNN based text-independent speaker recognition for long and short utterances.
Comput. Speech Lang., 2020

13 years of speaker recognition research at BUT, with longitudinal analysis of NIST SRE.
Comput. Speech Lang., 2020

A Technical Report: BUT Speech Translation Systems.
CoRR, 2020

Probabilistic Embeddings for Speaker Diarization.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020


SdSV Challenge 2020: Large-Scale Evaluation of Short-Duration Speaker Verification.
Proceedings of the Interspeech 2020, 2020

BUT Text-Dependent Speaker Verification System for SdSV Challenge 2020.
Proceedings of the Interspeech 2020, 2020

But System for the Second Dihard Speech Diarization Challenge.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Optimizing Bayesian Hmm Based X-Vector Clustering for the Second Dihard Speech Diarization Challenge.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Investigation of Specaugment for Deep Speaker Embedding Learning.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
SpeakerBeam: Speaker Aware Neural Network for Target Speaker Extraction in Speech Mixtures.
IEEE J. Sel. Top. Signal Process., 2019

Analysis of DNN Speech Signal Enhancement for Robust Speaker Recognition.
Comput. Speech Lang., 2019

Short-duration Speaker Verification (SdSV) Challenge 2020: the Challenge Evaluation Plan.
CoRR, 2019

Acoustic Scene Classification Using Fusion of Attentive Convolutional Neural Networks for DCASE2019 Challenge.
CoRR, 2019

BUT VOiCES 2019 System Description.
CoRR, 2019

Self-supervised Sequence-to-sequence ASR using Unpaired Speech and Text.
CoRR, 2019

BUT-FIT at SemEval-2019 Task 7: Determining the Rumour Stance with Pre-Trained Deep Bidirectional Transformers.
Proceedings of the 13th International Workshop on Semantic Evaluation, 2019

Detecting Spoofing Attacks Using VGG and SincNet: BUT-Omilia Submission to ASVspoof 2019 Challenge.
Proceedings of the Interspeech 2019, 2019

On the Usage of Phonetic Information for Text-Independent Speaker Embedding Extraction.
Proceedings of the Interspeech 2019, 2019

Self-Supervised Speaker Embeddings.
Proceedings of the Interspeech 2019, 2019

Bayesian Subspace Hidden Markov Model for Acoustic Unit Discovery.
Proceedings of the Interspeech 2019, 2019

Factorization of Discriminatively Trained i-Vector Extractor for Speaker Recognition.
Proceedings of the Interspeech 2019, 2019

Analysis of BUT Submission in Far-Field Scenarios of VOiCES 2019 Challenge.
Proceedings of the Interspeech 2019, 2019

Bayesian HMM Based x-Vector Clustering for Speaker Diarization.
Proceedings of the Interspeech 2019, 2019

Semi-Supervised Sequence-to-Sequence ASR Using Unpaired Speech and Text.
Proceedings of the Interspeech 2019, 2019

How to Improve Your Speaker Embeddings Extractor in Generic Toolkits.
Proceedings of the IEEE International Conference on Acoustics, 2019

Speaker Verification Using End-to-end Adversarial Language Adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2019

Discriminatively Re-trained I-vector Extractor for Speaker Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

Promising Accurate Prefix Boosting for Sequence-to-sequence ASR.
Proceedings of the IEEE International Conference on Acoustics, 2019

A Multi Purpose and Large Scale Speech Corpus in Persian and English for Speaker and Speech Recognition: The Deepmine Database.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Speaker Verification with Application-Aware Beamforming.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Residual Memory Networks: Feed-forward approach to learn long temporal dependencies.
CoRR, 2018

Spoken Pass-Phrase Verification in the i-vector Space.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

BUT/Phonexia Bottleneck Feature Extractor.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Analysis of BUT-PT Submission for NIST LRE 2017.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Speaker Diarization based on Bayesian HMM with Eigenvoice Priors.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Gaussian meta-embeddings for efficient scoring of a heavy-tailed PLDA model.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Fast Variational Bayes for Heavy-tailed PLDA Applied to i-vectors and x-vectors.
Proceedings of the Interspeech 2018, 2018

BUT System for Low Resource Indian Language ASR.
Proceedings of the Interspeech 2018, 2018

BUT OpenSAT 2017 Speech Recognition System.
Proceedings of the Interspeech 2018, 2018

BUT System for DIHARD Speech Diarization Challenge 2018.
Proceedings of the Interspeech 2018, 2018

i-Vectors in Language Modeling: An Efficient Way of Domain Adaptation for Feed-Forward Models.
Proceedings of the Interspeech 2018, 2018

End-to-End DNN Based Speaker Recognition Inspired by I-Vector and PLDA.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Bayesian Models for Unit Discovery on a Very Low Resource Language.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Analysis of Multilingual Blstm Acoustic Model on Low and High Resource Languages.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Out-of-Vocabulary Word Recovery using FST-Based Subword Unit Clustering in a Hybrid ASR System.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Convolutional neural networks and x-vector embedding for DCASE2018 Acoustic Scene Classification challenge.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

2017
HMM-Based Phrase-Independent i-Vector Extractor for Text-Dependent Speaker Verification.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Text-dependent speaker verification based on i-vectors, Neural Networks and Hidden Markov Models.
Comput. Speech Lang., 2017

Semi-Supervised DNN Training with Word Selection for ASR.
Proceedings of the Interspeech 2017, 2017

Alternative Approaches to Neural Network Based Speaker Verification.
Proceedings of the Interspeech 2017, 2017


Analysis of Score Normalization in Multilingual Speaker Recognition.
Proceedings of the Interspeech 2017, 2017

2016 BUT Babel System: Multilingual BLSTM Acoustic Model with i-Vector Based Adaptation.
Proceedings of the Interspeech 2017, 2017

Residual Memory Networks in Language Modeling: Improving the Reputation of Feed-Forward Networks.
Proceedings of the Interspeech 2017, 2017

Bayesian phonotactic Language Model for Acoustic Unit Discovery.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

An empirical evaluation of zero resource acoustic unit discovery.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Topic identification of spoken documents using unsupervised acoustic unit discovery.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Bayesian joint-sequence models for grapheme-to-phoneme conversion.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Residual memory networks: Feed-forward approach to learn long-term temporal dependencies.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Training Data Augmentation and Data Selection.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016
Variational Inference for Acoustic Unit Discovery.
Proceedings of the SLTU-2016, 2016

Analysis of the DNN-based SRE systems in multi-language conditions.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Deep Neural Networks and Hidden Markov Models in i-vector-based Text-Dependent Speaker Verification.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

BAT System Description for NIST LRE 2015.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

Analysis and Optimization of Bottleneck Features for Speaker Recognition.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

Data Selection by Sequence Summarizing Neural Network in Mismatch Condition Training.
Proceedings of the Interspeech 2016, 2016

i-Vector/HMM Based Text-Dependent Speaker Verification System for RedDots Challenge.
Proceedings of the Interspeech 2016, 2016

Sequence Summarizing Neural Networks for Spoken Language Recognition.
Proceedings of the Interspeech 2016, 2016

Analysis of Speaker Recognition Systems in Realistic Scenarios of the SITW 2016 Challenge.
Proceedings of the Interspeech 2016, 2016

Exploiting Hidden-Layer Responses of Deep Neural Networks for Language Recognition.
Proceedings of the Interspeech 2016, 2016

Learning Document Representations Using Subspace Multinomial Model.
Proceedings of the Interspeech 2016, 2016

Sequence summarizing neural network for speaker adaptation.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Audio enhancing with DNN autoencoder for speaker recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Analysis of DNN approaches to speaker identification.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Multilingual region-dependent transforms.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
DNN derived filters for processing of modulation spectrum of speech.
Proceedings of the INTERSPEECH 2015, 2015

Three ways to adapt a CTS recognizer to unseen reverberated speech in BUT system for the ASpIRE challenge.
Proceedings of the INTERSPEECH 2015, 2015

Migrating i-vectors between speaker recognition systems using regression neural networks.
Proceedings of the INTERSPEECH 2015, 2015

Copingwith channel mismatch in Query-by-Example - But QUESST 2014.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Employment of Subspace Gaussian Mixture Models in speaker recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Towards machines that know when they do not know: Summary of work done at 2014 Frederick Jelinek Memorial Workshop.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Robust speech recognition in unknown reverberant and noisy conditions.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Non-Negative Factor Analysis of Gaussian Mixture Model Weight Adaptation for Language and Dialect Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

But ASR system for BABEL Surprise evaluation 2014.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

GMM Weights Adaptation Based on Subspace Approaches for Speaker Verification.
Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014

BUT QUESST 2014 system description.
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014

PLLR features in language recognition system for RATS.
Proceedings of the INTERSPEECH 2014, 2014

Calibration and fusion of query-by-example systems - But SWS 2013.
Proceedings of the IEEE International Conference on Acoustics, 2014

Unscented transform for ivector-based noisy speaker recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

Domain adaptation via within-class covariance correction in I-vector based speaker recognition systems.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Pairwise Discriminative Speaker Verification in the 𝕀-Vector Space.
IEEE Trans. Speech Audio Process., 2013

BUT SWS 2013 - Massive Parallel Approach.
Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013

Sequence-discriminative training of deep neural networks.
Proceedings of the INTERSPEECH 2013, 2013

Regularized subspace n-gram model for phonotactic ivector extraction.
Proceedings of the INTERSPEECH 2013, 2013

A region-specific feature-space transformation for speaker adaptation and singularity analysis of jacobian matrix.
Proceedings of the INTERSPEECH 2013, 2013

A noise robust i-vector extractor using vector taylor series for speaker recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

Rich system combination for keyword spotting in noisy and acoustically heterogeneous audio streams.
Proceedings of the IEEE International Conference on Acoustics, 2013

Semi-supervised training of Deep Neural Networks.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012
Out-of-Vocabulary Word Detection and Beyond.
Proceedings of the Detection and Identification of Rare Audiovisual Cues, 2012

Transcribing Meetings With the AMIDA Systems.
IEEE Trans. Speech Audio Process., 2012

A unified approach for audio characterization and its application to speaker recognition.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

Bilinear Factor Analysis for iVector Based Speaker Verification.
Proceedings of the INTERSPEECH 2012, 2012

Discriminatively trained phoneme confusion model for keyword spotting.
Proceedings of the INTERSPEECH 2012, 2012

Discriminative classifiers for phonotactic language recognition with iVectors.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Generating exact lattices in the WFST framework.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Towards noise-robust speaker recognition using probabilistic linear discriminant analysis.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Improving language models for ASR using translated in-domain data.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Region dependent linear transforms in multilingual speech recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

iVector-based prosodic system for language identification.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Application of speaker- and language identification state-of-the-art techniques for emotion recognition.
Speech Commun., 2011

The subspace Gaussian mixture model - A structured model for speech recognition.
Comput. Speech Lang., 2011

iVector Approach to Phonotactic Language Recognition.
Proceedings of the INTERSPEECH 2011, 2011

Empirical Evaluation and Combination of Advanced Language Modeling Techniques.
Proceedings of the INTERSPEECH 2011, 2011

Language Recognition in iVectors Space.
Proceedings of the INTERSPEECH 2011, 2011

Recurrent Neural Network Based Language Modeling in Meeting Recognition.
Proceedings of the INTERSPEECH 2011, 2011

iVector Fusion of Prosodic and Cepstral Features for Speaker Verification.
Proceedings of the INTERSPEECH 2011, 2011

Discriminatively Trained i-vector Extractor for Speaker Verification.
Proceedings of the INTERSPEECH 2011, 2011

Extensions of recurrent neural network language model.
Proceedings of the IEEE International Conference on Acoustics, 2011

Full-covariance UBM and heavy-tailed PLDA in i-vector speaker verification.
Proceedings of the IEEE International Conference on Acoustics, 2011

Recent progress in prosodic speaker verification.
Proceedings of the IEEE International Conference on Acoustics, 2011

Simplification and optimization of i-vector extraction.
Proceedings of the IEEE International Conference on Acoustics, 2011

Fast discriminative speaker verification in the i-vector space.
Proceedings of the IEEE International Conference on Acoustics, 2011

Discriminatively trained Probabilistic Linear Discriminant Analysis for speaker verification.
Proceedings of the IEEE International Conference on Acoustics, 2011

Strategies for training large scale neural network language models.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

iVector-based discriminative adaptation for automatic speech recognition.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
Recovery of Rare Words in Lecture Speech.
Proceedings of the Text, Speech and Dialogue, 13th International Conference, 2010

PCA-based Feature Extraction for Phonotactic Language Recognition.
Proceedings of the Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 28, 2010

Data selection and calibration issues in automatic language recognition - investigation with BUT-AGNITIO NIST LRE 2009 system.
Proceedings of the Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 28, 2010

Parallel training of neural networks for speech recognition.
Proceedings of the INTERSPEECH 2010, 2010

Recurrent neural network based language model.
Proceedings of the INTERSPEECH 2010, 2010

Prosodic speaker verification using subspace multinomial models with intersession compensation.
Proceedings of the INTERSPEECH 2010, 2010

Brno university of technology system for interspeech 2010 paralinguistic challenge.
Proceedings of the INTERSPEECH 2010, 2010

Similarity scoring for recognizing repeated out-of-vocabulary words.
Proceedings of the INTERSPEECH 2010, 2010

The AMIDA 2009 meeting transcription system.
Proceedings of the INTERSPEECH 2010, 2010

Subspace Gaussian Mixture Models for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

Tuning phone decoders for language identification.
Proceedings of the IEEE International Conference on Acoustics, 2010

Investigations into prosodic syllable contour features for speaker recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

Approaches to automatic lexicon learning with limited training examples.
Proceedings of the IEEE International Conference on Acoustics, 2010

A novel estimation of feature-space MLLR for full-covariance models.
Proceedings of the IEEE International Conference on Acoustics, 2010

Multilingual acoustic modeling for speech recognition based on subspace Gaussian Mixture Models.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Posterior-based out of vocabulary word detection in telephone speech.
Proceedings of the INTERSPEECH 2009, 2009

Brno University of Technology system for Interspeech 2009 emotion challenge.
Proceedings of the INTERSPEECH 2009, 2009

Investigation into bottle-neck features for meeting speech recognition.
Proceedings of the INTERSPEECH 2009, 2009

Investigation into variants of joint factor analysis for speaker recognition.
Proceedings of the INTERSPEECH 2009, 2009

BUT system for NIST 2008 speaker recognition evaluation.
Proceedings of the INTERSPEECH 2009, 2009

Discriminative acoustic language recognition via channel-compensated GMM statistics.
Proceedings of the INTERSPEECH 2009, 2009

Neural network based language models for highly inflective languages.
Proceedings of the IEEE International Conference on Acoustics, 2009

Comparison of scoring methods used in speaker recognition with Joint Factor Analysis.
Proceedings of the IEEE International Conference on Acoustics, 2009

Support vector machines and Joint Factor Analysis for speaker verification.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Acquisition of Telephone Data from Radio Broadcasts with Applications to Language Recognition.
Proceedings of the Text, Speech and Dialogue, 11th International Conference, 2008

Sub-word modeling of out of vocabulary words in spoken term detection.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Morphological random forests for language modeling of inflectional languages.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Contour modeling of prosodic and acoustic features for speaker recognition.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

BUT language recognition system for NIST 2007 evaluations.
Proceedings of the INTERSPEECH 2008, 2008

Discrimininative training of narrow band - wide band adapted systems for meeting recognition.
Proceedings of the INTERSPEECH 2008, 2008

Discriminative training and channel compensation for acoustic language recognition.
Proceedings of the INTERSPEECH 2008, 2008

Advances in phonotactic language recognition.
Proceedings of the INTERSPEECH 2008, 2008

Confidence estimation, OOV detection and language ID using phone-to-word transduction and phone-level alignments.
Proceedings of the IEEE International Conference on Acoustics, 2008

Combination of strongly and weakly constrained recognizers for reliable detection of OOVS.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Analysis of Feature Extraction and Channel Compensation in a GMM Speaker Recognition System.
IEEE Trans. Speech Audio Process., 2007

Fusion of Heterogeneous Speaker Recognition Systems in the STBU Submission for the NIST Speaker Recognition Evaluation 2006.
IEEE Trans. Speech Audio Process., 2007

Maximum Likelihood and Maximum Mutual Information Training in Gender and Age Recognition System.
Proceedings of the Text, Speech and Dialogue, 10th International Conference, 2007

Spoken Term Detection System Based on Combination of LVCSR and Phonetic Search.
Proceedings of the Machine Learning for Multimodal Interaction , 2007

Application of CMLLR in narrow band wide band adapted systems.
Proceedings of the INTERSPEECH 2007, 2007

STBU System for the NIST 2006 Speaker Recognition Evaluation.
Proceedings of the IEEE International Conference on Acoustics, 2007

The AMI System for the Transcription of Speech in Meetings.
Proceedings of the IEEE International Conference on Acoustics, 2007

The 2007 AMI(DA) System for Meeting Transcription.
Proceedings of the Multimodal Technologies for Perception of Humans, 2007

2006
Indexing and Search Methods for Spoken Documents.
Proceedings of the Text, Speech and Dialogue, 9th International Conference, 2006

Brno University of Technology System for NIST 2005 Language Recognition Evaluation.
Proceedings of the Odyssey 2006, 2006

Robust Heteroscedastic Linear Discriminant Analysis and LCRC Posterior Features in Meeting Data Recognition.
Proceedings of the Machine Learning for Multimodal Interaction, 2006

The AMI Meeting Transcription System: Progress and Performance.
Proceedings of the Machine Learning for Multimodal Interaction, 2006

Use of Anti-Models to Further Improve State-of-the-Art PRLM Language Recognition System.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Discriminative Training Techniques for Acoustic Language Identification.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Information Retrieval from Spoken Documents.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2006

2005
Phoneme Based Acoustics Keyword Spotting in Informal Continuous Speech.
Proceedings of the Text, Speech and Dialogue, 8th International Conference, 2005

The Development of the AMI System for the Transcription of Speech in Meetings.
Proceedings of the Machine Learning for Multimodal Interaction, 2005

The 2005 AMI System for the Transcription of Speech in Meetings.
Proceedings of the Machine Learning for Multimodal Interaction, 2005

Comparison of keyword spotting approaches for informal continuous speech.
Proceedings of the INTERSPEECH 2005, 2005

Non-parametric speaker turn segmentation of meeting data.
Proceedings of the INTERSPEECH 2005, 2005

2004
Measurement of Complementarity of Recognition Systems.
Proceedings of the Text, Speech and Dialogue, 7th International Conference, 2004

Combination of speech features using smoothed heteroscedastic linear discriminant analysis.
Proceedings of the INTERSPEECH 2004, 2004

2003
Recognition of Speech with Non-random Attributes.
Proceedings of the Text, Speech and Dialogue, 6th International Conference, 2003

2002
Efficient Noise Estimation and Its Application for Robust Speech Recognition.
Proceedings of the Text, Speech and Dialogue, 5th International Conference, 2002

Noise estimation for efficient speech enhancement and robust speech recognition.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Qualcomm-ICSI-OGI features for ASR.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2001
Data Driven Design of Filter Bank for Speech Recognition.
Proceedings of the Text, Speech and Dialogue, 4th International Conference, 2001

Robust ASR front-end using spectral-based and discriminant features: experiments on the Aurora tasks.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001


  Loading...