Lukás Burget

Martin Kocour

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Twenty-Five Years of Evolution in Speech and Language Processing.

[BibT_eX]

[DOI]

IEEE Signal Process. Mag., July, 2023

Stabilized training of joint energy-based models and their practical applications.

[BibT_eX]

[DOI]

Laureano Moro-Velázquez

Najim Dehak

CoRR, 2023

Improving Speaker Verification with Self-Pretrained Transformer Models.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Multi-Channel Speech Separation with Cross-Attention and Beamforming.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Description and Analysis of ABC Submission to NIST LRE 2022.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Toroidal Probabilistic Spherical Discriminant Analysis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Parameter-Efficient Transfer Learning of Pre-Trained Transformer Models for Speaker Verification Using Adapters.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Multi-Speaker and Wide-Band Simulated Conversations as Training Data for End-to-End Neural Diarization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Speech-Based Emotion Recognition with Self-Supervised Models Using Attentive Channel-Wise Correlations and Label Smoothing.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Non-Parametric Bayesian Subspace Models for Acoustic Unit Discovery.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2022

Spelling-Aware Word-Based End-to-End ASR.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2022

Bayesian HMM clustering of x-vector sequences (VBx) in speaker diarization: Theory, implementation and analysis on standard tasks.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2022

Extracting Speaker and Emotion Information from Self-Supervised Speech Models via Channel-Wise Correlations.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2022

An Attention-Based Backend Allowing Efficient Fine-Tuning of Transformer Models for Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Analyzing Speaker Verification Embedding Extractors and Back-Ends Under Language and Channel Mismatch.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022

Development of ABC Systems for the 2021 Edition of NIST Speaker Recognition Evaluation.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022

Training speaker embedding extractors using multi-speaker audio with unknown speaker boundaries.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Learnable Sparse Filterbank for Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

From Simulated Mixtures to Simulated Conversations as Training Data for End-to-End Neural Diarization.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Revisiting joint decoding based multi-talker speech recognition with DNN acoustic model.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Probabilistic Spherical Discriminant Analysis: An Alternative to PLDA for length-normalized embeddings.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Speaker adaptation for Wav2vec2 based dysarthric ASR.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

GPU-Accelerated Forward-Backward Algorithm with Application to Lattice-Free MMI.

[BibT_eX]

[DOI]

Lucas Ondel

Léa-Marie Lam-Yee-Mui

Martin Kocour

Caio Filippo Corro

Proceedings of the IEEE International Conference on Acoustics, 2022

Multi-Channel Speaker Verification with Conv-Tasnet Based Beamformer.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Multisv: Dataset for Far-Field Multi-Channel Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation and Extraction.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

BCN2BRNO: ASR System Fusion for Albayzin 2022 Speech to Text Challenge.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference, 2022

2021

Integration of Variational Autoencoder and Spatial Clustering for Adaptive Multi-Channel Neural Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

The IWSLT 2021 BUT Speech Translation Systems.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Spoken Language Translation, 2021

Speaker Embeddings by Modeling Channel-Wise Correlations.

[BibT_eX]

[DOI]

Themos Stafylakis

Johan Rohdin

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

ICSpk: Interpretable Complex Speaker Embedding Extractor from Raw Waveform.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Effective Phase Encoding for End-To-End Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Out-of-Vocabulary Words Detection with Attention and CTC Alignments in an End-to-End ASR System.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Text Augmentation for Language Models in High Error Recognition Scenario.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

A Hierarchical Subspace Model for Language-Attuned Acoustic Unit Discovery.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Jointly Trained Transformers Models for Spoken Language Translation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Analysis of the but Diarization System for Voxconverse Challenge.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Eat: Enhanced ASR-TTS for Self-Supervised Speech Recognition.

[BibT_eX]

[DOI]

Ramón Fernandez Astudillo

Shinji Watanabe

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Learning Document Embeddings Along With Their Uncertainties.

[BibT_eX]

[DOI]

Santosh Kesiraju

Oldrich Plchot

Suryakanth V. Gangashetty

IEEE ACM Trans. Audio Speech Lang. Process., 2020

Analysis of Speaker Diarization Based on Bayesian HMM With Eigenvoice Priors.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2020

End-to-end DNN based text-independent speaker recognition for long and short utterances.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2020

13 years of speaker recognition research at BUT, with longitudinal analysis of NIST SRE.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2020

A Technical Report: BUT Speech Translation Systems.

[BibT_eX]

[DOI]

Hari Krishna Vydana

CoRR, 2020

Probabilistic Embeddings for Speaker Diarization.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Analysis of ABC Submission to NIST SRE 2019 CMN and VAST Challenge.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

SdSV Challenge 2020: Large-Scale Evaluation of Short-Duration Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

BUT Text-Dependent Speaker Verification System for SdSV Challenge 2020.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

But System for the Second Dihard Speech Diarization Challenge.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Optimizing Bayesian Hmm Based X-Vector Clustering for the Second Dihard Speech Diarization Challenge.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Investigation of Specaugment for Deep Speaker Embedding Learning.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

SpeakerBeam: Speaker Aware Neural Network for Target Speaker Extraction in Speech Mixtures.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2019

Analysis of DNN Speech Signal Enhancement for Robust Speaker Recognition.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2019

Short-duration Speaker Verification (SdSV) Challenge 2020: the Challenge Evaluation Plan.

[BibT_eX]

[DOI]

CoRR, 2019

Acoustic Scene Classification Using Fusion of Attentive Convolutional Neural Networks for DCASE2019 Challenge.

[BibT_eX]

[DOI]

CoRR, 2019

BUT VOiCES 2019 System Description.

[BibT_eX]

[DOI]

CoRR, 2019

Self-supervised Sequence-to-sequence ASR using Unpaired Speech and Text.

[BibT_eX]

[DOI]

Ramón Fernandez Astudillo

Shinji Watanabe

Takaaki Hori

CoRR, 2019

BUT-FIT at SemEval-2019 Task 7: Determining the Rumour Stance with Pre-Trained Deep Bidirectional Transformers.

[BibT_eX]

[DOI]

Martin Fajcik

Pavel Smrz

Proceedings of the 13th International Workshop on Semantic Evaluation, 2019

Detecting Spoofing Attacks Using VGG and SincNet: BUT-Omilia Submission to ASVspoof 2019 Challenge.

[BibT_eX]

[DOI]

Themos Stafylakis

Georgia Athanasopoulou

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

On the Usage of Phonetic Information for Text-Independent Speaker Embedding Extraction.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Self-Supervised Speaker Embeddings.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Bayesian Subspace Hidden Markov Model for Acoustic Unit Discovery.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Factorization of Discriminatively Trained i-Vector Extractor for Speaker Recognition.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Analysis of BUT Submission in Far-Field Scenarios of VOiCES 2019 Challenge.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Bayesian HMM Based x-Vector Clustering for Speaker Diarization.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Semi-Supervised Sequence-to-Sequence ASR Using Unpaired Speech and Text.

[BibT_eX]

[DOI]

Ramón Fernandez Astudillo

Shinji Watanabe

Takaaki Hori

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

How to Improve Your Speaker Embeddings Extractor in Generic Toolkits.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Speaker Verification Using End-to-end Adversarial Language Adaptation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Discriminatively Re-trained I-vector Extractor for Speaker Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Promising Accurate Prefix Boosting for Sequence-to-sequence ASR.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

A Multi Purpose and Large Scale Speech Corpus in Persian and English for Speaker and Speech Recognition: The Deepmine Database.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Speaker Verification with Application-Aware Beamforming.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018

Residual Memory Networks: Feed-forward approach to learn long temporal dependencies.

[BibT_eX]

[DOI]

CoRR, 2018

Spoken Pass-Phrase Verification in the i-vector Space.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

BUT/Phonexia Bottleneck Feature Extractor.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Analysis of BUT-PT Submission for NIST LRE 2017.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Speaker Diarization based on Bayesian HMM with Eigenvoice Priors.

[BibT_eX]

[DOI]

Mireia Díez

Pavel Matejka

Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Gaussian meta-embeddings for efficient scoring of a heavy-tailed PLDA model.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Fast Variational Bayes for Heavy-tailed PLDA Applied to i-vectors and x-vectors.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

BUT System for Low Resource Indian Language ASR.

[BibT_eX]

[DOI]

Bhargav Pulugundla

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

BUT OpenSAT 2017 Speech Recognition System.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

BUT System for DIHARD Speech Diarization Challenge 2018.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

i-Vectors in Language Modeling: An Efficient Way of Domain Adaptation for Feed-Forward Models.

[BibT_eX]

[DOI]

Santosh Kesiraju

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

End-to-End DNN Based Speaker Recognition Inspired by I-Vector and PLDA.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Bayesian Models for Unit Discovery on a Very Low Resource Language.

[BibT_eX]

[DOI]

Mark Hasegawa-Johnson

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Analysis of Multilingual Blstm Acoustic Model on Low and High Resource Languages.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Out-of-Vocabulary Word Recovery using FST-Based Subword Unit Clustering in a Hybrid ASR System.

[BibT_eX]

[DOI]

Ekaterina Egorova

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Convolutional neural networks and x-vector embedding for DCASE2018 Acoustic Scene Classification challenge.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

2017

HMM-Based Phrase-Independent i-Vector Extractor for Text-Dependent Speaker Verification.

[BibT_eX]

[DOI]

Hossein Sameti

IEEE ACM Trans. Audio Speech Lang. Process., 2017

Text-dependent speaker verification based on i-vectors, Neural Networks and Hidden Markov Models.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2017

Semi-Supervised DNN Training with Word Selection for ASR.

[BibT_eX]

[DOI]

Karel Veselý

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Alternative Approaches to Neural Network Based Speaker Verification.

[BibT_eX]

[DOI]

Anna Silnova

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Team ELISA System for DARPA LORELEI Speech Evaluation 2016.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Analysis of Score Normalization in Multilingual Speaker Recognition.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2016 BUT Babel System: Multilingual BLSTM Acoustic Model with i-Vector Based Adaptation.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Residual Memory Networks in Language Modeling: Improving the Reputation of Feed-Forward Networks.

[BibT_eX]

[DOI]

Suryakanth V. Gangashetty

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Bayesian phonotactic Language Model for Acoustic Unit Discovery.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

An empirical evaluation of zero resource acoustic unit discovery.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Topic identification of spoken documents using unsupervised acoustic unit discovery.

[BibT_eX]

[DOI]

Santosh Kesiraju

Raghavendra Pappagari

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Bayesian joint-sequence models for grapheme-to-phoneme conversion.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Residual memory networks: Feed-forward approach to learn long-term temporal dependencies.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Training Data Augmentation and Data Selection.

[BibT_eX]

[DOI]

Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016

Variational Inference for Acoustic Unit Discovery.

[BibT_eX]

[DOI]

Lucas Ondel

Joaquin Gonzalez-Rodriguez

Proceedings of the SLTU-2016, 2016

Analysis of the DNN-based SRE systems in multi-language conditions.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Deep Neural Networks and Hidden Markov Models in i-vector-based Text-Dependent Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

BAT System Description for NIST LRE 2015.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

Analysis and Optimization of Bottleneck Features for Speaker Recognition.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

Data Selection by Sequence Summarizing Neural Network in Mismatch Condition Training.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

i-Vector/HMM Based Text-Dependent Speaker Verification System for RedDots Challenge.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Sequence Summarizing Neural Networks for Spoken Language Recognition.

[BibT_eX]

[DOI]

Jan Pesán

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Analysis of Speaker Recognition Systems in Realistic Scenarios of the SITW 2016 Challenge.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Exploiting Hidden-Layer Responses of Deep Neural Networks for Language Recognition.

[BibT_eX]

[DOI]

Ruizhi Li

Sri Harish Reddy Mallidi

Oldrich Plchot

Najim Dehak

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Learning Document Representations Using Subspace Multinomial Model.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Sequence summarizing neural network for speaker adaptation.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Audio enhancing with DNN autoencoder for speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Analysis of DNN approaches to speaker identification.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Multilingual region-dependent transforms.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

DNN derived filters for processing of modulation spectrum of speech.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Three ways to adapt a CTS recognizer to unseen reverberated speech in BUT system for the ASpIRE challenge.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Migrating i-vectors between speaker recognition systems using regression neural networks.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Copingwith channel mismatch in Query-by-Example - But QUESST 2014.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Employment of Subspace Gaussian Mixture Models in speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Towards machines that know when they do not know: Summary of work done at 2014 Frederick Jelinek Memorial Workshop.

[BibT_eX]

[DOI]

Sri Harish Reddy Mallidi

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Robust speech recognition in unknown reverberant and noisy conditions.

[BibT_eX]

[DOI]

Sri Harish Reddy Mallidi

Hynek Hermansky

Stavros Tsakalidis

Richard M. Schwartz

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014

Non-Negative Factor Analysis of Gaussian Mixture Model Weight Adaptation for Language and Dialect Recognition.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2014

But ASR system for BABEL Surprise evaluation 2014.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

GMM Weights Adaptation Based on Subspace Approaches for Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014

BUT QUESST 2014 system description.

[BibT_eX]

[DOI]

Igor Szöke

Miroslav Skácel

Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014

PLLR features in language recognition system for RATS.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Calibration and fusion of query-by-example systems - But SWS 2013.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Unscented transform for ivector-based noisy speaker recognition.

[BibT_eX]

[DOI]

David Martínez González

Proceedings of the IEEE International Conference on Acoustics, 2014

Domain adaptation via within-class covariance correction in I-vector based speaker recognition systems.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

Pairwise Discriminative Speaker Verification in the 𝕀-Vector Space.

[BibT_eX]

[DOI]

Vasileios Vasilakakis

IEEE Trans. Speech Audio Process., 2013

BUT SWS 2013 - Massive Parallel Approach.

[BibT_eX]

[DOI]

Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013

Sequence-discriminative training of deep neural networks.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Regularized subspace n-gram model for phonotactic ivector extraction.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

A region-specific feature-space transformation for speaker adaptation and singularity analysis of jacobian matrix.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

A noise robust i-vector extractor using vector taylor series for speaker recognition.

[BibT_eX]

[DOI]

Yun Lei

Nicolas Scheffer

Proceedings of the IEEE International Conference on Acoustics, 2013

Rich system combination for keyword spotting in noisy and acoustically heterogeneous audio streams.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Semi-supervised training of Deep Neural Networks.

[BibT_eX]

[DOI]

Karel Veselý

Mirko Hannemann

Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012

Out-of-Vocabulary Word Detection and Beyond.

[BibT_eX]

[DOI]

Stefan Kombrink

Mirko Hannemann

Proceedings of the Detection and Identification of Rare Audiovisual Cues, 2012

Transcribing Meetings With the AMIDA Systems.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2012

A unified approach for audio characterization and its application to speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

Bilinear Factor Analysis for iVector Based Speaker Verification.

[BibT_eX]

[DOI]

Yun Lei

Nicolas Scheffer

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Discriminatively trained phoneme confusion model for keyword spotting.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Discriminative classifiers for phonotactic language recognition with iVectors.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Generating exact lattices in the WFST framework.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Towards noise-robust speaker recognition using probabilistic linear discriminant analysis.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Improving language models for ASR using translated in-domain data.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Region dependent linear transforms in multilingual speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

iVector-based prosodic system for language identification.

[BibT_eX]

[DOI]

David Martínez González

Luciana Ferrer

Nicolas Scheffer

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011

Application of speaker- and language identification state-of-the-art techniques for emotion recognition.

[BibT_eX]

[DOI]

Speech Commun., 2011

The subspace Gaussian mixture model - A structured model for speech recognition.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2011

iVector Approach to Phonotactic Language Recognition.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Empirical Evaluation and Combination of Advanced Language Modeling Techniques.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Language Recognition in iVectors Space.

[BibT_eX]

[DOI]

David Martínez González

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Recurrent Neural Network Based Language Modeling in Meeting Recognition.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

iVector Fusion of Prosodic and Cepstral Features for Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Discriminatively Trained i-vector Extractor for Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Extensions of recurrent neural network language model.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Full-covariance UBM and heavy-tailed PLDA in i-vector speaker verification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Recent progress in prosodic speaker verification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Simplification and optimization of i-vector extraction.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Fast discriminative speaker verification in the i-vector space.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Discriminatively trained Probabilistic Linear Discriminant Analysis for speaker verification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Strategies for training large scale neural network language models.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

iVector-based discriminative adaptation for automatic speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010

Parallel Training of Neural Networks for Speech Recognition.

[BibT_eX]

[DOI]

Karel Veselý

Frantisek Grézl

Proceedings of the Text, Speech and Dialogue, 13th International Conference, 2010

Recovery of Rare Words in Lecture Speech.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue, 13th International Conference, 2010

PCA-based Feature Extraction for Phonotactic Language Recognition.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 28, 2010

Data selection and calibration issues in automatic language recognition - investigation with BUT-AGNITIO NIST LRE 2009 system.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 28, 2010

Recurrent neural network based language model.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Prosodic speaker verification using subspace multinomial models with intersession compensation.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Brno university of technology system for interspeech 2010 paralinguistic challenge.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Similarity scoring for recognizing repeated out-of-vocabulary words.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

The AMIDA 2009 meeting transcription system.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Subspace Gaussian Mixture Models for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Tuning phone decoders for language identification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Investigations into prosodic syllable contour features for speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Approaches to automatic lexicon learning with limited training examples.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

A novel estimation of feature-space MLLR for full-covariance models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Multilingual acoustic modeling for speech recognition based on subspace Gaussian Mixture Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

Posterior-based out of vocabulary word detection in telephone speech.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Brno University of Technology system for Interspeech 2009 emotion challenge.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Investigation into bottle-neck features for meeting speech recognition.

[BibT_eX]

[DOI]

Frantisek Grézl

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Investigation into variants of joint factor analysis for speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

BUT system for NIST 2008 speaker recognition evaluation.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Discriminative acoustic language recognition via channel-compensated GMM statistics.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Neural network based language models for highly inflective languages.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

Comparison of scoring methods used in speaker recognition with Joint Factor Analysis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

Support vector machines and Joint Factor Analysis for speaker verification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

2008

Acquisition of Telephone Data from Radio Broadcasts with Applications to Language Recognition.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue, 11th International Conference, 2008

Sub-word modeling of out of vocabulary words in spoken term detection.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Morphological random forests for language modeling of inflectional languages.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Contour modeling of prosodic and acoustic features for speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

BUT language recognition system for NIST 2007 evaluations.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Discrimininative training of narrow band - wide band adapted systems for meeting recognition.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Discriminative training and channel compensation for acoustic language recognition.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Advances in phonotactic language recognition.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Confidence estimation, OOV detection and language ID using phone-to-word transduction and phone-level alignments.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

Combination of strongly and weakly constrained recognizers for reliable detection of OOVS.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

2007

Analysis of Feature Extraction and Channel Compensation in a GMM Speaker Recognition System.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2007

Fusion of Heterogeneous Speaker Recognition Systems in the STBU Submission for the NIST Speaker Recognition Evaluation 2006.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2007

Maximum Likelihood and Maximum Mutual Information Training in Gender and Age Recognition System.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue, 10th International Conference, 2007

Spoken Term Detection System Based on Combination of LVCSR and Phonetic Search.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning for Multimodal Interaction , 2007

Application of CMLLR in narrow band wide band adapted systems.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

STBU System for the NIST 2006 Speaker Recognition Evaluation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

The AMI System for the Transcription of Speech in Meetings.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

The 2007 AMI(DA) System for Meeting Transcription.

[BibT_eX]

[DOI]

Proceedings of the Multimodal Technologies for Perception of Humans, 2007

2006

Indexing and Search Methods for Spoken Documents.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue, 9th International Conference, 2006

Brno University of Technology System for NIST 2005 Language Recognition Evaluation.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2006: The Speaker and Language Recognition Workshop, 2006

Robust Heteroscedastic Linear Discriminant Analysis and LCRC Posterior Features in Meeting Data Recognition.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning for Multimodal Interaction, 2006

The AMI Meeting Transcription System: Progress and Performance.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning for Multimodal Interaction, 2006

Use of Anti-Models to Further Improve State-of-the-Art PRLM Language Recognition System.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Discriminative Training Techniques for Acoustic Language Identification.

[BibT_eX]

[DOI]

Pavel Matejka

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Information Retrieval from Spoken Documents.

[BibT_eX]

[DOI]

Proceedings of the Computational Linguistics and Intelligent Text Processing, 2006

2005

Phoneme Based Acoustics Keyword Spotting in Informal Continuous Speech.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue, 8th International Conference, 2005

The Development of the AMI System for the Transcription of Speech in Meetings.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning for Multimodal Interaction, 2005

The 2005 AMI System for the Transcription of Speech in Meetings.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning for Multimodal Interaction, 2005

Comparison of keyword spotting approaches for informal continuous speech.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Non-parametric speaker turn segmentation of meeting data.

[BibT_eX]

[DOI]

Petr Motlícek

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

2004

Measurement of Complementarity of Recognition Systems.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue, 7th International Conference, 2004

Combination of speech features using smoothed heteroscedastic linear discriminant analysis.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

2003

Recognition of Speech with Non-random Attributes.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue, 6th International Conference, 2003

2002

Efficient Noise Estimation and Its Application for Robust Speech Recognition.

[BibT_eX]

[DOI]

Petr Motlícek

Proceedings of the Text, Speech and Dialogue, 5th International Conference, 2002

Noise estimation for efficient speech enhancement and robust speech recognition.

[BibT_eX]

[DOI]

Petr Motlícek

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Qualcomm-ICSI-OGI features for ASR.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2001

Data Driven Design of Filter Bank for Speech Recognition.

[BibT_eX]

[DOI]