Martin Karafiát

Orcid: 0000-0001-6474-8366

According to our database1, Martin Karafiát authored at least 95 papers between 2002 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
BUT CHiME-7 system description.
CoRR, 2023

2021
The IWSLT 2021 BUT Speech Translation Systems.
Proceedings of the 18th International Conference on Spoken Language Translation, 2021

Jointly Trained Transformers Models for Spoken Language Translation.
Proceedings of the IEEE International Conference on Acoustics, 2021

Analysis of X-Vectors for Low-Resource Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge.
Proceedings of the Fifth International Conference, 2021

2020
BUT Opensat 2019 Speech Recognition System.
CoRR, 2020

2019
Analysis of Multilingual Sequence-to-Sequence Speech Recognition Systems.
Proceedings of the Interspeech 2019, 2019

Promising Accurate Prefix Boosting for Sequence-to-sequence ASR.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Residual Memory Networks: Feed-forward approach to learn long temporal dependencies.
CoRR, 2018

Multilingual Sequence-to-Sequence Speech Recognition: Architecture, Transfer Learning, and Language Modeling.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

BUT System for Low Resource Indian Language ASR.
Proceedings of the Interspeech 2018, 2018

BUT OpenSAT 2017 Speech Recognition System.
Proceedings of the Interspeech 2018, 2018

Analysis of Multilingual Blstm Acoustic Model on Low and High Resource Languages.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

2016 BUT Babel System: Multilingual BLSTM Acoustic Model with i-Vector Based Adaptation.
Proceedings of the Interspeech 2017, 2017

Residual memory networks: Feed-forward approach to learn long-term temporal dependencies.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Training Data Augmentation and Data Selection.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016
Bottle-Neck Feature Extraction Structures for Multilingual Training and Porting.
Proceedings of the SLTU-2016, 2016

Study of Large Data Resources for Multilingual Training and System Porting.
Proceedings of the SLTU-2016, 2016

Multilingual BLSTM and speaker-specific vector adaptation in 2016 but babel system.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Boosting performance on low-resource languages by standard corpora: An analysis.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

BUT Zero-Cost Speech Recognition 2016 System Description.
Proceedings of the Working Notes Proceedings of the MediaEval 2016 Workshop, 2016

Data Selection by Sequence Summarizing Neural Network in Mismatch Condition Training.
Proceedings of the Interspeech 2016, 2016

Sequence summarizing neural network for speaker adaptation.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Multilingual region-dependent transforms.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Three ways to adapt a CTS recognizer to unseen reverberated speech in BUT system for the ASpIRE challenge.
Proceedings of the INTERSPEECH 2015, 2015

Robust speech recognition in unknown reverberant and noisy conditions.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Adapting multilingual neural network hierarchy to a new language.
Proceedings of the 4th Workshop on Spoken Language Technologies for Under-resourced Languages, 2014

But ASR system for BABEL Surprise evaluation 2014.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Further investigation into multilingual training and adaptation of stacked bottle-neck neural network structure.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Progress in the BBN keyword search system for the DARPA RATS program.
Proceedings of the INTERSPEECH 2014, 2014

BUT 2014 Babel system: analysis of adaptation in NN based systems.
Proceedings of the INTERSPEECH 2014, 2014

Combination of multilingual and semi-supervised training for under-resourced languages.
Proceedings of the INTERSPEECH 2014, 2014

But neural network features for spontaneous Vietnamese in BABEL.
Proceedings of the IEEE International Conference on Acoustics, 2014

Adaptation of multilingual stacked bottle-neck neural network structure for new language.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
A region-specific feature-space transformation for speaker adaptation and singularity analysis of jacobian matrix.
Proceedings of the INTERSPEECH 2013, 2013

BUT BABEL system for spontaneous Cantonese.
Proceedings of the INTERSPEECH 2013, 2013

Feature and score level combination of subspace Gaussinas in LVCSR task.
Proceedings of the IEEE International Conference on Acoustics, 2013

Manual and semi-automatic approaches to building a multilingual phoneme set.
Proceedings of the IEEE International Conference on Acoustics, 2013

Score normalization and system combination for improved keyword spotting.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

Semi-supervised bootstrapping approach for neural network feature extractor training.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012
Transcribing Meetings With the AMIDA Systems.
IEEE Trans. Speech Audio Process., 2012

Dealing with Numbers in Grapheme-Based Speech Recognition.
Proceedings of the Text, Speech and Dialogue - 15th International Conference, 2012

The language-independent bottleneck features.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Speaker vectors from subspace Gaussian mixture model as complementary features for language identification.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

Description and analysis of the Brno276 system for LRE2011.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

A factorized representation of FMLLR transform based on QR-decomposition.
Proceedings of the INTERSPEECH 2012, 2012

Generating exact lattices in the WFST framework.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Improving language models for ASR using translated in-domain data.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Region dependent linear transforms in multilingual speech recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Independent component analysis and MLLR transforms for speaker identification.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
The subspace Gaussian mixture model - A structured model for speech recognition.
Comput. Speech Lang., 2011

Recurrent Neural Network Based Language Modeling in Meeting Recognition.
Proceedings of the INTERSPEECH 2011, 2011

Integrating Recent MLP Feature Extraction Techniques into TRAP Architecture.
Proceedings of the INTERSPEECH 2011, 2011

A symmetrization of the Subspace Gaussian Mixture Model.
Proceedings of the IEEE International Conference on Acoustics, 2011

Simplification and optimization of i-vector extraction.
Proceedings of the IEEE International Conference on Acoustics, 2011

Variational approximation of long-span language models for lvcsr.
Proceedings of the IEEE International Conference on Acoustics, 2011

Convolutive Bottleneck Network features for LVCSR.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

iVector-based discriminative adaptation for automatic speech recognition.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

Study of probabilistic and Bottle-Neck features in multilingual environment.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
Using Gradient Descent Optimization for Acoustics Training from Heterogeneous Data.
Proceedings of the Text, Speech and Dialogue, 13th International Conference, 2010

Data selection and calibration issues in automatic language recognition - investigation with BUT-AGNITIO NIST LRE 2009 system.
Proceedings of the Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 28, 2010

Recurrent neural network based language model.
Proceedings of the INTERSPEECH 2010, 2010

Similarity scoring for recognizing repeated out-of-vocabulary words.
Proceedings of the INTERSPEECH 2010, 2010

The AMIDA 2009 meeting transcription system.
Proceedings of the INTERSPEECH 2010, 2010

Hierarchical neural net architectures for feature extraction in ASR.
Proceedings of the INTERSPEECH 2010, 2010

Subword-based spoken term detection in audio course lectures.
Proceedings of the IEEE International Conference on Acoustics, 2010

Subspace Gaussian Mixture Models for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

Approaches to automatic lexicon learning with limited training examples.
Proceedings of the IEEE International Conference on Acoustics, 2010

A novel estimation of feature-space MLLR for full-covariance models.
Proceedings of the IEEE International Conference on Acoustics, 2010

Multilingual acoustic modeling for speech recognition based on subspace Gaussian Mixture Models.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Posterior-based out of vocabulary word detection in telephone speech.
Proceedings of the INTERSPEECH 2009, 2009

Investigation into bottle-neck features for meeting speech recognition.
Proceedings of the INTERSPEECH 2009, 2009

Real-time ASR from meetings.
Proceedings of the INTERSPEECH 2009, 2009

BUT system for NIST 2008 speaker recognition evaluation.
Proceedings of the INTERSPEECH 2009, 2009

2008
Advances in Acoustic Modeling for the Recognition of Czech.
Proceedings of the Text, Speech and Dialogue, 11th International Conference, 2008

Discrimininative training of narrow band - wide band adapted systems for meeting recognition.
Proceedings of the INTERSPEECH 2008, 2008

2007
Fusion of Heterogeneous Speaker Recognition Systems in the STBU Submission for the NIST Speaker Recognition Evaluation 2006.
IEEE Trans. Speech Audio Process., 2007

Spoken Term Detection System Based on Combination of LVCSR and Phonetic Search.
Proceedings of the Machine Learning for Multimodal Interaction , 2007

Application of CMLLR in narrow band wide band adapted systems.
Proceedings of the INTERSPEECH 2007, 2007

STBU System for the NIST 2006 Speaker Recognition Evaluation.
Proceedings of the IEEE International Conference on Acoustics, 2007

The AMI System for the Transcription of Speech in Meetings.
Proceedings of the IEEE International Conference on Acoustics, 2007

Probabilistic and Bottle-Neck Features for LVCSR of Meetings.
Proceedings of the IEEE International Conference on Acoustics, 2007

The 2007 AMI(DA) System for Meeting Transcription.
Proceedings of the Multimodal Technologies for Perception of Humans, 2007

2006
Indexing and Search Methods for Spoken Documents.
Proceedings of the Text, Speech and Dialogue, 9th International Conference, 2006

Robust Heteroscedastic Linear Discriminant Analysis and LCRC Posterior Features in Meeting Data Recognition.
Proceedings of the Machine Learning for Multimodal Interaction, 2006

The AMI Meeting Transcription System: Progress and Performance.
Proceedings of the Machine Learning for Multimodal Interaction, 2006

Information Retrieval from Spoken Documents.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2006

2005
Phoneme Based Acoustics Keyword Spotting in Informal Continuous Speech.
Proceedings of the Text, Speech and Dialogue, 8th International Conference, 2005

The Development of the AMI System for the Transcription of Speech in Meetings.
Proceedings of the Machine Learning for Multimodal Interaction, 2005

The 2005 AMI System for the Transcription of Speech in Meetings.
Proceedings of the Machine Learning for Multimodal Interaction, 2005

Comparison of keyword spotting approaches for informal continuous speech.
Proceedings of the INTERSPEECH 2005, 2005

Transcription of conference room meetings: an investigation.
Proceedings of the INTERSPEECH 2005, 2005

2004
TRAP based features for LVCSR of meting data.
Proceedings of the INTERSPEECH 2004, 2004

2002
Some Like It Gaussian....
Proceedings of the Text, Speech and Dialogue, 5th International Conference, 2002


  Loading...