Ralf Schlüter

According to our database1, Ralf Schlüter authored at least 181 papers between 1997 and 2019.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepages:

On csauthors.net:

Bibliography

2019
Training of reduced-rank linear transformations for multi-layer polynomial acoustic features for speech recognition.
Speech Communication, 2019

Upper and Lower Tight Error Bounds for Feature Omission with an Extension to Context Reduction.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Investigation into Joint Optimization of Single Channel Speech Enhancement and Acoustic Modeling for Robust ASR.
Proceedings of the IEEE International Conference on Acoustics, 2019

On Using 2D Sequence-to-sequence Models for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Speaker Adapted Beamforming for Multi-Channel Automatic Speech Recognition.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Improved Training of End-to-end Attention Models for Speech Recognition.
Proceedings of the Interspeech 2018, 2018

Investigation on LSTM Recurrent N-gram Language Models for Speech Recognition.
Proceedings of the Interspeech 2018, 2018

Comparison of BLSTM-Layer-Specific Affine Transformations for Speaker Adaptation.
Proceedings of the Interspeech 2018, 2018

Investigation on Estimation of Sentence Probability by Combining Forward, Backward and Bi-directional LSTM-RNNs.
Proceedings of the Interspeech 2018, 2018

Segmental Encoder-Decoder Models for Large Vocabulary Automatic Speech Recognition.
Proceedings of the Interspeech 2018, 2018

Acoustic Modeling of Speech Waveform Based on Multi-Resolution, Neural Network Signal Processing.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Prediction of LSTM-RNN Full Context States as a Subtask for N-Gram Feedforward Language Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Inverted Alignments for End-to-End Automatic Speech Recognition.
J. Sel. Topics Signal Processing, 2017

The 2016 RWTH Keyword Search System for Low-Resource Languages.
Proceedings of the Speech and Computer - 19th International Conference, 2017

CTC in the Context of Generalized Full-Sum HMM Training.
Proceedings of the Interspeech 2017, 2017

Parallel Neural Network Features for Improved Tandem Acoustic Modeling.
Proceedings of the Interspeech 2017, 2017

Faster sequence training.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

A comprehensive study of deep bidirectional LSTM RNNS for acoustic modeling in speech recognition.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Noisy objective functions based on the f-divergence.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Investigations on byte-level convolutional neural networks for language modeling in low resource speech recognition.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Returnn: The RWTH extensible training framework for universal recurrent neural networks.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Automatic Speech Recognition Based on Neural Networks.
Proceedings of the Speech and Computer - 18th International Conference, 2016

Towards Online-Recognition with Deep Bidirectional LSTM Acoustic Models.
Proceedings of the Interspeech 2016, 2016

LSTM, GRU, Highway and a Bit of Attention: An Empirical Overview for Language Modeling in Speech Recognition.
Proceedings of the Interspeech 2016, 2016

Investigation on log-linear interpolation of multi-domain neural network language model.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Robust Online Multi-Channel Speech Recognition.
Proceedings of the 12. ITG Symposium on Speech Communication, 2016

2015
From Feedforward to Recurrent LSTM Neural Networks for Language Modeling.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2015

Improvements in RWTH LVCSR evaluation systems for Polish, Portuguese, English, urdu, and Arabic.
Proceedings of the INTERSPEECH 2015, 2015

Bag-of-words input for long history representation in neural network-based language models for speech recognition.
Proceedings of the INTERSPEECH 2015, 2015

Multilingual features based keyword search for very low-resource languages.
Proceedings of the INTERSPEECH 2015, 2015

Convolutional neural networks for acoustic modeling of raw time signal in LVCSR.
Proceedings of the INTERSPEECH 2015, 2015

Error bounds for context reduction and feature omission.
Proceedings of the INTERSPEECH 2015, 2015

Investigations on sequence training of neural networks.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Sequence-discriminative training of recurrent neural networks.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Integrating Gaussian mixtures into deep neural networks: Softmax layer with hidden variables.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Investigation of mixture splitting concept for training linear bottlenecks of deep neural network acoustic models.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Improved strategies for a zero oov rate LVCSR system.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Unsupervised adaptation of a denoising autoencoder by Bayesian Feature Enhancement for reverberant asr under mismatch conditions.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Speaker adaptive joint training of Gaussian mixture models and bottleneck features.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Multilingual representations for low resource speech recognition and keyword search.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Acoustic modeling with deep neural networks using raw time signal for LVCSR.
Proceedings of the INTERSPEECH 2014, 2014

Data augmentation, feature combination, and multilingual neural networks to improve ASR and KWS performance for low-resource languages.
Proceedings of the INTERSPEECH 2014, 2014

Lattice decoding and rescoring with long-Span neural network language models.
Proceedings of the INTERSPEECH 2014, 2014

rwthlm - the RWTH aachen university neural network language modeling toolkit.
Proceedings of the INTERSPEECH 2014, 2014

RWTH LVCSR systems for quaero and EU-bridge: German, Polish, Spanish and Portuguese.
Proceedings of the INTERSPEECH 2014, 2014

Word pair approximation for more efficient decoding with high-order language models.
Proceedings of the INTERSPEECH 2014, 2014

Open-Lexicon Language Modeling Combining Word and Character Levels.
Proceedings of the 14th International Conference on Frontiers in Handwriting Recognition, 2014

Mean-normalized stochastic gradient for large-scale deep learning.
Proceedings of the IEEE International Conference on Acoustics, 2014

RASR/NN: The RWTH neural network toolkit for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

The RWTH English lecture recognition system.
Proceedings of the IEEE International Conference on Acoustics, 2014

Multilingual MRASTA features for low-resource keyword search and speech recognition systems.
Proceedings of the IEEE International Conference on Acoustics, 2014

A family of discriminative training criteria based on the F-divergence for deep neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Lexical Prefix Tree and WFST: A Comparison of Two Dynamic Search Concepts for LVCSR.
IEEE Trans. Audio, Speech & Language Processing, 2013

Novel tight classification error bounds under mismatch conditions based on f-Divergence.
Proceedings of the 2013 IEEE Information Theory Workshop, 2013

Multilingual hierarchical MRASTA features for ASR.
Proceedings of the INTERSPEECH 2013, 2013

Training log-linear acoustic models in higher-order polynomial feature space for speech recognition.
Proceedings of the INTERSPEECH 2013, 2013

Feature-rich sub-lexical language models using a maximum entropy approach for German LVCSR.
Proceedings of the INTERSPEECH 2013, 2013

Relative error bounds for statistical classifiers based on the f-divergence.
Proceedings of the INTERSPEECH 2013, 2013

Morpheme level hierarchical pitman-yor class-based language models for LVCSR of morphologically rich languages.
Proceedings of the INTERSPEECH 2013, 2013

Improving LVCSR with hidden conditional random fields for grapheme-to-phoneme conversion.
Proceedings of the INTERSPEECH 2013, 2013

Development of the RWTH transcription system for slovenian.
Proceedings of the INTERSPEECH 2013, 2013

A critical evaluation of stochastic algorithms for convex optimization.
Proceedings of the IEEE International Conference on Acoustics, 2013

Deep hierarchical bottleneck MRASTA features for LVCSR.
Proceedings of the IEEE International Conference on Acoustics, 2013

Investigation on cross- and multilingual MLP features under matched and mismatched acoustical conditions.
Proceedings of the IEEE International Conference on Acoustics, 2013

Comparison of feedforward and recurrent neural network language models.
Proceedings of the IEEE International Conference on Acoustics, 2013

Feature combination and stacking of recurrent and non-recurrent neural networks for LVCSR.
Proceedings of the IEEE International Conference on Acoustics, 2013

Advanced search space pruning with acoustic look-ahead for WFST based LVCSR.
Proceedings of the IEEE International Conference on Acoustics, 2013

System combination and score normalization for spoken term detection.
Proceedings of the IEEE International Conference on Acoustics, 2013

Open vocabulary handwriting recognition using combined word-level and character-level language models.
Proceedings of the IEEE International Conference on Acoustics, 2013

A high-performance Cantonese keyword search system.
Proceedings of the IEEE International Conference on Acoustics, 2013

Efficient nearly error-less LVCSR decoding based on incremental forward and backward passes.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012
WFST Enabled Solutions to ASR Problems: Beyond HMM Decoding.
IEEE Trans. Audio, Speech & Language Processing, 2012

Discriminative Training for Automatic Speech Recognition: Modeling, Criteria, Optimization, Implementation, and Performance.
IEEE Signal Process. Mag., 2012

Does the Cost Function Matter in Bayes Decision Rule?
IEEE Trans. Pattern Anal. Mach. Intell., 2012

Phase difference of filter-stable part-tones as acoustic feature.
Proceedings of the IEEE Statistical Signal Processing Workshop, 2012

Accelerated Batch Learning of Convex Log-linear Models for LVCSR.
Proceedings of the INTERSPEECH 2012, 2012

Context-Dependent MLPs for LVCSR: TANDEM, Hybrid or Both?
Proceedings of the INTERSPEECH 2012, 2012

Non-stationary signal processing and its application in speech recognition.
Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2012

Simultaneous Discriminative Training and Mixture Splitting of HMMs for Speech Recognition.
Proceedings of the INTERSPEECH 2012, 2012

LSTM Neural Networks for Language Modeling.
Proceedings of the INTERSPEECH 2012, 2012

Hierarchical hybrid language models for open vocabulary continuous speech recognition using WFST.
Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2012

Investigation of Maximum Entropy Hybrid Language Models for Open Vocabulary German and Polish LVCSR.
Proceedings of the INTERSPEECH 2012, 2012

Posterior-Scaled MPE: Novel Discriminative Training Criteria.
Proceedings of the INTERSPEECH 2012, 2012

Search Space Pruning Based on Anticipated Path Recombination in LVCSR.
Proceedings of the INTERSPEECH 2012, 2012

Morpheme Level Feature-based Language Models for German LVCSR.
Proceedings of the INTERSPEECH 2012, 2012

Comparison and combination of different CRBE based MLP features for LVCSR.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Silence is golden: Modeling non-speech events in WFST-based dynamic network decoders.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Extended search space pruning in LVCSR.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Joining advantages of word-conditioned and token-passing decoding.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Investigations on the use of morpheme level features in Language Models for Arabic LVCSR.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Basis vector orthogonalization for an improved kernel gradient matching pursuit method.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
On the Relationship Between Bayes Risk and Word Error Rate in ASR.
IEEE Trans. Audio, Speech & Language Processing, 2011

Equivalence of Generative and Log-Linear Models.
IEEE Trans. Audio, Speech & Language Processing, 2011


A Study on Speaker Normalized MLP Features in LVCSR.
Proceedings of the INTERSPEECH 2011, 2011

Log-Linear Optimization of Second-Order Polynomial Features with Subsequent Dimension Reduction for Speech Recognition.
Proceedings of the INTERSPEECH 2011, 2011

On the Estimation of Discount Parameters for Language Model Smoothing.
Proceedings of the INTERSPEECH 2011, 2011

Hybrid Language Models Using Mixed Types of Sub-Lexical Units for Open Vocabulary German LVCSR.
Proceedings of the INTERSPEECH 2011, 2011

Improved Acoustic Feature Combination for LVCSR by Neural Networks.
Proceedings of the INTERSPEECH 2011, 2011

Compound Word Recombination for German LVCSR.
Proceedings of the INTERSPEECH 2011, 2011

Acoustic Look-Ahead for More Efficient Decoding in LVCSR.
Proceedings of the INTERSPEECH 2011, 2011

Morpheme Based Factored Language Models for German LVCSR.
Proceedings of the INTERSPEECH 2011, 2011

Feature selection for log-linear acoustic models.
Proceedings of the IEEE International Conference on Acoustics, 2011

Non-stationary feature extraction for automatic speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

The RWTH 2010 Quaero ASR evaluation system for English, French, and German.
Proceedings of the IEEE International Conference on Acoustics, 2011

Using morpheme and syllable based sub-words for polish LVCSR.
Proceedings of the IEEE International Conference on Acoustics, 2011

A comparative analysis of dynamic network decoding.
Proceedings of the IEEE International Conference on Acoustics, 2011

Exploiting sparseness of backing-off language models for efficient look-ahead in LVCSR.
Proceedings of the IEEE International Conference on Acoustics, 2011

Subspace pursuit method for kernel-log-linear models.
Proceedings of the IEEE International Conference on Acoustics, 2011

A convergence analysis of log-linear training and its application to speech recognition.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

Discriminative splitting of Gaussian/log-linear mixture HMMs for speech recognition.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

Cross-lingual portability of Chinese and english neural network features for French and German LVCSR.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
Margin-Based Discriminative Training for String Recognition.
J. Sel. Topics Signal Processing, 2010

Sub-lexical language models for German LVCSR.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Evaluation of automatic transcription systems for the judicial domain.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

A Hybrid Morphologically Decomposed Factored Language Models for Arabic LVCSR.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

A discriminative splitting criterion for phonetic decision trees.
Proceedings of the INTERSPEECH 2010, 2010

On the relation of Bayes risk, word error, and word posteriors in ASR.
Proceedings of the INTERSPEECH 2010, 2010

Revisiting VTLN using linear transformation on conventional MFCC.
Proceedings of the INTERSPEECH 2010, 2010

Hierarchical bottle neck features for LVCSR.
Proceedings of the INTERSPEECH 2010, 2010

Parallel lexical-tree based LVCSR on multi-core processors.
Proceedings of the INTERSPEECH 2010, 2010

The RWTH 2009 quaero ASR evaluation system for English and German.
Proceedings of the INTERSPEECH 2010, 2010

Time conditioned search in automatic speech recognition reconsidered.
Proceedings of the INTERSPEECH 2010, 2010

Discriminative adaptation for log-linear acoustic models.
Proceedings of the INTERSPEECH 2010, 2010

Discriminative HMMS, log-linear models, and CRFS: What is the difference?
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
The RWTH aachen university open source speech recognition system.
Proceedings of the INTERSPEECH 2009, 2009

Development of the GALE 2008 Mandarin LVCSR system.
Proceedings of the INTERSPEECH 2009, 2009

Parallel fast likelihood computation for LVCSR using mixture decomposition.
Proceedings of the INTERSPEECH 2009, 2009

Bayes risk approximations using time overlap with an application to system combination.
Proceedings of the INTERSPEECH 2009, 2009

Log-linear model combination with word-dependent scaling factors.
Proceedings of the INTERSPEECH 2009, 2009

Investigations on convex optimization using log-linear HMMs for digit string recognition.
Proceedings of the INTERSPEECH 2009, 2009

Investigating the use of morphological decomposition and diacritization for improving Arabic LVCSR.
Proceedings of the INTERSPEECH 2009, 2009

Automatic Transcription of Courtroom Recordings in the JUMAS project.
Proceedings of the 2<sup>nd</sup> International Conference on ICT Solutions for Justice, 2009

Audio segmentation for speech recognition using segment features.
Proceedings of the IEEE International Conference on Acoustics, 2009

Modified MPE/MMI in a transducer-based framework.
Proceedings of the IEEE International Conference on Acoustics, 2009

Investigations on features for log-linear acoustic models in continuous speech recognition.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

Generalized likelihood ratio discriminant analysis.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008
Development of the SRI/nightingale Arabic ASR system.
Proceedings of the INTERSPEECH 2008, 2008

Recent improvements of the RWTH GALE Mandarin LVCSR system.
Proceedings of the INTERSPEECH 2008, 2008

iCNC and iROVER: the limits of improving system combination with classification?
Proceedings of the INTERSPEECH 2008, 2008

On the equivalence of Gaussian and log-linear HMMs.
Proceedings of the INTERSPEECH 2008, 2008

Modified MMI/MPE: a direct evaluation of the margin in speech recognition.
Proceedings of the Machine Learning, 2008

A GIS-like training algorithm for log-linear models with hidden variables.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Using multiple acoustic feature sets for speech recognition.
Speech Communication, 2007

iROVER: Improving System Combination with Classification.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

Hierarchical neural networks feature extraction for LVCSR system.
Proceedings of the INTERSPEECH 2007, 2007

Efficient estimation of speaker-specific projecting feature transforms.
Proceedings of the INTERSPEECH 2007, 2007

The RWTH 2007 TC-STAR evaluation system for european English and Spanish.
Proceedings of the INTERSPEECH 2007, 2007

On the equivalence of Gaussian HMM and Gaussian HMM-like hidden conditional random fields.
Proceedings of the INTERSPEECH 2007, 2007

An improved method for unsupervised training of LVCSR systems.
Proceedings of the INTERSPEECH 2007, 2007

Gammatone Features and Feature Combination for Large Vocabulary Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2007

Cross-Site and Intra-Site ASR System Combination: Comparisons on Lattice and 1-Best Methods.
Proceedings of the IEEE International Conference on Acoustics, 2007

Advances in Arabic broadcast news transcription at RWTH.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

Development of the 2007 RWTH Mandarin LVCSR system.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
Feature combination using linear discriminant analysis and its pitfalls.
Proceedings of the INTERSPEECH 2006, 2006

The 2006 RWTH parliamentary speeches transcription system.
Proceedings of the INTERSPEECH 2006, 2006

Frame based system combination and a comparison with weighted ROVER and CNC.
Proceedings of the INTERSPEECH 2006, 2006

2005
Bayes risk minimization using metric loss functions.
Proceedings of the INTERSPEECH 2005, 2005

Investigations on error minimizing training criteria for discriminative training in automatic speech recognition.
Proceedings of the INTERSPEECH 2005, 2005

Articulatory motivated acoustic features for speech recognition.
Proceedings of the INTERSPEECH 2005, 2005

Acoustic Feature Combination for Robust Speech Recognition.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Cross Domain Automatic Transcription on the TC-STAR EPPS Corpus.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Discriminative training with tied covariance matrices.
Proceedings of the INTERSPEECH 2004, 2004

2003
Extraction methods of voicing feature for robust speech recognition.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002
Robust speech recognition using a voiced-unvoiced feature.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2001
Confidence measures for large vocabulary continuous speech recognition.
IEEE Trans. Speech and Audio Processing, 2001

Comparison of discriminative training criteria and optimization methods for speech recognition.
Speech Communication, 2001

Vocal tract normalization equals linear transformation in cepstral space.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Explicit word error minimization using word hypothesis posterior probabilities.
Proceedings of the IEEE International Conference on Acoustics, 2001

Using phase spectrum information for improved speech recognition performance.
Proceedings of the IEEE International Conference on Acoustics, 2001

Computing Mel-frequency cepstral coefficients on the power spectrum.
Proceedings of the IEEE International Conference on Acoustics, 2001

2000
Investigations on discriminative training criteria.
PhD thesis, 2000

The RWTH Large Vocabulary Speech Recognition System for Spontaneous Speech.
Proceedings of the KONVENS 2000 / Sprachkommunikation, 2000

Speech recognition using context conditional word posterior probabilities.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Using posterior word probabilities for improved speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2000

Recent improvements of the RWTH large vocabulary speech recognition system on spontaneous speech.
Proceedings of the IEEE International Conference on Acoustics, 2000

1999
A combined maximum mutual information and maximum likelihood approach for mixture density splitting.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Discriminative Training of Gaussian Mixtures for Image Object Recognition.
Proceedings of the Mustererkennung 1999, 1999

1998
Using word probabilities as confidence measures.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

Comparison of discriminative training criteria.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997
Comparison of optimization methods for discriminative training criteria.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997


  Loading...