Georg Heigold

According to our database1, Georg Heigold authored at least 64 papers between 2006 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Video OWL-ViT: Temporally-consistent open-world localization in video.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Conditional Object-Centric Learning from Video.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale.
Proceedings of the 9th International Conference on Learning Representations, 2021

ViViT: A Video Vision Transformer.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
Object-Centric Learning with Slot Attention.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

2018
How Robust Are Character-Based Word Embeddings in Tagging and MT Against Wrod Scramlbing or Randdm Nouse?
Proceedings of the 13th Conference of the Association for Machine Translation in the Americas, 2018

2017
A Linguistic Evaluation of Rule-Based, Phrase-Based, and Neural MT Engines.
Prague Bull. Math. Linguistics, 2017

How Robust Are Character-Based Word Embeddings in Tagging and MT Against Wrod Scramlbing or Randdm Nouse?
CoRR, 2017

Cross-lingual Character-Level Neural Morphological Tagging.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Common Round: Application of Language Technologies to Large-Scale Web Debates.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

An Extensive Empirical Evaluation of Character-Based Morphological Tagging for 14 Languages.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

2016
Neural Morphological Tagging from Characters for Morphologically Rich Languages.
CoRR, 2016

End-to-end text-dependent speaker verification.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Scaling character-based morphological tagging to fourteen languages.
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016

2015
A Gaussian Mixture Model layer jointly optimized with discriminative features within a Deep Neural Network architecture.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Sequence discriminative distributed training of long short-term memory recurrent neural networks.
Proceedings of the INTERSPEECH 2014, 2014

Asynchronous stochastic optimization for sequence training of deep neural networks: towards big data.
Proceedings of the INTERSPEECH 2014, 2014

Word embeddings for speech recognition.
Proceedings of the INTERSPEECH 2014, 2014

Asynchronous, online, GMM-free training of a context dependent acoustic model for speech recognition.
Proceedings of the INTERSPEECH 2014, 2014

GMM-free DNN acoustic model training.
Proceedings of the IEEE International Conference on Acoustics, 2014

Asynchronous stochastic optimization for sequence training of deep neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2014

Small-footprint keyword spotting using deep neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Optimization Algorithms and Applications for Speech and Language Processing.
IEEE Trans. Speech Audio Process., 2013

Investigations on an EM-Style Optimization Algorithm for Discriminative Training of HMMs.
IEEE ACM Trans. Audio Speech Lang. Process., 2013

Multiframe deep neural networks for acoustic modeling.
Proceedings of the IEEE International Conference on Acoustics, 2013

An empirical study of learning rates in deep neural networks for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

Deep neural networks with auxiliary Gaussian mixture models for real-time speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

Multilingual acoustic models using distributed deep neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
WFST Enabled Solutions to ASR Problems: Beyond HMM Decoding.
IEEE Trans. Speech Audio Process., 2012

Discriminative Training for Automatic Speech Recognition: Modeling, Criteria, Optimization, Implementation, and Performance.
IEEE Signal Process. Mag., 2012

Latent Log-Linear Models for Handwritten Digit Classification.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

Exemplar-based speech recognition in a rescoring approach.
Proceedings of the 2012 Symposium on Machine Learning in Speech and Language Processing, 2012

Posterior-Scaled MPE: Novel Discriminative Training Criteria.
Proceedings of the INTERSPEECH 2012, 2012

Overview of large scale optimization for discriminative training in speech recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Investigations on exemplar-based features for speech recognition towards thousands of hours of unsupervised, noisy data.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Equivalence of Generative and Log-Linear Models.
IEEE Trans. Speech Audio Process., 2011

Confidence- and margin-based MMI/MPE discriminative training for off-line handwriting recognition.
Int. J. Document Anal. Recognit., 2011

EM-style optimization of hidden conditional random fields for grapheme-to-phoneme conversion.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
A log-linear discriminative modeling framework for speech recognition.
PhD thesis, 2010

Object classification by fusing SVMs and Gaussian mixtures.
Pattern Recognit., 2010

Speech Recognition With Flat Direct Models.
IEEE J. Sel. Top. Signal Process., 2010

Margin-Based Discriminative Training for String Recognition.
IEEE J. Sel. Top. Signal Process., 2010

A discriminative splitting criterion for phonetic decision trees.
Proceedings of the INTERSPEECH 2010, 2010

Discriminative HMMS, log-linear models, and CRFS: What is the difference?
Proceedings of the IEEE International Conference on Acoustics, 2010

Eine Formulierung für den log-linearen, diskriminativen Ansatz in der Spracherkennung.
Proceedings of the Ausgezeichnete Informatikdissertationen 2010, 2010

2009
The RWTH aachen university open source speech recognition system.
Proceedings of the INTERSPEECH 2009, 2009

Development of the GALE 2008 Mandarin LVCSR system.
Proceedings of the INTERSPEECH 2009, 2009

Investigations on convex optimization using log-linear HMMs for digit string recognition.
Proceedings of the INTERSPEECH 2009, 2009

Optimizing CRFs for SLU tasks in various languages using modified training criteria.
Proceedings of the INTERSPEECH 2009, 2009

Confidence-Based Discriminative Training for Model Adaptation in Offline Arabic Handwriting Recognition.
Proceedings of the 10th International Conference on Document Analysis and Recognition, 2009

A flat direct model for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009

Modified MPE/MMI in a transducer-based framework.
Proceedings of the IEEE International Conference on Acoustics, 2009

Investigations on features for log-linear acoustic models in continuous speech recognition.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

Generalized likelihood ratio discriminant analysis.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008
Recent improvements of the RWTH GALE Mandarin LVCSR system.
Proceedings of the INTERSPEECH 2008, 2008

On the equivalence of Gaussian and log-linear HMMs.
Proceedings of the INTERSPEECH 2008, 2008

SVMs, Gaussian mixtures, and their generative/discriminative fusion.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Modified MMI/MPE: a direct evaluation of the margin in speech recognition.
Proceedings of the Machine Learning, 2008

A GIS-like training algorithm for log-linear models with hidden variables.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
The RWTH 2007 TC-STAR evaluation system for european English and Spanish.
Proceedings of the INTERSPEECH 2007, 2007

On the equivalence of Gaussian HMM and Gaussian HMM-like hidden conditional random fields.
Proceedings of the INTERSPEECH 2007, 2007

Speech recognition with state-based nearest neighbour classifiers.
Proceedings of the INTERSPEECH 2007, 2007

Development of the 2007 RWTH Mandarin LVCSR system.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
The 2006 RWTH parliamentary speeches transcription system.
Proceedings of the INTERSPEECH 2006, 2006


  Loading...