David Rybach

According to our database1, David Rybach authored at least 61 papers between 2005 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Multi-Output RNN-T Joint Networks for Multi-Task Learning of ASR and Auxiliary Tasks.
Proceedings of the IEEE International Conference on Acoustics, 2023

Alignment Entropy Regularization.
Proceedings of the IEEE International Conference on Acoustics, 2023

E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model.
Proceedings of the IEEE International Conference on Acoustics, 2023

Large-Scale Language Model Rescoring on Long-Form Data.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Handling Compounding in Mobile Keyboard Input.
CoRR, 2022

Global Normalization for Streaming Speech Recognition in a Modular Framework.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Improving Rare Word Recognition with LM-aware MWER Training.
Proceedings of the Interspeech 2022, 2022

On Adaptive Weight Interpolation of the Hybrid Autoregressive Transducer.
Proceedings of the Interspeech 2022, 2022

E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR.
Proceedings of the Interspeech 2022, 2022


2021
An Efficient Streaming Non-Recurrent On-Device End-to-End Model with Improvements to Rare-Word Modeling.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Lookup-Table Recurrent Language Models for Long Tail Speech Recognition.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

A Hybrid Seq-2-Seq ASR Design for On-Device and Server Applications.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Less is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency.
CoRR, 2020

Emitting Word Timings with End-to-End Models.
Proceedings of the Interspeech 2020, 2020

Low Latency Speech Recognition Using End-to-End Prefetching.
Proceedings of the Interspeech 2020, 2020

Hybrid Autoregressive Transducer (HAT).
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020


2019
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling.
CoRR, 2019

Model Unit Exploration for Sequence-to-Sequence Speech Recognition.
CoRR, 2019

Shallow-Fusion End-to-End Contextual Biasing.
Proceedings of the Interspeech 2019, 2019


On the Choice of Modeling Unit for Sequence-to-Sequence Speech Recognition.
Proceedings of the Interspeech 2019, 2019


Recognizing Long-Form Speech Using Streaming End-to-End Models.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Contextual Speech Recognition in End-to-end Neural Network Systems Using Beam Search.
Proceedings of the Interspeech 2018, 2018

No Need for a Lexicon? Evaluating the Value of the Pronunciation Lexica in End-to-End Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Mobile Keyboard Input Decoding with Finite-State Transducers.
CoRR, 2017

Transliterated Mobile Keyboard Input via Weighted Finite-State Transducers.
Proceedings of the 13th International Conference on Finite State Methods and Natural Language Processing, 2017

On lattice generation for large vocabulary speech recognition.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
Personalized speech recognition on mobile devices.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Composition-based on-the-fly rescoring for salient n-gram biasing.
Proceedings of the INTERSPEECH 2015, 2015

Bringing contextual information to google speech recognition.
Proceedings of the INTERSPEECH 2015, 2015

Multitask learning and system combination for automatic speech recognition.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Investigations on search methods for speech recognition using weighted finite-state transducers.
PhD thesis, 2014

Direct construction of compact context-dependency transducers from data.
Comput. Speech Lang., 2014

Context dependent state tying for speech recognition using deep neural network acoustic models.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Lexical Prefix Tree and WFST: A Comparison of Two Dynamic Search Concepts for LVCSR.
IEEE Trans. Speech Audio Process., 2013

Open vocabulary handwriting recognition using combined word-level and character-level language models.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
WFST Enabled Solutions to ASR Problems: Beyond HMM Decoding.
IEEE Trans. Speech Audio Process., 2012

Hierarchical hybrid language models for open vocabulary continuous speech recognition using WFST.
Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2012

Silence is golden: Modeling non-speech events in WFST-based dynamic network decoders.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Joining advantages of word-conditioned and token-passing decoding.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
A comparative analysis of dynamic network decoding.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Direct construction of compact context-dependency transducers from data.
Proceedings of the INTERSPEECH 2010, 2010

Parallel lexical-tree based LVCSR on multi-core processors.
Proceedings of the INTERSPEECH 2010, 2010

2009
Transcription of Catalan Broadcast Conversation.
Proceedings of the Text, Speech and Dialogue, 12th International Conference, 2009

The RWTH aachen university open source speech recognition system.
Proceedings of the INTERSPEECH 2009, 2009

Parallel fast likelihood computation for LVCSR using mixture decomposition.
Proceedings of the INTERSPEECH 2009, 2009

Investigations on convex optimization using log-linear HMMs for digit string recognition.
Proceedings of the INTERSPEECH 2009, 2009

Investigating the use of morphological decomposition and diacritization for improving Arabic LVCSR.
Proceedings of the INTERSPEECH 2009, 2009

Writer Adaptive Training and Writing Variant Model Refinement for Offline Arabic Handwriting Recognition.
Proceedings of the 10th International Conference on Document Analysis and Recognition, 2009

Audio segmentation for speech recognition using segment features.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Development of the SRI/nightingale Arabic ASR system.
Proceedings of the INTERSPEECH 2008, 2008

2007
The RWTH 2007 TC-STAR evaluation system for european English and Spanish.
Proceedings of the INTERSPEECH 2007, 2007

Speech recognition techniques for a sign language recognition system.
Proceedings of the INTERSPEECH 2007, 2007

Advances in Arabic broadcast news transcription at RWTH.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
Tracking Using Dynamic Programming for Appearance-Based Sign Language Recognition.
Proceedings of the Seventh IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2006), 2006

Geometric Features for Improving Continuous Appearance-based Sign Language Recognition.
Proceedings of the British Machine Vision Conference 2006, 2006

2005
Erweiterung eines holistischen statistischen Bilderkenners zur Verwendung von mehreren Merkmalen.
Proceedings of the Informatiktage 2005, 2005


  Loading...