David Rybach

Orcid: 0009-0006-6720-1335

According to our database¹, David Rybach authored at least 61 papers between 2005 and 2023.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2023

Multi-Output RNN-T Joint Networks for Multi-Task Learning of ASR and Auxiliary Tasks.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Alignment Entropy Regularization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Large-Scale Language Model Rescoring on Long-Form Data.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Handling Compounding in Mobile Keyboard Input.

[BibT_eX]

[DOI]

CoRR, 2022

Global Normalization for Streaming Speech Recognition in a Modular Framework.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Improving Rare Word Recognition with LM-aware MWER Training.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

On Adaptive Weight Interpolation of the Hybrid Autoregressive Transducer.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Improving The Latency And Quality Of Cascaded Encoders.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

An Efficient Streaming Non-Recurrent On-Device End-to-End Model with Improvements to Rare-Word Modeling.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Lookup-Table Recurrent Language Models for Long Tail Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

A Hybrid Seq-2-Seq ASR Design for On-Device and Server Applications.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Less is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency.

[BibT_eX]

[DOI]

CoRR, 2020

Emitting Word Timings with End-to-End Models.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Low Latency Speech Recognition Using End-to-End Prefetching.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Hybrid Autoregressive Transducer (HAT).

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

A Streaming On-Device End-To-End Model Surpassing Server-Side Conventional Model Quality and Latency.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling.

[BibT_eX]

[DOI]

CoRR, 2019

Model Unit Exploration for Sequence-to-Sequence Speech Recognition.

[BibT_eX]

[DOI]

CoRR, 2019

Shallow-Fusion End-to-End Contextual Biasing.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Two-Pass End-to-End Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

On the Choice of Modeling Unit for Sequence-to-Sequence Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Streaming End-to-end Speech Recognition for Mobile Devices.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Recognizing Long-Form Speech Using Streaming End-to-End Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018

Contextual Speech Recognition in End-to-end Neural Network Systems Using Beam Search.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

No Need for a Lexicon? Evaluating the Value of the Pronunciation Lexica in End-to-End Models.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Mobile Keyboard Input Decoding with Finite-State Transducers.

[BibT_eX]

[DOI]

CoRR, 2017

Transliterated Mobile Keyboard Input via Weighted Finite-State Transducers.

[BibT_eX]

[DOI]

Proceedings of the 13th International Conference on Finite State Methods and Natural Language Processing, 2017

On lattice generation for large vocabulary speech recognition.

[BibT_eX]

[DOI]

David Rybach

Michael Riley

Johan Schalkwyk

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016

Personalized speech recognition on mobile devices.

[BibT_eX]

[DOI]

Ian McGraw

Rohit Prabhavalkar

Raziel Alvarez

Montse Gonzalez Arenas

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

Composition-based on-the-fly rescoring for salient n-gram biasing.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Bringing contextual information to google speech recognition.

[BibT_eX]

[DOI]

Petar S. Aleksic

Mohammadreza Ghodsi

Assaf Hurwitz Michaely

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Multitask learning and system combination for automatic speech recognition.

[BibT_eX]

[DOI]

Olivier Siohan

David Rybach

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014

Investigations on search methods for speech recognition using weighted finite-state transducers.

[BibT_eX]

[DOI]

David Rybach

PhD thesis, 2014

Direct construction of compact context-dependency transducers from data.

[BibT_eX]

[DOI]

David Rybach

Michael Riley

Chris Alberti

Comput. Speech Lang., 2014

Context dependent state tying for speech recognition using deep neural network acoustic models.

[BibT_eX]

[DOI]

Michiel Bacchiani

David Rybach

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

Lexical Prefix Tree and WFST: A Comparison of Two Dynamic Search Concepts for LVCSR.

[BibT_eX]

[DOI]

David Rybach

Hermann Ney

Ralf Schlüter

IEEE Trans. Speech Audio Process., 2013

Open vocabulary handwriting recognition using combined word-level and character-level language models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

WFST Enabled Solutions to ASR Problems: Beyond HMM Decoding.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2012

Hierarchical hybrid language models for open vocabulary continuous speech recognition using WFST.

[BibT_eX]

[DOI]

Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2012

Silence is golden: Modeling non-speech events in WFST-based dynamic network decoders.

[BibT_eX]

[DOI]

David Rybach

Ralf Schlüter

Hermann Ney

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Joining advantages of word-conditioned and token-passing decoding.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011

A comparative analysis of dynamic network decoding.

[BibT_eX]

[DOI]

David Rybach

Ralf Schlüter

Hermann Ney

Proceedings of the IEEE International Conference on Acoustics, 2011

2010

Direct construction of compact context-dependency transducers from data.

[BibT_eX]

[DOI]

David Rybach

Michael Riley

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Parallel lexical-tree based LVCSR on multi-core processors.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009

Transcription of Catalan Broadcast Conversation.

[BibT_eX]

[DOI]

Henrik Schulz

José A. R. Fonollosa

David Rybach

Proceedings of the Text, Speech and Dialogue, 12th International Conference, 2009

The RWTH aachen university open source speech recognition system.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Parallel fast likelihood computation for LVCSR using mixture decomposition.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Investigations on convex optimization using log-linear HMMs for digit string recognition.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Investigating the use of morphological decomposition and diacritization for improving Arabic LVCSR.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Writer Adaptive Training and Writing Variant Model Refinement for Offline Arabic Handwriting Recognition.

[BibT_eX]

[DOI]

Proceedings of the 10th International Conference on Document Analysis and Recognition, 2009

Audio segmentation for speech recognition using segment features.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

2008

Development of the SRI/nightingale Arabic ASR system.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

2007

The RWTH 2007 TC-STAR evaluation system for european English and Spanish.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Speech recognition techniques for a sign language recognition system.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Advances in Arabic broadcast news transcription at RWTH.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006

Tracking Using Dynamic Programming for Appearance-Based Sign Language Recognition.

[BibT_eX]

[DOI]

Proceedings of the Seventh IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2006), 2006

Geometric Features for Improving Continuous Appearance-based Sign Language Recognition.

[BibT_eX]

[DOI]

Proceedings of the British Machine Vision Conference 2006, 2006

2005

Erweiterung eines holistischen statistischen Bilderkenners zur Verwendung von mehreren Merkmalen.

[BibT_eX]

David Rybach

Daniel Keysers

Hermann Ney

Proceedings of the Informatiktage 2005, 2005

David Rybach

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...