Johan Schalkwyk

According to our database1, Johan Schalkwyk authored at least 33 papers between 1994 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context.
CoRR, 2024

2023
Gemini: A Family of Highly Capable Multimodal Models.
CoRR, 2023

SLM: Bridge the thin gap between speech and text foundation models.
CoRR, 2023

AudioPaLM: A Large Language Model That Can Speak and Listen.
CoRR, 2023

Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages.
CoRR, 2023

Lego-Features: Exporting Modular Encoder Features for Streaming and Deliberation ASR.
Proceedings of the IEEE International Conference on Acoustics, 2023

SLM: Bridge the Thin Gap Between Speech and Text Foundation Models.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2017
On lattice generation for large vocabulary speech recognition.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Speech Research at Google to Enable Universal Speech Interfaces.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2015
Learning acoustic frame labeling for speech recognition with recurrent neural networks.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Long short term memory neural network for keyboard gesture decoding.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2012
Voice Query Refinement.
Proceedings of the INTERSPEECH 2012, 2012

2011
A Filter-Based Algorithm for Efficient Composition of Finite-State Transducers.
Int. J. Found. Comput. Sci., 2011

2010
Filters for Efficient Composition of Weighted Finite-State Transducers.
Proceedings of the Implementation and Application of Automata, 2010

Query language modeling for voice search.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Voice search for development.
Proceedings of the INTERSPEECH 2010, 2010

On-demand language model interpolation for mobile speech input.
Proceedings of the INTERSPEECH 2010, 2010

2009
Semantic context effects in the recognition of acoustically unreduced and reduced words.
Proceedings of the INTERSPEECH 2009, 2009

Language modeling for what-with-where on GOOG-411.
Proceedings of the INTERSPEECH 2009, 2009

A generalized composition algorithm for weighted finite-state transducers.
Proceedings of the INTERSPEECH 2009, 2009

Mobile media search.
Proceedings of the IEEE International Conference on Acoustics, 2009

OpenFst.
Proceedings of the Finite-State Methods and Natural Language Processing, 2009

2008
Deploying GOOG-411: Early lessons in data, measurement, and testing.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
OpenFst: A General and Efficient Weighted Finite-State Transducer Library.
Proceedings of the Implementation and Application of Automata, 2007

2003
Speech recognition with dynamic grammars using finite-state transducers.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

1998
Universal speech tools: the CSLU toolkit.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

1997
Experiments with a spoken dialogue system for taking the US census.
Speech Commun., 1997

CSLUsh: an extendible research environment.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

1996
Building 10, 000 spoken dialogue systems.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Speech recognition using syllable-like units.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Speaker verification with low storage requirements.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1994
A prototype voice-response questionnaire for the u.s. census.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Detecting an imposter in telephone speech.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994


  Loading...