Hagen Soltau

According to our database1, Hagen Soltau authored at least 72 papers between 1996 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Retrieval Augmented End-to-End Spoken Dialog Models.
CoRR, 2024

2023
SLM: Bridge the thin gap between speech and text foundation models.
CoRR, 2023

Efficient Adapters for Giant Speech Models.
CoRR, 2023

Speech-to-Text Adapter and Speech-to-Entity Retriever Augmented LLMs for Speech Understanding.
CoRR, 2023

Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages.
CoRR, 2023

AnyTOD: A Programmable Task-Oriented Dialog System.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

SLM: Bridge the Thin Gap Between Speech and Text Foundation Models.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Detecting Speech Abnormalities With a Perceiver-Based Sequence Classifier that Leverages a Universal Speech Model.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Speech Aware Dialog System Technology Challenge (DSTC11).
CoRR, 2022

RNN Transducers for Nested Named Entity Recognition with constraints on alignment for long sequences.
CoRR, 2022

Unsupervised Slot Schema Induction for Task-oriented Dialog.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

RNN Transducers for Named Entity Recognition with constraints on alignment for understanding medical conversations.
Proceedings of the Interspeech 2022, 2022

Knowledge-grounded Dialog State Tracking.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021
Understanding Medical Conversations: Rich Transcription, Confidence Scores & Information Extraction.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Word-Level Confidence Estimation for RNN Transducers.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
The Medical Scribe: Corpus Development and Model Performance Analyses.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

2019
Joint Speech Recognition and Speaker Diarization via Sequence Transduction.
Proceedings of the Interspeech 2019, 2019

Monotonic Recurrent Neural Network Transducer and Decoding Strategies.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2017
Neural Speech Recognizer: Acoustic-to-Word LSTM Model for Large Vocabulary Speech Recognition.
Proceedings of the Interspeech 2017, 2017

Reducing the computational complexity for whole word models.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2015
Deep Convolutional Neural Networks for Large-scale Speech Tasks.
Neural Networks, 2015

2014
Automatic Speech Recognition.
Proceedings of the Natural Language Processing of Semitic Languages, 2014

Unfolded recurrent neural networks for speech recognition.
Proceedings of the INTERSPEECH 2014, 2014

Removing redundancy from lattices.
Proceedings of the INTERSPEECH 2014, 2014

Analyzing convolutional neural networks for speech activity detection in mismatched acoustic conditions.
Proceedings of the IEEE International Conference on Acoustics, 2014

Joint training of convolutional and non-convolutional neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2014

A comparison of two optimization techniques for sequence discriminative training of deep neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2014

Progress in dynamic network decoding.
Proceedings of the IEEE International Conference on Acoustics, 2014

Efficient spoken term detection using confusion networks.
Proceedings of the IEEE International Conference on Acoustics, 2014

Out-of-vocabulary word detection in a speech-to-speech translation system.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Optimization Techniques to Improve Training Speed of Deep Neural Networks for Large Speech Tasks.
IEEE Trans. Speech Audio Process., 2013

Neural network acoustic models for the DARPA RATS program.
Proceedings of the INTERSPEECH 2013, 2013

The IBM speech activity detection system for the DARPA RATS program.
Proceedings of the INTERSPEECH 2013, 2013

Morpheme-based feature-rich language models using Deep Neural Networks for LVCSR of Egyptian Arabic.
Proceedings of the IEEE International Conference on Acoustics, 2013

Exploiting diversity for spoken term detection.
Proceedings of the IEEE International Conference on Acoustics, 2013

Speaker adaptation of neural network acoustic models using i-vectors.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

Improvements to Deep Convolutional Neural Networks for LVCSR.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

The IBM keyword search system for the DARPA RATS program.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012
Boosting systems for large vocabulary continuous speech recognition.
Speech Commun., 2012

Scalable Minimum Bayes Risk Training of Deep Neural Network Acoustic Models Using Distributed Hessian-free Optimization.
Proceedings of the INTERSPEECH 2012, 2012

2011
The IBM 2009 GALE Arabic speech transcription system.
Proceedings of the IEEE International Conference on Acoustics, 2011

From Modern Standard Arabic to Levantine ASR: Leveraging GALE for dialects.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

The IBM 2011 GALE Arabic speech transcription system.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
The IBM Attila speech recognition toolkit.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Discriminative Phonotactics for Dialect Recognition Using Context-Dependent Phone Classifiers.
Proceedings of the Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 28, 2010

Boosting systems for LVCSR.
Proceedings of the INTERSPEECH 2010, 2010

Decoding with shrinkage-based language models.
Proceedings of the INTERSPEECH 2010, 2010

The IBM 2008 GALE Arabic speech transcription system.
Proceedings of the IEEE International Conference on Acoustics, 2010

A comparative study on system combination schemes for LVCSR.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Advances in Arabic Speech Transcription at IBM Under the DARPA GALE Program.
IEEE Trans. Speech Audio Process., 2009

Large margin semi-tied covariance transforms for discriminative training.
Proceedings of the IEEE International Conference on Acoustics, 2009

Dynamic network decoding revisited.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008
Fast speaker adaptive training for speech recognition.
Proceedings of the INTERSPEECH 2008, 2008

2007
The IBM 2006 Gale Arabic ASR System.
Proceedings of the IEEE International Conference on Acoustics, 2007

2006
Advances in speech transcription at IBM under the DARPA EARS program.
IEEE Trans. Speech Audio Process., 2006

2005
Compensating hyperarticulation for automatic speech recognition.
PhD thesis, 2005

The IBM 2004 Conversational Telephony System for Rich Transcription.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

fMPE: Discriminatively Trained Features for Speech Recognition.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
The 2003 ISL rich transcription system for conversational telephony speech.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2002
Compensating for hyperarticulation by modeling articulatory properties.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Efficient language model lookahead through polymorphic linguistic context assignment.
Proceedings of the IEEE International Conference on Acoustics, 2002

2001
Advances in meeting recognition.
Proceedings of the First International Conference on Human Language Technology Research, 2001

Speech recognition over netmeeting connections.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Advances in automatic meeting record creation and access.
Proceedings of the IEEE International Conference on Acoustics, 2001

The ISL evaluation system for Verbmobil-II.
Proceedings of the IEEE International Conference on Acoustics, 2001

Speaker compensation with sine-log all-pass transforms.
Proceedings of the IEEE International Conference on Acoustics, 2001

2000
Phone dependent modeling of hyperarticulated effects#.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Specialized acoustic models for hyperarticulated speech.
Proceedings of the IEEE International Conference on Acoustics, 2000

Confidence measure based language identification.
Proceedings of the IEEE International Conference on Acoustics, 2000

1998
On the influence of hyperarticulated speech on recognition performance.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Recognition of music types.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1996
Automatische Identifizierung spontan gesprochener Sprachen mit neuronalen Netzen.
Proceedings of the Natural Language Processing and Speech Technology, 1996


  Loading...