Sebastian Stüker

Affiliations:
  • Karlsruhe Institute of Technology, IFA, Germany


According to our database1, Sebastian Stüker authored at least 124 papers between 2003 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Multi-stage Large Language Model Correction for Speech Recognition.
CoRR, 2023


2022

2021
Text and Synthetic Data for Domain Adaptation in End-to-End Speech Recognition.
Proceedings of the Speech and Computer - 23rd International Conference, 2021

Multilingual Speech Translation KIT @ IWSLT2021.
Proceedings of the 18th International Conference on Spoken Language Translation, 2021

KIT's IWSLT 2021 Offline Speech Translation System.
Proceedings of the 18th International Conference on Spoken Language Translation, 2021


Efficient Weight Factorization for Multilingual Speech Recognition.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Super-Human Performance in Online Low-Latency Recognition of Conversational Speech.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

ELITR Multilingual Live Subtitling: Demo and Strategy.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, 2021

Instant One-Shot Word-Learning for Context-Specific Neural Sequence-to-Sequence Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Speech Technology for Unwritten Languages.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Low Latency ASR for Simultaneous Speech Translation.
CoRR, 2020

Toward Cross-Domain Speech Recognition with End-to-End Models.
CoRR, 2020

German-Arabic Speech-to-Speech Translation for Psychiatric Diagnosis.
Proceedings of the Fifth Arabic Natural Language Processing Workshop, 2020

DaCToR: A Data Collection Tool for the RELATER Project.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Removing European Language Barriers with Innovative Machine Translation Technology.
Proceedings of the 1st International Workshop on Language Technology Platforms, 2020

KIT's IWSLT 2020 SLT Translation System.
Proceedings of the 17th International Conference on Spoken Language Translation, 2020


Relative Positional Encoding for Speech Recognition and Direct Translation.
Proceedings of the Interspeech 2020, 2020

High Performance Sequence-to-Sequence Model for Streaming Speech Recognition.
Proceedings of the Interspeech 2020, 2020

Improving Sequence-To-Sequence Speech Recognition Training with On-The-Fly Data Augmentation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

ELITR: European Live Translator.
Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, 2020

2019
Learning Shared Encoding Representation for End-to-End Speech Recognition Models.
CoRR, 2019

The IWSLT 2019 KIT Speech Translation System.
Proceedings of the 16th International Conference on Spoken Language Translation, 2019

The IWSLT 2019 Evaluation Campaign.
Proceedings of the 16th International Conference on Spoken Language Translation, 2019

Neural Codes to Factor Language in Multilingual Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Open Source Toolkit for Speech to Text Translation.
Prague Bull. Math. Linguistics, 2018

Building Real-Time Speech Recognition Without CMVN.
Proceedings of the Speech and Computer - 20th International Conference, 2018

BULBasaa: A Bilingual Basaa-French Speech Corpus for the Evaluation of Language Documentation Tools.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

A Very Low Resource Language Speech Corpus for Computational Language Documentation Experiments.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

KIT's IWSLT 2018 SLT Translation System.
Proceedings of the 15th International Conference on Spoken Language Translation, 2018

The IWSLT 2018 Evaluation Campaign.
Proceedings of the 15th International Conference on Spoken Language Translation, 2018

Self-Attentional Acoustic Models.
Proceedings of the Interspeech 2018, 2018

Term Extraction via Neural Sequence Labeling a Comparative Evaluation of Strategies Using Recurrent Neural Networks.
Proceedings of the Interspeech 2018, 2018

Neural Language Codes for Multilingual Acoustic Models.
Proceedings of the Interspeech 2018, 2018

Linguistic Unit Discovery from Multi-Modal Inputs in Unwritten Languages: Summary of the "Speaking Rosetta" JSALT 2017 Workshop.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Multilingual Adaptation of RNN Based ASR Systems.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

KIT Lecture Translator: Multilingual Speech Translation with One-Shot Learning.
Proceedings of the COLING 2018, 2018

Parameter Optimization for CTC Acoustic Models in a Less-resourced Scenario: An Empirical Study.
Proceedings of the 13th ITG Symposium on Speech Communication, 2018

2017
Phonemic and Graphemic Multilingual CTC Based Speech Recognition.
CoRR, 2017

Language Adaptive Multilingual CTC Speech Recognition.
Proceedings of the Speech and Computer - 19th International Conference, 2017

The 2017 KIT IWSLT Speech-to-Text Systems for English and German.
Proceedings of the 14th International Conference on Spoken Language Translation, 2017

Overview of the IWSLT 2017 Evaluation Campaign.
Proceedings of the 14th International Conference on Spoken Language Translation, 2017

Yeah, Right, Uh-Huh: A Deep Learning Backchannel Predictor.
Proceedings of the Advanced Social Interaction with Agents, 2017

Comparison of Decoding Strategies for CTC Acoustic Models.
Proceedings of the Interspeech 2017, 2017

Enhancing Backchannel Prediction Using Word Embeddings.
Proceedings of the Interspeech 2017, 2017

Towards phoneme inventory discovery for documentation of unwritten languages.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

DBLSTM based multilingual articulatory feature extraction for language documentation.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016

Lecture Translator - Speech translation framework for simultaneous lecture translation.
Proceedings of the Demonstrations Session, 2016

Evaluation of the KIT Lecture Translation System.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

The 2016 KIT IWSLT Speech-to-Text Systems for English and German.
Proceedings of the 13th International Conference on Spoken Language Translation, 2016

The IWSLT 2016 Evaluation Campaign.
Proceedings of the 13th International Conference on Spoken Language Translation, 2016

Towards Improving Low-Resource Speech Recognition Using Articulatory and Language Features.
Proceedings of the 13th International Conference on Spoken Language Translation, 2016

Unsupervised Phoneme Segmentation of Previously Unseen Languages.
Proceedings of the Interspeech 2016, 2016

Dynamic Transcription for Low-Latency Speech Translation.
Proceedings of the Interspeech 2016, 2016

Language Adaptive DNNs for Improved Low Resource Speech Recognition.
Proceedings of the Interspeech 2016, 2016

Lightly Supervised Quality Estimation.
Proceedings of the COLING 2016, 2016

Training Deep Neural Networks for Reverberation Robust Speech Recognition.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016

Language Feature Vectors for Resource Constraint Speech Recognition.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016

Growing a Deep Neural Network Acoustic Model with Singular Value Decomposition.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016

Phoneme Boundary Detection using Deep Bidirectional LSTMs.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016

2015
Preparing children's writing database for automated processing.
Proceedings of the Language Teaching, 2015

Evaluation of Crowdsourced User Input Data for Spoken Dialog Systems.
Proceedings of the SIGDIAL 2015 Conference, 2015

The 2015 KIT IWSLT speech-to-text systems for English and German.
Proceedings of the 12th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2015, 2015

The IWSLT 2015 Evaluation Campaign.
Proceedings of the 12th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2015, 2015

Gaussian free cluster tree construction using deep neural network.
Proceedings of the INTERSPEECH 2015, 2015

Semi-supervised training in low-resource ASR and KWS.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Using Neural Networks for Data-Driven Backchannel Prediction: A Survey on Input Features and Training Techniques.
Proceedings of the Human-Computer Interaction: Interaction Technologies, 2015

A Semi-Automatic Word-Level Annotation and Transcription Tool for Spelling Error Categories.
Proceedings of the HCI International 2015 - Posters' Extended Abstracts, 2015

2014
An automatic system for the simultaneous translation of lectures.
J. Cheminformatics, 2014

A Corpus of Spontaneous Speech in Lectures: The KIT Lecture Corpus for Spoken Language Processing and Translation.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

A Database of Freely Written Texts of German School Students for the Purpose of Automatic Spelling Error Classification.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

The 2014 KIT IWSLT speech-to-text systems for English, German and Italian.
Proceedings of the 11th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2014, 2014

Report on the 11th IWSLT evaluation campaign.
Proceedings of the 11th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2014, 2014

Multilingual deep bottle neck features: a study on language selection and training techniques.
Proceedings of the 11th International Workshop on Spoken Language Translation: Papers, 2014

Training time reduction and performance improvements from multilingual techniques on the BABEL ASR task.
Proceedings of the IEEE International Conference on Acoustics, 2014

Multilingual shifting deep bottleneck features for low-resource ASR.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Segmentation of Telephone Speech Based on Speech and Non-speech Models.
Proceedings of the Speech and Computer - 15th International Conference, 2013

The 2013 KIT Quaero speech-to-text system for French.
Proceedings of the 10th International Workshop on Spoken Language Translation: Papers, 2013

Maximum entropy language modeling for Russian ASR.
Proceedings of the 10th International Workshop on Spoken Language Translation: Papers, 2013

The 2013 KIT IWSLT speech-to-text systems for German and English.
Proceedings of the 10th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2013, 2013

Incremental unsupervised training for university lecture recognition.
Proceedings of the 10th International Workshop on Spoken Language Translation: Papers, 2013

Report on the 10th IWSLT evaluation campaign.
Proceedings of the 10th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2013, 2013

Slightly Supervised Adaptation of Acoustic Models on Captioned BBC Weather Forecasts.
Proceedings of the First Workshop on Speech, 2013

A real-world system for simultaneous translation of German lectures.
Proceedings of the INTERSPEECH 2013, 2013

2012
The KIT Lecture Corpus for Speech Translation.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

The IWSLT 2011 Evaluation Campaign on Automatic Talk Translation.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

The 2012 KIT and KIT-NAIST English ASR systems for the IWSLT evaluation.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

Evaluation of interactive user corrections for lecture transcription.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

The KIT-NAIST (contrastive) English ASR system for IWSLT 2012.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

Overview of the IWSLT 2012 evaluation campaign.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

A hybrid phonotactic language identification system with an SVM back-end for simultaneous lecture translation.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Speech technology-based framework for quantitative analysis of German spelling errors in freely composed children's texts.
Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2011

The 2011 KIT English ASR system for the IWSLT evaluation.
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011


The 2011 KIT QUAERO speech-to-text system for Spanish.
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011

Overview of the IWSLT 2011 evaluation campaign.
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011

Towards Context-Dependent Phonetic Spelling Error Correction in Children's Freely Composed Text for Diagnostic and Pedagogical Purposes.
Proceedings of the INTERSPEECH 2011, 2011

Quaero 2010 Speech-to-Text Evaluation Systems.
Proceedings of the High Performance Computing in Science and Engineering '11, 2011

2010
Spoken news queries over the world wide web.
Proceedings of the 2010 International Workshop on Searching Spontaneous Conversational Speech, 2010

Overview of the IWSLT 2010 evaluation campaign.
Proceedings of the 2010 International Workshop on Spoken Language Translation, 2010

Towards social integration of humanoid robots by conversational concept learning.
Proceedings of the 10th IEEE-RAS International Conference on Humanoid Robots, 2010

Quaero Speech-to-Text and Text Translation Evaluation Systems.
Proceedings of the High Performance Computing in Science and Engineering '10, 2010

2009
Acoustic modelling for under-resourced languages.
PhD thesis, 2009

Human translations guided language discovery for ASR systems.
Proceedings of the INTERSPEECH 2009, 2009

2008
Towards human translations guided language discovery for ASR systems.
Proceedings of the First International Workshop on Spoken Languages Technologies for Under-Resourced Languages, 2008

Integrating Thai grapheme based acoustic models into the ML-MIX framework - for language independent and cross-language ASR.
Proceedings of the First International Workshop on Spoken Languages Technologies for Under-Resourced Languages, 2008

Modified polyphone decision tree specialization for porting multilingual Grapheme based ASR systems to new languages.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
The ISL 2007 English speech transcription system for european parliament speeches.
Proceedings of the INTERSPEECH 2007, 2007

Speech Translation Enhanced ASR for European Parliament Speeches - On the Influence of ASR Performance on Speech Translation.
Proceedings of the IEEE International Conference on Acoustics, 2007

The ISL RT-07 Speech-to-Text System.
Proceedings of the Multimodal Technologies for Perception of Humans, 2007

2006
Speech-to-Speech Translation Services for the Olympic Games 2008.
Proceedings of the Machine Learning for Multimodal Interaction, 2006

The ISL RT-06S Speech-to-Text System.
Proceedings of the Machine Learning for Multimodal Interaction, 2006

Cross-system adaptation and combination for continuous speech recognition: the influence of phoneme set and acoustic front-end.
Proceedings of the INTERSPEECH 2006, 2006

Advances in lecture recognition: the ISL RT-06s evaluation system.
Proceedings of the INTERSPEECH 2006, 2006

Open Domain Speech Recognition & Translation: Lectures and Speeches.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Document driven machine translation enhanced ASR.
Proceedings of the INTERSPEECH 2005, 2005

Rapid porting of ASR-systems to mobile devices.
Proceedings of the INTERSPEECH 2005, 2005

2004
Towards language portability in statistical speech translation.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
Integrating multilingual articulatory features into speech recognition.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Grapheme based speech recognition.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Multilingual articulatory features.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003


  Loading...