Shigeki Karita

According to our database1, Shigeki Karita authored at least 27 papers between 2015 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Lenient Evaluation of Japanese Speech Recognition: Modeling Naturally Occurring Spelling Inconsistency.
CoRR, 2023

LibriTTS-R: A Restored Multi-Speaker Text-to-Speech Corpus.
CoRR, 2023

Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text Representations.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

2022
SNRi Target Training for Joint Speech Enhancement and Recognition.
Proceedings of the Interspeech 2022, 2022

Knowledge Transfer from Large-Scale Pretrained Language Models to End-To-End Speech Recognizers.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
DF-Conformer: Integrated Architecture of Conv-Tasnet and Conformer Using Linear Complexity Self-Attention for Speech Enhancement.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

Unsupervised Learning of Disentangled Speech Content and Style Representation.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

A Comparative Study on Neural Architectures and Training Methods for Japanese Speech Recognition.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

2020
The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans.
CoRR, 2020

Self-Distillation for Improving CTC-Transformer-Based ASR Systems.
Proceedings of the Interspeech 2020, 2020

ESPnet-ST: All-in-One Speech Translation Toolkit.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2020

2019
Improved Deep Duel Model for Rescoring N-Best Speech Recognition List Using Backward LSTMLM and Ensemble Encoders.
Proceedings of the Interspeech 2019, 2019

Improving Transformer-Based End-to-End Speech Recognition with Connectionist Temporal Classification and Language Model Integration.
Proceedings of the Interspeech 2019, 2019

End-to-End SpeakerBeam for Single Channel Target Speech Recognition.
Proceedings of the Interspeech 2019, 2019

Semi-supervised End-to-end Speech Recognition Using Text-to-speech and Autoencoders.
Proceedings of the IEEE International Conference on Acoustics, 2019

A Comparative Study on Transformer vs RNN in Speech Applications.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018

Semi-Supervised End-to-End Speech Recognition.
Proceedings of the Interspeech 2018, 2018

Auxiliary Feature Based Adaptation of End-to-end ASR Systems.
Proceedings of the Interspeech 2018, 2018

Rescoring N-Best Speech Recognition List Based on One-on-One Hypothesis Comparison Using Encoder-Classifier Model.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Sequence Training of Encoder-Decoder Model Using Policy Gradient for End-to-End Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Frame-by-Frame Closed-Form Update for Mask-Based Adaptive MVDR Beamforming.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Unfolded Deep Recurrent Convolutional Neural Network with Jump Ahead Connections for Acoustic Modeling.
Proceedings of the Interspeech 2017, 2017

Forward-Backward Convolutional LSTM for Acoustic Modeling.
Proceedings of the Interspeech 2017, 2017

Online meeting recognition in noisy environments with time-frequency mask based MVDR beamforming.
Proceedings of the Hands-free Speech Communications and Microphone Arrays, 2017

2015
Owner authentication for mobile devices using motion gestures based on multi-owner template update.
Proceedings of the 2015 IEEE International Conference on Multimedia & Expo Workshops, 2015

Far-field speech recognition using CNN-DNN-HMM with convolution in time.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015


  Loading...