Shinsuke Sakai

According to our database1, Shinsuke Sakai authored at least 50 papers between 1990 and 2022.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2022
Distilling the Knowledge of BERT for CTC-based ASR.
CoRR, 2022

Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM.
Proceedings of the Interspeech 2022, 2022

2021
Data Augmentation for ASR Using TTS Via a Discrete Representation.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

ASR Rescoring and Confidence Estimation with Electra.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

An End-To-End Model from Speech to Clean Transcript for Parliamentary Meetings.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020
Speech Corpus of Ainu Folklore and End-to-end Speech Recognition for Ainu Language.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Generative Adversarial Training Data Adaptation for Very Low-Resource Automatic Speech Recognition.
Proceedings of the Interspeech 2020, 2020

Distilling the Knowledge of BERT for Sequence-to-Sequence ASR.
Proceedings of the Interspeech 2020, 2020

2019
Multi-speaker Sequence-to-sequence Speech Synthesis for Data Augmentation in Acoustic-to-word Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Leveraging Sequence-to-Sequence Speech Synthesis for Enhancing Acoustic-to-Word Speech Recognition.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Encoder Transfer for Attention-based Acoustic-to-word Speech Recognition.
Proceedings of the Interspeech 2018, 2018

Forward-Backward Attention Decoder.
Proceedings of the Interspeech 2018, 2018

2017
Combined Multi-Channel NMF-Based Robust Beamforming for Noisy Speech Recognition.
Proceedings of the Interspeech 2017, 2017

Semi-supervised ensemble DNN acoustic model training.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Cross-domain speech recognition using nonparallel corpora with cycle-consistent adversarial networks.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
Joint Optimization of Denoising Autoencoder and DNN Acoustic Model Based on Multi-Target Learning for Noisy Speech Recognition.
Proceedings of the Interspeech 2016, 2016

2015
Reverberant speech recognition combining deep neural networks and deep autoencoders augmented with a phone-class feature.
EURASIP J. Adv. Signal Process., 2015

Speech dereverberation using long short-term memory.
Proceedings of the INTERSPEECH 2015, 2015

Deep autoencoders augmented with phone-class feature for reverberant speech recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Exploring deep neural networks and deep autoencoders in reverberant speech recognition.
Proceedings of the 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014

2013
Admissible Stopping in Viterbi Beam Search for Unit Selection Speech Synthesis.
IEICE Trans. Inf. Syst., 2013

A-STAR: Toward translating Asian spoken languages.
Comput. Speech Lang., 2013

2011
Probabilistic Concatenation Modeling for Corpus-Based Speech Synthesis.
IEICE Trans. Inf. Syst., 2011

A sampling-based environment population projection approach for rapid acoustic model adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Improved training of excitation for HMM-based parametric speech synthesis.
Proceedings of the INTERSPEECH 2010, 2010

2009
Hyperbolic structure of fundamental frequency contour.
Proceedings of the 3rd International Universal Communication Symposium, 2009

A close look into the probabilistic concatenation model for corpus-based speech synthesis.
Proceedings of the INTERSPEECH 2009, 2009

A decision tree-based clustering approach to state definition in an excitation modeling framework for HMM-based speech synthesis.
Proceedings of the INTERSPEECH 2009, 2009

Optimal learning of P-Layer additive F0 models with cross-validation.
Proceedings of the IEEE International Conference on Acoustics, 2009

CART-based modeling of Chinese tonal patterns with a functional model tracing the fundamental frequency trajectories.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Prosody Modeling from Tone to Intonation in Chinese using a Functional F0 Model.
Proceedings of the ISUC 2008, 2008

Simultaneous Acoustic, Prosodic, and Phrasing Model Training for TTs Conversion Systems.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Frequency Modulation Technique for Prosodic Modification.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Development of Indonesian Large Vocabulary Continuous Speech Recognition System within A-STAR Project.
Proceedings of the Third International Joint Conference on Natural Language Processing, 2008

2007
Communicative speech synthesis with XIMERA: a first step.
Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007

2006
Decision tree-based training of probabilistic concatenation models for corpus-based speech synthesis.
Proceedings of the INTERSPEECH 2006, 2006

2005
Fundamental Frequency Modeling for Speech Synthesis Based on a Statistical Learning Technique.
IEICE Trans. Inf. Syst., 2005

A probabilistic approach to unit selection for corpus-based speech synthesis.
Proceedings of the INTERSPEECH 2005, 2005

Additive Modeling of English F0 Contour for Speech Synthesis.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
F0 modeling with multi-layer additive modeling based on a statistical learning technique.
Proceedings of the Fifth ISCA ITRW on Speech Synthesis, 2004

2000
An automatic interpretation system for travel conversation.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Continuous speech recognition with parse filtering.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1995
Multilingual spoken-language understanding in the MIT Voyager system.
Speech Commun., 1995

1994
An automatic voice dialing system developed on PC speech i/o platform.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

1993
A Bilingual VOYAGER System.
Proceedings of the Human Language Technology: Proceedings of a Workshop Held at Plainsboro, 1993

J-SUMMIT: Japanese spontaneous speech recognition.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

A bilingual Voyager system.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

1992
J-SUMMIT: a Japanese segment-based speech recognition system.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

1990
From interlingua to speech: generating prosodic information from conceptual representation.
Proceedings of the 1990 International Conference on Acoustics, 1990


  Loading...