Daisuke Saito

According to our database1, Daisuke Saito authored at least 64 papers between 2008 and 2019.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2019
Many-to-Many and Completely Parallel-Data-Free Voice Conversion Based on Eigenspace DNN.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2019

Rubric to Evaluate Programming Learning of Elementary School Students.
Proceedings of the 50th ACM Technical Symposium on Computer Science Education, 2019

Cooking State Recognition based on Acoustic Event Detection.
Proceedings of the 11th Workshop on Multimedia for Cooking and Eating Activities, 2019

2018
Wasserstein GAN and Waveform Loss-Based Acoustic Model Training for Multi-Speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder.
IEEE Access, 2018

DNN-Based Scoring of Language Learners' Proficiency Using Learners' Shadowings and Native Listeners' Responsive Shadowings.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

The Voice Conversion Challenge 2018: Promoting Development of Parallel and Nonparallel Methods.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

A Spoofing Benchmark for the 2018 Voice Conversion Challenge: Leveraging from Spoofing Countermeasures for Speech Artifact Assessment.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

A Comparative Study of Statistical Conversion of Face to Voice Based on Their Subjective Impressions.
Proceedings of the Interspeech 2018, 2018

A Study of Objective Measurement of Comprehensibility through Native Speakers' Shadowing of Learners' Utterances.
Proceedings of the Interspeech 2018, 2018

Analysis of Unintentional Signal Propagation in Intra-Body Communication.
Proceedings of the IEEE 7th Global Conference on Consumer Electronics, 2018

A Revisit to Feature Handling for High-quality Voice Conversion Based on Gaussian Mixture Model.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017
Development and Maintenance of Practical and In-service Systems for Recording Shadowing Utterances and Their Assessment.
Proceedings of the 7th ISCA International Workshop on Speech and Language Technology in Education, 2017

New Features and Effectiveness of Suzuki-kun, the First and Only Prosodic Reading Tutor of Tokyo Japanese.
Proceedings of the 7th ISCA International Workshop on Speech and Language Technology in Education, 2017

Automatic Scoring of Shadowing Speech Based on DNN Posteriors and Their DTW.
Proceedings of the Interspeech 2017, 2017

Acoustic-to-Articulatory Mapping Based on Mixture of Probabilistic Canonical Correlation Analysis.
Proceedings of the Interspeech 2017, 2017

Use of Global and Acoustic Features Associated with Contextual Factors to Adapt Language Models for Spontaneous Speech Recognition.
Proceedings of the Interspeech 2017, 2017

Parallel-Data-Free Many-to-Many Voice Conversion Based on DNN Integrated with Eigenspace Using a Non-Parallel Speech Corpus.
Proceedings of the Interspeech 2017, 2017

Voice conversion based on deep neural networks for time-variant linear transformations.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016
Anti-Spoofing for Text-Independent Speaker Verification: An Initial Database, Comparison of Countermeasures, and Human Performance.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2016

Prosodic Reading Tutor of Japanese, Suzuki-kun: The first and only educational tool to teach the formal Japanese.
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

Improved prediction of the accent gap between speakers of English for individual-based clustering of World Englishes.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Influence of the Programming Environment on Programming Education.
Proceedings of the 2016 ACM Conference on Innovation and Technology in Computer Science Education, 2016

Speaker Representations for Speaker Adaptation in Multiple Speakers' BLSTM-RNN-Based Speech Synthesis.
Proceedings of the Interspeech 2016, 2016

Voice Conversion Based on Matrix Variate Gaussian Mixture Model Using Multiple Frame Features.
Proceedings of the Interspeech 2016, 2016

Prediction of the Articulatory Movements of Unseen Phonemes of a Speaker Using the Speech Structure of Another Speaker.
Proceedings of the Interspeech 2016, 2016

The Voice Conversion Challenge 2016.
Proceedings of the Interspeech 2016, 2016

Automatic Assessment and Error Detection of Shadowing Speech: Case of English Spoken by Japanese Learners.
Proceedings of the Interspeech 2016, 2016

Divergence estimation based on deep neural networks and its use for language identification.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Arbitrary speaker conversion based on speaker space bases constructed by deep neural networks.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

2015
Automatic prediction of intelligibility of English words spoken with Japanese accents - comparative study of features and models used for prediction.
Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2015

Development of a prosodic reading tutor of Japanese - effective use of TTS and F0 contour modeling techniques for CALL.
Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2015

Noise-robust and stress-free visualization of pronunciation diversity of World Englishes using a learner's self-centered viewpoint.
Proceedings of the 2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2015

Statistical acoustic-to-articulatory mapping unified with speaker normalization based on voice conversion.
Proceedings of the INTERSPEECH 2015, 2015

A measure of phonetic similarity to quantify pronunciation variation by using ASR technology.
Proceedings of the 18th International Congress of Phonetic Sciences, 2015

SAS: A speaker verification spoofing database containing diverse attacks.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Speaker-basis Accent Clustering Using Invariant Structure Analysis and the Speech Accent Archive.
Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014

Visualization of pronunciation diversity of world Englishes from a speaker's self-centered viewpoint.
Proceedings of the 2014 17th Oriental Chapter of the International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques (COCOSDA), 2014

Minecraft-based preparatory training for software development project.
Proceedings of the 2014 IEEE International Professional Communication Conference, 2014

Application of matrix variate Gaussian mixture model to statistical voice conversion.
Proceedings of the INTERSPEECH 2014, 2014

Semi-supervised noise dictionary adaptation for exemplar-based noise robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

Improved and robust prediction of pronunciation distance for individual-basis clustering of World Englishes pronunciation.
Proceedings of the IEEE International Conference on Acoustics, 2014

A turning control of electric wheeled walker device by PSD camera information.
Proceedings of the IEEE 13th International Workshop on Advanced Motion Control, 2014

2013
A New Approach to Programming Language Education for Beginners with Top-Down Learning.
iJEP, 2013

Text-to-speech synthesizer based on combination of composite wavelet and hidden Markov models.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Probabilistic speech F0 contour model incorporating statistical vocabulary model of phrase-accent command sequence.
Proceedings of the INTERSPEECH 2013, 2013

Discriminative piecewise linear transformation based on deep learning for noise robust automatic speech recognition.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012
Statistical Voice Conversion Based on Noisy Channel Model.
IEEE Trans. Audio, Speech & Language Processing, 2012

Hidden Markov Convolutive Mixture Model for Pitch Contour Analysis of Speech.
Proceedings of the INTERSPEECH 2012, 2012

Effects of Speaker Adaptive Training on Tensor-based Arbitrary Speaker Conversion.
Proceedings of the INTERSPEECH 2012, 2012

Assistance for Novice Users on Creating Songs from Japanese Lyrics.
Proceedings of the Non-Cochlear Sound: Proceedings of the 38th International Computer Music Conference, 2012

A tandem connectionist model using combination of multi-scale spectro-temporal features for acoustic event detection.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Correcting for non-uniform illumination when photographing the mural in the royal tomb of Amenophis III (III) Correcting mural images.
Proceedings of the 6th European Conference on Colour in Graphics, Imaging, and Vision, 2012

2011
One-to-Many Voice Conversion Based on Tensor Representation of Speaker Space.
Proceedings of the INTERSPEECH 2011, 2011

Gesture Design of Hand-to-Speech Converter Derived from Speech-to-Hand Converter Based on Probabilistic Integration Model.
Proceedings of the INTERSPEECH 2011, 2011

Adaptation of Prosody in Speech Synthesis by Changing Command Values of the Generation Process Model of Fundamental Frequency.
Proceedings of the INTERSPEECH 2011, 2011

High accurate model-integration-based voice conversion using dynamic features and model structure optimization.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Improved generation of prosodic features in HMM-based Mandarin speech synthesis.
Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010

Probabilistic integration of joint density model and speaker model for voice conversion.
Proceedings of the INTERSPEECH 2010, 2010

HMM-based sequence-to-frame mapping for voice conversion.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
A numerical method for solving the Vlasov-Poisson equation based on the conservative IDO scheme.
J. Comput. Physics, 2009

Optimal event search using a structural cost function - improvement of structure to speech conversion.
Proceedings of the INTERSPEECH 2009, 2009

2008
Decomposition of rotational distortion caused by VTL difference using eigenvalues of its transformation matrix.
Proceedings of the INTERSPEECH 2008, 2008

Structure to speech conversion - speech generation based on infant-like vocal imitation.
Proceedings of the INTERSPEECH 2008, 2008

Directional dependency of cepstrum on vocal tract length.
Proceedings of the IEEE International Conference on Acoustics, 2008


  Loading...