Sunao Hara

CoRR, 2021

Phonetic and Prosodic Information Estimation from Texts for Genuine Japanese End-to-End Text-to-Speech.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020

Concept Drift Adaptation for Acoustic Scene Classifier Based on Gaussian Mixture Model.

[BibT_eX]

[DOI]

Ibnu Daqiqil Id

Proceedings of the 2020 IEEE Region 10 Conference, 2020

Controlling the Strength of Emotions in Speech-Like Emotional Sound Generated by WaveNet.

[BibT_eX]

[DOI]

Kento Matsumoto

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Semi-Supervised Speaker Adaptation for End-to-End Speech Synthesis with Pretrained Models.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Module Comparison Of Transformer-Tts For Speaker Adaptation Based On Fine-Tuning.

[BibT_eX]

[DOI]

Katsuki Inoue

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019

A Signal Processing Perspective on Human Gait: Decoupling Walking Oscillations and Gestures.

[BibT_eX]

[DOI]

Proceedings of the Interactive Collaborative Robotics - 4th International Conference, 2019

DNN-based Voice Conversion with Auxiliary Phonemic Information to Improve Intelligibility of Glossectomy Patients' Speech.

[BibT_eX]

[DOI]

Hiroki Murakami

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Speech-like Emotional Sound Generator by WaveNet.

[BibT_eX]

[DOI]

Kento Matsumoto

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018

Naturalness Improvement Algorithm for Reconstructed Glossectomy Patient's Speech Using Spectral Differential Modification in Voice Conversion.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017

Speaker Dependent Approach for Enhancing a Glossectomy Patient's Speech via GMM-Based Voice Conversion.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Prediction of subjective assessments for a noise map using deep neural networks.

[BibT_eX]

[DOI]

Shota Kobayashi

Proceedings of the Adjunct Proceedings of the 2017 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2017 ACM International Symposium on Wearable Computers, 2017

New monitoring scheme for persons with dementia through monitoring-area adaptation according to stage of disease.

[BibT_eX]

[DOI]

Proceedings of the 1st ACM SIGSPATIAL Workshop on Recommendations for Location-based Services and Social Networks, 2017

An investigation to transplant emotional expressions in DNN-based TTS synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Sound sensing using smartphones as a crowdsourcing approach.

[BibT_eX]

[DOI]

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016

Sound collection systems using a crowdsourcing approach to construct sound map based on subjective evaluation.

[BibT_eX]

[DOI]

Shota Kobayashi

Proceedings of the 2016 IEEE International Conference on Multimedia & Expo Workshops, 2016

LiBS: lifelog browsing system to support sharing of memories.

[BibT_eX]

[DOI]

Atsuya Namba

Proceedings of the 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2016 ACM International Symposium on Wearable Computers, 2016

Safety vs. privacy: user preferences from the monitored and monitoring sides of a monitoring system.

[BibT_eX]

[DOI]

Shigeki Kamada

Proceedings of the 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2016 ACM International Symposium on Wearable Computers, 2016

Enhancing a glossectomy patient's speech via GMM-based voice conversion.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

2015

Sound collection and visualization system enabled participatory and opportunistic sensing approaches.

[BibT_eX]

[DOI]

Noboru Sonehara

Proceedings of the 2015 IEEE International Conference on Pervasive Computing and Communication Workshops, 2015

Sub-band text-to-speech combining sample-based spectrum with statistically generated spectrum.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Extracting daily patterns of human activity using non-negative matrix factorization.

[BibT_eX]

[DOI]

Akihiko Hirayama

Proceedings of the IEEE International Conference on Consumer Electronics, 2015

Algorithm to Estimate a Living Area Based on Connectivity of Places with Home.

[BibT_eX]

[DOI]

Yuji Matsuo

Proceedings of the HCI International 2015 - Posters' Extended Abstracts, 2015

Extraction of Key Segments from Day-Long Sound Data.

[BibT_eX]

[DOI]

Akinori Kasai

Proceedings of the HCI International 2015 - Posters' Extended Abstracts, 2015

A spoken dialog system with redundant response to prevent user misunderstanding.

[BibT_eX]

[DOI]

Masaki Yamaoka

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014

A Graph-Based Spoken Dialog Strategy Utilizing Multiple Understanding Hypotheses.

[BibT_eX]

[DOI]

Inf. Media Technol., 2014

New approach to emotional information exchange: Experience metaphor based on life logs.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Pervasive Computing and Communication Workshops, 2014

A hybrid text-to-speech based on sub-band approach.

[BibT_eX]

[DOI]

Takuma Inoue

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2012

Acoustic Model Training Using Pseudo-Speaker Features Generated by MLLR Transformations for Robust Speaker-Independent Speech Recognition.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2012

Causal analysis of task completion errors in spoken music retrieval interactions.

[BibT_eX]

[DOI]

Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Evaluation of Invalid Input Discrimination Using Bag-of-Words for Speech-Oriented Guidance System.

[BibT_eX]

[DOI]

Proceedings of the Natural Interaction with Robots, 2012

Development of a Toolkit Handling Multiple Speech-Oriented Guidance Agents for Mobile Applications.

[BibT_eX]

[DOI]

Proceedings of the Natural Interaction with Robots, 2012

2011

On-line detection of task incompletion for spoken dialog systems using utterance and behavior tag N-gram vectors.

[BibT_eX]

[DOI]

Proceedings of the Paralinguistic Information and its Integration in Spoken Dialogue Systems, 2011

Detection of Task-Incomplete Dialogs Based on Utterance-and-Behavior Tag N-Gram for Spoken Dialog Systems.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Music Recommendation System Based on Human-to-human Conversation Recognition.

[BibT_eX]

[DOI]

Proceedings of the Workshop Proceedings of the 7th International Conference on Intelligent Environments, 2011

Robust seed model training for speaker adaptation using pseudo-speaker features generated by inverse CMLLR transformation.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010

Estimation Method of User Satisfaction Using N-gram-based Dialog History Model for Spoken Dialog System.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Language Resources and Evaluation, 2010

Automatic detection of task-incompleted dialog for spoken dialog system based on dialog act n-gram.

[BibT_eX]

[DOI]