Sunao Hara

According to our database1, Sunao Hara authored at least 45 papers between 2008 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Speech-Emotion Control for Text-to-Speech in Spoken Dialogue Systems Using Voice Conversion and x-vector Embedding.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

Speech Synthesis Using Ambiguous Inputs From Wearable Keyboards.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
Speech-Like Emotional Sound Generation Using WaveNet.
IEICE Trans. Inf. Syst., September, 2022

Prediction method of Soundscape Impressions using Environmental Sounds and Aerial Photographs.
CoRR, 2022

Concept drift adaptation for audio scene classification using high-level features.
Proceedings of the IEEE International Conference on Consumer Electronics, 2022

Incremental acoustic scene classification using rehearsal-based strategy.
Proceedings of the 11th IEEE Global Conference on Consumer Electronics, 2022

2021
Model architectures to extrapolate emotional expressions in DNN-based text-to-speech.
Speech Commun., 2021

Evaluation of concept drift adaptation for acoustic scene classifier based on Kernel Density Drift Detection and Combine Merge Gaussian Mixture Model.
CoRR, 2021

Phonetic and Prosodic Information Estimation from Texts for Genuine Japanese End-to-End Text-to-Speech.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

2020
Concept Drift Adaptation for Acoustic Scene Classifier Based on Gaussian Mixture Model.
Proceedings of the 2020 IEEE Region 10 Conference, 2020

Controlling the Strength of Emotions in Speech-Like Emotional Sound Generated by WaveNet.
Proceedings of the Interspeech 2020, 2020

Semi-Supervised Speaker Adaptation for End-to-End Speech Synthesis with Pretrained Models.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Module Comparison Of Transformer-Tts For Speaker Adaptation Based On Fine-Tuning.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
A Signal Processing Perspective on Human Gait: Decoupling Walking Oscillations and Gestures.
Proceedings of the Interactive Collaborative Robotics - 4th International Conference, 2019

DNN-based Voice Conversion with Auxiliary Phonemic Information to Improve Intelligibility of Glossectomy Patients' Speech.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Speech-like Emotional Sound Generator by WaveNet.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018
Naturalness Improvement Algorithm for Reconstructed Glossectomy Patient's Speech Using Spectral Differential Modification in Voice Conversion.
Proceedings of the Interspeech 2018, 2018

2017
Speaker Dependent Approach for Enhancing a Glossectomy Patient's Speech via GMM-Based Voice Conversion.
Proceedings of the Interspeech 2017, 2017

Prediction of subjective assessments for a noise map using deep neural networks.
Proceedings of the Adjunct Proceedings of the 2017 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2017 ACM International Symposium on Wearable Computers, 2017

New monitoring scheme for persons with dementia through monitoring-area adaptation according to stage of disease.
Proceedings of the 1st ACM SIGSPATIAL Workshop on Recommendations for Location-based Services and Social Networks, 2017

An investigation to transplant emotional expressions in DNN-based TTS synthesis.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Sound sensing using smartphones as a crowdsourcing approach.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016
Sound collection systems using a crowdsourcing approach to construct sound map based on subjective evaluation.
Proceedings of the 2016 IEEE International Conference on Multimedia & Expo Workshops, 2016

LiBS: lifelog browsing system to support sharing of memories.
Proceedings of the 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2016 ACM International Symposium on Wearable Computers, 2016

Safety vs. privacy: user preferences from the monitored and monitoring sides of a monitoring system.
Proceedings of the 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2016 ACM International Symposium on Wearable Computers, 2016

Enhancing a glossectomy patient's speech via GMM-based voice conversion.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

2015
Sound collection and visualization system enabled participatory and opportunistic sensing approaches.
Proceedings of the 2015 IEEE International Conference on Pervasive Computing and Communication Workshops, 2015

Sub-band text-to-speech combining sample-based spectrum with statistically generated spectrum.
Proceedings of the INTERSPEECH 2015, 2015

Extracting daily patterns of human activity using non-negative matrix factorization.
Proceedings of the IEEE International Conference on Consumer Electronics, 2015

Algorithm to Estimate a Living Area Based on Connectivity of Places with Home.
Proceedings of the HCI International 2015 - Posters' Extended Abstracts, 2015

Extraction of Key Segments from Day-Long Sound Data.
Proceedings of the HCI International 2015 - Posters' Extended Abstracts, 2015

A spoken dialog system with redundant response to prevent user misunderstanding.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014
New approach to emotional information exchange: Experience metaphor based on life logs.
Proceedings of the 2014 IEEE International Conference on Pervasive Computing and Communication Workshops, 2014

A hybrid text-to-speech based on sub-band approach.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2012
Acoustic Model Training Using Pseudo-Speaker Features Generated by MLLR Transformations for Robust Speaker-Independent Speech Recognition.
IEICE Trans. Inf. Syst., 2012

Causal analysis of task completion errors in spoken music retrieval interactions.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Evaluation of Invalid Input Discrimination Using Bag-of-Words for Speech-Oriented Guidance System.
Proceedings of the Natural Interaction with Robots, 2012

Development of a Toolkit Handling Multiple Speech-Oriented Guidance Agents for Mobile Applications.
Proceedings of the Natural Interaction with Robots, 2012

2011
On-line detection of task incompletion for spoken dialog systems using utterance and behavior tag N-gram vectors.
Proceedings of the Paralinguistic Information and its Integration in Spoken Dialogue Systems, 2011

Detection of Task-Incomplete Dialogs Based on Utterance-and-Behavior Tag N-Gram for Spoken Dialog Systems.
Proceedings of the INTERSPEECH 2011, 2011

Music Recommendation System Based on Human-to-human Conversation Recognition.
Proceedings of the Workshop Proceedings of the 7th International Conference on Intelligent Environments, 2011

Robust seed model training for speaker adaptation using pseudo-speaker features generated by inverse CMLLR transformation.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
Estimation Method of User Satisfaction Using N-gram-based Dialog History Model for Spoken Dialog System.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Automatic detection of task-incompleted dialog for spoken dialog system based on dialog act n-gram.
Proceedings of the INTERSPEECH 2010, 2010

2008
In-car Speech Data Collection along with Various Multimodal Signals.
Proceedings of the International Conference on Language Resources and Evaluation, 2008


  Loading...