Nobukatsu Hojo

According to our database1, Nobukatsu Hojo authored at least 36 papers between 2013 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of five.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
End-to-End Joint Target and Non-Target Speakers ASR.
CoRR, 2023

Downstream Task Agnostic Speech Enhancement with Self-Supervised Representation Loss.
CoRR, 2023

Next-Speaker Prediction Based on Non-Verbal Information in Multi-Party Video Conversation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Modeling Lead-Lag Structure in Facial Expression Synchrony for Social-Psychological Outcome Prediction from Negotiation Interaction.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Multimodal Negotiation Corpus with Various Subjective Assessments for Social-Psychological Outcome Prediction from Non-Verbal Cues.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

End-to-End Joint Modeling of Conversation History-Dependent and Independent ASR Systems with Multi-History Training.
Proceedings of the Interspeech 2022, 2022

2021
Many-to-Many Voice Transformer Network.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Model architectures to extrapolate emotional expressions in DNN-based text-to-speech.
Speech Commun., 2021

Maskcyclegan-VC: Learning Non-Parallel Voice Conversion with Filling in Frames.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
ConvS2S-VC: Fully Convolutional Sequence-to-Sequence Voice Conversion.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Nonparallel Voice Conversion With Augmented Classifier Star Generative Adversarial Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics.
CoRR, 2020

CycleGAN-VC3: Examining and Improving CycleGAN-VCs for Mel-Spectrogram Conversion.
Proceedings of the Interspeech 2020, 2020

2019
ACVAE-VC: Non-Parallel Voice Conversion With Auxiliary Classifier Variational Autoencoder.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

WaveCycleGAN2: Time-domain Neural Post-filter for Speech Waveform Generation.
CoRR, 2019

StarGAN-VC2: Rethinking Conditional Methods for StarGAN-Based Voice Conversion.
Proceedings of the Interspeech 2019, 2019

Evaluating Intention Communication by TTS Using Explicit Definitions of Illocutionary Act Performance.
Proceedings of the Interspeech 2019, 2019

ATTS2S-VC: Sequence-to-sequence Voice Conversion with Attention and Context Preservation Mechanisms.
Proceedings of the IEEE International Conference on Acoustics, 2019

Cyclegan-VC2: Improved Cyclegan-based Non-parallel Voice Conversion.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
DNN-Based Speech Synthesis Using Speaker Codes.
IEICE Trans. Inf. Syst., 2018

ConvS2S-VC: Fully convolutional sequence-to-sequence voice conversion.
CoRR, 2018

WaveCycleGAN: Synthetic-to-natural speech waveform conversion using cycle-consistent adversarial networks.
CoRR, 2018

ACVAE-VC: Non-parallel many-to-many voice conversion with auxiliary classifier variational autoencoder.
CoRR, 2018

StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial networks.
CoRR, 2018

Generative adversarial network-based approach to signal reconstruction from magnitude spectrograms.
CoRR, 2018

Synthetic-to-Natural Speech Waveform Conversion Using Cycle-Consistent Adversarial Networks.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

StarGAN-VC: non-parallel many-to-many Voice Conversion Using Star Generative Adversarial Networks.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Generative adversarial network-based approach to signal reconstruction from magnitude spectrogram.
Proceedings of the 26th European Signal Processing Conference, 2018

Automatic Speech Pronunciation Correction with Dynamic Frequency Warping-Based Spectral Conversion.
Proceedings of the 26th European Signal Processing Conference, 2018

2017
Prosody Aware Word-Level Encoder Based on BLSTM-RNNs for DNN-Based Speech Synthesis.
Proceedings of the Interspeech 2017, 2017

DNN-SPACE: DNN-HMM-Based Generative Model of Voice F<sub>0</sub> Contours for Statistical Phrase/Accent Command Estimation.
Proceedings of the Interspeech 2017, 2017

Generative adversarial network-based postfilter for statistical parametric speech synthesis.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

An investigation to transplant emotional expressions in DNN-based TTS synthesis.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016
An Investigation of DNN-Based Speech Synthesis Using Speaker Codes.
Proceedings of the Interspeech 2016, 2016

2014
Speech prosody generation for text-to-speech synthesis based on generative model of F<sub>0</sub> contours.
Proceedings of the INTERSPEECH 2014, 2014

2013
Text-to-speech synthesizer based on combination of composite wavelet and hidden Markov models.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013


  Loading...