Takuhiro Kaneko

Orcid: 0009-0000-8016-5144

According to our database1, Takuhiro Kaneko authored at least 58 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Training Generative Adversarial Network-Based Vocoder with Limited Data Using Augmentation-Conditional Discriminator.
CoRR, 2024

Unsupervised Intrinsic Image Decomposition with LiDAR Intensity Enhanced Training.
CoRR, 2024

2023
iSTFTNet2: Faster and More Lightweight iSTFT-Based Neural Vocoder Using 1D-2D CNN.
CoRR, 2023

Non-Parallel Whisper-to-Normal Speaking Style Conversion Using Auxiliary Classifier Variational Autoencoder.
IEEE Access, 2023

Frame-Level Event Representation Learning for Semantic-Level Generation and Editing of Avatar Motion.
Proceedings of the 25th International Conference on Multimodal Interaction, 2023

MIMO-NeRF: Fast Neural Rendering with Multi-input Multi-output Neural Radiance Fields.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

JSV-VC: Jointly Trained Speaker Verification and Voice Conversion Models.
Proceedings of the IEEE International Conference on Acoustics, 2023

Wave-U-Net Discriminator: Fast and Lightweight Discriminator for Generative Adversarial Network-Based Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2023

W2N-AVSC: Audiovisual Extension For Whisper-To-Normal Speech Conversion.
Proceedings of the 31st European Signal Processing Conference, 2023

Unsupervised Intrinsic Image Decomposition with LiDAR Intensity.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Distilling Sequence-to-Sequence Voice Conversion Models for Streaming Conversion Applications.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

MISRNet: Lightweight Neural Vocoder Using Multi-Input Single Shared Residual Blocks.
Proceedings of the Interspeech 2022, 2022

CAUSE: Crossmodal Action Unit Sequence Estimation from Speech.
Proceedings of the Interspeech 2022, 2022

ISTFTNET: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform.
Proceedings of the IEEE International Conference on Acoustics, 2022

AR-NeRF: Unsupervised Learning of Depth and Defocus Effects from Natural Images with Aperture Rendering Neural Radiance Fields.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Many-to-Many Voice Transformer Network.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

FastS2S-VC: Streaming Non-Autoregressive Sequence-to-Sequence Voice Conversion.
CoRR, 2021

Maskcyclegan-VC: Learning Non-Parallel Voice Conversion with Filling in Frames.
Proceedings of the IEEE International Conference on Acoustics, 2021

Blur, Noise, and Compression Robust Generative Adversarial Networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Unsupervised Learning of Depth and Depth-of-Field Effect From Natural Images With Aperture Rendering Generative Adversarial Networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
ConvS2S-VC: Fully Convolutional Sequence-to-Sequence Voice Conversion.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Nonparallel Voice Conversion With Augmented Classifier Star Generative Adversarial Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics.
CoRR, 2020

CycleGAN-VC3: Examining and Improving CycleGAN-VCs for Mel-Spectrogram Conversion.
Proceedings of the Interspeech 2020, 2020

Noise Robust Generative Adversarial Networks.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
ACVAE-VC: Non-Parallel Voice Conversion With Auxiliary Classifier Variational Autoencoder.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Label-Noise Robust Multi-Domain Image-to-Image Translation.
CoRR, 2019

Crossmodal Voice Conversion.
CoRR, 2019

WaveCycleGAN2: Time-domain Neural Post-filter for Speech Waveform Generation.
CoRR, 2019

StarGAN-VC2: Rethinking Conditional Methods for StarGAN-Based Voice Conversion.
Proceedings of the Interspeech 2019, 2019

ATTS2S-VC: Sequence-to-sequence Voice Conversion with Attention and Context Preservation Mechanisms.
Proceedings of the IEEE International Conference on Acoustics, 2019

Cyclegan-VC2: Improved Cyclegan-based Non-parallel Voice Conversion.
Proceedings of the IEEE International Conference on Acoustics, 2019

Label-Noise Robust Generative Adversarial Networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Class-Distinct and Class-Mutual Image Generation with GANs.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

2018
ConvS2S-VC: Fully convolutional sequence-to-sequence voice conversion.
CoRR, 2018

WaveCycleGAN: Synthetic-to-natural speech waveform conversion using cycle-consistent adversarial networks.
CoRR, 2018

ACVAE-VC: Non-parallel many-to-many voice conversion with auxiliary classifier variational autoencoder.
CoRR, 2018

StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial networks.
CoRR, 2018

Generative adversarial network-based approach to signal reconstruction from magnitude spectrograms.
CoRR, 2018

Synthetic-to-Natural Speech Waveform Conversion Using Cycle-Consistent Adversarial Networks.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

StarGAN-VC: non-parallel many-to-many Voice Conversion Using Star Generative Adversarial Networks.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Generative adversarial network-based approach to signal reconstruction from magnitude spectrogram.
Proceedings of the 26th European Signal Processing Conference, 2018

CycleGAN-VC: Non-parallel Voice Conversion Using Cycle-Consistent Adversarial Networks.
Proceedings of the 26th European Signal Processing Conference, 2018

Automatic Speech Pronunciation Correction with Dynamic Frequency Warping-Based Spectral Conversion.
Proceedings of the 26th European Signal Processing Conference, 2018

Generative Adversarial Image Synthesis With Decision Tree Latent Controller.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Parallel-Data-Free Voice Conversion Using Cycle-Consistent Adversarial Networks.
CoRR, 2017

Generative Adversarial Network-Based Postfilter for STFT Spectrograms.
Proceedings of the Interspeech 2017, 2017

Sequence-to-Sequence Voice Conversion with Similarity Metric Learned Using Generative Adversarial Networks.
Proceedings of the Interspeech 2017, 2017

Generative adversarial network-based postfilter for statistical parametric speech synthesis.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Generative Attribute Controller with Conditional Filtered Generative Adversarial Networks.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Non-native speech conversion with consistency-aware recursive network and generative adversarial network.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016
Collective activity localization by spatiality preservation search.
Adv. Robotics, 2016

Adaptive Visual Feedback Generation for Facial Expression Improvement with Multi-task Deep Neural Networks.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

2014
A fully connected model for consistent collective activity recognition in videos.
Pattern Recognit. Lett., 2014

Modeling risk anticipation and defensive driving on residential roads with inverse reinforcement learning.
Proceedings of the 17th International IEEE Conference on Intelligent Transportation Systems, 2014

2012
Consistent collective activity recognition with fully connected CRFs.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Collective Activity Localization with Contextual Spatial Pyramid.
Proceedings of the Computer Vision - ECCV 2012. Workshops and Demonstrations, 2012

Viewpoint Invariant Collective Activity Recognition with Relative Action Context.
Proceedings of the Computer Vision - ECCV 2012. Workshops and Demonstrations, 2012


  Loading...