We stand with Ukraine

We stand with Ukraine

Ryoichi Takashima

Orcid: 0000-0002-9808-0250

According to our database¹, Ryoichi Takashima authored at least 80 papers between 2009 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

Online presence:

on orcid.org

On csauthors.net:

Bibliography

2026

Color-based Emotion Representation for Speech Emotion Recognition.

[DOI]

,

Ryoichi Takashima

,

Yoichi Yamashita

CoRR, February, 2026

2025

Prefix tuning with prompt augmentation for efficient financial news summarization.

[DOI]

,

,

,

,

Ryoichi Takashima

,

Tetsuya Takiguchi

,

J. Comput. Soc. Sci., February, 2025

Sequence-to-Sequence Voice Conversion With Weighted Guided Attention.

[DOI]

Haruki Yamashita

,

,

Ryoichi Takashima

,

,

Tetsuya Takiguchi

,

,

IEEE Access, 2025

Operatic Singing Voice Synthesis From Inexperienced Voice Considering Tempo and Vowel Change.

[DOI]

,

,

,

,

Ryoichi Takashima

,

Tetsuya Takiguchi

Proceedings of the MultiMedia Modeling, 2025

Zero-Shot Learning for Acoustic Event Classification Using an Attribute Vector and Conditional GAN.

[DOI]

,

Ryoichi Takashima

,

Tetsuya Takiguchi

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Revisiting WFST-based Hybrid Japanese Speech Recognition System for Individuals with Organic Speech Disorders.

[DOI]

,

Ryoichi Takashima

,

Chihiro Sugiyama

,

Nobukazu Tanaka

,

,

Kazunori Nozaki

,

Tetsuya Takiguchi

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Highly Intelligible Text-to-Speech System Based on Weighted Averaging of Parameters for Individuals with Spinal Muscular Atrophy.

[DOI]

,

Ryoichi Takashima

,

,

Tetsuya Takiguchi

Proceedings of the 27th International ACM SIGACCESS Conference on Computers and Accessibility, 2025

Speaker-dependent Continuous Speech Recognition for Individuals with Cerebral Palsy Using Weighted Finite-State Transducer and Text-to-Speech Synthesis.

[DOI]

,

,

Ryoichi Takashima

,

Tetsuya Takiguchi

,

Tatsuhiko Saito

Proceedings of the 27th International ACM SIGACCESS Conference on Computers and Accessibility, 2025

GAN-Enhanced InpaintNet for Music Inpainting on Limited Data.

[DOI]

,

,

,

Ryoichi Takashima

,

Yoichi Yamashita

Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2025

2024

Fast Neural Speech Waveform Generative Models With Fully-Connected Layer-Based Upsampling.

[DOI]

Haruki Yamashita

,

,

Ryoichi Takashima

,

,

Tetsuya Takiguchi

,

,

IEEE Access, 2024

Dysarthric Speech Recognition Using Pseudo-Labeling, Self-Supervised Feature Learning, and a Joint Multi-Task Learning Approach.

[DOI]

Ryoichi Takashima

,

,

,

Tetsuya Takiguchi

,

IEEE Access, 2024

Training of VITS Model Reflecting the Duration of a Physically Unimpaired Speaker for a Text-to-speech System for a Person with a Stutter.

[DOI]

,

Haruki Yamashita

,

Ryoichi Takashima

,

,

Tetsuya Takiguchi

Proceedings of the 13th IEEE Global Conference on Consumer Electronics, 2024

Speech Recognition for a Person With Cerebral Palsy Using Whisper Fine-Tuned on Japanese and English Dysarthric Speech.

[DOI]

,

Ryoichi Takashima

,

Tetsuya Takiguchi

Proceedings of the 13th IEEE Global Conference on Consumer Electronics, 2024

Representation Learning Based on Variational Autoencoders for Imagined Speech Classification.

[DOI]

,

Ryoichi Takashima

,

Tetsuya Takiguchi

,

Proceedings of the 32nd European Signal Processing Conference, 2024

Generation of Colored Subtitle Images Based on Emotional Information of Speech Utterances.

[DOI]

Ryoichi Takashima

,

Fumiya Nakamura

,

,

Tetsuya Takiguchi

,

Proceedings of the 32nd European Signal Processing Conference, 2024

Self-supervised learning using unlabeled speech with multiple types of speech disorder for disordered speech recognition.

[DOI]

Ryoichi Takashima

,

,

,

Tetsuya Takiguchi

,

Proceedings of the 26th International ACM SIGACCESS Conference on Computers and Accessibility, 2024

Individuality-Preserving Speech Synthesis for Spinal Muscular Atrophy with a Tracheotomy.

[DOI]

,

Ryoichi Takashima

,

,

Tetsuya Takiguchi

Proceedings of the 26th International ACM SIGACCESS Conference on Computers and Accessibility, 2024

2023

Harmonic-Net: Fundamental Frequency and Speech Rate Controllable Fast Neural Vocoder.

[DOI]

Keisuke Matsubara

,

,

Ryoichi Takashima

,

Tetsuya Takiguchi

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2023

Zero-Shot Sound Event Classification Using a Sound Attribute Vector with Global and Local Feature Learning.

[DOI]

,

,

Ryoichi Takashima

,

Tetsuya Takiguchi

Proceedings of the IEEE International Conference on Acoustics, 2023

EEG Source Estimation Using Deep Prior Without a Subject's Individual Lead Field.

[DOI]

,

,

Ryoichi Takashima

,

Tetsuya Takiguchi

,

Proceedings of the IEEE International Conference on Acoustics, 2023

Operatic Singing Voice Synthesis Using Diff-SVC.

[DOI]

,

,

,

,

Ryoichi Takashima

,

Tetsuya Takiguchi

Proceedings of the 12th IEEE Global Conference on Consumer Electronics, 2023

2022

Phoneme-guided Dysarthric speech conversion With non-parallel data by joint training.

[DOI]

,

,

,

Ryoichi Takashima

,

Tetsuya Takiguchi

Signal Image Video Process., 2022

Learn to See Faster: Pushing the Limits of High-Speed Camera with Deep Underexposed Image Denoising.

[DOI]

,

Tristan Hascoet

,

Ryoichi Takashima

,

Tetsuya Takiguchi

CoRR, 2022

Optical Flow Regularization of Implicit Neural Representations for Video Frame Interpolation.

[DOI]

,

Tristan Hascoet

,

Ryoichi Takashima

,

Tetsuya Takiguchi

CoRR, 2022

Current Source Localization Using Deep Prior with Depth Weighting.

[DOI]

,

,

Ryoichi Takashima

,

Tetsuya Takiguchi

,

CoRR, 2022

MEG Source Localization Using Deep Prior.

[DOI]

,

,

Ryoichi Takashima

,

Tetsuya Takiguchi

,

Proceedings of the 4th IEEE Global Conference on Life Sciences and Technologies, 2022

Comparative Evaluation of Neural Vocoders for Speech Synthesis of Operatic Singing.

[DOI]

,

Keisuke Matsubara

,

,

,

Ryoichi Takashima

,

Tetsuya Takiguchi

Proceedings of the 4th IEEE Global Conference on Life Sciences and Technologies, 2022

Adaptation of a Pronunciation Dictionary for Dysarthric Speech Recognition.

[DOI]

,

Ryoichi Takashima

,

Tetsuya Takiguchi

Proceedings of the 4th IEEE Global Conference on Life Sciences and Technologies, 2022

Data Augmentation for Dysarthric Speech Recognition Based on Text-to-Speech Synthesis.

[DOI]

,

Ryoichi Takashima

,

,

Tetsuya Takiguchi

Proceedings of the 4th IEEE Global Conference on Life Sciences and Technologies, 2022

Speaker-Targeted Audio-Visual Speech Recognition Using a Hybrid CTC/Attention Model with Interference Loss.

[DOI]

,

,

Ryoichi Takashima

,

Tetsuya Takiguchi

,

Proceedings of the IEEE International Conference on Acoustics, 2022

Binary Attribute Embeddings for Zero-Shot Sound Event Classification.

[DOI]

,

,

Ryoichi Takashima

,

Tetsuya Takiguchi

Proceedings of the 11th IEEE Global Conference on Consumer Electronics, 2022

2021

Multimodal fusion for indoor sound source localization.

[DOI]

,

Ryoichi Takashima

,

,

,

,

Tetsuya Takiguchi

,

Edwin R. Hancock

Pattern Recognit., 2021

Unsupervised domain adaptation for lip reading based on cross-modal knowledge distillation.

[DOI]

,

Ryoichi Takashima

,

,

,

Tetsuya Takiguchi

,

,

Nobuaki Motoyama

EURASIP J. Audio Speech Music. Process., 2021

Full-Band LPCNet: A Real-Time Neural Vocoder for 48 kHz Audio With a CPU.

[DOI]

Keisuke Matsubara

,

,

Ryoichi Takashima

,

Tetsuya Takiguchi

,

,

Yoshinori Shiga

,

IEEE Access, 2021

High-Intelligibility Speech Synthesis for Dysarthric Speakers with LPCNet-Based TTS and CycleVAE-Based VC.

[DOI]

Keisuke Matsubara

,

,

Ryoichi Takashima

,

Tetsuya Takiguchi

,

,

Yoshinori Shiga

,

Proceedings of the IEEE International Conference on Acoustics, 2021

Data Augmentation Based on Frequency Warping for Recognition of Cleft Palate Speech.

[DOI]

,

Ryoichi Takashima

,

Chihiro Sugiyama

,

Nobukazu Tanaka

,

,

Kazunori Nozaki

,

Tetsuya Takiguchi

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020

Dysarthric Speech Recognition Based on Deep Metric Learning.

[DOI]

,

Ryoichi Takashima

,

Tetsuya Takiguchi

,

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Two-Step Acoustic Model Adaptation for Dysarthric Speech Recognition.

[DOI]

Ryoichi Takashima

,

Tetsuya Takiguchi

,

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Convolutional neural networks Memory optimization Inference with Splitting Image.

[DOI]

,

Tristan Hascoet

,

Ryoichi Takashima

,

Tetsuya Takiguchi

,

Proceedings of the 9th IEEE Global Conference on Consumer Electronics, 2020

An Investigation of End-to-End Speech Recognition Using Model Adaptation for Dysarthric Speakers.

[DOI]

,

Ryoichi Takashima

,

Tetsuya Takiguchi

Proceedings of the 9th IEEE Global Conference on Consumer Electronics, 2020

Opera Singing Voice Synthesis Considering Vowel Variations.

[DOI]

,

,

,

Ryoichi Takashima

,

Tetsuya Takiguchi

Proceedings of the 9th IEEE Global Conference on Consumer Electronics, 2020

FasterRCNN Monitoring of Road Damages: Competition and Deployment.

[DOI]

Tristan Hascoet

,

,

,

Ryoichi Takashima

,

Tetsuya Takiguchi

,

Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020

2019

Knowledge Transferability Between the Speech Data of Persons With Dysarthria Speaking Different Languages for Dysarthric Speech Recognition.

[DOI]

,

Ryoichi Takashima

,

Tetsuya Takiguchi

,

IEEE Access, 2019

Auxiliary Interference Speaker Loss for Target-Speaker Speech Recognition.

[DOI]

,

Shota Horiguchi

,

Ryoichi Takashima

,

,

Kenji Nagamatsu

,

Shinji Watanabe

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Investigation of Sequence-level Knowledge Distillation Methods for CTC Acoustic Models.

[DOI]

Ryoichi Takashima

,

,

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

Improving Very Deep Time-Delay Neural Network With Vertical-Attention For Effectively Training CTC-Based ASR Systems.

[DOI]

,

,

Ryoichi Takashima

,

,

Tatsuya Kawahara

,

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Improving CTC-based Acoustic Model with Very Deep Residual Time-delay Neural Networks.

[DOI]

,

,

Ryoichi Takashima

,

,

Tatsuya Kawahara

,

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

CTC Loss Function with a Unit-Level Ambiguity Penalty.

[DOI]

Ryoichi Takashima

,

,

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

An Investigation of a Knowledge Distillation Method for CTC Acoustic Models.

[DOI]

Ryoichi Takashima

,

,

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Separation of vibration-derived sound signals based on fusion processing of vibration sensors and microphones.

[DOI]

Ryoichi Takashima

,

Yohei Kawaguchi

,

Masahito Togami

Proceedings of the 25th European Signal Processing Conference, 2017

ADMM-based audio reconstruction for low-cost-sound-monitoring.

[DOI]

Sandra Ramaswami

,

Yohei Kawaguchi

,

Ryoichi Takashima

,

,

Masahito Togami

Proceedings of the 25th European Signal Processing Conference, 2017

Incremental training and constructing the very deep convolutional residual network acoustic models.

[DOI]

,

,

,

Ryoichi Takashima

,

Tatsuya Kawahara

,

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

An application of noise-robust speech translation using asynchronous smart devices.

[DOI]

Ryoichi Takashima

,

Yohei Kawaguchi

,

,

Takashi Sumiyoshi

,

Masahito Togami

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Time-domain subsampling and reconstruction for microphone array.

[DOI]

Yohei Kawaguchi

,

Ryoichi Takashima

,

,

Masahito Togami

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Sub-Nyquist non-uniform sampling for low-cost sound monitoring.

[DOI]

Yohei Kawaguchi

,

Sandra Ramaswami

,

Ryoichi Takashima

,

,

Rintaro Ikeshita

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016

Solving permutation problem with a cascade combination of phase difference entropy and power spectral correlation.

[DOI]

Masahito Togami

,

Ryoichi Takashima

,

Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016

Data Augmentation Using Multi-Input Multi-Output Source Separation for Deep Neural Network Based Acoustic Modeling.

[DOI]

,

Ryoichi Takashima

,

,

Masahito Togami

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015

Unified ASR system using LGM-based source separation, noise-robust feature extraction, and word hypothesis selection.

[DOI]

,

Ryoichi Takashima

,

,

Rintaro Ikeshita

,

Yohei Kawaguchi

,

Takashi Sumiyoshi

,

,

Masahito Togami

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014

Noise-Robust Voice Conversion Based on Sparse Spectral Mapping Using Non-negative Matrix Factorization.

[DOI]

,

Ryoichi Takashima

,

Tetsuya Takiguchi

,

IEICE Trans. Inf. Syst., 2014

A preliminary demonstration of exemplar-based voice conversion for articulation disorders using an individuality-preserving dictionary.

[DOI]

,

Ryoichi Takashima

,

Tetsuya Takiguchi

,

EURASIP J. Audio Speech Music. Process., 2014

Frequency domain acoustic echo reduction based on Kalman smoother with time-varying noise covariance matrix.

[DOI]

Masahito Togami

,

Yohei Kawaguchi

,

Ryoichi Takashima

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

Exemplar-Based Voice Conversion Using Sparse Representation in Noisy Environments.

[DOI]

Ryoichi Takashima

,

Tetsuya Takiguchi

,

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2013

Noise-robust voice conversion based on spectral mapping on sparse space.

[DOI]

Ryoichi Takashima

,

,

Tetsuya Takiguchi

,

Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Voice conversion based on Non-negative Matrix Factorization in noisy environments.

[DOI]

,

,

Ryoichi Takashima

,

Tetsuya Takiguchi

,

Proceedings of the 2013 IEEE/SICE International Symposium on System Integration, 2013

Voice conversion in high-order eigen space using deep belief nets.

[DOI]

,

Ryoichi Takashima

,

Tetsuya Takiguchi

,

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Exemplar-based individuality-preserving voice conversion for articulation disorders in noisy environments.

[DOI]

,

Ryoichi Takashima

,

Tetsuya Takiguchi

,

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Prediction of unlearned position based on local regression for single-channel talker localization using acoustic transfer function.

[DOI]

Ryoichi Takashima

,

Tetsuya Takiguchi

,

Proceedings of the IEEE International Conference on Acoustics, 2013

Individuality-preserving voice conversion for articulation disorders based on non-negative matrix factorization.

[DOI]

,

Ryoichi Takashima

,

Tetsuya Takiguchi

,

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

Exemplar-based voice conversion in noisy environment.

[DOI]

Ryoichi Takashima

,

Tetsuya Takiguchi

,

Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Estimation of Talker's Head Orientation Based on Discrimination of the Shape of Cross-power Spectrum Phase Coefficients.

[DOI]

Ryoichi Takashima

,

Tetsuya Takiguchi

,

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

A new multiple-kernel-learning weighting method for localizing human brain magnetic activity.

[DOI]

Tetsuya Takiguchi

,

,

Ryoichi Takashima

,

,

Jo-Fu Lotus Lin

,

Patricia K. Kuhl

,

Masaki Kawakatsu

,

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Robust feature extraction to utterance fluctuations due to articulation disorders based on sparse expression.

[DOI]

Toshiya Yoshioka

,

Ryoichi Takashima

,

Tetsuya Takiguchi

,

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

An adaboost-based weighting method for localizing human brain magnetic activity.

[DOI]

Tetsuya Takiguchi

,

Ryoichi Takashima

,

,

,

Jo-Fu Lotus Lin

,

Patricia K. Kuhl

,

Masaki Kawakatsu

,

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

Consonant enhancement for articulation disorders based on non-negative matrix factorization.

[DOI]

,

Ryoichi Takashima

,

Tetsuya Takiguchi

,

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011

Single-Channel Head Orientation Estimation Based on Discrimination of Acoustic Transfer Function.

[DOI]

Ryoichi Takashima

,

Tetsuya Takiguchi

,

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Agglomerative Hierarchical Clustering of Emotions in Speech Based on Subjective Relative Similarity.

[DOI]

Ryoichi Takashima

,

,

Ryuki Tachibana

,

Masafumi Nishimura

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Feature selection based on Multiple Kernel Learning for single-channel sound source localization using the acoustic transfer function.

[DOI]

Ryoichi Takashima

,

Tetsuya Takiguchi

,

Proceedings of the IEEE International Conference on Acoustics, 2011

2010

HMM-based separation of acoustic transfer function for single-channel sound source localization.

[DOI]

Ryoichi Takashima

,

Tetsuya Takiguchi

,

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

Single-Channel Talker Localization Based on Discrimination of Acoustic Transfer Functions.

[DOI]

Tetsuya Takiguchi

,

,

Ryoichi Takashima

,

EURASIP J. Adv. Signal Process., 2009

Monaural sound-source-direction estimation using the acoustic transfer function of an active microphone.

[DOI]

Ryoichi Takashima

,

Tetsuya Takiguchi

,

Proceedings of the 12th International Conference on Information Fusion, 2009

Loading...