Yuki Takashima

Orcid: 0000-0001-8489-9487

According to our database1, Yuki Takashima authored at least 23 papers between 2015 and 2023.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Online Neural Diarization of Unlimited Numbers of Speakers Using Global and Local Attractors.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

2022
Online Neural Diarization of Unlimited Numbers of Speakers.
CoRR, 2022

Mutual Learning of Single- and Multi-Channel End-to-End Neural Diarization.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Updating Only Encoders Prevents Catastrophic Forgetting of End-to-End ASR Models.
Proceedings of the Interspeech 2022, 2022

Multi-Channel End-To-End Neural Diarization with Distributed Microphones.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Unsupervised domain adaptation for lip reading based on cross-modal knowledge distillation.
EURASIP J. Audio Speech Music. Process., 2021

The Hitachi-JHU DIHARD III System: Competitive End-to-End Neural Diarization and X-Vector Clustering Systems Combined by DOVER-Lap.
CoRR, 2021

Online End-to-End Neural Diarization Handling Overlapping Speech and Flexible Numbers of Speakers.
CoRR, 2021

End-to-End Speaker Diarization Conditioned on Speech Activity and Overlap Detection.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Online Streaming End-to-End Neural Diarization Handling Overlapping Speech and Flexible Numbers of Speakers.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Semi-Supervised Training with Pseudo-Labeling for End-To-End Neural Diarization.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

A Study on Dual-Language Display Method Using the Law of Common Fate in Oscillatory Animation on Digital Signage.
Proceedings of the HCI International 2021 - Late Breaking Papers: Design and User Experience, 2021

Towards Neural Diarization for Unlimited Numbers of Speakers Using Global and Local Attractors.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Dysarthric Speech Recognition Based on Deep Metric Learning.
Proceedings of the Interspeech 2020, 2020

A Study on Bilingual Superimposed Display Method on Digital Signage.
Proceedings of the Social Computing and Social Media. Participation, User Experience, Consumer Experience, and Applications of Social Computing, 2020

2019
Non-parallel dictionary learning for voice conversion using non-negative Tucker decomposition.
EURASIP J. Audio Speech Music. Process., 2019

Knowledge Transferability Between the Speech Data of Persons With Dysarthria Speaking Different Languages for Dysarthric Speech Recognition.
IEEE Access, 2019

End-to-end Dysarthric Speech Recognition Using Multiple Databases.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Parallel-Data-Free Dictionary Learning for Voice Conversion Using Non-Negative Tucker Decomposition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2016
Audio-Visual Speech Recognition Using Bimodal-Trained Bottleneck Features for a Person with Severe Hearing Loss.
Proceedings of the Interspeech 2016, 2016

Lip reading using a dynamic feature of lip images and convolutional neural networks.
Proceedings of the 15th IEEE/ACIS International Conference on Computer and Information Science, 2016

2015
Audio-Visual Speech Recognition Using Convolutive Bottleneck Networks for a Person with Severe Hearing Loss.
IPSJ Trans. Comput. Vis. Appl., 2015

Feature extraction using pre-trained convolutive bottleneck nets for dysarthric speech recognition.
Proceedings of the 23rd European Signal Processing Conference, 2015


  Loading...