Yui Sudo

Orcid: 0000-0003-2094-6701

According to our database1, Yui Sudo authored at least 17 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
OWSM-CTC: An Open Encoder-Only Speech Foundation Model for Speech Recognition, Translation, and Language Identification.
CoRR, 2024

OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer.
CoRR, 2024

Contextualized Automatic Speech Recognition with Attention-Based Bias Phrase Boosted Beam Search.
CoRR, 2024

2023
Retraining-free Customized ASR for Enharmonic Words Based on a Named-Entity-Aware Model and Phoneme Similarity Estimation.
CoRR, 2023

DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models.
CoRR, 2023

Online Adaptation of Fourier Series Based Acoustic Transfer Function Model to Improve Sound Source Localization and Separation.
Proceedings of the 32nd IEEE International Conference on Robot and Human Interactive Communication, 2023

Reproducing Whisper-Style Training Using An Open-Source Toolkit And Publicly Available Data.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Flexible Evidence Model to Reduce Uncertainty Mismatch Between Speech Enhancement and ASR Based on Encoder-Decoder Architecture.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
4D ASR: Joint modeling of CTC, Attention, Transducer, and Mask-Predict decoders.
CoRR, 2022

Empirical Sampling from Latent Utterance-wise Evidence Model for Missing Data ASR based on Neural Encoder-Decoder Model.
Proceedings of the Interspeech 2022, 2022

Streaming Automatic Speech Recognition with Re-blocking Processing Based on Integrated Voice Activity Detection.
Proceedings of the Interspeech 2022, 2022

2021
Multichannel environmental sound segmentation.
Appl. Intell., 2021

Multi-channel Environmental Sound Segmentation utilizing Sound Source Localization and Separation U-Net.
Proceedings of the IEEE/SICE International Symposium on System Integration, 2021

2020
Sound event aware environmental sound segmentation with Mask U-Net.
Adv. Robotics, 2020

Multi-channel Environmental sound segmentation.
Proceedings of the 2020 IEEE/SICE International Symposium on System Integration, 2020

2019
Environmental sound segmentation utilizing Mask U-Net.
Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

Improvement of DOA Estimation by using Quaternion Output in Sound Event Localization and Detection.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019


  Loading...