Yuya Fujita

Orcid: 0000-0001-8429-8964

According to our database1, Yuya Fujita authored at least 31 papers between 2012 and 2023.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
HuBERTopic: Enhancing Semantic Representation of HuBERT through Self-supervision Utilizing Topic Model.
CoRR, 2023

Cross-Modal Multi-Tasking for Speech-to-Text Translation via Hard Parameter Sharing.
CoRR, 2023

Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study.
CoRR, 2023

Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning.
CoRR, 2023

Align, Write, Re-Order: Explainable End-to-End Speech Translation via Operation Sequence Generation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Fully Unsupervised Topic Clustering of Unlabelled Spoken Audio Using Self-Supervised Representation Learning and Topic Model.
Proceedings of the IEEE International Conference on Acoustics, 2023

LV-CTC: Non-Autoregressive ASR With CTC and Latent Variable Models.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Production planning method for <i>seru</i> production systems under demand uncertainty.
Comput. Ind. Eng., 2022

Attention Weight Smoothing Using Prior Distributions for Transformer-Based End-to-End ASR.
Proceedings of the Interspeech 2022, 2022

End-to-End Integration of Speech Recognition, Speech Enhancement, and Self-Supervised Learning Representation.
Proceedings of the Interspeech 2022, 2022

Non-Autoregressive End-To-End Automatic Speech Recognition Incorporating Downstream Natural Language Processing.
Proceedings of the IEEE International Conference on Acoustics, 2022

An Exploration of Hubert with Large Number of Cluster Units and Model Assessment Using Bayesian Information Criterion.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
End-to-end ASR to jointly predict transcriptions and linguistic annotations.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Streaming End-to-End ASR Based on Blockwise Non-Autoregressive Models.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Speech Representation Learning Combining Conformer CPC with Deep Cluster for the ZeroSpeech Challenge 2021.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Toward Streaming ASR with Non-Autoregressive Insertion-Based Model.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Insertion-Based Modeling for End-to-End Automatic Speech Recognition.
Proceedings of the Interspeech 2020, 2020

End-to-End ASR with Adaptive Span Self-Attention.
Proceedings of the Interspeech 2020, 2020

Attention-Based ASR with Lightweight and Dynamic Convolutions.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Dry, Focus, and Transcribe: End-to-End Integration of Dereverberation, Beamforming, and ASR.
CoRR, 2019

Generalized Weighted-Prediction-Error Dereverberation with Varying Source Priors For Reverberant Speech Recognition.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Speech Enhancement Using End-to-End Speech Recognition Objectives.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

2018
PARHELIA: Particle Filter-Based Heart Rate Estimation From Photoplethysmographic Signals During Physical Exercise.
IEEE Trans. Biomed. Eng., 2018

Speaker Selective Beamformer with Keyword Mask Estimation.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Multi Scale Feedback Connection for Noise Robust Acoustic Modeling.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Fast And Robust Heart Rate Estimation From Videos Through Dynamic Region Selection.
Proceedings of the 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2018

2016
Robust DNN-Based VAD Augmented with Phone Entropy Based Rejection of Background Speech.
Proceedings of the Interspeech 2016, 2016

2013
Lightly supervised training for risk-based discriminative language models.
Proceedings of the INTERSPEECH 2013, 2013

Progressive language model adaptation for disaster broadcasting with closed-captions.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012
Speaker adaptation intensively weighted on mis-recognized speech segments.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012


  Loading...