Yuya Fujita

Orcid: 0009-0003-9170-0335

According to our database¹, Yuya Fujita authored at least 32 papers between 2012 and 2024.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2024

MC-Whisper: Extending Speech Foundation Models to Multichannel Distant Speech Recognition.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2024

Cross-Modal Multi-Tasking for Speech-to-Text Translation via Hard Parameter Sharing.

[BibT_eX]

[DOI]

Brian Yan

Xuankai Chang

Antonios Anastasopoulos

Yuya Fujita

Shinji Watanabe

Proceedings of the IEEE International Conference on Acoustics, 2024

Hubertopic: Enhancing Semantic Representation of Hubert Through Self-Supervision Utilizing Topic Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Align, Write, Re-Order: Explainable End-to-End Speech Translation via Operation Sequence Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Fully Unsupervised Topic Clustering of Unlabelled Spoken Audio Using Self-Supervised Representation Learning and Topic Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

LV-CTC: Non-Autoregressive ASR With CTC and Latent Variable Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022

Production planning method for <i>seru</i> production systems under demand uncertainty.

[BibT_eX]

[DOI]

Comput. Ind. Eng., 2022

Attention Weight Smoothing Using Prior Distributions for Transformer-Based End-to-End ASR.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

End-to-End Integration of Speech Recognition, Speech Enhancement, and Self-Supervised Learning Representation.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Non-Autoregressive End-To-End Automatic Speech Recognition Incorporating Downstream Natural Language Processing.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

An Exploration of Hubert with Large Number of Cluster Units and Model Assessment Using Bayesian Information Criterion.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

End-to-end ASR to jointly predict transcriptions and linguistic annotations.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Streaming End-to-End ASR Based on Blockwise Non-Autoregressive Models.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Speech Representation Learning Combining Conformer CPC with Deep Cluster for the ZeroSpeech Challenge 2021.

[BibT_eX]

[DOI]

Alexander I. Rudnicky

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Toward Streaming ASR with Non-Autoregressive Insertion-Based Model.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020

Insertion-Based Modeling for End-to-End Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

End-to-End ASR with Adaptive Span Self-Attention.

[BibT_eX]

[DOI]

Xuankai Chang

Aswin Shanmugam Subramanian

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Attention-Based ASR with Lightweight and Dynamic Convolutions.

[BibT_eX]

[DOI]

Yuya Fujita

Aswin Shanmugam Subramanian

Motoi Omachi

Shinji Watanabe

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Dry, Focus, and Transcribe: End-to-End Integration of Dereverberation, Beamforming, and ASR.

[BibT_eX]

[DOI]

Aswin Shanmugam Subramanian

CoRR, 2019

Generalized Weighted-Prediction-Error Dereverberation with Varying Source Priors For Reverberant Speech Recognition.

[BibT_eX]

[DOI]

Toru Taniguchi

Aswin Shanmugam Subramanian

Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Speech Enhancement Using End-to-End Speech Recognition Objectives.

[BibT_eX]

[DOI]

Aswin Shanmugam Subramanian

Xiaofei Wang

Murali Karthick Baskar

Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

2018

PARHELIA: Particle Filter-Based Heart Rate Estimation From Photoplethysmographic Signals During Physical Exercise.

[BibT_eX]

[DOI]

Yuya Fujita

Masayuki Hiromoto

Takashi Sato

IEEE Trans. Biomed. Eng., 2018

Speaker Selective Beamformer with Keyword Mask Estimation.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Multi Scale Feedback Connection for Noise Robust Acoustic Modeling.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Fast And Robust Heart Rate Estimation From Videos Through Dynamic Region Selection.

[BibT_eX]

[DOI]

Yuya Fujita

Masayuki Hiromoto

Takashi Sato

Proceedings of the 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2018

2016

Robust DNN-Based VAD Augmented with Phone Entropy Based Rejection of Background Speech.

[BibT_eX]

[DOI]

Yuya Fujita

Ken-ichi Iso

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2013

Lightly supervised training for risk-based discriminative language models.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Progressive language model adaptation for disaster broadcasting with closed-captions.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012

Speaker adaptation intensively weighted on mis-recognized speech segments.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

Yuya Fujita

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...