Yusuke Fujita

Orcid: 0000-0003-4211-8237

According to our database1, Yusuke Fujita authored at least 72 papers between 2003 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Keep Decoding Parallel with Effective Knowledge Distillation from Language Models to End-to-end Speech Recognisers.
CoRR, 2024

2023
Audio Difference Learning for Audio Captioning.
CoRR, 2023

Self-Conditioning via Intermediate Predictions for End-to-End Neural Speaker Diarization.
IEEE Access, 2023

Neural Diarization with Non-Autoregressive Intermediate Attractors.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Encoder-Decoder Based Attractors for End-to-End Neural Diarization.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Image Correction Methods for Regions of Interest in Liver Cirrhosis Classification on CNNs.
Sensors, 2022

Multi-sequence Intermediate Conditioning for CTC-based ASR.
CoRR, 2022

Interdecoder: using Attention Decoders as Intermediate Regularization for CTC-Based Speech Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Alternate Intermediate Conditioning with Syllable-Level and Character-Level Targets for Japanese ASR.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR.
Proceedings of the Interspeech 2022, 2022

Better Intermediates Improve CTC Inference.
Proceedings of the Interspeech 2022, 2022

Sound Event Localization and Detection with Pre-Trained Audio Spectrogram Transformer and Multichannel Seperation Network.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

Effects of transfer learning for handwritten digit classification in a small training sample size situation.
Proceedings of the 5th Artificial Intelligence and Cloud Computing Conference, 2022

2021
Encoder-Decoder Based Attractor Calculation for End-to-End Neural Diarization.
CoRR, 2021

The Hitachi-JHU DIHARD III System: Competitive End-to-End Neural Diarization and X-Vector Clustering Systems Combined by DOVER-Lap.
CoRR, 2021

Online End-to-End Neural Diarization Handling Overlapping Speech and Flexible Numbers of Speakers.
CoRR, 2021

Online End-To-End Neural Diarization with Speaker-Tracing Buffer.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

End-to-End Speaker Diarization Conditioned on Speech Activity and Overlap Detection.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Block-Online Guided Source Separation.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Online Streaming End-to-End Neural Diarization Handling Overlapping Speech and Flexible Numbers of Speakers.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Semi-Supervised Training with Pseudo-Labeling for End-To-End Neural Diarization.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

End-To-End Speaker Diarization as Post-Processing.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Online End-to-End Neural Diarization with Speaker-Tracing Buffer.
CoRR, 2020

Neural Speaker Diarization with Speaker-Wise Chain Rule.
CoRR, 2020

End-to-End Neural Diarization: Reformulating Speaker Diarization as Simple Multi-label Classification.
CoRR, 2020

Sequence to Multi-Sequence Learning via Conditional Chain Mapping for Mixture Signals.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Effect of an Augmentation on CNNs in Classifying a Cirrhosis Liver on B-Mode Ultrasound Images.
Proceedings of the 2nd IEEE Global Conference on Life Sciences and Technologies, 2020

Utterance-Wise Meeting Transcription System Using Asynchronous Distributed Microphones.
Proceedings of the Interspeech 2020, 2020

End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors.
Proceedings of the Interspeech 2020, 2020

Speaker-Conditional Chain Model for Speech Separation and Extraction.
Proceedings of the Interspeech 2020, 2020

Speaker Diarization with Region Proposal Network.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Analysis of Robustness of Deep Single-Channel Speech Separation Using Corpora Constructed From Multiple Domains.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Auxiliary Interference Speaker Loss for Target-Speaker Speech Recognition.
Proceedings of the Interspeech 2019, 2019

Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR.
Proceedings of the Interspeech 2019, 2019

End-to-End Neural Speaker Diarization with Permutation-Free Objectives.
Proceedings of the Interspeech 2019, 2019

Acoustic Modeling for Overlapping Speech Recognition: Jhu Chime-5 Challenge System.
Proceedings of the IEEE International Conference on Acoustics, 2019

Acoustic Modeling for Distant Multi-talker Speech Recognition with Single- and Multi-channel Branches.
Proceedings of the IEEE International Conference on Acoustics, 2019

Simultaneous Speech Recognition and Speaker Diarization for Monaural Dialogue Recordings with Target-Speaker Acoustic Models.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

End-to-End Neural Speaker Diarization with Self-Attention.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Lattice-free State-level Minimum Bayes Risk Training of Acoustic Models.
Proceedings of the Interspeech 2018, 2018

Sequence Distillation for Purely Sequence Trained Acoustic Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

A Network-Based Event Detection Module Using NTP for Cyber Attacks on IoT.
Proceedings of the Sixth International Symposium on Computing and Networking, 2018

2017
An Automated Self-Learning Quantification System to Identify Visible Areas in Capsule Endoscopy Images.
J. Medical Syst., 2017

A method based on machine learning using hand-crafted features for crack detection from asphalt pavement surface images.
Proceedings of the Thirteenth International Conference on Quality Control by Artificial Vision, 2017

Training sample selection based on self-training for liver cirrhosis classification using ultrasound images.
Proceedings of the Thirteenth International Conference on Quality Control by Artificial Vision, 2017

Local Gaussian model with source-set constraints in audio source separation.
Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017

Independent vector analysis with frequency range division and prior switching.
Proceedings of the 25th European Signal Processing Conference, 2017

Investigation of lattice-free maximum mutual information-based acoustic models with sequence-level Kullback-Leibler divergence.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
Development of a Gastric Cancer Diagnostic Support System with a Pattern Recognition Method Using a Hyperspectral Camera.
J. Sensors, 2016

Solving permutation problem with a cascade combination of phase difference entropy and power spectral correlation.
Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016

Data Augmentation Using Multi-Input Multi-Output Source Separation for Deep Neural Network Based Acoustic Modeling.
Proceedings of the Interspeech 2016, 2016

Training ROI Selection Based on MILBoost for Liver Cirrhosis Classification Using Ultrasound Images.
Proceedings of the Trends in Applied Knowledge-Based Systems and Data Science, 2016

2015
Novel Architecture for Cellular Neural Network Suitable for High-Density Integration of Electron Devices-Learning of Multiple Logics.
Proceedings of the Neural Information Processing - 22nd International Conference, 2015

Transmission characteristics through the bent arm wearing magnetically-coupled coils for body area communication.
Proceedings of the IEEE 4th Global Conference on Consumer Electronics, 2015

Unified ASR system using LGM-based source separation, noise-robust feature extraction, and word hypothesis selection.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
A Method of Bubble Removal for Computer-Assisted Diagnosis of Capsule Endoscopic Images.
Proceedings of the Modern Advances in Applied Intelligence, 2014

Comparative Study of Classifiers for Prediction of Recurrence of Liver Cancer Using Binary Patterns.
Proceedings of the Modern Advances in Applied Intelligence, 2014

2013
A note of liver cirrhosis classification on M-mode ultrasound images by higher-order local auto-correlation features.
Proceedings of the 2013 International Conference on Soft Computing and Pattern Recognition, 2013

Classification Based on Boolean Algebra and Its Application to the Prediction of Recurrence of Liver Cancer.
Proceedings of the 2nd IAPR Asian Conference on Pattern Recognition, 2013

2012
The Use of a Local Histogram Feature Vector of Classifying Diffuse Lung Opacities in High-Resolution Computed Tomography.
Proceedings of the Advanced Research in Applied Artificial Intelligence, 2012

2011
A robust automatic crack detection method from noisy concrete surfaces.
Mach. Vis. Appl., 2011

2010
An Improved Method for Cirrhosis Detection Using Liver's Ultrasound Images.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

2009
A Robust Method for Automatically Detecting Cracks on Noisy Concrete Surfaces.
Proceedings of the Next-Generation Applied Intelligence, 2009

2008
Visualization of transitions of developing of hepatitis C virus-associated hepatocellular carcinoma.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Using a sentiment map for visualizing credibility of news sites on the web.
Proceedings of the 2nd ACM Workshop on Information Credibility on the Web, 2008

2006
A Method for Crack Detection on a Concrete Structure.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Frequency domain simultaneous equations method for active noise control systems.
Proceedings of the 14th European Signal Processing Conference, 2006

2005
Estimation of Physically and Physiologically Valid Somatosensory Information.
Proceedings of the 2005 IEEE International Conference on Robotics and Automation, 2005

2004
A Study on Nonparametric Classifiers for a CAD System of Diffuse Lung Opacities in Thin-Section Computed Tomography Images.
Proceedings of the Knowledge-Based Intelligent Information and Engineering Systems, 2004

Computing a Set of Local Optimal Paths through Cluttered Environments and over Open Terrain.
Proceedings of the 2004 IEEE International Conference on Robotics and Automation, 2004

CELP-based speaker verification: an evaluation under noisy conditions.
Proceedings of the 8th International Conference on Control, 2004

2003
Dual Dijkstra search for paths with different topologies.
Proceedings of the 2003 IEEE International Conference on Robotics and Automation, 2003


  Loading...