Yuma Koizumi

Orcid: 0000-0003-3645-6213

According to our database1, Yuma Koizumi authored at least 60 papers between 2014 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
LibriTTS-R: A Restored Multi-Speaker Text-to-Speech Corpus.
CoRR, 2023

Description and Discussion on DCASE 2023 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring.
CoRR, 2023

Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text Representations.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

2022
Description and Discussion on DCASE 2022 Challenge Task 2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring Applying Domain Generalization Techniques.
CoRR, 2022

Mask scalar prediction for improving robust automatic speech recognition.
CoRR, 2022

Learning Mask Scalars for Improved Robust Automatic Speech Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Wavefit: an Iterative and Non-Autoregressive Neural Vocoder Based on Fixed-Point Iteration.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral Shaping.
Proceedings of the Interspeech 2022, 2022

SNRi Target Training for Joint Speech Enhancement and Recognition.
Proceedings of the Interspeech 2022, 2022

Description and Discussion on DCASE 2022 Challenge Task 2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring Applying Domain Generalization Techniques.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

2021
Deep Griffin-Lim Iteration: Trainable Iterative Phase Reconstruction Using Neural Network.
IEEE J. Sel. Top. Signal Process., 2021

Description and Discussion on DCASE 2021 Challenge Task 2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring under Domain Shifted Conditions.
CoRR, 2021

DF-Conformer: Integrated Architecture of Conv-Tasnet and Conformer Using Linear Complexity Self-Attention for Speech Enhancement.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

Sampling-Frequency-Independent Audio Source Separation Using Convolution Layer Based on Impulse Invariant Method.
Proceedings of the 29th European Signal Processing Conference, 2021

Noisy-target Training: A Training Strategy for DNN-based Speech Enhancement without Clean Speech.
Proceedings of the 29th European Signal Processing Conference, 2021

Description and Discussion on DCASE 2021 Challenge Task 2: Unsupervised Anomalous Detection for Machine Condition Monitoring Under Domain Shifted Conditions.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

2020
Audio Captioning using Pre-Trained Large-Scale Language Model Guided by Audio-based Similar Caption Retrieval.
CoRR, 2020

The NTT DCASE2020 Challenge Task 6 system: Automated Audio Captioning with Keywords and Sentence Length Estimation.
CoRR, 2020

Crossmodal Sound Retrieval Based on Specific Target Co-Occurrence Denoted with Weak Labels.
Proceedings of the Interspeech 2020, 2020

Listen to What You Want: Neural Network-Based Universal Sound Selector.
Proceedings of the Interspeech 2020, 2020

A Transformer-Based Audio Captioning Model with Keyword Estimation.
Proceedings of the Interspeech 2020, 2020

Sound Event Localization Based on Sound Intensity Vector Refined by Dnn-Based Denoising and Source Separation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Invertible DNN-Based Nonlinear Time-Frequency Transform for Speech Enhancement.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Real-Time Speech Enhancement Using Equilibriated RNN.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Phase Reconstruction Based On Recurrent Phase Unwrapping With Deep Neural Networks.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

SPIDERnet: Attention Network For One-Shot Anomaly Detection In Sounds.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Speech Enhancement Using Self-Adaptation and Multi-Head Self-Attention.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Stable Training of Dnn for Speech Enhancement Based on Perceptually-Motivated Black-Box Cost Function.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Sound Event Detection by Multitask Learning of Sound Events and Scenes with Soft Scene Labels.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Effects of Word-Frequency Based Pre- and Post- Processings for Audio Captioning.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

Description and Discussion on DCASE2020 Challenge Task2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

2019
Unsupervised Detection of Anomalous Sound Based on Deep Learning and the Neyman-Pearson Lemma.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

DOA Estimation by DNN-based Denoising and Dereverberation from Sound Intensity Vector.
CoRR, 2019

Batch Uniformization for Minimizing Maximum Anomaly Score of Dnn-Based Anomaly Detection in Sounds.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

ToyADMOS: A Dataset of Miniature-Machine Operating Sounds for Anomalous Sound Detection.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Finding Low-Dimensional Dynamical Structure Through Variational Auto-Encoding Dynamic Mode Decomposition.
Proceedings of the 29th IEEE International Workshop on Machine Learning for Signal Processing, 2019

AdaFlow: Domain-adaptive Density Estimator with Application to Anomaly Detection and Unpaired Cross-domain Translation.
Proceedings of the IEEE International Conference on Acoustics, 2019

Data-driven Design of Perfect Reconstruction Filterbank for DNN-based Sound Source Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2019

Deep Griffin-Lim Iteration.
Proceedings of the IEEE International Conference on Acoustics, 2019

SNIPER: Few-shot Learning for Anomaly Detection to Minimize False-negative Rate with Ensured True-positive Rate.
Proceedings of the IEEE International Conference on Acoustics, 2019

Trainable Adaptive Window Switching for Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2019

A Two-class Hyper-spherical Autoencoder for Supervised Anomaly Detection.
Proceedings of the IEEE International Conference on Acoustics, 2019

Context-Aware Neural Voice Activity Detection Using Auxiliary Networks for Phoneme Recognition, Speech Enhancement and Acoustic Scene Classification.
Proceedings of the 27th European Signal Processing Conference, 2019

First Order Ambisonics Domain Spatial Augmentation for DNN-based Direction of Arrival Estimation.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

2018
DNN-Based Source Enhancement to Increase Objective Sound Quality Assessment Score.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

DNN-Based Near- and Far-Field Source Separation Using Spherical-Harmonic-Analysis-Based Acoustic Features.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

End-to-End Sound Source Enhancement Using Deep Neural Network in the Modified Discrete Cosine Transform Domain.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Complementary Set Variational Autoencoder for Supervised Anomaly Detection.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Distant Noise Reduction Based on Multi-delay Noise Model Using Distributed Microphone Array.
Proceedings of the 26th European Signal Processing Conference, 2018

2017
Informative Acoustic Feature Selection to Maximize Mutual Information for Collecting Target Sources.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

On relationships between amplitude and phase of short-time Fourier transform.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Supervised source enhancement composed of nonnegative auto-encoders and complementarity subtraction.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

DNN-based source enhancement self-optimized by reinforcement learning using sound quality measurements.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Optimizing acoustic feature extractor for anomalous sound detection based on Neyman-Pearson lemma.
Proceedings of the 25th European Signal Processing Conference, 2017

2016
Binaural sound generation corresponding to omnidirectional video view using angular region-wise source enhancement.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Pinpoint extraction of distant sound source based on DNN mapping from multiple beamforming outputs to prior SNR.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Integrated approach of feature extraction and sound source enhancement based on maximization of mutual information.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Informative acoustic feature selection on microphone array wiener filtering for collecting target source on sports ground.
Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015

Effective approach to character input for novice BCI users.
Proceedings of the 10th Asia-Pacific Symposium on Information and Telecommunication Technologies, 2015

2014
Intra-note segmentation via sticky HMM with DP emission.
Proceedings of the IEEE International Conference on Acoustics, 2014


  Loading...