Kazuki Shimada

Orcid: 0000-0001-5389-2346

According to our database1, Kazuki Shimada authored at least 19 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes.
CoRR, 2024

2023
Zero- and Few-shot Sound Event Localization and Detection.
CoRR, 2023

Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders.
CoRR, 2023

Diffusion-based Signal Refiner for Speech Separation.
CoRR, 2023

Extending Audio Masked Autoencoders toward Audio Restoration.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

An Attention-Based Approach to Hierarchical Multi-Label Music Instrument Classification.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Video Generation Unconsciously Evoking Pre-Motion to Passengers in Automated Vehicles.
Proceedings of the 2022 IEEE International Symposium on Mixed and Augmented Reality Adjunct (ISMAR-Adjunct), 2022

Multi-ACCDOA: Localizing And Detecting Overlapping Sounds From The Same Class With Auxiliary Duplicating Permutation Invariant Training.
Proceedings of the IEEE International Conference on Acoustics, 2022

Spatial Mixup: Directional Loudness Modification as Data Augmentation for Sound Event Localization and Detection.
Proceedings of the IEEE International Conference on Acoustics, 2022

Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection.
Proceedings of the IEEE International Conference on Acoustics, 2022

STARSS22: A Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

2021
Ensemble of ACCDOA- and EINV2-based Systems with D3Nets and Impulse Response Simulation for Sound Event Localization and Detection.
CoRR, 2021

Accdoa: Activity-Coupled Cartesian Direction of Arrival Representation for Sound Event Localization And Detection.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Sound Event Localization and Detection Using Activity-Coupled Cartesian DOA Vector and RD3net.
CoRR, 2020

Metric Learning with Background Noise Class for Few-Shot Detection of Rare Sound Events.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

2018
Unsupervised Beamforming Based on Multichannel Nonnegative Matrix Factorization for Noisy Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Combined Multi-Channel NMF-Based Robust Beamforming for Noisy Speech Recognition.
Proceedings of the Interspeech 2017, 2017


  Loading...