Kazuki Shimada

Irán R. Román

CoRR, July, 2025

DCASE2025 Task3 Stereo SELD Dataset.

[BibT_eX]

[DOI]

Irán R. Román

Dataset, June, 2025

Combining Deterministic Enhanced Conditions with Dual-Streaming Encoding for Diffusion-Based Speech Enhancement.

[BibT_eX]

[DOI]

CoRR, May, 2025

DCASE2025 Task3 Stereo SELD Dataset.

[BibT_eX]

[DOI]

Irán R. Román

Marco A. Martínez Ramírez

Dataset, April, 2025

Music Foundation Model as Generic Booster for Music Downstream Tasks.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2025

CCStereo: Audio-Visual Contextual and Contrastive Learning for Binaural Audio Generation.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

StereoSync: Spatially-Aware Stereo Audio Generation from Video.

[BibT_eX]

[DOI]

Christian Marinoni

Riccardo F. Gramaccioni

Proceedings of the International Joint Conference on Neural Networks, 2025

2024

HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2024

SAVGBench: Benchmarking Spatially Aligned Audio-Video Generation.

[BibT_eX]

[DOI]

CoRR, 2024

Zero- and Few-Shot Sound Event Localization and Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

On-Chip ESD Current Sensor for Nanosecond Oscillation Waveform Over Ampere Detecting.

[BibT_eX]

[DOI]

Mototsugu Okushima

Proceedings of the 14th International Workshop on the Electromagnetic Compatibility of Integrated Circuits, 2024

2023

STARSS23: Sony-TAu Realistic Spatial Soundscapes 2023.

[BibT_eX]

[DOI]

Aapo Hakala

Shusuke Takahashi

Dataset, March, 2023

STARSS23: Sony-TAu Realistic Spatial Soundscapes 2023.

[BibT_eX]

[DOI]

Aapo Hakala

Shusuke Takahashi

Dataset, March, 2023

Diffusion-based Signal Refiner for Speech Separation.

[BibT_eX]

[DOI]

CoRR, 2023

Extending Audio Masked Autoencoders toward Audio Restoration.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

An Attention-Based Approach to Hierarchical Multi-Label Music Instrument Classification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

STARSS22: Sony-TAu Realistic Spatial Soundscapes 2022 dataset.

[BibT_eX]

[DOI]

Yuki Mitsufuji

Sharath Adavanne

Yuichiro Koyama

Naoya Takahashi

Shusuke Takahashi

Tuomas Virtanen

Dataset, May, 2022

STARSS22: Sony-TAu Realistic Spatial Soundscapes 2022 dataset.

[BibT_eX]

[DOI]

Adavanne Politis

Yuki Mitsufuji

Dataset, March, 2022

Video Generation Unconsciously Evoking Pre-Motion to Passengers in Automated Vehicles.

[BibT_eX]

[DOI]

Proceedings of the 2022 IEEE International Symposium on Mixed and Augmented Reality Adjunct (ISMAR-Adjunct), 2022

Multi-ACCDOA: Localizing And Detecting Overlapping Sounds From The Same Class With Auxiliary Duplicating Permutation Invariant Training.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Spatial Mixup: Directional Loudness Modification as Data Augmentation for Sound Event Localization and Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

STARSS22: A Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events.

[BibT_eX]

[DOI]

Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

2021

Ensemble of ACCDOA- and EINV2-based Systems with D3Nets and Impulse Response Simulation for Sound Event Localization and Detection.

[BibT_eX]

[DOI]

CoRR, 2021

Accdoa: Activity-Coupled Cartesian Direction of Arrival Representation for Sound Event Localization And Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Sound Event Localization and Detection Using Activity-Coupled Cartesian DOA Vector and RD3net.

[BibT_eX]

[DOI]

CoRR, 2020

Metric Learning with Background Noise Class for Few-Shot Detection of Rare Sound Events.

[BibT_eX]

[DOI]