Kazuyoshi Yoshii

Proceedings of the 27th International Conference on Multimodal Interaction, 2025

SHAMaNS: Sound Localization with Hybrid Alpha-Stable Spatial Measure and Neural Steerer.

[BibT_eX]

[DOI]

Proceedings of the 33rd European Signal Processing Conference, 2025

TAPA-ICL: Taxonomy-Aware Prompt Augmentation for in-Context Learning in Music Understanding.

[BibT_eX]

[DOI]

Jiahao Zhao

Yunjia Li

Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2025

Efficient Transformer-Based Piano Transcription with Sparse Attention Mechanisms.

[BibT_eX]

[DOI]

Weixing Wei

Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2025

Narrativity-Aware Video Summarization Based on Vision and Language Foundation Models.

[BibT_eX]

[DOI]

Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2025

Joint Separation and Tracking of Moving Sources with Distributed Microphone Arrays Based on Time-Varying Inertial Spatial Models.

[BibT_eX]

[DOI]

Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2025

Visually-Informed Multichannel Sound Source Separation Based on 3D Gaussian Primitives.

[BibT_eX]

[DOI]

Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2025

2024

Joint Audio Source Localization and Separation with Distributed Microphone Arrays Based on Spatially-Regularized Multichannel NMF.

[BibT_eX]

[DOI]

Proceedings of the 18th International Workshop on Acoustic Signal Enhancement, 2024

Streaming Piano Transcription Based on Consistent Onset and Offset Decoding With Sustain Pedal Detection.

[BibT_eX]

[DOI]

Proceedings of the 25th International Society for Music Information Retrieval Conference, 2024

Learning Multifaceted Self-Similarity Over Time and Frequency for Music Structure Analysis.

[BibT_eX]

[DOI]

Tsung-Ping Chen

Proceedings of the 25th International Society for Music Information Retrieval Conference, 2024

RIR-in-a-Box: Estimating Room Acoustics from 3D Mesh Data through Shoebox Approximation.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Neural Steerer: Novel Steering Vector Synthesis with a Causal Neural Field over Frequency and Direction.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

On the Importance of Time and Pitch Relativity for Transformer-Based Symbolic Music Generation.

[BibT_eX]

[DOI]

Tatsuro Inaba

Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2024

Run-Time Adaptation of Neural Beamforming for Robust Speech Dereverberation and Denoising.

[BibT_eX]

[DOI]

Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2024

2023

Neural Steerer: Novel Steering Vector Synthesis with a Causal Neural Field over Frequency and Source Positions.

[BibT_eX]

[DOI]

CoRR, 2023

Time-Domain Audio Source Separation Based on Gaussian Processes with Deep Kernel Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

Neural Band-to-Piano Score Arrangement with Stepless Difficulty Control.

[BibT_eX]

[DOI]

Moyu Terao

Proceedings of the IEEE International Conference on Acoustics, 2023

Neural Fast Full-Rank Spatial Covariance Analysis for Blind Source Separation.

[BibT_eX]

[DOI]

Proceedings of the 31st European Signal Processing Conference, 2023

Multimodal Multifaceted Music Emotion Recognition Based on Self-Attentive Fusion of Psychology-Inspired Symbolic and Acoustic Features.

[BibT_eX]

[DOI]

Jiahao Zhao

Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

CTC2: End-to-End Drum Transcription Based on Connectionist Temporal Classification With Constant Tempo Constraint.

[BibT_eX]

[DOI]

Daichi Kamakura

Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

Joint Drum Transcription and Metrical Analysis Based on Periodicity-Aware Multi-Task Learning.

[BibT_eX]

[DOI]

Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

DOA-Aware Audio-Visual Self-Supervised Learning for Sound Event Localization and Detection.

[BibT_eX]

[DOI]

Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

Audio-to-Score Singing Transcription Based on Joint Estimation of Pitches, Onsets, and Metrical Positions With Tatum-Level CTC Loss.

[BibT_eX]

[DOI]

Tengyu Deng

Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

Learning Multifaceted Self-Similarity for Musical Structure Analysis.

[BibT_eX]

[DOI]

Tsung-Ping Chen

Li Su

Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022

Autoregressive Moving Average Jointly-Diagonalizable Spatial Covariance Analysis for Joint Source Separation and Dereverberation.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2022

Generalized Fast Multichannel Nonnegative Matrix Factorization Based on Gaussian Scale Mixtures for Blind Source Separation.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2022

Computationally-Efficient Overdetermined Blind Source Separation Based on Iterative Source Steering.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2022

Joint Localization and Synchronization of Distributed Camera-Attached Microphone Arrays for Indoor Scene Analysis.

[BibT_eX]

[DOI]

Proceedings of the 17th International Workshop on Acoustic Signal Enhancement, 2022

DNN-free Low-Latency Adaptive Speech Enhancement Based on Frame-Online Beamforming Powered by Block-Online Fastmnmf.

[BibT_eX]

[DOI]

Proceedings of the 17th International Workshop on Acoustic Signal Enhancement, 2022

Tracking the Evolution of a Band's Live Performances over Decades.

[BibT_eX]

[DOI]

Florian Thalmann

Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

End-to-End Lyrics Transcription Informed by Pitch and Onset Estimation.

[BibT_eX]

[DOI]

Tengyu Deng

Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

Direction-Aware Adaptive Online Neural Speech Enhancement with an Augmented Reality Headset in Real Noisy Conversational Environments.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Direction-Aware Joint Adaptation of Neural Speech Enhancement and Recognition in Real Multiparty Conversational Environments.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Difficulty-Aware Neural Band-to-Piano Score Arrangement based on Note- and Statistic-Level Criteria.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Flow-Based Fast Multichannel Nonnegative Matrix Factorization for Blind Source Separation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Elliptically Contoured Alpha-Stable Representation for MUSIC-Based Sound Source Localization.

[BibT_eX]

[DOI]

Proceedings of the 30th European Signal Processing Conference, 2022

2021

Neural Full-Rank Spatial Covariance Analysis for Blind Source Separation.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2021

MirrorNet: A Deep Reflective Approach to 2D Pose Estimation for Single-Person Images.

[BibT_eX]

[DOI]

J. Inf. Process., 2021

Non-local musical statistics as guides for audio-to-score piano transcription.

[BibT_eX]

[DOI]

Kentaro Shibata

Inf. Sci., 2021

Musical rhythm transcription based on Bayesian piece-specific score models capturing repetitions.

[BibT_eX]

[DOI]

Inf. Sci., 2021

Global Structure-Aware Drum Transcription Based on Self-Attention Mechanisms.

[BibT_eX]

[DOI]

Ryoto Ishizuka

Ryo Nishikimi

CoRR, 2021

A Real-Time Drum-Wise Volume Visualization System for Learning Volume-Balanced Drum Performance.

[BibT_eX]

[DOI]

Proceedings of the Entertainment Computing - ICEC 2021, 2021

Phase-Aware Joint Beat and Downbeat Estimation Based on Periodicity of Metrical Structure.

[BibT_eX]

[DOI]

Takehisa Oyama

Ryoto Ishizuka

Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

Joint Estimation of Note Values and Voices for Audio-to-Score Piano Transcription.

[BibT_eX]

[DOI]

Yuki Hiramatsu

Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

Alpha-Stable Autoregressive Fast Multichannel Nonnegative Matrix Factorization for Joint Speech Enhancement and Dereverberation.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Pitch-Timbre Disentanglement Of Musical Instrument Sounds Based On Vae-Based Metric Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Autoregressive Fast Multichannel Nonnegative Matrix Factorization For Joint Blind Source Separation And Dereverberation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Statistical Correction of Transcribed Melody Notes Based on Probabilistic Integration of a Music Language Model and a Transcription Error Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Gamma Process FastMNMF for Separating an Unknown Number of Sound Sources.

[BibT_eX]

[DOI]

Yoshiaki Bando

Proceedings of the 29th European Signal Processing Conference, 2021

2020

Semi-Supervised Neural Chord Estimation Based on a Variational Autoencoder With Latent Chord Labels and Features.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2020

Bayesian Melody Harmonization Based on a Tree-Structured Generative Model of Chord Sequences and Melodies.

[BibT_eX]

[DOI]

Hiroaki Tsushima

IEEE ACM Trans. Audio Speech Lang. Process., 2020

Fast Multichannel Nonnegative Matrix Factorization With Directivity-Aware Jointly-Diagonalizable Spatial Covariance Matrices for Blind Source Separation.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2020

A Flow-Based Deep Latent Variable Model for Speech Spectrogram Modeling and Enhancement.

[BibT_eX]

[DOI]

Aditya Arie Nugraha

IEEE ACM Trans. Audio Speech Lang. Process., 2020

Bayesian Singing Transcription Based on a Hierarchical Generative Model of Keys, Musical Notes, and F0 Trajectories.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2020

Flow-Based Independent Vector Analysis for Blind Source Separation.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2020

Statistical learning and estimation of piano fingering.

[BibT_eX]

[DOI]

Yasuyuki Saito

Inf. Sci., 2020

Semi-supervised Neural Chord Estimation Based on a Variational Autoencoder with Discrete Labels and Continuous Textures of Chords.

[BibT_eX]

[DOI]

CoRR, 2020

MirrorNet: A Deep Bayesian Approach to Reflective 2D Pose Estimation from Human Images.

[BibT_eX]

[DOI]

CoRR, 2020

A Method for Analysis of Shared Structure in Large Music Collections using Techniques from Genetic Sequencing and Graph Theory.

[BibT_eX]

[DOI]

Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

Multi-Instrument Music Transcription Based on Deep Spherical Clustering of Spectrograms and Pitchgrams.

[BibT_eX]

[DOI]

Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

Music Structure Analysis Based on an LSTM-HSMM Hybrid Model.

[BibT_eX]

[DOI]

Go Shibata

Ryo Nishikimi

Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

The MIDI Degradation Toolkit: Symbolic Music Augmentation and Correction.

[BibT_eX]

[DOI]

Andrew McLeod

James Owers

Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

Adaptive Neural Speech Enhancement with a Denoising Variational Autoencoder.

[BibT_eX]

[DOI]

Yoshiaki Bando

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Unsupervised Robust Speech Enhancement Based on Alpha-Stable Fast Multichannel Nonnegative Matrix Factorization.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Fast Multichannel Correlated Tensor Factorization for Blind Source Separation.

[BibT_eX]

[DOI]

Proceedings of the 28th European Signal Processing Conference, 2020

Semi-supervised Multichannel Speech Separation Based on a Phone- and Speaker-Aware Deep Generative Model of Speech Spectrograms.

[BibT_eX]

[DOI]

Proceedings of the 28th European Signal Processing Conference, 2020

A Variational Autoencoder for Joint Chord and Key Estimation from Audio Chromagrams.

[BibT_eX]

[DOI]

Yiming Wu

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

End-to-end Music-mixed Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

Integration of Semi-Blind Speech Source Separation and Voice Activity Detection for Flexible Spoken Dialogue.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

Computer-Resource-Aware Deep Speech Separation with a Run-Time-Specified Number of BLSTM Layers.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

Tatum-Level Drum Transcription Based on a Convolutional Recurrent Neural Network with Language Model-Based Regularized Training.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019

Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2019

Semi-Supervised Multichannel Speech Enhancement With a Deep Speech Prior.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2019

Multi-Step Chord Sequence Prediction Based on Aggregated Multi-Scale Encoder-Decoder Network.

[BibT_eX]

[DOI]

CoRR, 2019

Music Transcription Based on Bayesian Piece-Specific Score Models Capturing Repetitions.

[BibT_eX]

[DOI]

CoRR, 2019

End-To-End Melody Note Transcription Based on a Beat-Synchronous Attention Mechanism.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Joint Singing Pitch Estimation and Voice Separation Based on a Neural Harmonic Structure Renderer.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Audio-Visual SLAM towards Human Tracking and Human-Robot Interaction in Indoor Environments.

[BibT_eX]

[DOI]

Proceedings of the 28th IEEE International Conference on Robot and Human Interactive Communication, 2019

Multi-Step Chord Sequence Prediction Based On Aggregated Multi-Scale Encoder-Decoder Networks.

[BibT_eX]

[DOI]

Proceedings of the 29th IEEE International Workshop on Machine Learning for Signal Processing, 2019

Deep Bayesian Unsupervised Source Separation Based On A Complex Gaussian Mixture Model.

[BibT_eX]

[DOI]

Yoshiaki Bando

Yoko Sasaki

Proceedings of the 29th IEEE International Workshop on Machine Learning for Signal Processing, 2019

Blending Acoustic and Language Model Predictions for Automatic Music Transcription.

[BibT_eX]

[DOI]

Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

Statistical Music Structure Analysis Based on a Homogeneity-, Repetitiveness-, and Regularity-Aware Hierarchical Hidden Semi-Markov Model.

[BibT_eX]

[DOI]

Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

Bayesian Drum Transcription Based on Nonnegative Matrix Factor Decomposition with a Deep Score Prior.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Joint Transcription of Lead, Bass, and Rhythm Guitars Based on a Factorial Hidden Semi-Markov Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

A Deep Generative Model of Speech Complex Spectrograms.

[BibT_eX]

[DOI]

Aditya Arie Nugraha

Proceedings of the IEEE International Conference on Acoustics, 2019

Automatic Singing Transcription Based on Encoder-decoder Recurrent Neural Networks with a Weakly-supervised Attention Mechanism.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Unsupervised Melody Style Conversion.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Improved Metrical Alignment of Midi Performance Based on a Repetition-aware Online-adapted Grammar.

[BibT_eX]

[DOI]

Andrew McLeod

Proceedings of the IEEE International Conference on Acoustics, 2019

Automatic Chord Estimation Based on a Frame-wise Convolutional Recurrent Neural Network with Non-Aligned Annotations.

[BibT_eX]

[DOI]

Yiming Wu

Tristan Carsault

Proceedings of the 27th European Signal Processing Conference, 2019

Fast Multichannel Source Separation Based on Jointly Diagonalizable Spatial Covariance Matrices.

[BibT_eX]

[DOI]

Proceedings of the 27th European Signal Processing Conference, 2019

Cauchy Multichannel Speech Enhancement with a Deep Speech Prior.

[BibT_eX]

[DOI]

Proceedings of the 27th European Signal Processing Conference, 2019

2018

Bayesian Multichannel Audio Source Separation Based on Integrated Source and Spatial Models.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2018

Speech Enhancement Based on Bayesian Low-Rank and Sparse Decomposition of Multichannel Magnitude Spectrograms.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2018

Statistical Piano Reduction Controlling Performance Difficulty.

[BibT_eX]

[DOI]

CoRR, 2018

Interactive Arrangement of Chords and Melodies Based on a Tree-Structured Generative Model.

[BibT_eX]

[DOI]

Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018

Correlated Tensor Factorization for Audio Source Separation.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Unsupervised Beamforming Based on Multichannel Nonnegative Matrix Factorization for Noisy Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Towards Complete Polyphonic Music Transcription: Integrating Multi-Pitch Detection and Rhythm Quantization.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

An End-to-End Approach to Joint Social Signal Detection and Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Independent Low-Rank Tensor Analysis for Audio Source Separation.

[BibT_eX]

[DOI]

Proceedings of the 26th European Signal Processing Conference, 2018

Sequential Generation of Singing F0 Contours from Musical Note Sequences Based on WaveNet.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Bayesian Multichannel Speech Enhancement with a Deep Speech Prior.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Probabilistic Sequential Patterns for Singing Transcription.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017

Rhythm Transcription of Polyphonic Piano Music Based on Merged-Output HMM for Multiple Voices.

[BibT_eX]

[DOI]

Shigeki Sagayama

IEEE ACM Trans. Audio Speech Lang. Process., 2017

Note Value Recognition for Piano Transcription Using Markov Random Fields.

[BibT_eX]

[DOI]

Simon Dixon

IEEE ACM Trans. Audio Speech Lang. Process., 2017

Simultaneous Identification and Localization of Still and Mobile Speakers Based on Binaural Robot Audition.

[BibT_eX]

[DOI]

Karim Youssef

J. Robotics Mechatronics, 2017

Layout Optimization of Cooperative Distributed Microphone Arrays Based on Estimation of Source Separation Performance.

[BibT_eX]

[DOI]

J. Robotics Mechatronics, 2017

Audio-Visual Beat Tracking Based on a State-Space Model for a Robot Dancer Performing with a Human Dancer.

[BibT_eX]

[DOI]

J. Robotics Mechatronics, 2017

Low Latency and High Quality Two-Stage Human-Voice-Enhancement System for a Hose-Shaped Rescue Robot.

[BibT_eX]

[DOI]

J. Robotics Mechatronics, 2017

Generative Statistical Models with Self-Emergent Grammar of Chord Sequences.

[BibT_eX]

[DOI]

CoRR, 2017

Note Value Recognition for Rhythm Transcription Using a Markov Random Field Model for Musical Scores and Performances of Piano Music.

[BibT_eX]

[DOI]

Simon Dixon

CoRR, 2017

Infinite probabilistic latent component analysis for audio source separation.

[BibT_eX]

[DOI]

Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017

Semi-Blind speech enhancement basedon recurrent neural network for source separation and dereverberation.

[BibT_eX]

[DOI]

Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017

A diagonal plus low-rank covariance model for computationally efficient source separation.

[BibT_eX]

[DOI]

Antoine Liutkus

Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017

Function- and Rhythm-Aware Melody Harmonization Based on Tree-Structured Parsing and Split-Merge Sampling of Chord Sequences.

[BibT_eX]

[DOI]

Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017

Scale- and Rhythm-Aware Musical Note Estimation for Vocal F0 Trajectories Based on a Semi-Tatum-Synchronous Hierarchical Hidden Semi-Markov Model.

[BibT_eX]

[DOI]

Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017

Performance Error Detection and Post-Processing for Fast and Accurate Symbolic Music Alignment.

[BibT_eX]

[DOI]

Haruhiro Katayose

Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017

Combined Multi-Channel NMF-Based Robust Beamforming for Noisy Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Bayesian multichannel nonnegative matrix factorization for audio source separation and localization.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016

Singing Voice Separation and Vocal F0 Estimation Based on Mutual Combination of Robust Principal Component Analysis and Subharmonic Summation.

[BibT_eX]

[DOI]

Yukara Ikemiya

IEEE ACM Trans. Audio Speech Lang. Process., 2016

Musical Similarity and Commonness Estimation Based on Probabilistic Generative Models of Musical Elements.

[BibT_eX]

[DOI]

Int. J. Semantic Comput., 2016

Sound-based online localization for an in-pipe snake robot.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Symposium on Safety, 2016

Student's t multichannel nonnegative matrix factorization for blind source separation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016

A Hierarchical Bayesian Model of Chords, Pitches, and Spectrograms for Multipitch Analysis.

[BibT_eX]

[DOI]

Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016

Musical Note Estimation for F0 Trajectories of Singing Voices Based on a Bayesian Semi-Beat-Synchronous HMM.

[BibT_eX]

[DOI]

Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016

Musical Typicality: How Many Similar Songs Exist?.

[BibT_eX]

[DOI]

Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016

Online simultaneous localization and mapping of multiple sound sources and asynchronous microphone arrays.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2016

Student's T nonnegative matrix factorization and positive semidefinite tensor factorization for single-channel audio source separation.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Tree-structured probabilistic model of monophonic written music based on the generative theory of tonal music.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Rhythm transcription of MIDI performances based on hierarchical Bayesian modelling of repetition and modification of musical note patterns.

[BibT_eX]

[DOI]

Proceedings of the 24th European Signal Processing Conference, 2016

A unified Bayesian model of time-frequency clustering and low-rank approximation for multi-channel source separation.

[BibT_eX]

[DOI]

Proceedings of the 24th European Signal Processing Conference, 2016

Variational Bayesian multi-channel robust NMF for human-voice enhancement with a deformable and partially-occluded microphone array.

[BibT_eX]

[DOI]

Proceedings of the 24th European Signal Processing Conference, 2016

2015

Toward a quizmaster robot for speech-based multiparty interaction.

[BibT_eX]

[DOI]

Adv. Robotics, 2015

Unified inter- and intra-recording duration model for multiple music audio alignment.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015

Human-voice enhancement based on online RPCA for a hose-shaped rescue robot with a microphone array.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Symposium on Safety, 2015

Identification and Localization of One or Two Concurrent Speakers in a Binaural Robotic Context.

[BibT_eX]

[DOI]

Karim Youssef

Proceedings of the 2015 IEEE International Conference on Systems, 2015

Infinite Superimposed Discrete All-Pole Modeling for Multipitch Analysis of Wavelet Spectrograms.

[BibT_eX]

[DOI]

Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015

Musical Similarity and Commonness Estimation Based on Probabilistic Generative Models.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Symposium on Multimedia, 2015

Songle Widget: Making Animation and Physical Devices Synchronized with Music Videos on the Web.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Symposium on Multimedia, 2015

Optimizing the layout of multiple mobile robots for cooperative sound source separation.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Audio-visual beat tracking based on a state-space model for a music robot dancing with humans.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Microphone-accelerometer based 3D posture estimation for a hose-shaped rescue robot.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Bayesian integration of sound source separation and speech recognition: a new approach to simultaneous speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

A feedback framework for improved chord recognition based on NMF-based approximate note transcription.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Singing voice analysis and editing based on mutually dependent F0 estimation and source separation.

[BibT_eX]

[DOI]

Yukara Ikemiya

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Challenges in deploying a microphone array to localize and separate sound sources in real auditory scenes.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Recognition of In-Field Frog Chorusing Using Bayesian Nonparametric Microphone Array Processing.

[BibT_eX]

[DOI]

Hiroshi Gitchang Okuno

Proceedings of the Computational Sustainability, 2015

2014

Nonparametric Bayesian dereverberation of power spectrograms based on infinite-order autoregressive processes.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2014

AutoMashUpper: automatic creation of multi-song music mashups.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2014

A sound-based online method for estimating the time-varying posture of a hose-shaped robot.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Symposium on Safety, 2014

LyricsRadar: A Lyrics Retrieval System Based on Latent Topics of Lyrics.

[BibT_eX]

[DOI]

Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014

Spotting a Query Phrase from Polyphonic Music Audio Signals Based on Semi-supervised Nonnegative Matrix Factorization.

[BibT_eX]

[DOI]

Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014

Bayesian Audio Alignment based on a Unified Model of Music Composition and Performance.

[BibT_eX]

[DOI]

Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014

Cultivating vocal activity detection for music audio signals in a circulation-type crowdsourcing ecosystem.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Vocal timbre analysis using latent Dirichlet allocation and cross-gender vocal timbre similarity.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Timbre replacement of harmonic and drum components for music audio signals.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

A robot quizmaster that can localize, separate, and recognize simultaneous utterances for a fastest-voice-first quiz game.

[BibT_eX]

[DOI]

Proceedings of the 14th IEEE-RAS International Conference on Humanoid Robots, 2014

2013

A nested infinite Gaussian mixture model for identifying known and unknown audio events.

[BibT_eX]

[DOI]

Yoko Sasaki

Satoshi Kagami

Proceedings of the 14th International Workshop on Image Analysis for Multimedia Interactive Services, 2013

Beyond NMF: Time-Domain Audio Source Separation without Phase Reconstruction.

[BibT_eX]

[DOI]

Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

Transfer Learning In Mir: Sharing Learned Latent Representations For Music Audio Classification And Similarity.

[BibT_eX]

[DOI]

Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

Chord-Sequence-Factory: A Chord Arrangement System Modifying Factorized Chord Sequence Probabilities.

[BibT_eX]

[DOI]

Satoru Fukayama

Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

AutoMashUpper: An Automatic Multi-Song Mashup System.

[BibT_eX]

[DOI]

Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

Nested iGMM recognition and multiple hypothesis tracking of moving sound sources for mobile robot audition.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2013

Infinite Positive Semidefinite Tensor Factorization for Source Separation of Mixture Signals.

[BibT_eX]

[DOI]

Proceedings of the 30th International Conference on Machine Learning, 2013

Infinite kernel linear prediction for joint estimation of spectral envelope and fundamental frequency.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

A Nonparametric Bayesian Multipitch Analyzer Based on Infinite Latent Harmonic Allocation.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2012

PodCastle and Songle: Crowdsourcing-Based Web Services for Retrieval and Browsing of Speech and Music Content.

[BibT_eX]

[DOI]

Proceedings of the First International Workshop on Crowdsourcing Web Search, 2012

PodCastle and songle: crowdsourcing-based web services for spoken content retrieval and active music listening.

[BibT_eX]

[DOI]

Proceedings of the ACM multimedia 2012 workshop on Crowdsourcing for multimedia, 2012

PodCastle and songle: Crowdsourcing-based web services for spoken document retrieval and active music listening.

[BibT_eX]

[DOI]

Proceedings of the 2012 Information Theory and Applications Workshop, 2012

Infinite Composite Autoregressive Models for Music Signal Analysis.

[BibT_eX]

[DOI]

Proceedings of the 13th International Society for Music Information Retrieval Conference, 2012

Unsupervised music understanding based on nonparametric Bayesian models.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011

A Vocabulary-Free Infinity-Gram Model for Nonparametric Bayesian Chord Progression Analysis.

[BibT_eX]

[DOI]

Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011

Timbre and Melody Features for the Recognition of Vocal Activity and Instrumental Solos in Polyphonic Music.

[BibT_eX]

[DOI]

Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011

Songle: A Web Service for Active Music Listening Improved by User Contributions.

[BibT_eX]

[DOI]

Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011

2010

Infinite Latent Harmonic Allocation: A Nonparametric Bayesian Approach to Multipitch Analysis.

[BibT_eX]

[DOI]

Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

2009

MusicCommentator: Generating Comments Synchronized with Musical Audio Signals by a Joint Probabilistic Model of Acoustic and Textual Features.

[BibT_eX]

[DOI]

Proceedings of the Entertainment Computing, 2009

Continuous pLSI and Smoothing Techniques for Hybrid Music Recommendation.

[BibT_eX]

[DOI]

Proceedings of the 10th International Society for Music Information Retrieval Conference, 2009

2008

An Efficient Hybrid Music Recommender System Using an Incrementally Trainable Probabilistic Generative Model.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2008

Music Thumbnailer: Visualizing Musical Pieces in Thumbnail Images Based on Acoustic Features.

[BibT_eX]

[DOI]

Proceedings of the ISMIR 2008, 2008

Automatic Chord Recognition Based on Probabilistic Integration of Chord Transition and Bass Pitch Estimation.

[BibT_eX]

[DOI]

Proceedings of the ISMIR 2008, 2008

A Robot Singer with Music Recognition Based on Real-Time Beat Tracking.

[BibT_eX]

[DOI]

Proceedings of the ISMIR 2008, 2008

A robot uses its own microphone to synchronize its steps to musical beats while scatting and singing.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2008

A robot listens to music and counts its beats aloud by separating music from counting voice.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2008

2007

Drum Sound Recognition for Polyphonic Audio Signals by Adaptation and Matching of Spectrogram Templates With Harmonic Structure Suppression.

[BibT_eX]

[DOI]

Hiroshi G. Okuno

IEEE Trans. Speech Audio Process., 2007

Drumix: An Audio Player with Real-time Drum-part Rearrangement Functions for Active Music Listening.

[BibT_eX]

[DOI]

Inf. Media Technol., 2007

Improving Efficiency and Scalability of Model-Based Music Recommender System Based on Incremental Training.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Music Information Retrieval, 2007

A biped robot that keeps steps in time with musical beats while listening to music with its own ears.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, October 29, 2007

2006

Hybrid Collaborative and Content-based Music Recommendation Using Probabilistic Model with Latent User Preferences.

[BibT_eX]

Proceedings of the ISMIR 2006, 2006

An Error Correction Framework Based on Drum Pattern Periodicity for Improving Drum Sound Detection.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2004

Automatic Drum Sound Description for Real-World Music Using Template Adaptation and Matching Methods.

[BibT_eX]

[DOI]

Hiroshi G. Okuno

Proceedings of the ISMIR 2004, 2004

Drum sound identification for polyphonic music using template adaptation and matching methods.

[BibT_eX]

[DOI]