Masataka Goto

Orcid: 0000-0003-1167-0977

Affiliations:
  • National Institute of Advanced Industrial Science and Technology, Ibaraki, Japan


According to our database1, Masataka Goto authored at least 284 papers between 1994 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Rail DRAGON: Long-Reach Bendable Modularized Rail Structure for Constant Observation Inside PCV.
IEEE Robotics Autom. Lett., 2024

DanceUnisoner: A Parametric, Visual, and Interactive Simulation Interface for Choreographic Composition of Group Dance.
IEICE Trans. Inf. Syst., 2024

2023
Kiite Cafe: A Web Service Enabling Users to Listen to the Same Song at the Same Moment While Reacting to the Song.
IEICE Trans. Inf. Syst., November, 2023

A Method to Detect Chorus Sections in Lyrics Text.
IEICE Trans. Inf. Syst., September, 2023

Why and How People View Lyrics While Listening to Music on a Smartphone.
IEICE Trans. Inf. Syst., April, 2023

Content-Based Music-Image Retrieval Using Self- and Cross-Modal Feature Embedding Memory.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

IteraTTA: An Interface for Exploring Both Text Prompts and Audio Priors in Generating Music With Text-to-Audio Models.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023

Text-to-Lyrics Generation With Image-Based Semantics and Reduced Risk of Plagiarism.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023

Unveiling the Impact of Musical Factors in Judging a Song on First Listen: Insights From a User Survey.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023

Chorus-Playlist: Exploring the Impact of Listening to Only Choruses in a Playlist.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023

Music Source Separation With MLP Mixing of Time, Frequency, and Channel.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023

A Computational Evaluation Framework for Singable Lyric Translation.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023

Decoding Drums, Instrumentals, Vocals, and Mixed Sources in Music Using Human Brain Activity With fMRI.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023

Transformer-Based Beat Tracking With Low-Resolution Encoder and High-Resolution Decoder.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023

U-Beat: A Multi-Scale Beat Tracking Model Based on Wave-U-Net.
Proceedings of the IEEE International Conference on Acoustics, 2023

CatAlyst: Domain-Extensible Intervention for Preventing Task Procrastination Using Large Generative Models.
Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

Lyric App Framework: A Web-based Framework for Developing Interactive Lyric-driven Musical Applications.
Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

2022
An automated system recommending background music to listen to while working.
User Model. User Adapt. Interact., 2022

Self-Supervised Contrastive Learning for Singing Voices.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Singer Diarization for Polyphonic Music With Unison Singing.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Deep Learning Approaches in Topics of Singing Information Processing.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Fonts That Fit the Music: A Multimodal Design Trend Analysis of Lyric Videos.
IEEE Access, 2022

BO as Assistant: Using Bayesian Optimization for Asynchronously Generating Design Suggestions.
Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology, 2022

BeParrot: Efficient Interface for Transcribing Unclear Speech via Respeaking.
Proceedings of the IUI 2022: 27th International Conference on Intelligent User Interfaces, Helsinki, Finland, March 22, 2022

An Analysis of Using Fuzzy Annotations in CRNN-Based Joint Beat and Downbeat Tracking.
Proceedings of the 30th European Signal Processing Conference, 2022

2021
MirrorNet: A Deep Reflective Approach to 2D Pose Estimation for Single-Person Images.
J. Inf. Process., 2021

Vocal-Accompaniment Compatibility Estimation Using Self-Supervised and Joint-Embedding Techniques.
IEEE Access, 2021

Atypical Lyrics Completion Considering Musical Audio Signals.
Proceedings of the MultiMedia Modeling - 27th International Conference, 2021

Interactive Exploration-Exploitation Balancing for Generative Melody Composition.
Proceedings of the IUI '21: 26th International Conference on Intelligent User Interfaces, 2021

Kiite Cafe: A Web Service for Getting Together Virtually to Listen to Music.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

Toward an Understanding of Lyrics-viewing Behavior While Listening to Music on a Smartphone.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

Tool- and Domain-Agnostic Parameterization of Style Transfer Effects Leveraging Pretrained Perceptual Metrics.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

2020
Audio-visual object removal in 360-degree videos.
Vis. Comput., 2020

Sequential gallery for interactive visual design optimization.
ACM Trans. Graph., 2020

Intelligent User Interfaces for Music Discovery.
Trans. Int. Soc. Music. Inf. Retr., 2020

Bayesian Singing Transcription Based on a Hierarchical Generative Model of Keys, Musical Notes, and F0 Trajectories.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Modeling N-th Order Derivative Creation Based on Content Attractiveness and Time-Dependent Popularity.
IEICE Trans. Inf. Syst., 2020

Generative Melody Composition with Human-in-the-Loop Bayesian Optimization.
CoRR, 2020

MirrorNet: A Deep Bayesian Approach to Reflective 2D Pose Estimation from Human Images.
CoRR, 2020

Query/Task Satisfaction and Grid-based Evaluation Metrics Under Different Image Search Intents.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Explainable Recommendation for Repeat Consumption.
Proceedings of the RecSys 2020: Fourteenth ACM Conference on Recommender Systems, 2020

Drum Synthesis and Rhythmic Transformation with Adversarial Autoencoders.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Interactive deep singing-voice separation based on human-in-the-loop adaptation.
Proceedings of the IUI '20: 25th International Conference on Intelligent User Interfaces, 2020

A Chorus-Section Detection Method for Lyrics Text.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

Analysis of Song/Artist Latent Features and Its Application for Song Search.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

Dance Beat Tracking from Visual Information Alone.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

Unsupervised Disentanglement of Pitch and Timbre for Isolated Musical Instrument Sounds.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

Chord Jazzification: Learning Jazz Interpretations of Chord Symbols.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

Enhancing Participation Experience in VR Live Concerts by Improving Motions of Virtual Audience Avatars.
Proceedings of the 2020 IEEE International Symposium on Mixed and Augmented Reality, 2020

Lyric Video Analysis Using Text Detection and Tracking.
Proceedings of the Document Analysis Systems - 14th IAPR International Workshop, 2020

2019
Precomputed optimal one-hop motion transition for responsive character animation.
Vis. Comput., 2019

Music Interfaces Based on Automatic Music Signal Analysis: New Ways to Create and Listen to Music.
IEEE Signal Process. Mag., 2019

End-To-End Melody Note Transcription Based on a Beat-Synchronous Attention Mechanism.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Joint Singing Pitch Estimation and Voice Separation Based on a Neural Harmonic Structure Renderer.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

ABCPRec: Adaptively Bridging Consumer and Producer Roles for User-Generated Content Recommendation.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

DualDiv: diversifying items and explanation styles in explainable hybrid recommendation.
Proceedings of the 13th ACM Conference on Recommender Systems, 2019

Query-by-Dancing: A Dance Music Retrieval System Based on Body-Motion Similarity.
Proceedings of the MultiMedia Modeling - 25th International Conference, 2019

Audio-Based Automatic Generation of a Piano Reduction Score by Considering the Musical Structure.
Proceedings of the MultiMedia Modeling - 25th International Conference, 2019

Autocomplete vocal-<i>f</i><sub>o</sub> annotation of songs using musical repetitions.
Proceedings of the 24th International Conference on Intelligent User Interfaces: Companion, 2019

Query-by-Blending: A Music Exploration System Blending Latent Vector Representations of Lyric Word, Song Audio, and Artist.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

AIST Dance Video Database: Multi-Genre, Multi-Dancer, and Multi-Camera Database for Dance Information Processing.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

Unmixer: An Interface for Extracting and Remixing Loops.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

Intelligent User Interfaces for Music Discovery: The Past 20 Years and What's to Come.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

Joint Transcription of Lead, Bass, and Rhythm Guitars Based on a Factorial Hidden Semi-Markov Model.
Proceedings of the IEEE International Conference on Acoustics, 2019

Transdrums: A Drum Pattern Transfer System Preserving Global Pattern Structure.
Proceedings of the IEEE International Conference on Acoustics, 2019

Automatic Singing Transcription Based on Encoder-decoder Recurrent Neural Networks with a Weakly-supervised Attention Mechanism.
Proceedings of the IEEE International Conference on Acoustics, 2019

Zero-mean Convolutional Network with Data Augmentation for Sound Level Invariant Singing Voice Separation.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Modeling Storylines in Lyrics.
IEICE Trans. Inf. Syst., 2018

Songrium Derivation Factor Analysis: A Web Service for Browsing Derivation Factors by Modeling N-th Order Derivative Creation.
IEICE Trans. Inf. Syst., 2018

Decomposing Images into Layers with Advanced Color Blending.
Comput. Graph. Forum, 2018

DeployGround: A Framework for Streamlined Programming from API playgrounds to Application Deployment.
Proceedings of the 2018 IEEE Symposium on Visual Languages and Human-Centric Computing, 2018

A Melody-Conditioned Lyrics Language Model.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Songle Sync: A Large-Scale Web-based Platform for Controlling Various Devices in Synchronization with Music.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

FocusMusicRecommender: A System for Recommending Music to Listen to While Working.
Proceedings of the 23rd International Conference on Intelligent User Interfaces, 2018

Intelligent Music Interfaces.
Proceedings of the 23rd International Conference on Intelligent User Interfaces, 2018

Listener Anonymizer: Camouflaging Play Logs to Preserve User's Demographic Anonymity.
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018

Instrudive: A Music Visualization System Based on Automatically Recognized Instrumentation.
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018

Comparing RNN Parameters for Melodic Similarity.
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018

Collaboration in N-th Order Derivative Creation.
Proceedings of the Twelfth International Conference on Web and Social Media, 2018

Chordscanner: Browsing Chord Progressions Based on Musical Typicality and Intra-composer Consistency.
Proceedings of the 2018 International Computer Music Conference, 2018

Nonnegative Tensor Factorization for Source Separation of Loops in Audio.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Instlistener: An Expressive Parameter Estimation System Imitating Human Performances of Monophonic Musical Instruments.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Retrieval of Song Lyrics from Sung Queries.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Music Structure Boundary Detection and Labelling by a Deconvolution of Path-Enhanced Self-Similarity Matrix.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Convolving Gaussian Kernels for RNN-Based Beat Tracking.
Proceedings of the 26th European Signal Processing Conference, 2018

OptiMo: Optimization-Guided Motion Editing for Keyframe Character Animation.
Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, 2018

Placing Music in Space: A Study on Music Appreciation with Spatial Mapping.
Proceedings of the Companion Publication of the 19th International ACM SIGACCESS Conference on Computers and Accessibility, 2018

2017
QueryShare: Working Together to Facilitate Exploratory Multimedia Searches without Skill in Creating.
Proceedings of the 13th International Symposium on Open Collaboration, 2017

User-Generated Variables: Streamlined Interaction Design for Feature Requests and Implementations.
Proceedings of the Companion to the first International Conference on the Art, 2017

Taste or Addiction?: Using Play Logs to Infer Song Selection Motivation.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2017

Infinite probabilistic latent component analysis for audio source separation.
Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017

LyriSys: An Interactive Support System for Writing Lyrics Based on Topic Transition.
Proceedings of the 22nd International Conference on Intelligent User Interfaces, 2017

Lyric Jumper: A Lyrics-Based Music Exploratory Web Service by Modeling Lyrics Generative Process.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017

Multi-Part Pattern Analysis: Combining Structure Analysis and Source Separation to Discover Intra-Part Repeated Sequences.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017

Scale- and Rhythm-Aware Musical Note Estimation for Vocal F0 Trajectories Based on a Semi-Tatum-Synchronous Hierarchical Hidden Semi-Markov Model.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017

Song2Guitar: A Difficulty-Aware Arrangement System for Generating Guitar Solo Covers from Polyphonic Audio of Popular Music.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017

Classifying derivative works with search, text, audio and video features.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Strummer: An interactive guitar chord practice system.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Keyboard Interface with Shape-Distortion Expression for Interactive Performance.
Proceedings of the 2017 International Computer Music Conference, 2017

A Robotic Framework for Video Recording and Authoring.
Proceedings of the Companion of the 2017 ACM/IEEE International Conference on Human-Robot Interaction, 2017

f3.js: A Parametric Design Tool for Physical Computing Devices for Both Interaction Designers and End-users.
Proceedings of the 2017 Conference on Designing Interactive Systems, 2017

Automatic System for Editing Dance Videos Recorded Using Multiple Cameras.
Proceedings of the Advances in Computer Entertainment Technology, 2017

OngaCREST Project: Building a Similarity-Aware Information Environment for a Content-Symbiotic Society.
Proceedings of the Human-Harmonized Information Technology, Volume 2, 2017

2016
Musical Similarity and Commonness Estimation Based on Probabilistic Generative Models of Musical Elements.
Int. J. Semantic Comput., 2016

Improvements of Voice Timbre Control Based on Perceived Age in Singing Voice Conversion.
IEICE Trans. Inf. Syst., 2016

Programming with Examples to Develop Data-Intensive User Interfaces.
Computer, 2016

A choreographic authoring system for character dance animation reflecting a user's preference.
Proceedings of the Poster Proceedings of the ACM SIGGRAPH/Eurographics Symposium on Computer Animation, 2016

Plate: persistent memory management for nonvolatile main memory.
Proceedings of the 31st Annual ACM Symposium on Applied Computing, 2016

PlaylistPlayer: An Interface Using Multiple Criteria to Change the Playback Order of a Music Playlist.
Proceedings of the 21st International Conference on Intelligent User Interfaces, 2016

Using Priors to Improve Estimates of Music Structure.
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016

Musical Typicality: How Many Similar Songs Exist?.
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016

A soundtrack generation system to synchronize the climax of a video clip with music.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

SmartVideoRanking: Video Search by Mining Emotions from Time-Synchronized Comments.
Proceedings of the IEEE International Conference on Data Mining Workshops, 2016

Student's T nonnegative matrix factorization and positive semidefinite tensor factorization for single-channel audio source separation.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

An estimation method of voice timbre evaluation values using feature extraction with Gaussian mixture model based on reference singer.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Music emotion recognition with adaptive aggregation of Gaussian process regressors.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Modeling Discourse Segments in Lyrics Using Repeated Patterns.
Proceedings of the COLING 2016, 2016

Why Did You Cover That Song?: Modeling N-th Order Derivative Creation with Content Popularity.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

2015
AutoGuitarTab: Computer-Aided Composition of Rhythm and Lead Guitar Parts in the Tablature Space.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Expression Control in Singing Voice Synthesis: Features, approaches, evaluation, and challenges.
IEEE Signal Process. Mag., 2015

Frontiers of music technologies.
Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015

Form Follows Function(): An IDE to Create Laser-cut Interfaces and Microcontroller Programs from Single Code Base.
Proceedings of the 28th Annual ACM Symposium on User Interface Software & Technology, 2015

A music video authoring system synchronizing climax of video clips and music via rearrangement of musical bars.
Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2015

Infinite Superimposed Discrete All-Pole Modeling for Multipitch Analysis of Wavelet Spectrograms.
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015

Song2Quartet: A System for Generating String Quartet Cover Songs from Polyphonic Audio of Popular Music.
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015

ExploratoryVideoSearch: A Music Video Search System Based on Coordinate Terms and Diversification.
Proceedings of the 2015 IEEE International Symposium on Multimedia, 2015

Musical Similarity and Commonness Estimation Based on Probabilistic Generative Models.
Proceedings of the 2015 IEEE International Symposium on Multimedia, 2015

Songle Widget: Making Animation and Physical Devices Synchronized with Music Videos on the Web.
Proceedings of the 2015 IEEE International Symposium on Multimedia, 2015

A feedback framework for improved chord recognition based on NMF-based approximate note transcription.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Music Cultures Opened Up by Music Technologies.
Proceedings of the 2015 International Conference on Culture and Computing, 2015

TextAlive: Integrated Design Environment for Kinetic Typography.
Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems, 2015

2014
AutoMashUpper: automatic creation of multi-song music mashups.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

Voice Timbre Control Based on Perceived Age in Singing Voice Conversion.
IEICE Trans. Inf. Syst., 2014

Songrium: a music browsing assistance service with interactive visualization and exploration of protect a web of music.
Proceedings of the 23rd International World Wide Web Conference, 2014

Modeling Structural Topic Transitions for Automatic Lyrics Generation.
Proceedings of the 28th Pacific Asia Conference on Language, Information and Computation, 2014

Improvasher: A Real-Time Mashup System for Live Musical Input.
Proceedings of the 14th International Conference on New Interfaces for Musical Expression, 2014

A Multi-Touch DJ Interface with Remote Audience Feedback.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

LyricsRadar: A Lyrics Retrieval System Based on Latent Topics of Lyrics.
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014

Spotting a Query Phrase from Polyphonic Music Audio Signals Based on Semi-supervised Nonnegative Matrix Factorization.
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014

Unisoner: An Interactive Interface for Derivative Chorus Creation from Various Singing Voices on the Web.
Proceedings of the Music Technology meets Philosophy, 2014

AutoRhythmGuitar: Computer-aided Composition for Rhythm Guitar in the Tab Space.
Proceedings of the Music Technology meets Philosophy, 2014

An Automatic Singing Impression Estimation Method Using Factor Analysis and Multiple Regression.
Proceedings of the Music Technology meets Philosophy, 2014

HarmonyMixer: Mixing the Character of Chords among Polyphonic Audio.
Proceedings of the Music Technology meets Philosophy, 2014

AutoChorusCreator: Four-Part Chorus Generator with Musical Feature Control, Using Search Spaces Constructed from Rules of Music Theory.
Proceedings of the Music Technology meets Philosophy, 2014

Cultivating vocal activity detection for music audio signals in a circulation-type crowdsourcing ecosystem.
Proceedings of the IEEE International Conference on Acoustics, 2014

Vocal timbre analysis using latent Dirichlet allocation and cross-gender vocal timbre similarity.
Proceedings of the IEEE International Conference on Acoustics, 2014

Timbre replacement of harmonic and drum components for music audio signals.
Proceedings of the IEEE International Conference on Acoustics, 2014

Leveraging repetition for improved automatic lyric transcription in popular music.
Proceedings of the IEEE International Conference on Acoustics, 2014

Regression approaches to perceptual age control in singing voice conversion.
Proceedings of the IEEE International Conference on Acoustics, 2014

Sharedo: to-do list interface for human-agent task sharing.
Proceedings of the second international conference on Human-agent interaction, 2014

Two-level fast-forwarding using speech detection for rapidly perusing video.
Proceedings of the 5th Augmented Human International Conference, 2014

Gender-dependent spectrum differential models for perceived age control based on direct waveform modification in singing voice conversion.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

Automated choreography synthesis using a Gaussian process leveraging consumer-generated dance motions.
Proceedings of the 11th Conference on Advances in Computer Entertainment Technology, 2014

2013
Songrium: a music browsing assistance service based on visualization of massive open collaboration within music content creation community.
Proceedings of the 9th International Symposium on Open Collaboration, Hong Kong, China, August 05, 2013

Multimedia information retrieval: music and audio.
Proceedings of the ACM Multimedia Conference, 2013

Beyond NMF: Time-Domain Audio Source Separation without Phase Reconstruction.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

Transfer Learning In Mir: Sharing Learned Latent Representations For Music Audio Classification And Similarity.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

Chord-Sequence-Factory: A Chord Arrangement System Modifying Factorized Chord Sequence Probabilities.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

AutoMashUpper: An Automatic Multi-Song Mashup System.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

An investigation of acoustic features for singing voice conversion based on perceptual age.
Proceedings of the INTERSPEECH 2013, 2013

Evaluation of a singing voice conversion method based on many-to-many eigenvoice conversion.
Proceedings of the INTERSPEECH 2013, 2013

Infinite Positive Semidefinite Tensor Factorization for Source Separation of Mixture Signals.
Proceedings of the 30th International Conference on Machine Learning, 2013

Infinite kernel linear prediction for joint estimation of spectral envelope and fundamental frequency.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
A Nonparametric Bayesian Multipitch Analyzer Based on Infinite Latent Harmonic Allocation.
IEEE Trans. Speech Audio Process., 2012

Integrating Additional Chord Information Into HMM-Based Lyrics-to-Audio Alignment.
IEEE Trans. Speech Audio Process., 2012

PodCastle and Songle: Crowdsourcing-Based Web Services for Retrieval and Browsing of Speech and Music Content.
Proceedings of the First International Workshop on Crowdsourcing Web Search, 2012

PodCastle and songle: crowdsourcing-based web services for spoken content retrieval and active music listening.
Proceedings of the ACM multimedia 2012 workshop on Crowdsourcing for multimedia, 2012

PodCastle and songle: Crowdsourcing-based web services for spoken document retrieval and active music listening.
Proceedings of the 2012 Information Theory and Applications Workshop, 2012

Infinite Composite Autoregressive Models for Music Signal Analysis.
Proceedings of the 13th International Society for Music Information Retrieval Conference, 2012

PodCastle: Collaborative Training of Language Models on the Basis of Wisdom of Crowds.
Proceedings of the INTERSPEECH 2012, 2012

A spectral envelope estimation method based on F0-adaptive multi-frame integration analysis.
Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2012

Unsupervised music understanding based on nonparametric Bayesian models.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

VocaListener and VocaWatcher: Imitating a human singer by using signal processing.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Grand Challenges in Music Information Research.
Proceedings of the Multimodal Music Processing, 2012

Lyrics-to-Audio Alignment and its Application.
Proceedings of the Multimodal Music Processing, 2012

Singing voice conversion method based on many-to-many eigenvoice conversion and training data generation using a singing-to-singing synthesis system.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011
LyricSynchronizer: Automatic Synchronization System Between Musical Audio Signals and Lyrics.
IEEE J. Sel. Top. Signal Process., 2011

Multimodal Music Processing (Dagstuhl Seminar 11041).
Dagstuhl Reports, 2011

Music Listening in the Future: Augmented Music-Understanding Interfaces and Crowd Music Listening.
Proceedings of the AES International Conference Semantic Audio 2011, 2011

A musical mood trajectory estimation method using lyrics and acoustic features.
Proceedings of the 1st international ACM workshop on Music information retrieval with user-centered and multimodal strategies, Scottsdale, AZ, USA, November 28, 2011

A Vocabulary-Free Infinity-Gram Model for Nonparametric Bayesian Chord Progression Analysis.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011

Timbre and Melody Features for the Recognition of Vocal Activity and Instrumental Solos in Polyphonic Music.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011

Songle: A Web Service for Active Music Listening Improved by User Contributions.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011

VocaWatcher: Natural singing motion generator for a humanoid robot.
Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2011

PodCastle: Recent Advances of a Spoken Document Retrieval Service Improved by Anonymous User Contributions.
Proceedings of the INTERSPEECH 2011, 2011

Vocalistener2: A singing synthesis system able to mimic a user's singing in terms of voice timbre changes as well as pitch and dynamics.
Proceedings of the IEEE International Conference on Acoustics, 2011

Polyphonic audio-to-score alignment based on Bayesian Latent Harmonic Allocation Hidden Markov Model.
Proceedings of the IEEE International Conference on Acoustics, 2011

Simultaneous processing of sound source separation and musical instrument identification using Bayesian spectral modeling.
Proceedings of the IEEE International Conference on Acoustics, 2011

Concurrent estimation of singing voice F0 and phonemes by using spectral envelopes estimated from polyphonic music.
Proceedings of the IEEE International Conference on Acoustics, 2011

Gradient-based musical feature extraction based on scale-invariant feature transform.
Proceedings of the 19th European Signal Processing Conference, 2011

Social Infobox: collaborative knowledge construction by social property tagging.
Proceedings of the 2011 ACM Conference on Computer Supported Cooperative Work, 2011

2010
A Modeling of Singing Voice Robust to Accompaniment Sounds and Its Application to Singer Identification and Vocal-Timbre-Similarity-Based Music Information Retrieval.
IEEE Trans. Speech Audio Process., 2010

Editorial for the Special Issue on Signal Models and Representations of Musical and Environmental Sounds.
IEEE Trans. Speech Audio Process., 2010

PodCastle: A Spoken Document Retrieval Service Improved by Anonymous User Contributions.
Proceedings of the 24th Pacific Asia Conference on Language, Information and Computation, 2010

Infinite Latent Harmonic Allocation: A Nonparametric Bayesian Approach to Multipitch Analysis.
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

Query-by-conducting: An Interface to Retrieve Classical-music Interpretations by Real-time Tempo Input.
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

Singing information processing based on singing voice modeling.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Parameter Estimation for Harmonic and Inharmonic Models by Using Timbre Feature Distributions.
J. Inf. Process., 2009

Musicream: Integrated Music-Listening Interface for Active, Flexible, and Unexpected Encounters with Musical Pieces.
J. Inf. Process., 2009

A novel framework for recognizing phonemes of singing voice in polyphonic music.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009

PodCastle: a spoken document retrieval system for podcasts and its performance improvement by anonymous user contributions.
Proceedings of the third workshop on Searching spontaneous conversational speech, 2009

MusicCommentator: Generating Comments Synchronized with Musical Audio Signals by a Joint Probabilistic Model of Acoustic and Textual Features.
Proceedings of the Entertainment Computing, 2009

Continuous pLSI and Smoothing Techniques for Hybrid Music Recommendation.
Proceedings of the 10th International Society for Music Information Retrieval Conference, 2009

Acoustic event detection for spotting "hot spots" in podcasts.
Proceedings of the INTERSPEECH 2009, 2009

Acoustic and perceptual effects of vocal training in amateur male singing.
Proceedings of the INTERSPEECH 2009, 2009

Podcastle: collaborative training of acoustic models on the basis of wisdom of crowds for podcast transcription.
Proceedings of the INTERSPEECH 2009, 2009

The use of acoustically detected filled and silent pauses in spontaneous speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
An Efficient Hybrid Music Recommender System Using an Incrementally Trainable Probabilistic Generative Model.
IEEE Trans. Speech Audio Process., 2008

Computational Models of Similarity for Drum Samples.
IEEE Trans. Speech Audio Process., 2008

Content-Based Music Information Retrieval: Current Directions and Future Challenges.
Proc. IEEE, 2008

A similar content retrieval method for podcast episodes.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Music Thumbnailer: Visualizing Musical Pieces in Thumbnail Images Based on Acoustic Features.
Proceedings of the ISMIR 2008, 2008

Instrument Equalizer for Query-by-Example Retrieval: Improving Sound Source Separation Based on Integrated Harmonic and Inharmonic Models.
Proceedings of the ISMIR 2008, 2008

Hyperlinking Lyrics: A Method for Creating Hyperlinks Between Phrases in Song Lyrics.
Proceedings of the ISMIR 2008, 2008

Three techniques for improving automatic synchronization between music and lyrics: Fricative detection, filler model, and novel feature vectors for vocal activity detection.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Drum Sound Recognition for Polyphonic Audio Signals by Adaptation and Matching of Spectrogram Templates With Harmonic Structure Suppression.
IEEE Trans. Speech Audio Process., 2007

Instrument Identification in Polyphonic Music: Feature Weighting to Minimize Influence of Sound Overlaps.
EURASIP J. Adv. Signal Process., 2007

Music Information Retrieval Based on Signal Processing.
EURASIP J. Adv. Signal Process., 2007

Improving Efficiency and Scalability of Model-Based Music Recommender System Based on Incremental Training.
Proceedings of the 8th International Conference on Music Information Retrieval, 2007

A Supervised Approach for Detecting Boundaries in Music Using Difference Features and Boosting.
Proceedings of the 8th International Conference on Music Information Retrieval, 2007

MusicSun: A New Approach to Artist Recommendation.
Proceedings of the 8th International Conference on Music Information Retrieval, 2007

A Stochastic Representation of the Dynamics of Sung Melody.
Proceedings of the 8th International Conference on Music Information Retrieval, 2007

A Music Information Retrieval System Based on Singing Voice Timbre.
Proceedings of the 8th International Conference on Music Information Retrieval, 2007

Vocal conversion from speaking voice to singing voice using STRAIGHT.
Proceedings of the INTERSPEECH 2007, 2007

Automatic transcription for a web 2.0 service to search podcasts.
Proceedings of the INTERSPEECH 2007, 2007

Podcastle: a web 2.0 approach to speech recognition research.
Proceedings of the INTERSPEECH 2007, 2007

Presentation sensei: a presentation training system using speech and image processing.
Proceedings of the 9th International Conference on Multimodal Interfaces, 2007

Integration and Adaptation of Harmonic and Inharmonic Models for Separating Polyphonic Musical Signals.
Proceedings of the IEEE International Conference on Acoustics, 2007

Active Music Listening Interfaces Based on Signal Processing.
Proceedings of the IEEE International Conference on Acoustics, 2007

2006
A chorus section detection method for musical audio signals and its application to a music listening station.
IEEE Trans. Speech Audio Process., 2006

Hybrid Collaborative and Content-based Music Recommendation Using Probabilistic Model with Latent User Preferences.
Proceedings of the ISMIR 2006, 2006

MusicRainbow: A New User Interface to Discover Artists Using Audio-based Similarity and Web-based Labeling.
Proceedings of the ISMIR 2006, 2006

AIST Annotation for the RWC Music Database.
Proceedings of the ISMIR 2006, 2006

Musical Instrument Recognizer "Instrogram" and Its Application to Music Retrieval Based on Instrumentation Similarity.
Proceedings of the Eigth IEEE International Symposium on Multimedia (ISM 2006), 2006

Automatic Synchronization between Lyrics and Music CD Recordings Based on Viterbi Alignment of Segregated Vocal Signals.
Proceedings of the Eigth IEEE International Symposium on Multimedia (ISM 2006), 2006

An automatic singing skill evaluation method for unknown melodies using pitch interval accuracy and vibrato features.
Proceedings of the INTERSPEECH 2006, 2006

Speaker identification under noisy environments by using harmonic structure extraction and reliable frame weighting.
Proceedings of the INTERSPEECH 2006, 2006

An Error Correction Framework Based on Drum Pattern Periodicity for Improving Drum Sound Detection.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Instrogram: A New Musical Instrument Recognition Technique Without Using Onset Detection NOR F0 Estimation.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

F0 Estimation Method for Singing Voice in Polyphonic Audio Signal Based on Statistical Vocal Model and Viterbi Search.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Speech pen: predictive handwriting based on ambient multimodal recognition.
Proceedings of the 2006 Conference on Human Factors in Computing Systems, 2006

2005
Pitch-Dependent Identification of Musical Instrument Sounds.
Appl. Intell., 2005

A wireless LAN architecture using PANA for secure network selection.
Proceedings of the 2005 IEEE International Conference on Wireless And Mobile Computing, 2005

Instrument Identification in Polyphonic Music: Feature Weighting with Mixed Sounds, Pitch-Dependent Timbre Modeling, and Use of Musical Context.
Proceedings of the ISMIR 2005, 2005

Musicream: New Music Playback Interface for Streaming, Sticking, Sorting, and Recalling Musical Pieces.
Proceedings of the ISMIR 2005, 2005

Singer Identification Based on Accompaniment Sound Reduction and Reliable Frame Selection.
Proceedings of the ISMIR 2005, 2005

Discrimination between singing and speaking voices.
Proceedings of the INTERSPEECH 2005, 2005

Speech repair: quick error correction just by using selection operation for speech input interfaces.
Proceedings of the INTERSPEECH 2005, 2005

An Auto-Regressive, Non-Stationary Excited Signal Parameter Estimation Method and an Evaluation of a Singing-Voice Recognition.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
A real-time music-scene-description system: predominant-F0 estimation for detecting melody and bass lines in real-world audio signals.
Speech Commun., 2004

Automatic Drum Sound Description for Real-World Music Using Template Adaptation and Matching Methods.
Proceedings of the ISMIR 2004, 2004

A Drum Pattern Retrieval Method by Voice Percussion.
Proceedings of the ISMIR 2004, 2004

Speech-Recognition Interfaces for Music Information Retrieval: 'Speech Completion' and 'Speech Spotter'.
Proceedings of the ISMIR 2004, 2004

Drum sound identification for polyphonic music using template adaptation and matching methods.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, 2004

Speech spotter: on-demand speech recognition in human-human conversation on the telephone or in face-to-face situations.
Proceedings of the INTERSPEECH 2004, 2004

Category-level identification of non-registered musical instrument sounds.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
SmartMusicKIOSK: music listening station with chorus-search function.
Proceedings of the 16th Annual ACM Symposium on User Interface Software and Technology, 2003

RWC Music Database: Music genre database and musical instrument sound database.
Proceedings of the ISMIR 2003, 2003

Music scene description project: Toward audio-based real-time music understanding.
Proceedings of the ISMIR 2003, 2003

Speech starter: noise-robust endpoint detection by using filled pauses.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Speech shift: direct speech-input-mode switching through intentional control of voice pitch.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

A Learning-Based Jam Session System that Imitates a Player's Personality Model.
Proceedings of the IJCAI-03, 2003

Pitch-Dependent Musical Instrument Identification and Its Application to Musical Sound Ontology.
Proceedings of the Developments in Applied Artificial Intelligence, 2003

A Learning-Based Quantization: Unsupervised Estimation of the Model Parameters.
Proceedings of the 2003 International Computer Music Conference, 2003

Musical instrument identification based on F0-dependent multivariate normal distribution.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

A chorus-section detecting method for musical audio signals.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
RWC Music Database: Popular, Classical and Jazz Music Databases.
Proceedings of the ISMIR 2002, 2002

Speech completion: on-demand completion assistance using filled pauses for speech input interfaces.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2001
Evaluation of exchanging time for mechanism of exchanging parts of programs during execution.
Syst. Comput. Jpn., 2001

Real-time sound source localization and separation system and its application to automatic speech recognition.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Learning-Based Jam Session System for A Guitar Trio.
Proceedings of the 2001 International Computer Music Conference, 2001

A predominant-F<sub>0</sub> estimation method for CD recordings: MAP estimation using EM algorithm for adaptive tone models.
Proceedings of the IEEE International Conference on Acoustics, 2001

2000
A robust predominant-F0 estimation method for real-time detection of melody and bass lines in CD recordings.
Proceedings of the IEEE International Conference on Acoustics, 2000

1999
Real-time beat tracking for drumless audio signals: Chord change detection for musical decisions.
Speech Commun., 1999

A real-time filled pause detection system for spontaneous speech recognition.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1998
A WWW-based Melody Retrieval System.
Proceedings of the 1998 International Computer Music Conference, 1998

An Audio-based Real-time Beat Tracking System and Its Applications.
Proceedings of the 1998 International Computer Music Conference, 1998

1997
RMCP: Remote Music Control Protocol - Design and Applications.
Proceedings of the 1997 International Computer Music Conference, 1997

1996
A Jazz Session System for Interplay Among All Players - VirJa Session (Virtual Jazz Session System).
Proceedings of the 1996 International Computer Music Conference, 1996

Localization by harmonic structure and its application to harmonic sound stream segregation.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995
An Automatic Jazz Accompaniment System Reacting to Solo.
Proceedings of the 1995 International Computer Music Conference, 1995

A Real-Time Beat Tracking System for Audio Signals.
Proceedings of the 1995 International Computer Music Conference, 1995

1994
A Beat Tracking System for Acoustic Signals of Music.
Proceedings of the Second ACM International Conference on Multimedia '94, 1994

Rhythm Tracking Using Multiple Hypotheses.
Proceedings of the 1994 International Computer Music Conference, 1994


  Loading...