Yi-Hsuan Yang

Orcid: 0000-0002-2724-6161

According to our database1, Yi-Hsuan Yang authored at least 218 papers between 2006 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Theme Transformer: Symbolic Music Generation With Theme-Conditioned Transformer.
IEEE Trans. Multim., 2023

MuseMorphose: Full-Song and Fine-Grained Piano Music Style Transfer With One Transformer VAE.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Local Periodicity-Based Beat Tracking for Expressive Classical Piano Music.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Compose & Embellish: Well-Structured Piano Performance Generation via A Two-Stage Approach.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
An Analysis Method for Metric-Level Switching in Beat Tracking.
IEEE Signal Process. Lett., 2022

Exploiting Pre-trained Feature Networks for Generative Adversarial Networks in Audio-domain Loop Generation.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

DDSP-based Singing Vocoders: A New Subtractive-based Synthesizer and A Comprehensive Evaluation.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

Jukedrummer: Conditional Beat-aware Audio-domain Drum Accompaniment Generation via Transformer VQ-VAE.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

Melody Infilling with User-Provided Structural Context.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

KaraSinger: Score-Free Singing Voice Synthesis with VQ-VAE Using Mel-Spectrograms.
Proceedings of the IEEE International Conference on Acoustics, 2022

Automatic DJ Transitions with Differentiable Audio Effects and Generative Adversarial Networks.
Proceedings of the IEEE International Conference on Acoustics, 2022

Towards Automatic Transcription of Polyphonic Electric Guitar Music: A New Dataset and a Multi-Loss Transformer Model.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Leveraging Affective Hashtags for Ranking Music Recommendations.
IEEE Trans. Affect. Comput., 2021

Music Emotion Recognition: Toward new, robust standards in personalized and context-sensitive applications.
IEEE Signal Process. Mag., 2021

Drum-Aware Ensemble Architecture for Improved Joint Musical Beat and Downbeat Tracking.
IEEE Signal Process. Lett., 2021

Music Score Expansion with Variable-Length Infilling.
CoRR, 2021

Learning To Generate Piano Music With Sustain Pedals.
CoRR, 2021

Deep Learning Based EDM Subgenre Classification using Mel-Spectrogram and Tempogram Features.
CoRR, 2021

MidiBERT-Piano: Large-scale Pre-training for Symbolic Music Understanding.
CoRR, 2021

MuseMorphose: Full-Song and Fine-Grained Music Style Transfer with Just One Transformer VAE.
CoRR, 2021

Reverse-Engineering The Transition Regions of Real-World DJ Mixes using Sub-band Analysis with Convex Optimization.
Proceedings of the 21th International Conference on New Interfaces for Musical Expression, 2021

Automatic Music Composition with Transformers.
Proceedings of the MMArt-ACM '21: Proceedings of the 2021 International Joint Workshop on Multimedia Artworks Analysis and Attractiveness Computing in Multimedia 2021, 2021

DadaGP: A Dataset of Tokenized GuitarPro Songs for Sequence Models.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

A Benchmarking Initiative for Audio-domain Music Generation using the FreeSound Loop Dataset.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

EMOPIA: A Multi-Modal Pop Piano Dataset For Emotion Recognition and Emotion-based Music Generation.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

Variable-Length Music Score Infilling via XLNet and Musically Specialized Positional Encoding.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

Let's agree to disagree: Consensus Entropy Active Learning for Personalized Music Emotion Recognition.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

Relative Positional Encoding for Transformers with Linear Complexity.
Proceedings of the 38th International Conference on Machine Learning, 2021

Source Separation-based Data Augmentation for Improved Joint Beat and Downbeat Tracking.
Proceedings of the 29th European Signal Processing Conference, 2021

Mandarin Singing Voice Synthesis with a Phonology-based Duration Model.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

Compound Word Transformer: Learning to Compose Full-Song Music over Dynamic Directed Hypergraphs.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Backpropagation With $N$ -D Vector-Valued Neurons Using Arbitrary Bilinear Products.
IEEE Trans. Neural Networks Learn. Syst., 2020

Fast Tensor Factorization for Large-Scale Context-Aware Recommendation from Implicit Feedback.
IEEE Trans. Big Data, 2020

Automatic Composition of Guitar Tabs by Transformers and Groove Modeling.
CoRR, 2020

Pop Music Transformer: Generating Music with Rhythm and Harmony.
CoRR, 2020

Automatic Melody Harmonization with Triad Chords: A Comparative Study.
CoRR, 2020

Mixing-Specific Data Augmentation Techniques for Improved Blind Violin/Piano Source Separation.
Proceedings of the 22nd IEEE International Workshop on Multimedia Signal Processing, 2020

Pop Music Transformer: Beat-based Modeling and Generation of Expressive Pop Piano Compositions.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

The Jazz Transformer on the Front Line: Exploring the Shortcomings of AI-composed Music through Quantitative Measures.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

The Freesound Loop Dataset and Annotation Tool.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

A Computational Analysis of Real-World DJ Mixes using Mix-To-Track Subsequence Alignment.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

Neural Loop Combiner: Neural Network Models for Assessing the Compatibility of Loops.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

Automatic Composition of Guitar Tabs by Transformers and Groove Modeling.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

Speech-to-Singing Conversion Based on Boundary Equilibrium GAN.
Proceedings of the Interspeech 2020, 2020

Unconditional Audio Generation with Generative Adversarial Networks and Cycle Regularization.
Proceedings of the Interspeech 2020, 2020

Score and Lyrics-Free Singing Voice Generation.
Proceedings of the Eleventh International Conference on Computational Creativity, 2020

Speech-To-Singing Conversion in an Encoder-Decoder Framework.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Addressing The Confounds Of Accompaniments In Singer Identification.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

A Comparative Study of Western and Chinese Classical Music Based on Soundscape Models.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Weakly-Supervised Visual Instrument-Playing Action Detection in Videos.
IEEE Trans. Multim., 2019

TENT: Technique-Embedded Note Tracking for Real-World Guitar Solo Recordings.
Trans. Int. Soc. Music. Inf. Retr., 2019

Deep Learning for Audio-Based Music Classification and Tagging: Teaching Computers to Distinguish Rock from Bach.
IEEE Signal Process. Mag., 2019

Computational Methods for Melody and Voice Processing in Music Recordings (Dagstuhl Seminar 19052).
Dagstuhl Reports, 2019

Towards a Deeper Understanding of Adversarial Losses.
CoRR, 2019

Collaborative Similarity Embedding for Recommender Systems.
Proceedings of the World Wide Web Conference, 2019

Multi-label Few-shot Learning for Sound Event Recognition.
Proceedings of the 21st IEEE International Workshop on Multimedia Signal Processing, 2019

Recognizing Song Mood and Theme Using Convolutional Recurrent Neural Networks.
Proceedings of the Working Notes Proceedings of the MediaEval 2019 Workshop, 2019

MediaEval 2019 Emotion and Theme Recognition task: A VQ-VAE Based Approach.
Proceedings of the Working Notes Proceedings of the MediaEval 2019 Workshop, 2019

A Minimal Template for Interactive Web-based Demonstrations of Musical Machine Learning.
Proceedings of the Joint Proceedings of the ACM IUI 2019 Workshops co-located with the 24th ACM Conference on Intelligent User Interfaces (ACM IUI 2019), 2019

Hit Song Prediction: Leveraging Low- and High-Level Audio Features.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

Deep Cyclic Group Networks.
Proceedings of the International Joint Conference on Neural Networks, 2019

Dilated Convolution with Dilated GRU for Music Source Separation.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Musical Composition Style Transfer via Disentangled Timbre Representations.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Demonstration of PerformanceNet: A Convolutional Neural Network Model for Score-to-Audio Music Generation.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Multitask Learning for Frame-level Instrument Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

A Streamlined Encoder/decoder Architecture for Melody Extraction.
Proceedings of the IEEE International Conference on Acoustics, 2019

Learning to Match Transient Sound Events Using Attentional Similarity for Few-shot Sound Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

Drum Fills Detection and Generation.
Proceedings of the Perception, Representations, Image, Sound, Music, 2019

Improving Automatic Jazz Melody Generation by Transfer Learning Techniques.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

PerformanceNet: Score-to-Audio Music Generation with Multi-Band Convolutional Residual Network.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Coherent Deep-Net Fusion To Classify Shots In Concert Videos.
IEEE Trans. Multim., 2018

Pop Music Highlighter: Marking the Emotion Keypoints.
Trans. Int. Soc. Music. Inf. Retr., 2018

Predicting the Probability Density Function of Music Emotion Using Emotion Space Mapping.
IEEE Trans. Affect. Comput., 2018

Learning Disentangled Representations for Timber and Pitch in Music Audio.
CoRR, 2018

A Streamlined Encoder/Decoder Architecture for Melody Extraction.
CoRR, 2018

Training Generative Adversarial Networks with Binary Neurons by End-to-end Backpropagation.
CoRR, 2018

Singing Style Transfer Using Cycle-Consistent Boundary Equilibrium Generative Adversarial Networks.
CoRR, 2018

Frame-level Instrument Recognition by Timbre and Pitch.
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018

Convolutional Generative Adversarial Networks with Binary Neurons for Polyphonic Music Generation.
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018

Learning to Recognize Transient Sound Events using Attentional Supervision.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Denoising Auto-Encoder with Recurrent Skip Connections and Residual Regression for Music Source Separation.
Proceedings of the 17th IEEE International Conference on Machine Learning and Applications, 2018

Lead Sheet Generation and Arrangement by Conditional Generative Adversarial Network.
Proceedings of the 17th IEEE International Conference on Machine Learning and Applications, 2018

Cross-Cultural Music Emotion Recognition by Adversarial Discriminative Domain Adaptation.
Proceedings of the 17th IEEE International Conference on Machine Learning and Applications, 2018

Seethevoice: Learning from Music to Visual Storytelling of Shots.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

Modeling Multi-way Relations with Hypergraph Embedding.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

Generating Music Medleys via Playing Music Puzzle Games.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

MuseGAN: Multi-track Sequential Generative Adversarial Networks for Symbolic Music Generation and Accompaniment.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Affective Music Information Retrieval.
Proceedings of the Emotions and Personality in Personalized Services, 2017

Introduction to Intelligent Music Systems and Applications.
ACM Trans. Intell. Syst. Technol., 2017

Component Tying for Mixture Model Adaptation in Personalization of Music Emotion Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Cross-Dataset and Cross-Cultural Music Mood Prediction: A Case on Western and Chinese Pop Songs.
IEEE Trans. Affect. Comput., 2017

Informed Group-Sparse Representation for Singing Voice Separation.
IEEE Signal Process. Lett., 2017

The mood of Chinese Pop music: Representation and recognition.
J. Assoc. Inf. Sci. Technol., 2017

Improving Cross-Day EEG-Based Emotion Classification Using Robust Principal Component Analysis.
Frontiers Comput. Neurosci., 2017

Vertex-Context Sampling for Weighted Network Embedding.
CoRR, 2017

Hit Song Prediction for Pop Music by Siamese CNN with Ranking Loss.
CoRR, 2017

MuseGAN: Symbolic-domain Music Generation and Accompaniment with Multi-track Sequential Generative Adversarial Networks.
CoRR, 2017

Similarity Embedding Network for Unsupervised Sequential Pattern Learning by Playing Music Puzzle Games.
CoRR, 2017

MidiNet: A Convolutional Generative Adversarial Network for Symbolic-domain Music Generation using 1D and 2D Conditions.
CoRR, 2017

Music Signal Processing Using Vector Product Neural Networks.
CoRR, 2017

MidiNet: A Convolutional Generative Adversarial Network for Symbolic-Domain Music Generation.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017

Conditional preference nets for user and item cold start problems in music recommendation.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Revisiting the problem of audio-based hit song prediction using convolutional neural networks.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Deep-net fusion to classify shots in concert videos.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Weakly-supervised audio event detection using event-specific Gaussian filters and fully convolutional networks.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Automatic conversion of Pop music into chiptunes for 8-bit pixel art.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Polyphonic piano note transcription with non-negative matrix factorization of differential spectrogram.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Low-Rank Matrix Completion over Finite Abelian Group Algebras for Context-Aware Recommendation.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

Music thumbnailing via neural attention modeling of music emotion.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016
Polar $n$-Complex and $n$-Bicomplex Singular Value Decomposition and Principal Component Pursuit.
IEEE Trans. Signal Process., 2016

Monaural Music Source Separation Using Convolutional Sparse Coding.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Complex and Quaternionic Principal Component Pursuit and Its Application to Audio Separation.
IEEE Signal Process. Lett., 2016

Applying Topological Persistence in Convolutional Neural Network for Music Audio Signals.
CoRR, 2016

Neural Network Based Next-Song Recommendation.
CoRR, 2016

Addressing Cold Start for Next-song Recommendation.
Proceedings of the 10th ACM Conference on Recommender Systems, 2016

Query-based Music Recommendations via Preference Embedding.
Proceedings of the 10th ACM Conference on Recommender Systems, 2016

Event Localization in Music Auto-tagging.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Emotion in Music task: Lessons Learned.
Proceedings of the Working Notes Proceedings of the MediaEval 2016 Workshop, 2016

Exploiting Frequency, Periodicity and Harmonicity Using Advanced Time-Frequency Concentration Techniques for Multipitch Estimation of Choir and Symphony.
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016

Highlighting root notes in chord recognition using cepstral features and multi-task learning.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

Model Adaptation for Personalized Music Emotion Recognition.
Proceedings of the Handbook of Pattern Recognition and Computer Vision, 5th Ed., 2016

2015
Quantitative Study of Music Listening Behavior in a Smartphone Context.
ACM Trans. Interact. Intell. Syst., 2015

Combining Spectral and Temporal Representations for Multipitch Estimation of Polyphonic Music.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Modeling the Affective Content of Music with a Gaussian Mixture Model.
IEEE Trans. Affect. Comput., 2015

Guest Editorial: Challenges and Perspectives for Affective Analysis in Multimedia.
IEEE Trans. Affect. Comput., 2015

Musical Onset Detection Using Constrained Linear Reconstruction.
IEEE Signal Process. Lett., 2015

Music Annotation and Retrieval using Unlabeled Exemplars: Correlation and Sparse Codes.
IEEE Signal Process. Lett., 2015

Affective Music Information Retrieval.
CoRR, 2015

Do You Have a Pop Face? Here is a Pop Song. Using Profile Pictures to Mitigate the Cold-start Problem in Music Recommender Systems.
Proceedings of the Poster Proceedings of the 9th ACM Conference on Recommender Systems, 2015

ASM'15: The 1st International Workshop on Affect and Sentiment in Multimedia.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

eMosic: Mobile Media Pushing through Social Emotion Sensing.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Emotion in Music Task at MediaEval 2015.
Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015

Detection of Common Mistakes in Novice Violin Playing.
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015

Musical Offset Detection of Pitched Instruments: The Case of Violin.
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015

Analysis of Expressive Musical Terms in Violin Using Score-Informed and Expression-Based Audio Features.
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015

Electric Guitar Playing Technique Detection in Real-World Recording Based on F0 Sequence Pattern Recognition.
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015

Evaluating music recommendation in a real-world setting: On data splitting and evaluation metrics.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Informed monaural source separation of music based on convolutional sparse coding.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

The AMG1608 dataset for music emotion recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Vocal activity informed singing voice separation with the iKala dataset.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Using robust principal component analysis to alleviate day-to-day variability in EEG based emotion classification.
Proceedings of the 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2015

Escaping from the Abyss of Manual Annotation: New Methodology of Building Polyphonic Datasets for Automatic Music Transcription.
Proceedings of the Music, Mind, and Embodiment - 11th International Symposium, 2015

2014
A Systematic Evaluation of the Bag-of-Frames Representation for Music Information Retrieval.
IEEE Trans. Multim., 2014

Sparse modeling of magnitude and phase-derived spectra for playing technique classification.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

AWtoolbox: Characterizing Audio Information Using Audio Words.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Emotional Analysis of Music: A Comparison of Methods.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Emotion in Music Task at MediaEval 2014.
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014

Cross-cultural mood regression for music digital libraries.
Proceedings of the IEEE/ACM Joint Conference on Digital Libraries, 2014

Automatic Set List Identification and Song Segmentation for Full-Length Concert Videos.
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014

Sparse Cepstral, Phase Codes for Guitar Playing Technique Classification.
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014

Music Driven Human Motion Manipulation for Characters in a Video.
Proceedings of the 2014 IEEE International Symposium on Multimedia, 2014

Towards time-varying music auto-tagging based on CAL500 expansion.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

LJ2M dataset: Toward better understanding of music listening behavior and user mood.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Attaching-music: An interactive music delivery system for private listening as wherever you go.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

Resolving Octave Ambiguities: A Cross-dataset Investigation.
Proceedings of the Music Technology meets Philosophy, 2014

Power-Scaled Spectral Flux and Peak-Valley Group-Delay Methods for Robust Musical Onset Detection.
Proceedings of the Music Technology meets Philosophy, 2014

A Study on Cross-cultural and Cross-dataset Generalizability of Music Mood Regression Models.
Proceedings of the Music Technology meets Philosophy, 2014

Leverage Item Popularity and Recommendation Quality via Cost-Sensitive Factorization Machines.
Proceedings of the 2014 IEEE International Conference on Data Mining Workshops, 2014

Sparse cepstral codes and power scale for instrument identification.
Proceedings of the IEEE International Conference on Acoustics, 2014

Improving music auto-tagging by intra-song instance bagging.
Proceedings of the IEEE International Conference on Acoustics, 2014

Modified lasso screening for audio word-based music classification using large-scale dictionary.
Proceedings of the IEEE International Conference on Acoustics, 2014

Linear regression-based adaptation of music emotion recognition models for personalization.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Quantitative Study of Music Listening Behavior in a Social and Affective Context.
IEEE Trans. Multim., 2013

Automatic highlights extraction for drama video using music emotion and human face features.
Neurocomputing, 2013

Music Recommendation Based on Multiple Contextual Similarity Information.
Proceedings of the 2013 IEEE/WIC/ACM International Conferences on Web Intelligence, 2013

1000 songs for emotional analysis of music.
Proceedings of the 2nd ACM international workshop on Crowdsourcing for multimedia, 2013

Using emotional context from article for contextual music recommendation.
Proceedings of the ACM Multimedia Conference, 2013

The MediaEval 2013 Brave New Task: Emotion in Music.
Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013

Low-Rank Representation of Both Singing Voice and Music Accompaniment Via Learned Dictionaries.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

Sparse Modeling for Artist Identification: Exploiting Phase Information and Vocal Separation.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

Towards real-time music auto-tagging using sparse features.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

A large in-situ dataset for context-aware music recommendation on smartphones.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

Dual-layer bag-of-frames model for music genre classification.
Proceedings of the IEEE International Conference on Acoustics, 2013

Singing voice timbre classification of Chinese popular music.
Proceedings of the IEEE International Conference on Acoustics, 2013

Towards a more efficient sparse coding based audio-word feature extraction system.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Analyzing the dictionary properties and sparsity constraints for a dictionary-based music genre classification system.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012
Multipitch Estimation of Piano Music by Exemplar-Based Sparse Representation.
IEEE Trans. Multim., 2012

Machine Recognition of Music Emotion: A Review.
ACM Trans. Intell. Syst. Technol., 2012

Music retagging using label propagation and robust principal component analysis.
Proceedings of the 21st World Wide Web Conference, 2012

On sparse and low-rank matrix decomposition for singing voice separation.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

The acoustic emotion gaussians model for emotion-based music annotation and retrieval.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

The acousticvisual emotion guassians model for automatic generation of music video.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Exploring the relationship between categorical and dimensional emotion semantics of music.
Proceedings of the second international ACM workshop on Music information retrieval with user-centered and multimodal strategies, 2012

Bilingual analysis of song lyrics and audio words.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Inferring personal traits from music listening history.
Proceedings of the second international ACM workshop on Music information retrieval with user-centered and multimodal strategies, 2012

Supervised dictionary learning for music genre classification.
Proceedings of the International Conference on Multimedia Retrieval, 2012

Cross-cultural Music Mood Classification: A Comparison on English and Chinese Songs.
Proceedings of the 13th International Society for Music Information Retrieval Conference, 2012

Personalized music emotion recognition via model adaptation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011
Exploiting online music tags for music emotion classification.
ACM Trans. Multim. Comput. Commun. Appl., 2011

Prediction of the Distribution of Perceived Music Emotions Using Discrete Samples.
IEEE Trans. Speech Audio Process., 2011

Ranking-Based Emotion Recognition for Music Organization and Retrieval.
IEEE Trans. Speech Audio Process., 2011

Automatic transcription of piano music by sparse representation of magnitude spectra.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Unsupervised auxiliary visual words discovery for large-scale image object retrieval.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010
A technical demonstration of large-scale image object retrieval by efficient query evaluation and effective auxiliary visual feature discovery.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

2009
Smooth Control of Adaptive Media Playout for Video Streaming.
IEEE Trans. Multim., 2009

Online Reranking via Ordinal Informative Concepts for Context Fusion in Concept Detection and Video Search.
IEEE Trans. Circuits Syst. Video Technol., 2009

Personalized music emotion recognition.
Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2009

Canonical image selection and efficient image graph construction for large-scale flickr photos.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Improving Musical Concept Detection by Ordinal Regression and Context Fusion.
Proceedings of the 10th International Society for Music Information Retrieval Conference, 2009

An Integrated Approach to Music Boundary Detection.
Proceedings of the 10th International Society for Music Information Retrieval Conference, 2009

Multimodal Structure Segmentation and Analysis of Music using Audio and Textual Information.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2009), 2009

Clustering for music search results.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Exploiting genre for music emotion classification.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Music emotion ranking.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
A Regression Approach to Music Emotion Recognition.
IEEE Trans. Speech Audio Process., 2008

Toward Multi-modal Music Emotion Classification.
Proceedings of the Advances in Multimedia Information Processing, 2008

ContextSeer: context search and recommendation at query time for shared consumer photos.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Mr. Emo: music retrieval in the emotion plane.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Keyword-based concept search on consumer photos by web-based kernel function.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Interactive content presentation based on expressed emotion and physiological feedback.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Automatic chord recognition for music classification and retrieval.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Video search reranking via online ordinal reranking.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

2007
The NTU Toolkit and Framework for High-Level Feature Detection at TRECVID 2007.
Proceedings of the TRECVID 2007 workshop participants notebook papers, 2007

Music emotion recognition: the role of individuality.
Proceedings of the International Workshop on Human-Centered Multimedia, 2007

Music Emotion Classification: A Regression Approach.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

2006
Music emotion classification: a fuzzy approach.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

Detecting and Classifying Emotion in Popular Music.
Proceedings of the 2006 Joint Conference on Information Sciences, 2006

Smooth Playout Control for Video Streaming over Error-Prone Channels.
Proceedings of the Eigth IEEE International Symposium on Multimedia (ISM 2006), 2006


  Loading...