Zhiyao Duan

According to our database1, Zhiyao Duan authored at least 64 papers between 2007 and 2019.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2019
Creating a Multitrack Classical Music Performance Dataset for Multimodal Music Analysis: Challenges, Insights, and Applications.
IEEE Trans. Multimedia, 2019

Siamese Style Convolutional Neural Networks for Sound Search by Vocal Imitation.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2019

Audio-Visual Deep Clustering for Speech Separation.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2019

Audiovisual Analysis of Music Performances: Overview of an Emerging Field.
IEEE Signal Process. Mag., 2019

Automatic Music Transcription: An Overview.
IEEE Signal Process. Mag., 2019

Adversarial Training for Speech Super-Resolution.
J. Sel. Topics Signal Processing, 2019

Sound Search by Text Description or Vocal Imitation?
CoRR, 2019

Hierarchical Cross-Modal Talking Face Generationwith Dynamic Pixel-Wise Loss.
CoRR, 2019

2018
Listen and Look: Audio-Visual Matching Assisted Speech Source Separation.
IEEE Signal Process. Lett., 2018

Front-end speech enhancement for commercial speaker verification systems.
Speech Communication, 2018

Lip Movements Generation at a Glance.
CoRR, 2018

Generating Talking Face Landmarks from Speech.
CoRR, 2018

Audio-Visual Event Localization in Unconstrained Videos.
CoRR, 2018

Part-invariant Model for Music Generation and Harmonization.
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018

Skeleton Plays Piano: Online Generation of Pianist Body Movements from MIDI Performance.
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018

Joint Speaker Diarization and Recognition Using Convolutional and Recurrent Neural Networks.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Visualization and Interpretation of Siamese Style Convolutional Neural Networks for Sound Search by Vocal Imitation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Score-Aligned Polyphonic Microtiming Estimation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Multi-Scale Recurrent Neural Network for Sound Event Detection.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Unsupervised Learning Approach to Feature Analysis for Automatic Speech Emotion Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Generating Talking Face Landmarks from Speech.
Proceedings of the Latent Variable Analysis and Signal Separation, 2018

Audio-Visual Event Localization in Unconstrained Videos.
Proceedings of the Computer Vision - ECCV 2018, 2018

Lip Movements Generation at a Glance.
Proceedings of the Computer Vision - ECCV 2018, 2018

2017
Piano Transcription With Convolutional Sparse Lateral Inhibition.
IEEE Signal Process. Lett., 2017

Enhanced multiclass SVM with thresholding fusion for speech-based emotion classification.
I. J. Speech Technology, 2017

Deep Cross-Modal Audio-Visual Generation.
CoRR, 2017

IMINET: Convolutional semi-siamese networks for sound search by vocal imitation.
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Metric learning based data augmentation for environmental sound classification.
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Deep Cross-Modal Audio-Visual Generation.
Proceedings of the on Thematic Workshops of ACM Multimedia 2017, Mountain View, CA, USA, October 23, 2017

Video-Based Vibrato Detection and Analysis for Polyphonic String Music.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017

A Metric for Music Notation Transcription Accuracy.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017

Deep ranking: Triplet MatchNet for music metric learning.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

See and listen: Score-informed association of sound tracks to players in chamber music performance videos.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Visually informed multi-pitch analysis of string ensembles.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
An Approach to Score Following for Piano Performances With the Sustained Effect.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2016

Context-Dependent Piano Music Transcription With Convolutional Sparse Coding.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2016

Creating A Musical Performance Dataset for Multimodal Music Analysis: Challenges, Insights, and Applications.
CoRR, 2016

Transcribing Human Piano Performances into Music Notation.
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016

WISE: Web-based Interactive Speech Emotion Classification.
Proceedings of the 4th Workshop on Sentiment Analysis where AI meets Psychology (SAAIP 2016) co-located with 25th International Joint Conference on Artificial Intelligence (IJCAI 2016), 2016

IMISOUND: An unsupervised system for sound query by vocal imitation.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Emotion classification: How does an automated system compare to Naive human coders?
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Emotion Classification: How Does an Automated System Compare to Naive Human Coders?
CoRR, 2015

Rotational reset strategy for online semi-supervised NMF-based speech enhancement for long recordings.
Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015

Retrieving sounds by vocal imitation recognition.
Proceedings of the 25th IEEE International Workshop on Machine Learning for Signal Processing, 2015

Piano music transcription with fast convolutional sparse coding.
Proceedings of the 25th IEEE International Workshop on Machine Learning for Signal Processing, 2015

Score Following for Piano Performances with Sustain-Pedal Effects.
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015

Piano music transcription modeling note temporal evolution.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Combining rhythm-based and pitch-based methods for background and melody separation.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2014

Multi-pitch Streaming of Harmonic Sound Mixtures.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2014

Note-level Music Transcription by Maximum Likelihood Sampling.
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014

A novel cepstral representation for timbre modeling of sound sources in polyphonic mixtures.
Proceedings of the IEEE International Conference on Acoustics, 2014

2012
Speech Enhancement by Online Non-negative Spectrogram Decomposition in Non-stationary Noise Environments.
Proceedings of the INTERSPEECH 2012, 2012

Online PLCA for Real-Time Semi-supervised Source Separation.
Proceedings of the Latent Variable Analysis and Signal Separation, 2012

2011
Soundprism: An Online System for Score-Informed Source Separation of Music Audio.
J. Sel. Topics Signal Processing, 2011

Aligning Semi-Improvised Music Audio with Its Lead Sheet.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011

A state space model for online polyphonic audio-score alignment.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Multiple Fundamental Frequency Estimation by Modeling Spectral Peaks and Non-Peak Regions.
IEEE Trans. Audio, Speech & Language Processing, 2010

Song-level multi-pitch tracking by heavily constrained clustering.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Harmonically Informed Multi-Pitch Tracking.
Proceedings of the 10th International Society for Music Information Retrieval Conference, 2009

2008
Unsupervised Single-Channel Music Source Separation by Average Harmonic Structure Modeling.
IEEE Trans. Audio, Speech & Language Processing, 2008

Collective Annotation of Music from Multiple Semantic Categories.
Proceedings of the ISMIR 2008, 2008

Audio tonality mode classification without tonic annotations.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

2007
Multi-Pitch Estimation Based on Partial Event and Support Transfer.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Excitation signal Extraction for Guitar tones.
Proceedings of the 2007 International Computer Music Conference, 2007


  Loading...