Wei Li

Orcid: 0000-0002-4486-8341

Affiliations:
  • Fudan University, School of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, China (PhD 2004)


According to our database1, Wei Li authored at least 62 papers between 2003 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Stripe-Transformer: deep stripe feature learning for music source separation.
EURASIP J. Audio Speech Music. Process., December, 2023

Multi-scale network with shared cross-attention for audio-visual correlation learning.
Neural Comput. Appl., September, 2023

Melody Generation from Lyrics with Local Interpretability.
ACM Trans. Multim. Comput. Commun. Appl., 2023

Variational Autoencoder with CCA for Audio-Visual Cross-modal Retrieval.
ACM Trans. Multim. Comput. Commun. Appl., 2023

A neural harmonic-aware network with gated attentive fusion for singing melody extraction.
Neurocomputing, 2023

MERTech: Instrument Playing Technique Detection Using Self-Supervised Pretrained Model With Multi-Task Finetuning.
CoRR, 2023

Phonetic and Prosody-aware Self-supervised Learning Approach for Non-native Fluency Scoring.
CoRR, 2023

Frame-Level Multi-Label Playing Technique Detection Using Multi-Scale Network and Self-Attention Mechanism.
CoRR, 2023

MFAE: Masked frame-level autoencoder with hybrid-supervision for low-resource music transcription.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

LC-Beating: An Online System for Beat and Downbeat Tracking using Latency-Controlled Mechanism.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

An ASR-Free Fluency Scoring Approach with Self-Supervised Learning.
Proceedings of the IEEE International Conference on Acoustics, 2023

Leveraging Phone-Level Linguistic-Acoustic Similarity For Utterance-Level Pronunciation Scoring.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
SoccerNet 2022 Challenges Results.
CoRR, 2022

Improving Non-native Word-level Pronunciation Scoring with Phone-level Mixup Data Augmentation and Multi-source Information.
CoRR, 2022

Melody Generation from Lyrics Using Three Branch Conditional LSTM-GAN.
Proceedings of the MultiMedia Modeling - 28th International Conference, 2022

Singing Voice Detection via Similarity-Based Semi-Supervised Learning.
Proceedings of the 4th ACM International Conference on Multimedia in Asia, 2022


HPPNet: Modeling the Harmonic Structure and Pitch Invariance in Piano Transcription.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

Automatic Chinese National Pentatonic Modes Recognition Using Convolutional Neural Network.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

Playing Technique Detection by Fusing Note Onset Information in Guzheng Performance.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

Using Fluency Representation Learned from Sequential Raw Features for Improving Non-native Fluency Scoring.
Proceedings of the Interspeech 2022, 2022

Multimodal Music Emotion Recognition with Hierarchical Cross-Modal Attention Network.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

HarmoF0: Logarithmic Scale Dilated Convolution for Pitch Estimation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Deepchorus: A Hybrid Model of Multi-Scale Convolution And Self-Attention for Chorus Detection.
Proceedings of the IEEE International Conference on Acoustics, 2022

Robust Capacity of Wireless Networks Under Cascading Failures.
Proceedings of the IEEE Global Communications Conference, 2022

2021
HANME: Hierarchical Attention Network for Singing Melody Extraction.
IEEE Signal Process. Lett., 2021

Musical Tempo Estimation Using a Multi-scale Network.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

Singer Identification Using Deep Timbre Feature Learning with KNN-NET.
Proceedings of the IEEE International Conference on Acoustics, 2021

Frequency-Temporal Attention Network for Singing Melody Extraction.
Proceedings of the IEEE International Conference on Acoustics, 2021

An Hrnet-Blstm Model With Two-Stage Training For Singing Melody Extraction.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Music Artist Classification with WaveNet Classifier for Raw Waveform Audio Data.
CoRR, 2020

Comparison for Improvements of Singing Voice Detection System Based on Vocal Separation.
CoRR, 2020

Residual Attention Based Network for Automatic Classification of Phonation Modes.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

2019
Automatic Audio Chord Recognition With MIDI-Trained Deep Feature and BLSTM-CRF Sequence Decoding Model.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Vocal Melody Extraction via DNN-based Pitch Estimation and Salience-based Pitch Refinement.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Music Chord Recognition Based on Midi-Trained Deep Feature and BLSTM-CRF Hybird Decoding.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
流行音乐主旋律提取技术综述 (Review on Main Melody Extraction from Pop Music).
计算机科学, 2017

2015
SIFT-based local spectrogram image descriptor: a novel feature for robust music identification.
EURASIP J. Audio Speech Music. Process., 2015

Towards Solving the Bottleneck of Pitch-based Singing Voice Separation.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Latent time-frequency component analysis: A novel pitch-based approach for singing voice separation.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2013
Multi-Stage Non-Negative Matrix Factorization for Monaural Singing Voice Separation.
IEEE Trans. Speech Audio Process., 2013

Low-order auditory Zernike moment: a novel approach for robust music identification in the compressed domain.
EURASIP J. Adv. Signal Process., 2013

Music content authentication based on beat segmentation and fuzzy classification.
EURASIP J. Audio Speech Music. Process., 2013

2012
A Double-Ranking Strategy for Long-Tail Product Recommendation.
Proceedings of the 2012 IEEE/WIC/ACM International Conferences on Web Intelligence, 2012

On the music content authentication.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

2011
Towards content-based audio fragment authentication.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

2010
Robust music identification based on low-order zernike moment in the compressed domain.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

Robust audio identification for MP3 popular music.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

A novel audio fingerprinting method robust to time scale modification and pitch shifting.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Robust hashing for music copyright protection by combining beat segmentation and chroma.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

2009
A Robust Mesh Watermarking Scheme Based on PCA.
Proceedings of the Fifth International Conference on Image and Graphics, 2009

2008
Audio Quality-Based Authentication Using Wavelet Packet Decomposition and Best Tree Selection.
Proceedings of the 4th International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2008), 2008

2006
Localized audio watermarking technique robust against time-scale modification.
IEEE Trans. Multim., 2006

2004
Multilingual Collection Retrieving Via Ontology Alignment.
Proceedings of the Digital Libraries: International Collaboration and Cross-Fertilization, 2004

2003
An Audio Watermarking Technique That Is Robust Against Random Cropping.
Comput. Music. J., 2003

Audio Watermarking Based on Statistical Feature in Wavelet Domain.
Proceedings of the Twelfth International World Wide Web Conference - Posters, 2003

Content Based Localized Robust Audio Watermarking.
Proceedings of the Interactive Multimedia on Next Generation Networks, 2003

Audio Watermarking Based on Music Content Analysis: Robust against Time Scale Modification.
Proceedings of the Digital Watermarking, Second International Workshop, 2003

A Novel Feature-Based Robust Audio Watermarking for Copyright Protection.
Proceedings of the 2003 International Symposium on Information Technology (ITCC 2003), 2003

Multi-channel Data Hiding Scheme for Color Images.
Proceedings of the 2003 International Symposium on Information Technology (ITCC 2003), 2003

An Optimized Multi-bits Blind Watermarking Scheme.
Proceedings of the Information and Communications Security, 5th International Conference, 2003

Robust Spatial Data Hiding for Color Images.
Proceedings of the Communications and Multimedia Security, 2003


  Loading...