Bilei Zhu

According to our database¹, Bilei Zhu authored at least 29 papers between 2010 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

GaMMA: Towards Joint Global-Temporal Music Understanding in Large Multimodal Models.

[BibT_eX]

[DOI]

CoRR, May, 2026

2024

ByteComposer: a Human-like Melody Composition Method based on Language Model Agent.

[BibT_eX]

[DOI]

CoRR, 2024

X-Cover: Better Music Version Identification System by Integrating Pretrained ASR Model.

[BibT_eX]

[DOI]

Proceedings of the 25th International Society for Music Information Retrieval Conference, 2024

MINT: Boosting Audio-Language Model via Multi-Target Pre-Training and Instruction Tuning.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

ByteHum: Fast and Accurate Query-by-Humming in the Wild.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Joint Music and Language Attention Models for Zero-Shot Music Tagging.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Graph contrastive learning with implicit augmentations.

[BibT_eX]

[DOI]

Neural Networks, 2023

Bytecover3: Accurate Cover Song Identification On Short Queries.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

GIO: A Timbre-informed Approach for Pitch Tracking in Highly Noisy Environments.

[BibT_eX]

[DOI]

Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

Latent feature augmentation for chorus detection.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

S3T: Self-Supervised Pre-Training with Swin Transformer For Music Classification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Bytecover2: Towards Dimensionality Reduction of Latent Embedding for Efficient Cover Song Identification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection.

[BibT_eX]

[DOI]

Taylor Berg-Kirkpatrick

Shlomo Dubnov

Proceedings of the IEEE International Conference on Acoustics, 2022

Zero-Shot Audio Source Separation through Query-Based Learning from Weakly-Labeled Data.

[BibT_eX]

[DOI]

Taylor Berg-Kirkpatrick

Shlomo Dubnov

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Attention-Based Cross-Modal Fusion for Audio-Visual Voice Activity Detection in Musical Video Streams.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Rule-Embedded Network for Audio-Visual Voice Activity Detection in Live Musical Video Streams.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

An Hrnet-Blstm Model With Two-Stage Training For Singing Melody Extraction.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Singing Melody Extraction from Polyphonic Music based on Spectral Correlation Modeling.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Bytecover: Cover Song Identification Via Multi-Loss Training.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Contrastive Unsupervised Learning for Audio Fingerprinting.

[BibT_eX]

[DOI]

CoRR, 2020

2019

Vocal Melody Extraction via DNN-based Pitch Estimation and Salience-based Pitch Refinement.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

2017

Fusing transcription results from polyphonic and monophonic audio for singing melody transcription in polyphonic music.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2015

SIFT-based local spectrogram image descriptor: a novel feature for robust music identification.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2015

Towards Solving the Bottleneck of Pitch-based Singing Voice Separation.

[BibT_eX]

[DOI]

Bilei Zhu

Wei Li

Linwei Li

Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Latent time-frequency component analysis: A novel pitch-based approach for singing voice separation.

[BibT_eX]

[DOI]

Xiu Zhang

Wei Li

Bilei Zhu

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2013

Multi-Stage Non-Negative Matrix Factorization for Monaural Singing Voice Separation.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2013

2012

On the music content authentication.

[BibT_eX]

[DOI]

Wei Li

Bilei Zhu

Zhurong Wang

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

2010

A novel audio fingerprinting method robust to time scale modification and pitch shifting.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Multimedia 2010, 2010

Robust hashing for music copyright protection by combining beat segmentation and chroma.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Multimedia 2010, 2010

Bilei Zhu

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...