Qiuqiang Kong

Orcid: 0000-0003-2864-0475

According to our database¹, Qiuqiang Kong authored at least 95 papers between 2016 and 2024.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

End-to-End Paired Ambisonic-Binaural Audio Rendering.

[BibT_eX]

[DOI]

IEEE CAA J. Autom. Sinica, February, 2024

Learning Temporal Resolution in Spectrogram for Audio Classification.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Transformer-based autoencoder with ID constraint for unsupervised anomalous sound detection.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., December, 2023

Selective-Memory Meta-Learning with Environment Representations for Sound Event Localization and Detection.

[BibT_eX]

[DOI]

CoRR, 2023

Joint Music and Language Attention Models for Zero-shot Music Tagging.

[BibT_eX]

[DOI]

CoRR, 2023

MERTech: Instrument Playing Technique Detection Using Self-Supervised Pretrained Model With Multi-Task Finetuning.

[BibT_eX]

[DOI]

CoRR, 2023

Music Source Separation with Band-Split RoPE Transformer.

[BibT_eX]

[DOI]

CoRR, 2023

AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining.

[BibT_eX]

[DOI]

CoRR, 2023

Separate Anything You Describe.

[BibT_eX]

[DOI]

CoRR, 2023

WavJourney: Compositional Audio Creation with Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

a unified front-end framework for english text-to-speech synthesis.

[BibT_eX]

[DOI]

CoRR, 2023

Universal Source Separation with Weakly Labelled Data.

[BibT_eX]

[DOI]

Taylor Berg-Kirkpatrick

Shlomo Dubnov

Mark D. Plumbley

CoRR, 2023

Multi-level Temporal-channel Speaker Retrieval for Robust Zero-shot Voice Conversion.

[BibT_eX]

[DOI]

CoRR, 2023

WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research.

[BibT_eX]

[DOI]

CoRR, 2023

Jointist: Simultaneous Improvement of Multi-instrument Transcription and Music Source Separation via Joint Training.

[BibT_eX]

[DOI]

CoRR, 2023

Simple Pooling Front-Ends for Efficient Audio Classification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Streaming Voice Conversion via Intermediate Bottleneck Features and Non-Streaming Teacher Guidance.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

GiantMIDI-Piano: A Large-Scale MIDI Dataset for Classical Piano Music.

[BibT_eX]

[DOI]

Trans. Int. Soc. Music. Inf. Retr., 2022

Feature Alignment for Robust Acoustic Scene Classification Across Devices.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2022

Ontology-aware Learning and Evaluation for Audio Tagging.

[BibT_eX]

[DOI]

CoRR, 2022

Binaural Rendering of Ambisonic Signals by Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2022

Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention.

[BibT_eX]

[DOI]

CoRR, 2022

Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance.

[BibT_eX]

[DOI]

CoRR, 2022

Neural Sound Field Decomposition with Super-resolution of Sound Direction.

[BibT_eX]

[DOI]

CoRR, 2022

Learning the Spectrogram Temporal Resolution for Audio Classification.

[BibT_eX]

[DOI]

CoRR, 2022

Surrey System for DCASE 2022 Task 5: Few-shot Bioacoustic Event Detection with Segment-level Metric Learning.

[BibT_eX]

[DOI]

CoRR, 2022

Jointist: Joint Learning for Multi-instrument Transcription and Its Applications.

[BibT_eX]

[DOI]

CoRR, 2022

Performance MIDI-to-score conversion by neural beat tracking.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2022, 2022

Separate What You Describe: Language-Queried Audio Source Separation.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2022, 2022

Neural Vocoder is All You Need for Speech Super-resolution.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2022, 2022

A Track-Wise Ensemble Event Independent Network for Polyphonic Sound Event Localization and Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Segment-Level Metric Learning for Few-Shot Bioacoustic Event Detection.

[BibT_eX]

[DOI]

Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

Sound Event Localization and Detection for Real Spatial Sound Scenes: Event-Independent Network and Data Augmentation Chains.

[BibT_eX]

[DOI]

Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

2021

CAA-Net: Conditional Atrous CNNs With Attention for Explainable Device-Robust Acoustic Scene Classification.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2021

Deep Learning-Based Energy Disaggregation and On/Off Detection of Household Appliances.

[BibT_eX]

[DOI]

ACM Trans. Knowl. Discov. Data, 2021

High-Resolution Piano Transcription With Pedals by Regressing Onset and Offset Times.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

CWS-PResUNet: Music Source Separation with Channel-wise Subband Phase-aware ResUNet.

[BibT_eX]

[DOI]

Haohe Liu

Qiuqiang Kong

Jiafeng Liu

CoRR, 2021

VoiceFixer: Toward General Speech Restoration With Neural Vocoder.

[BibT_eX]

[DOI]

CoRR, 2021

An Audio-Based Deep Learning Framework ForBBC Television Programme Classification.

[BibT_eX]

[DOI]

CoRR, 2021

Time-domain Speech Enhancement with Generative Adversarial Learning.

[BibT_eX]

[DOI]

CoRR, 2021

CatNet: music source separation system with mix-audio augmentation.

[BibT_eX]

[DOI]

CoRR, 2021

A unified model for zero-shot music source separation, transcription and synthesis.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

Decoupling Magnitude and Phase Estimation with Deep ResUNet for Music Source Separation.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

Speech Enhancement with Weakly Labelled Data from AudioSet.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Singing Melody Extraction from Polyphonic Music based on Spectral Correlation Modeling.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

An Audio-Based Deep Learning Framework For BBC Television Programme Classification.

[BibT_eX]

[DOI]

Proceedings of the 29th European Signal Processing Conference, 2021

2020

Audio Tagging by Cross Filtering Noisy Labels.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2020

Sound Event Detection of Weakly Labelled Data With CNN-Transformer and Automatic Threshold Optimization.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2020

PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2020

Large-Scale MIDI-based Composer Classification.

[BibT_eX]

[DOI]

Qiuqiang Kong

Keunwoo Choi

Yuxuan Wang

CoRR, 2020

High-resolution Piano Transcription with Pedals by Regressing Onsets and Offsets Times.

[BibT_eX]

[DOI]

CoRR, 2020

DD-CNN: Depthwise Disout Convolutional Neural Network for Low-complexity Acoustic Scene Classification.

[BibT_eX]

[DOI]

CoRR, 2020

Source Separation with Weakly Labelled Data: an Approach to Computational Auditory Scene Analysis.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Learning With Out-of-Distribution Data for Audio Classification.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Audio for Audio is Better? An Investigation on Transfer Learning Models for Heart Sound Classification.

[BibT_eX]

[DOI]

Proceedings of the 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2020

Event-Independent Network for Polyphonic Sound Event Localization and Detection.

[BibT_eX]

[DOI]

Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

2019

Weakly Labelled AudioSet Tagging With Attention Neural Networks.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2019

Sound Event Detection and Time-Frequency Segmentation from Weakly Labelled Data.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2019

Multi Model-Based Distillation for Sound Event Detection.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2019

Deep Learning Based Energy Disaggregation and On/Off Detection of Household Appliances.

[BibT_eX]

[DOI]

CoRR, 2019

Cross-task learning for audio tagging, sound event detection and spatial localization: DCASE 2019 baseline systems.

[BibT_eX]

[DOI]

CoRR, 2019

Weakly labelled AudioSet Classification with Attention Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2019

Single-Channel Signal Separation and Deconvolution with Generative Adversarial Networks.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Divergence Based Weighting for Information Channels in Deep Convolutional Neural Networks for Bird Audio Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Attention-based Atrous Convolutional Neural Networks: Visualisation and Understanding Perspectives of Acoustic Scenes.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Acoustic Scene Generation with Conditional Samplernn.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Sound Event Detection with Sequentially Labelled Data Based on Connectionist Temporal Classification and Unsupervised Clustering.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Multi-instance Learning for Bipolar Disorder Diagnosis using Weakly Labelled Speech Data.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Digital Public Health, 2019

Polyphonic Sound Event Detection and Localization using a Two-Stage Strategy.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

2018

Weakly supervised CRNN system for sound event detection with large-scale unlabeled in-domain data.

[BibT_eX]

[DOI]

CoRR, 2018

General audio tagging with ensembling convolutional neural network and statistical features.

[BibT_eX]

[DOI]

CoRR, 2018

Audio Tagging With Connectionist Temporal Classification Model Using Sequential Labelled Data.

[BibT_eX]

[DOI]

Yuanbo Hou

Qiuqiang Kong

Shengchen Li

CoRR, 2018

DCASE 2018 Challenge baseline with convolutional neural networks.

[BibT_eX]

[DOI]

CoRR, 2018

Predicting Appliance Usage Status In Home Like Environments.

[BibT_eX]

[DOI]

Proceedings of the 23rd IEEE International Conference on Digital Signal Processing, 2018

A Joint Separation-Classification Model for Sound Event Detection of Weakly Labelled Data.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Audio Set Classification with Attention Model: A Probabilistic Perspective.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Large-Scale Weakly Supervised Audio Classification Using Gated Convolutional Neural Network.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Improving Reverberant Speech Separation with Binaural Cues Using Temporal Context and Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Latent Variable Analysis and Signal Separation, 2018

Capsule Routing for Sound Event Detection.

[BibT_eX]

[DOI]

Proceedings of the 26th European Signal Processing Conference, 2018

Multi-level attention model for weakly supervised audio classification.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

Sample mixed-based data augmentation for domestic audio tagging.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

Attention-based convolutional neural networks for acoustic scene classification.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

DCASE 2018 Challenge Surrey cross-task convolutional neural network baseline.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

General-purpose audio tagging from noisy labels using convolutional neural networks.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

Polyphonic audio tagging with sequentially labelled data using CRNN with learnable gated linear units.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

Audio Tagging With Connectionist Temporal Classification Model Using Sequentially Labelled Data.

[BibT_eX]

[DOI]

Yuanbo Hou

Qiuqiang Kong

Shengchen Li

Proceedings of the Communications, Signal Processing, and Systems, 2018

2017

Surrey-cvssp system for DCASE2017 challenge task4.

[BibT_eX]

[DOI]

CoRR, 2017

Attention and Localization Based on a Deep Convolutional Recurrent Model for Weakly Supervised Audio Tagging.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2017, 2017

Convolutional gated recurrent neural network incorporating spatial features for audio tagging.

[BibT_eX]

[DOI]

Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

A joint detection-classification model for audio tagging of weakly labelled data.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Masked non-negative matrix factorization for eire detection using weakly labeled data.

[BibT_eX]

[DOI]

Iwona Sobieraj

Qiuqiang Kong

Mark D. Plumbley

Proceedings of the 25th European Signal Processing Conference, 2017

Joint detection and classification convolutional neural network on weakly labelled bird audio detection.

[BibT_eX]

[DOI]

Qiuqiang Kong

Yong Xu

Mark D. Plumbley

Proceedings of the 25th European Signal Processing Conference, 2017

2016

Deep Neural Network Baseline for DCASE Challenge 2016.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016

Qiuqiang Kong

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...