Qiuqiang Kong

Orcid: 0000-0003-2864-0475

According to our database1, Qiuqiang Kong authored at least 95 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
End-to-End Paired Ambisonic-Binaural Audio Rendering.
IEEE CAA J. Autom. Sinica, February, 2024

Learning Temporal Resolution in Spectrogram for Audio Classification.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Transformer-based autoencoder with ID constraint for unsupervised anomalous sound detection.
EURASIP J. Audio Speech Music. Process., December, 2023

Selective-Memory Meta-Learning with Environment Representations for Sound Event Localization and Detection.
CoRR, 2023

Joint Music and Language Attention Models for Zero-shot Music Tagging.
CoRR, 2023

MERTech: Instrument Playing Technique Detection Using Self-Supervised Pretrained Model With Multi-Task Finetuning.
CoRR, 2023

Music Source Separation with Band-Split RoPE Transformer.
CoRR, 2023

AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining.
CoRR, 2023

Separate Anything You Describe.
CoRR, 2023

WavJourney: Compositional Audio Creation with Large Language Models.
CoRR, 2023

a unified front-end framework for english text-to-speech synthesis.
CoRR, 2023

Universal Source Separation with Weakly Labelled Data.
CoRR, 2023

Multi-level Temporal-channel Speaker Retrieval for Robust Zero-shot Voice Conversion.
CoRR, 2023

WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research.
CoRR, 2023

Jointist: Simultaneous Improvement of Multi-instrument Transcription and Music Source Separation via Joint Training.
CoRR, 2023

Simple Pooling Front-Ends for Efficient Audio Classification.
Proceedings of the IEEE International Conference on Acoustics, 2023

Streaming Voice Conversion via Intermediate Bottleneck Features and Non-Streaming Teacher Guidance.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
GiantMIDI-Piano: A Large-Scale MIDI Dataset for Classical Piano Music.
Trans. Int. Soc. Music. Inf. Retr., 2022

Feature Alignment for Robust Acoustic Scene Classification Across Devices.
IEEE Signal Process. Lett., 2022

Ontology-aware Learning and Evaluation for Audio Tagging.
CoRR, 2022

Binaural Rendering of Ambisonic Signals by Neural Networks.
CoRR, 2022

Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention.
CoRR, 2022

Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance.
CoRR, 2022

Neural Sound Field Decomposition with Super-resolution of Sound Direction.
CoRR, 2022

Learning the Spectrogram Temporal Resolution for Audio Classification.
CoRR, 2022

Surrey System for DCASE 2022 Task 5: Few-shot Bioacoustic Event Detection with Segment-level Metric Learning.
CoRR, 2022

Jointist: Joint Learning for Multi-instrument Transcription and Its Applications.
CoRR, 2022

Performance MIDI-to-score conversion by neural beat tracking.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration.
Proceedings of the Interspeech 2022, 2022

Separate What You Describe: Language-Queried Audio Source Separation.
Proceedings of the Interspeech 2022, 2022

Neural Vocoder is All You Need for Speech Super-resolution.
Proceedings of the Interspeech 2022, 2022

A Track-Wise Ensemble Event Independent Network for Polyphonic Sound Event Localization and Detection.
Proceedings of the IEEE International Conference on Acoustics, 2022

Segment-Level Metric Learning for Few-Shot Bioacoustic Event Detection.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

Sound Event Localization and Detection for Real Spatial Sound Scenes: Event-Independent Network and Data Augmentation Chains.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

2021
CAA-Net: Conditional Atrous CNNs With Attention for Explainable Device-Robust Acoustic Scene Classification.
IEEE Trans. Multim., 2021

Deep Learning-Based Energy Disaggregation and On/Off Detection of Household Appliances.
ACM Trans. Knowl. Discov. Data, 2021

High-Resolution Piano Transcription With Pedals by Regressing Onset and Offset Times.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

CWS-PResUNet: Music Source Separation with Channel-wise Subband Phase-aware ResUNet.
CoRR, 2021

VoiceFixer: Toward General Speech Restoration With Neural Vocoder.
CoRR, 2021

An Audio-Based Deep Learning Framework ForBBC Television Programme Classification.
CoRR, 2021

Time-domain Speech Enhancement with Generative Adversarial Learning.
CoRR, 2021

CatNet: music source separation system with mix-audio augmentation.
CoRR, 2021

A unified model for zero-shot music source separation, transcription and synthesis.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

Decoupling Magnitude and Phase Estimation with Deep ResUNet for Music Source Separation.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

Speech Enhancement with Weakly Labelled Data from AudioSet.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Singing Melody Extraction from Polyphonic Music based on Spectral Correlation Modeling.
Proceedings of the IEEE International Conference on Acoustics, 2021

An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection.
Proceedings of the IEEE International Conference on Acoustics, 2021

An Audio-Based Deep Learning Framework For BBC Television Programme Classification.
Proceedings of the 29th European Signal Processing Conference, 2021

2020
Audio Tagging by Cross Filtering Noisy Labels.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Sound Event Detection of Weakly Labelled Data With CNN-Transformer and Automatic Threshold Optimization.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Large-Scale MIDI-based Composer Classification.
CoRR, 2020

High-resolution Piano Transcription with Pedals by Regressing Onsets and Offsets Times.
CoRR, 2020

DD-CNN: Depthwise Disout Convolutional Neural Network for Low-complexity Acoustic Scene Classification.
CoRR, 2020

Source Separation with Weakly Labelled Data: an Approach to Computational Auditory Scene Analysis.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Learning With Out-of-Distribution Data for Audio Classification.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Audio for Audio is Better? An Investigation on Transfer Learning Models for Heart Sound Classification.
Proceedings of the 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2020

Event-Independent Network for Polyphonic Sound Event Localization and Detection.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

2019
Weakly Labelled AudioSet Tagging With Attention Neural Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Sound Event Detection and Time-Frequency Segmentation from Weakly Labelled Data.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Multi Model-Based Distillation for Sound Event Detection.
IEICE Trans. Inf. Syst., 2019

Deep Learning Based Energy Disaggregation and On/Off Detection of Household Appliances.
CoRR, 2019

Cross-task learning for audio tagging, sound event detection and spatial localization: DCASE 2019 baseline systems.
CoRR, 2019

Weakly labelled AudioSet Classification with Attention Neural Networks.
CoRR, 2019

Single-Channel Signal Separation and Deconvolution with Generative Adversarial Networks.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Divergence Based Weighting for Information Channels in Deep Convolutional Neural Networks for Bird Audio Detection.
Proceedings of the IEEE International Conference on Acoustics, 2019

Attention-based Atrous Convolutional Neural Networks: Visualisation and Understanding Perspectives of Acoustic Scenes.
Proceedings of the IEEE International Conference on Acoustics, 2019

Acoustic Scene Generation with Conditional Samplernn.
Proceedings of the IEEE International Conference on Acoustics, 2019

Sound Event Detection with Sequentially Labelled Data Based on Connectionist Temporal Classification and Unsupervised Clustering.
Proceedings of the IEEE International Conference on Acoustics, 2019

Multi-instance Learning for Bipolar Disorder Diagnosis using Weakly Labelled Speech Data.
Proceedings of the 9th International Conference on Digital Public Health, 2019

Polyphonic Sound Event Detection and Localization using a Two-Stage Strategy.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

2018
Weakly supervised CRNN system for sound event detection with large-scale unlabeled in-domain data.
CoRR, 2018

General audio tagging with ensembling convolutional neural network and statistical features.
CoRR, 2018

Audio Tagging With Connectionist Temporal Classification Model Using Sequential Labelled Data.
CoRR, 2018

DCASE 2018 Challenge baseline with convolutional neural networks.
CoRR, 2018

Predicting Appliance Usage Status In Home Like Environments.
Proceedings of the 23rd IEEE International Conference on Digital Signal Processing, 2018

A Joint Separation-Classification Model for Sound Event Detection of Weakly Labelled Data.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Audio Set Classification with Attention Model: A Probabilistic Perspective.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Large-Scale Weakly Supervised Audio Classification Using Gated Convolutional Neural Network.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Improving Reverberant Speech Separation with Binaural Cues Using Temporal Context and Convolutional Neural Networks.
Proceedings of the Latent Variable Analysis and Signal Separation, 2018

Capsule Routing for Sound Event Detection.
Proceedings of the 26th European Signal Processing Conference, 2018

Multi-level attention model for weakly supervised audio classification.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

Sample mixed-based data augmentation for domestic audio tagging.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

Attention-based convolutional neural networks for acoustic scene classification.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

DCASE 2018 Challenge Surrey cross-task convolutional neural network baseline.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

General-purpose audio tagging from noisy labels using convolutional neural networks.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

Polyphonic audio tagging with sequentially labelled data using CRNN with learnable gated linear units.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

Audio Tagging With Connectionist Temporal Classification Model Using Sequentially Labelled Data.
Proceedings of the Communications, Signal Processing, and Systems, 2018

2017
Surrey-cvssp system for DCASE2017 challenge task4.
CoRR, 2017

Attention and Localization Based on a Deep Convolutional Recurrent Model for Weakly Supervised Audio Tagging.
Proceedings of the Interspeech 2017, 2017

Convolutional gated recurrent neural network incorporating spatial features for audio tagging.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

A joint detection-classification model for audio tagging of weakly labelled data.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Masked non-negative matrix factorization for eire detection using weakly labeled data.
Proceedings of the 25th European Signal Processing Conference, 2017

Joint detection and classification convolutional neural network on weakly labelled bird audio detection.
Proceedings of the 25th European Signal Processing Conference, 2017

2016
Deep Neural Network Baseline for DCASE Challenge 2016.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016


  Loading...