Wenwu Wang

Orcid: 0000-0002-8393-5703

Affiliations:
  • University of Surrey, Guildford, UK


According to our database1, Wenwu Wang authored at least 340 papers between 2003 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Three-Direction Fusion for Accurate Volumetric Liver and Tumor Segmentation.
IEEE J. Biomed. Health Informatics, April, 2024

Labelled Non-Zero Diffusion Particle Flow SMC-PHD Filtering for Multi-Speaker Tracking.
IEEE Trans. Multim., 2024

Cooperative Scene-Event Modelling for Acoustic Scene Classification.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Learning Temporal Resolution in Spectrogram for Audio Classification.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Transformer-based autoencoder with ID constraint for unsupervised anomalous sound detection.
EURASIP J. Audio Speech Music. Process., December, 2023

UAV-Enabled Mobile Edge Computing for Resource Allocation Using Cooperative Evolutionary Computation.
IEEE Trans. Aerosp. Electron. Syst., October, 2023

A long-range aerial acoustic communication scheme.
Phys. Commun., October, 2023

End-to-end translation of human neural activity to speech with a dual-dual generative adversarial network.
Knowl. Based Syst., October, 2023

Discriminative analysis dictionary learning with adaptively ordinal locality preserving.
Neural Networks, August, 2023

Audio-Visual Event Localization by Learning Spatial and Semantic Co-Attention.
IEEE Trans. Multim., 2023

ACTUAL: Audio Captioning With Caption Feature Space Regularization.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Unsupervised Deep Unfolded Representation Learning for Singing Voice Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

U-Shaped Transformer With Frequency-Band Aware Attention for Speech Enhancement.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Graph Attention for Automated Audio Captioning.
IEEE Signal Process. Lett., 2023

Audio Event-Relational Graph Representation Learning for Acoustic Scene Classification.
IEEE Signal Process. Lett., 2023

Multi-level graph learning for audio event classification and human-perceived annoyance rating prediction.
CoRR, 2023

Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection.
CoRR, 2023

Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions.
CoRR, 2023

First-Shot Unsupervised Anomalous Sound Detection With Unknown Anomalies Estimated by Metadata-Assisted Audio Generation.
CoRR, 2023

CM-PIE: Cross-modal perception for interactive-enhanced audio-visual video parsing.
CoRR, 2023

Audio Visual Speaker Localization from EgoCentric Views.
CoRR, 2023

Synth-AC: Enhancing Audio Captioning with Synthetic Supervision.
CoRR, 2023

Retrieval-Augmented Text-to-Audio Generation.
CoRR, 2023

Hierarchical Metadata Information Constrained Self-Supervised Learning for Anomalous Sound Detection Under Domain Shift.
CoRR, 2023

AudioSR: Versatile Audio Super-resolution at Scale.
CoRR, 2023

Multimodal Fish Feeding Intensity Assessment in Aquaculture.
CoRR, 2023

Joint Prediction of Audio Event and Annoyance Rating in an Urban Soundscape by Hierarchical Graph Representation Learning.
CoRR, 2023

META-SELD: Meta-Learning for Fast Adaptation to the new environment in Sound Event Localization and Detection.
CoRR, 2023

AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining.
CoRR, 2023

Separate Anything You Describe.
CoRR, 2023

WavJourney: Compositional Audio Creation with Large Language Models.
CoRR, 2023

Text-Driven Foley Sound Generation With Latent Diffusion Model.
CoRR, 2023

Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning.
CoRR, 2023

Adapting Language-Audio Models as Few-Shot Audio Learners.
CoRR, 2023

Latent Diffusion Model Based Foley Sound Generation System For DCASE Challenge 2023 Task 7.
CoRR, 2023

WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research.
CoRR, 2023

Leveraging Pre-trained AudioLDM for Text to Sound Generation: A Benchmark Study.
CoRR, 2023

TransAMR: An Interpretable Transformer Model for Accurate Prediction of Antimicrobial Resistance Using Antibiotic Administration Data.
IEEE Access, 2023

Differentiable Bootstrap Particle Filters for Regime-Switching Models.
Proceedings of the IEEE Statistical Signal Processing Workshop, 2023

AudioLDM: Text-to-Audio Generation with Latent Diffusion Models.
Proceedings of the International Conference on Machine Learning, 2023

A Semantics-Aware Normalizing Flow Model for Anomaly Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

DLAHSD: Dynamic Label Adopted In Auxiliary Head for SAR Detection.
Proceedings of the IEEE International Conference on Image Processing, 2023

An Improved Optimal Transport Kernel Embedding Method with Gating Mechanism for Singing Voice Separation and Speaker Identification.
Proceedings of the IEEE International Conference on Acoustics, 2023

Simple Pooling Front-Ends for Efficient Audio Classification.
Proceedings of the IEEE International Conference on Acoustics, 2023

Gct: Gated Contextual Transformer for Sequential Audio Tagging.
Proceedings of the IEEE International Conference on Acoustics, 2023

Anomalous Sound Detection Using Audio Representation with Machine ID Based Contrastive Learning Pretraining.
Proceedings of the IEEE International Conference on Acoustics, 2023

Time-Weighted Frequency Domain Audio Representation with GMM Estimator for Anomalous Sound Detection.
Proceedings of the IEEE International Conference on Acoustics, 2023

Visual-Haptic-Kinesthetic Object Recognition with Multimodal Transformer.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2023, 2023

Leveraging Pre-Trained AudioLDM for Sound Generation: A Benchmark Study.
Proceedings of the 31st European Signal Processing Conference, 2023

Enhancing Audio Retrieval with Attention-based Encoder for Audio Feature Representation.
Proceedings of the 31st European Signal Processing Conference, 2023

Knowledge Distillation for Efficient Audio-Visual Video Captioning.
Proceedings of the 31st European Signal Processing Conference, 2023

Learning Retrieval Augmentation for Personalized Dialogue Generation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Personalized Dialogue Generation with Persona-Adaptive Attention.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Acoustic Source Localization in the Circular Harmonic Domain Using Deep Learning Architecture.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Local Information Assisted Attention-Free Decoder for Audio Captioning.
IEEE Signal Process. Lett., 2022

Speech emotion recognition using data augmentation method by cycle-generative adversarial networks.
Signal Image Video Process., 2022

Reconstruction of Subsurface Salinity Structure in the South China Sea Using Satellite Observations: A LightGBM-Based Deep Forest Method.
Remote. Sens., 2022

Support vector machine embedding discriminative dictionary pair learning for pattern classification.
Neural Networks, 2022

Multiscale Deep Neural Network With Two-Stage Loss for SAR Target Recognition With Small Training Set.
IEEE Geosci. Remote. Sens. Lett., 2022

Automated audio captioning: an overview of recent progress and new challenges.
EURASIP J. Audio Speech Music. Process., 2022

Improved capsule routing for weakly labeled sound event detection.
EURASIP J. Audio Speech Music. Process., 2022

One to multiple mapping dual learning: Learning multiple signals from one mixture.
Digit. Signal Process., 2022

Unpaired Overwater Image Defogging Using Prior Map Guided CycleGAN.
CoRR, 2022

Towards Generating Diverse Audio Captions via Adversarial Training.
CoRR, 2022

ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation.
CoRR, 2022

Ontology-aware Learning and Evaluation for Audio Tagging.
CoRR, 2022

Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention.
CoRR, 2022

Multi-dimensional Edge-based Audio Event Relational Graph Representation Learning for Acoustic Scene Classification.
CoRR, 2022

Automated Audio Captioning via Fusion of Low- and High- Dimensional Features.
CoRR, 2022

Learning the Spectrogram Temporal Resolution for Audio Classification.
CoRR, 2022

Low-complexity CNNs for Acoustic Scene Classification.
CoRR, 2022

Surrey System for DCASE 2022 Task 5: Few-shot Bioacoustic Event Detection with Segment-level Metric Learning.
CoRR, 2022

Continual Learning For On-Device Environmental Sound Classification.
CoRR, 2022

A Two-student Learning Framework for Mixed Supervised Target Sound Detection.
CoRR, 2022

A Hybrid Approach to Blind Video Quality Prediction of User Generated Content.
Proceedings of the Picture Coding Symposium, 2022

Fish Feeding Intensity Assessment in Aquaculture: A New Audio Dataset AFFIA3K and a Deep Learning Algorithm.
Proceedings of the 32nd IEEE International Workshop on Machine Learning for Signal Processing, 2022

Audio Visual Multi-Speaker Tracking with Improved GCF and PMBM Filter.
Proceedings of the Interspeech 2022, 2022

RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection.
Proceedings of the Interspeech 2022, 2022

On Metric Learning for Audio-Text Cross-Modal Retrieval.
Proceedings of the Interspeech 2022, 2022

Separate What You Describe: Language-Queried Audio Source Separation.
Proceedings of the Interspeech 2022, 2022

DSTAGNN: Dynamic Spatial-Temporal Aware Graph Neural Network for Traffic Flow Forecasting.
Proceedings of the International Conference on Machine Learning, 2022

Audio-Visual Tracking of Multiple Speakers Via a PMBM Filter.
Proceedings of the IEEE International Conference on Acoustics, 2022

A Mutual Learning Framework for Few-Shot Sound Event Detection.
Proceedings of the IEEE International Conference on Acoustics, 2022

Partial Arithmetic Consensus based Distributed Intensity Particle Flow SMC-PHD Filter for Multi-Target Tracking.
Proceedings of the IEEE International Conference on Acoustics, 2022

Diverse Audio Captioning Via Adversarial Training.
Proceedings of the IEEE International Conference on Acoustics, 2022

Anomalous Sound Detection Using Spectral-Temporal Information Fusion.
Proceedings of the IEEE International Conference on Acoustics, 2022

A Transformer-Based GAN for Anomaly Detection.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2022, 2022

Face Super-Resolution with Spatial Attention Guided by Multiscale Receptive-Field Features.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2022, 2022

Deep Learning for Audio Visual Emotion Recognition.
Proceedings of the 25th International Conference on Information Fusion, 2022

UAV-enabled Edge Computing for Optimal Task Distribution in Target Tracking.
Proceedings of the 25th International Conference on Information Fusion, 2022

Visually Assisted Self-supervised Audio Speaker Localization and Tracking.
Proceedings of the 30th European Signal Processing Conference, 2022

MS-MLP: Multi-scale Sampling MLP for ECG Classification.
Proceedings of the 30th European Signal Processing Conference, 2022

Deep Neural Decision Forest for Acoustic Scene Classification.
Proceedings of the 30th European Signal Processing Conference, 2022

Automated Image Captioning with Multi-layer Gated Recurrent Unit.
Proceedings of the 30th European Signal Processing Conference, 2022

Leveraging Pre-trained BERT for Audio Captioning.
Proceedings of the 30th European Signal Processing Conference, 2022

Auxiliary Classifier based Residual RNN for Image Captioning.
Proceedings of the 30th European Signal Processing Conference, 2022

A Mixed Supervised Learning Framework For Target Sound Detection.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

Continual Learning for On-Ddevice Environmental Sound Classification.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

Segment-Level Metric Learning for Few-Shot Bioacoustic Event Detection.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

2021
Short-Term Traffic Flow Prediction With Wavelet and Multi-Dimensional Taylor Network Model.
IEEE Trans. Intell. Transp. Syst., 2021

Adaptive Recursive Decentralized Cooperative Localization for Multirobot Systems With Time-Varying Measurement Accuracy.
IEEE Trans. Instrum. Meas., 2021

Evolving Multi-Resolution Pooling CNN for Monaural Singing Voice Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Multiple Acoustic Source Localization in Microphone Array Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Indoor Multi-Speaker Localization Based on Bayesian Nonparametrics in the Circular Harmonic Domain.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Convolutional fusion network for monaural speech enhancement.
Neural Networks, 2021

A Multi-Scale Feature Recalibration Network for End-to-End Single Channel Speech Enhancement.
IEEE J. Sel. Top. Signal Process., 2021

Sparse Analysis Model Based Dictionary Learning for Signal Declipping.
IEEE J. Sel. Top. Signal Process., 2021

End-to-end translation of human neural activity to speech with a dual-dual generative adversarial network.
CoRR, 2021

One to Multiple Mapping Dual Learning: Learning Multiple Sources from One Mixed Signal.
CoRR, 2021

An Audio-Based Deep Learning Framework ForBBC Television Programme Classification.
CoRR, 2021

Time-domain Speech Enhancement with Generative Adversarial Learning.
CoRR, 2021

Conditional Sound Generation Using Neural Discrete Time-Frequency Representation Learning.
Proceedings of the 2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP), 2021

Crossfire Conditional Generative Adversarial Networks for Singing Voice Extraction.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Enhancing Alzheimer's Disease Diagnosis via Hierarchical 3D-FCN with Multi-Modal Features.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

SAGAN: Skip-Attention GAN For Anomaly Detection.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Robust Visual Object Tracking with Spatiotemporal Regularisation and Discriminative Occlusion Deformation.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Weighted Magnitude-Phase Loss for Speech Dereverberation.
Proceedings of the IEEE International Conference on Acoustics, 2021

A Global-Local Attention Framework for Weakly Labelled Audio Tagging.
Proceedings of the IEEE International Conference on Acoustics, 2021

Dimension Selected Subspace Clustering.
Proceedings of the IEEE International Conference on Acoustics, 2021

Enhancing Audio Augmentation Methods with Consistency Learning.
Proceedings of the IEEE International Conference on Acoustics, 2021

Low-Dimensional Denoising Embedding Transformer for ECG Classification.
Proceedings of the IEEE International Conference on Acoustics, 2021

An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection.
Proceedings of the IEEE International Conference on Acoustics, 2021

An Audio-Based Deep Learning Framework For BBC Television Programme Classification.
Proceedings of the 29th European Signal Processing Conference, 2021

Audio Captioning Transformer.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

An Encoder-Decoder Based Audio Captioning System with Transfer and Reinforcement Learning.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

CL4AC: A Contrastive Loss for Audio Captioning.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

ARCA23K: An Audio Dataset for Investigating Open-Set Label Noise.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

2020
Audio-Visual Particle Flow SMC-PHD Filtering for Multi-Speaker Tracking.
IEEE Trans. Multim., 2020

Joint Raindrop and Haze Removal From a Single Image.
IEEE Trans. Image Process., 2020

Blind Multiple-Input Multiple-Output Image Phase Retrieval.
IEEE Trans. Ind. Electron., 2020

Sound Event Detection of Weakly Labelled Data With CNN-Transformer and Automatic Threshold Optimization.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Association Loss for Visual Object Detection.
IEEE Signal Process. Lett., 2020

Modeling Label Dependencies for Audio Tagging With Graph Convolutional Network.
IEEE Signal Process. Lett., 2020

Adaptive Separation of Respiratory and Heartbeat Signals among Multiple People Based on Empirical Wavelet Transform Using UWB Radar.
Sensors, 2020

Optimal feasible step-size based working set selection for large scale SVMs training.
Neurocomputing, 2020

Robust PCA Using Nonconvex Rank Approximation and Sparse Regularizer.
Circuits Syst. Signal Process., 2020

Environmental Sound Classification with Parallel Temporal-Spectral Attention.
Proceedings of the Interspeech 2020, 2020

Gated Multi-Head Attention Pooling for Weakly Labelled Audio Tagging.
Proceedings of the Interspeech 2020, 2020

An Analytical Solution to Jacobsen Estimator for Windowed Signals.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Source Separation with Weakly Labelled Data: an Approach to Computational Auditory Scene Analysis.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Learning With Out-of-Distribution Data for Audio Classification.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Weakly Labelled Audio Tagging Via Convolutional Networks with Spatial and Channel-Wise Attention.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Meta Metric Learning for Highly Imbalanced Aerial Scene Classification.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Multi-Scale Residual Convolutional Encoder Decoder with Bidirectional Long Short-Term Memory for Single Channel Speech Enhancement.
Proceedings of the 28th European Signal Processing Conference, 2020

Open-Window: A Sound Event Dataset for Window State Detection and Recognition.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

Event-Independent Network for Polyphonic Sound Event Localization and Detection.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

A Covert Ultrasonic Phone-to-Phone Communication Scheme.
Proceedings of the Collaborative Computing: Networking, Applications and Worksharing, 2020

2019
Sparse Recovery and Dictionary Learning From Nonlinear Compressive Measurements.
IEEE Trans. Signal Process., 2019

Two-Stage Monaural Source Separation in Reverberant Room Environments Using Deep Neural Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Modeling the Comb Filter Effect and Interaural Coherence for Binaural Source Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Weakly Labelled AudioSet Tagging With Attention Neural Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Sound Event Detection and Time-Frequency Segmentation from Weakly Labelled Data.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

A Skip Attention Mechanism for Monaural Singing Voice Separation.
IEEE Signal Process. Lett., 2019

A Speech Synthesis Approach for High Quality Speech Separation and Generation.
IEEE Signal Process. Lett., 2019

A Source Counting Method Using Acoustic Vector Sensor Based on Sparse Modeling of DOA Histogram.
IEEE Signal Process. Lett., 2019

Monaural Source Separation in Complex Domain With Long Short-Term Memory Neural Network.
IEEE J. Sel. Top. Signal Process., 2019

Multiple Input Single Output Phase Retrieval.
Circuits Syst. Signal Process., 2019

Learning discriminative and robust time-frequency representations for environmental sound classification.
CoRR, 2019

Cross-task learning for audio tagging, sound event detection and spatial localization: DCASE 2019 baseline systems.
CoRR, 2019

Weakly labelled AudioSet Classification with Attention Neural Networks.
CoRR, 2019

Multi-channel Convolutional Neural Networks with Multi-level Feature Fusion for Environmental Sound Classification.
Proceedings of the MultiMedia Modeling - 25th International Conference, 2019

Single-Channel Signal Separation and Deconvolution with Generative Adversarial Networks.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Monaural Speech Enhancement Based On Two Stage Long Short-Term Memory Networks.
Proceedings of the 13th International Conference on Signal Processing and Communication Systems, 2019

Single-Channel Speech Enhancement with Sequentially Trained DNN System.
Proceedings of the 13th International Conference on Signal Processing and Communication Systems, 2019

SVD-Based Channel Pruning for Convolutional Neural Network in Acoustic Scene Classification Model.
Proceedings of the IEEE International Conference on Multimedia & Expo Workshops, 2019

Semantic Super-resolution for Extremely Low-resolution Vehicle License Plate.
Proceedings of the IEEE International Conference on Acoustics, 2019

Proximal Deep Recurrent Neural Network for Monaural Singing Voice Separation.
Proceedings of the IEEE International Conference on Acoustics, 2019

Background Adaptation for Improved Listening Experience in Broadcasting.
Proceedings of the IEEE International Conference on Acoustics, 2019

Labelled Non-zero Particle Flow for SMC-PHD Filtering.
Proceedings of the IEEE International Conference on Acoustics, 2019

Enhanced Streaming Based Subspace Clustering Applied to Acoustic Scene Data Clustering.
Proceedings of the IEEE International Conference on Acoustics, 2019

Generalisation in Environmental Sound Classification: The 'Making Sense of Sounds' Data Set and Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2019

Acoustic Scene Generation with Conditional Samplernn.
Proceedings of the IEEE International Conference on Acoustics, 2019

Polyphonic Sound Event Detection and Localization using a Two-Stage Strategy.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

Speaker-discriminative Embedding Learning via Affinity Matrix for Short Utterance Speaker Verification.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018
Multiple Speaker Tracking in Spatial Audio via PHD Filtering and Depth-Audio Fusion.
IEEE Trans. Multim., 2018

Manifold-Based Visual Object Counting.
IEEE Trans. Image Process., 2018

Bootstrap Averaging for Model-Based Source Separation in Reverberant Conditions.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

An RIP-Based Performance Guarantee of Covariance-Assisted Matching Pursuit.
IEEE Signal Process. Lett., 2018

Low rank matrix completion using truncated nuclear norm and sparse regularizer.
Signal Process. Image Commun., 2018

A non-intrusive method for estimating binaural speech intelligibility from noise-corrupted signals captured by a pair of microphones.
Speech Commun., 2018

Momentum fractional LMS for power signal parameter estimation.
Signal Process., 2018

Multi-source phase retrieval from multi-channel phaseless STFT measurements.
Signal Process., 2018

Polynomial dictionary learning algorithms in sparse representations.
Signal Process., 2018

Efficient multi-modal geometric mean metric learning.
Pattern Recognit., 2018

Error sensitivity analysis of Delta divergence - a novel measure for classifier incongruence detection.
Pattern Recognit., 2018

Standard-independent I/Q imbalance estimation and compensation scheme inOFDM.
Frontiers Inf. Technol. Electron. Eng., 2018

Learning soft mask with DNN and DNN-SVM for multi-speaker DOA estimation using an acoustic vector sensor.
J. Frankl. Inst., 2018

Intensity Particle Flow SMC-PHD Filter For Audio Speaker Tracking.
CoRR, 2018

Bayesian inference for PCA and MUSIC algorithms with unknown number of sources.
CoRR, 2018

DCASE 2018 Challenge baseline with convolutional neural networks.
CoRR, 2018

Approximate Message Passing for Underdetermined Audio Source Separation.
CoRR, 2018

LD-CNN: A Lightweight Dilated Convolutional Neural Network for Environmental Sound Classification.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Cascade Deep Networks for Sparse Linear Inverse Problems.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Spatially Regularized Low Rank Tensor Optimization for Visual Data Completion.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Bayesian Inference for Multi-Line Spectra in Linear Sensor Array.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Non-Zero Diffusion Particle Flow SMC-PHD Filter for Audio-Visual Multi-Speaker Tracking.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Iterative Deep Neural Networks for Speaker-Independent Binaural Blind Speech Separation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

A Joint Separation-Classification Model for Sound Event Detection of Weakly Labelled Data.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Audio Set Classification with Attention Model: A Probabilistic Perspective.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Intelligent Signal Processing Mechanisms for Nuanced Anomaly Detection in Action Audio-Visual Data Streams.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Synthesis of Images by Two-Stage Generative Adversarial Networks.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Large-Scale Weakly Supervised Audio Classification Using Gated Convolutional Neural Network.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Improving Reverberant Speech Separation with Binaural Cues Using Temporal Context and Convolutional Neural Networks.
Proceedings of the Latent Variable Analysis and Signal Separation, 2018

Consistent Dictionary Learning for Signal Declipping.
Proceedings of the Latent Variable Analysis and Signal Separation, 2018

Enhanced Time-Frequency Masking by Using Neural Networks for Monaural Source Separation in Reverberant Room Environments.
Proceedings of the 26th European Signal Processing Conference, 2018

Randomly Sketched Sparse Subspace Clustering for Acoustic Scene Clustering.
Proceedings of the 26th European Signal Processing Conference, 2018

Capsule Routing for Sound Event Detection.
Proceedings of the 26th European Signal Processing Conference, 2018

DCASE 2018 Challenge Surrey cross-task convolutional neural network baseline.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

General-purpose audio tagging from noisy labels using convolutional neural networks.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

Predicting the perceived level of reverberation using machine learning.
Proceedings of the 52nd Asilomar Conference on Signals, Systems, and Computers, 2018

A Performance Evaluation of Several Deep Neural Networks for Reverberant Speech Separation.
Proceedings of the 52nd Asilomar Conference on Signals, Systems, and Computers, 2018

2017
Semisupervised Online Multikernel Similarity Learning for Image Retrieval.
IEEE Trans. Multim., 2017

Social Force Model-Based MCMC-OCSVM Particle PHD Filter for Multiple Human Tracking.
IEEE Trans. Multim., 2017

Unsupervised Feature Learning Based on Deep Models for Environmental Audio Tagging.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Acoustic Reflector Localization: Novel Image Source Reversion and Direct Localization Methods.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Sparse ℓ<sub>1</sub>-Optimal Multiloudspeaker Panning and Its Relation to Vector Base Amplitude Panning.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Sparse analysis model based multiplicative noise removal with enhanced regularization.
Signal Process., 2017

Sparse Blind Speech Deconvolution with Dynamic Range Regularization and Indicator Function.
Circuits Syst. Signal Process., 2017

An Expectation-Maximization Algorithm for Blind Separation of Noisy Mixtures Using Gaussian Mixture Model.
Circuits Syst. Signal Process., 2017

Surrey-cvssp system for DCASE2017 challenge task4.
CoRR, 2017

Joint L1-L2 Regularisation for Blind Speech Deconvolution.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Blind Speech Deconvolution via Pretrained Polynomial Dictionary and Sparse Representation.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Binaural and log-power spectra features with deep neural networks for speech-noise separation.
Proceedings of the 19th IEEE International Workshop on Multimedia Signal Processing, 2017

Attention and Localization Based on a Deep Convolutional Recurrent Model for Weakly Supervised Audio Tagging.
Proceedings of the Interspeech 2017, 2017

Matrix of Polynomials Model Based Polynomial Dictionary Learning Method for Acoustic Impulse Response Modeling.
Proceedings of the Interspeech 2017, 2017

Convolutional gated recurrent neural network incorporating spatial features for audio tagging.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

A greedy algorithm with learned statistics for sparse signal reconstruction.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Particle flow for sequential Monte Carlo implementation of probability hypothesis density.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

A joint detection-classification model for audio tagging of weakly labelled data.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Fast tagging of natural sounds using marginal co-regularization.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Assessment of musical noise using localization of isolated peaks in time-frequency domain.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Particle Flow SMC-PHD Filter for Audio-Visual Multi-speaker Tracking.
Proceedings of the Latent Variable Analysis and Signal Separation, 2017

Multivariate iterative hard thresholding for sparse decomposition with flexible sparsity patterns.
Proceedings of the 25th European Signal Processing Conference, 2017

A perceptually-weighted deep neural network for monaural speech enhancement in various background noise conditions.
Proceedings of the 25th European Signal Processing Conference, 2017

Wideband DoA estimation based on joint optimisation of array and spatial sparsity.
Proceedings of the 25th European Signal Processing Conference, 2017

2016
Analysis SimCO Algorithms for Sparse Analysis Model Based Dictionary Learning.
IEEE Trans. Signal Process., 2016

Mean-Shift and Sparse Sampling-Based SMC-PHD Filtering for Audio Informed Visual Speaker Tracking.
IEEE Trans. Multim., 2016

Adaptive Retrodiction Particle PHD Filter for Multiple Human Tracking.
IEEE Signal Process. Lett., 2016

Audio head pose estimation using the direct to reverberant speech ratio.
Speech Commun., 2016

Localization based stereo speech source separation using probabilistic time-frequency masking and deep neural networks.
EURASIP J. Audio Speech Music. Process., 2016

A Promising Technique for Blind Identification: The Generic Statistics.
Circuits Syst. Signal Process., 2016

Hierachical learning for DNN-based acoustic scene classification.
CoRR, 2016

Fully Deep Neural Networks Incorporating Unsupervised Feature Learning for Audio Tagging.
CoRR, 2016

Higher-Order Circularity Based I/Q Imbalance Compensation in Direct-Conversion Receivers.
Proceedings of the IEEE 84th Vehicular Technology Conference, 2016

Predicting Binaural Speech Intelligibility from Signals Estimated by a Blind Source Separation Algorithm.
Proceedings of the Interspeech 2016, 2016

Identity association using PHD filters in multiple head tracking with depth sensors.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Social force model aided robust particle PHD filter for multiple human tracking.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Hierarchical Learning for DNN-Based Acoustic Scene Classification.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016

Fully DNN-Based Multi-Label Regression for Audio Tagging.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016

Deep Neural Network Baseline for DCASE Challenge 2016.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016

2015
Heterogeneous Feature Selection With Multi-Modal Deep Neural Networks and Sparse Group LASSO.
IEEE Trans. Multim., 2015

Audio Assisted Robust Visual Tracking With Adaptive Particle Filtering.
IEEE Trans. Multim., 2015

Reverberant speech separation with probabilistic time-frequency masking for B-format recordings.
Speech Commun., 2015

Cosparsity-based Stagewise Matching Pursuit algorithm for reconstruction of the cosparse signals.
EURASIP J. Adv. Signal Process., 2015

A novel approach for detection of medial temporal discharges using blind source separation incorporating dictionary look up.
Proceedings of the 7th International IEEE/EMBS Conference on Neural Engineering, 2015

Blind source separation of medial temporal discharges via partial dictionary learning.
Proceedings of the 25th IEEE International Workshop on Machine Learning for Signal Processing, 2015

Separation of vocals from monaural music recordings using diagonal median filters and practical time-frequency parameters.
Proceedings of the IEEE International Symposium on Signal Processing and Information Technology, 2015

Audio informed visual speaker tracking with SMC-PHD filter.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Localization based stereo speech separation using deep networks.
Proceedings of the 2015 IEEE International Conference on Digital Signal Processing, 2015

A robust PHD filter with deep learning updating for multiple human tracking.
Proceedings of the 2015 IEEE International Conference on Digital Signal Processing, 2015

Audio super-resolution using analysis dictionary learning.
Proceedings of the 2015 IEEE International Conference on Digital Signal Processing, 2015

Person Tracking Using Audio and Depth Cues.
Proceedings of the 2015 IEEE International Conference on Computer Vision Workshop, 2015

A 3D model for room boundary estimation.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

A Polynomial Dictionary Learning Method for Acoustic Impulse Response Modeling.
Proceedings of the Latent Variable Analysis and Signal Separation, 2015

A local discontinuity based approach for monaural singing voice separation from accompanying music with multi-stage non-negative matrix factorization.
Proceedings of the 2015 IEEE Global Conference on Signal and Information Processing, 2015

A source separation evaluation method in object-based spatial audio.
Proceedings of the 23rd European Signal Processing Conference, 2015

A Robust student's-t distribution PHD filter with OCSVM updating for multiple human tracking.
Proceedings of the 23rd European Signal Processing Conference, 2015

2014
Interference Reduction in Reverberant Speech Separation With Visual Voice Activity Detection.
IEEE Trans. Multim., 2014

Robust Multi-Speaker Tracking via Dictionary Learning and Identity Modeling.
IEEE Trans. Multim., 2014

Joint Mixing Vector and Binaural Model Based Stereo Source Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

Audiovisual Speech Source Separation: An overview of key methodologies.
IEEE Signal Process. Mag., 2014

PARAFAC-Based Blind Identification of Underdetermined Mixtures Using Gaussian Mixture Model.
Circuits Syst. Signal Process., 2014

Fourth-order cumulant based sources number estimation from mixtures of unknown number of sources.
Proceedings of the Sixth International Conference on Wireless Communications and Signal Processing, 2014

Improving model-based convolutive blind source separation techniques via bootstrap.
Proceedings of the IEEE Workshop on Statistical Signal Processing, 2014

Analysis dictionary learning based on Nesterov's gradient with application to SAR image despeckling.
Proceedings of the 6th International Symposium on Communications, 2014

Signal classification based on block-sparse tensor representation.
Proceedings of the 19th International Conference on Digital Signal Processing, 2014

Analysis SimCO: A new algorithm for analysis dictionary learning.
Proceedings of the IEEE International Conference on Acoustics, 2014

Discriminativetensor dictionaries and sparsity for speaker identification.
Proceedings of the 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014

A Bayesian performance bound for time-delay of arrival based acoustic source tracking in a reverberant environment.
Proceedings of the 17th International Conference on Information Fusion, 2014

Audio-visual tracking of a variable number of speakers with a random finite set approach.
Proceedings of the 17th International Conference on Information Fusion, 2014

2013
Source Separation of Convolutive and Noisy Mixtures Using Audio-Visual Dictionary Learning and Probabilistic Time-Frequency Masking.
IEEE Trans. Signal Process., 2013

Video-Aided Model-Based Source Separation in Real Reverberant Rooms.
IEEE Trans. Speech Audio Process., 2013

Sparse coding with adaptive dictionary learning for underdetermined blind speech separation.
Speech Commun., 2013

Generalized generating function with tucker decomposition and alternating least squares for underdetermined blind identification.
EURASIP J. Adv. Signal Process., 2013

Dictionary learning based sparse coefficients for audio classification with max and average pooling.
Digit. Signal Process., 2013

K-plane clustering algorithm for analysis dictionary learning.
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2013

A constrained approach for extraction of pre-ictal discharges from scalp EEG.
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2013

Tensor dictionary learning with sparse TUCKER decomposition.
Proceedings of the 18th International Conference on Digital Signal Processing, 2013

Joint image separation and dictionary learning.
Proceedings of the 18th International Conference on Digital Signal Processing, 2013

Acoustic vector sensor based speech source separation with mixed Gaussian-Laplacian distributions.
Proceedings of the 18th International Conference on Digital Signal Processing, 2013

Audio constrained particle filter based visual tracking.
Proceedings of the IEEE International Conference on Acoustics, 2013

Audio head pose estimation using the direct to reverberant speech ratio.
Proceedings of the IEEE International Conference on Acoustics, 2013

Spatial and coherence cues based time-frequency masking for binaural reverberant speech separation.
Proceedings of the IEEE International Conference on Acoustics, 2013

Acoustic source tracking in a reverberant environment using a pairwise synchronous microphone network.
Proceedings of the 16th International Conference on Information Fusion, 2013

Audio-visual face detection for tracking in a meeting room environment.
Proceedings of the 16th International Conference on Information Fusion, 2013

Acoustic vector sensor based reverberant speech separation with probabilistic time-frequency masking.
Proceedings of the 21st European Signal Processing Conference, 2013

Subset pursuit for analysis dictionary learning.
Proceedings of the 21st European Signal Processing Conference, 2013

Adaptive particle filtering approach to audio-visual tracking.
Proceedings of the 21st European Signal Processing Conference, 2013

Show-through removal for scanned images using non-linear NMF with adaptive smoothing.
Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013

2012
Simultaneous Codeword Optimization (SimCO) for Dictionary Update and Learning.
IEEE Trans. Signal Process., 2012

Use of bimodal coherence to resolve the permutation problem in convolutive BSS.
Signal Process., 2012

Multimodal (audio-visual) source separation exploiting multi-speaker tracking, robust beamforming and time-frequency masking.
IET Signal Process., 2012

Reverberant speech separation based on audio-visual dictionary learning and binaural cues.
Proceedings of the IEEE Statistical Signal Processing Workshop, 2012

Dictionary learning and update based on simultaneous codeword optimization (SimCO).
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

A dictionary learning approach to tracking.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Joint blind dereverberation and separation of speech mixtures.
Proceedings of the 20th European Signal Processing Conference, 2012

Blind reverberation time estimation based on Laplace distribution.
Proceedings of the 20th European Signal Processing Conference, 2012

2011
A multistage approach to blind separation of convolutive speech mixtures.
Speech Commun., 2011

Methods for learning adaptive dictionary in underdetermined speech separation.
Proceedings of the 2011 IEEE International Workshop on Machine Learning for Signal Processing, 2011

Blind source separation and visual voice activity detection for target speech extraction.
Proceedings of the 3rd International Conference on Awareness Science and Technology, 2011

Integrating binaural cues and blind source separation method for separating reverberant speech mixtures.
Proceedings of the IEEE International Conference on Acoustics, 2011

Multimodal blind source separation with a circular microphone array and robust beamforming.
Proceedings of the 19th European Signal Processing Conference, 2011

Robust feature selection for scaling ambiguity reduction in audio-visual convolutive BSS.
Proceedings of the 19th European Signal Processing Conference, 2011

Empirical mode decomposition for joint denoising and dereverberation.
Proceedings of the 19th European Signal Processing Conference, 2011

Simultaneous codeword optimization (SimCO) for dictionary learning.
Proceedings of the 49th Annual Allerton Conference on Communication, 2011

2010
Bimodal coherence based scale ambiguity cancellation for target speech extraction and enhancement.
Proceedings of the INTERSPEECH 2010, 2010

A block-based compressed sensing method for underdetermined blind speech separation incorporating binary mask.
Proceedings of the IEEE International Conference on Acoustics, 2010

Use of Bimodal Coherence to Resolve Spectral Indeterminacy in Convolutive BSS.
Proceedings of the Latent Variable Analysis and Signal Separation, 2010

Single Channel Music Sound Separation Based on Spectrogram Decomposition and Note Classification.
Proceedings of the Exploring Music Contents - 7th International Symposium, 2010

2009
A multiplicative algorithm for convolutive non-negative matrix factorization based on squared Euclidean distance.
IEEE Trans. Signal Process., 2009

A multistage approach for blind separation of convolutive speech mixtures.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Note Onset Detection via Nonnegative Factorization of Magnitude Spectrum.
EURASIP J. Adv. Signal Process., 2008

Advances in Nonnegative Matrix and Tensor Factorization.
Comput. Intell. Neurosci., 2008

Convolutive non-negative sparse coding.
Proceedings of the International Joint Conference on Neural Networks, 2008

2007
A New Variable Step-Size LMS Algorithm with Robustness to Nonstationary Noise.
Proceedings of the IEEE International Conference on Acoustics, 2007

2006
Exploitation of source nonstationarity in underdetermined blind source separation with advanced clustering techniques.
IEEE Trans. Signal Process., 2006

Sequential blind source separation based exclusively on second-order statistics developed for a class of periodic signals.
IEEE Trans. Signal Process., 2006

2005
Penalty function-based joint diagonalization approach for convolutive blind separation of nonstationary sources.
IEEE Trans. Signal Process., 2005

Variable step-size sign natural gradient algorithm for sequential blind source separation.
IEEE Signal Process. Lett., 2005

Removal of eye blinking artifact from the electro-encephalogram, incorporating a new constrained blind source separation algorithm.
Medical Biol. Eng. Comput., 2005

An Effective Method to Improve Convergence for Sequential Blind Source Separation.
Proceedings of the Advances in Natural Computation, First International Conference, 2005

Video assisted speech source separation.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
A coupled HMM for solving the permutation problem in frequency domain BSS.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Penalty Function Approach for Constrained Convolutive Blind Source Separation.
Proceedings of the Independent Component Analysis and Blind Signal Separation, 2004

A Novel Hybrid Approach to the Permutation Problem of Frequency Domain Blind Source Separation.
Proceedings of the Independent Component Analysis and Blind Signal Separation, 2004

Localization of P300 Sources in Schizophrenia Patients Using Constrained BSS.
Proceedings of the Independent Component Analysis and Blind Signal Separation, 2004

Penalty function based joint diagonalization approach for convolutive constrained BSS of nonstationary signals.
Proceedings of the 2004 12th European Signal Processing Conference, 2004

2003
Blind separation of convolutive mixtures of cyclostationary sources using an extended natural gradient method.
Proceedings of the Seventh International Symposium on Signal Processing and Its Applications, 2003


  Loading...