Tuomas Virtanen

Shayan Gharib

Dataset, November, 2019

Clotho dataset.

[BibT_eX]

[DOI]

Samuel Lipping

Dataset, October, 2019

Sound event localization and detection (SELDnet) results.

[BibT_eX]

[DOI]

Dataset, July, 2019

Analysis of an efficient parallel implementation of active-set Newton algorithm.

[BibT_eX]

[DOI]

Pablo San Juan Sebastián

Víctor M. García-Molla

Antonio M. Vidal

J. Supercomput., 2019

Sound Event Detection in the DCASE 2017 Challenge.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2019

Complex ISNMF: A Phase-Aware Model for Monaural Audio Source Separation.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2019

Deep Learning for Audio Signal Processing.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2019

Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2019

Generalization of the K-SVD algorithm for minimization of <i>β</i>-divergence.

[BibT_eX]

[DOI]

Víctor M. García-Molla

Pablo San Juan Sebastián

Antonio M. Vidal

Pedro Alonso

Digit. Signal Process., 2019

VOICe: A Sound Event Detection Dataset For Generalizable Domain Adaptation.

[BibT_eX]

[DOI]

CoRR, 2019

Memory Requirement Reduction of Deep Neural Networks Using Low-bit Quantization of Parameters.

[BibT_eX]

[DOI]

CoRR, 2019

Zero-Shot Audio Classification Based On Class Label Embeddings.

[BibT_eX]

[DOI]

Huang Xie

Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Joint Measurement of Localization and Detection of Sound Events.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Acoustic Scene Classification Using Higher-Order Ambisonic Features.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Unsupervised Adversarial Domain Adaptation Based on The Wasserstein Distance For Acoustic Scene Classification.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

City Classification from Multiple Real-World Sound Scenes.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Detection of Typical Pronunciation Errors in Non-native English Speech Using Convolutional Recurrent Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2019

Low-latency Deep Clustering for Speech Separation.

[BibT_eX]

[DOI]

Shanshan Wang

Gaurav Naithani

Proceedings of the IEEE International Conference on Acoustics, 2019

Sound Event Envelope Estimation in Polyphonic Mixtures.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Audio-Based Epileptic Seizure Detection.

[BibT_eX]

[DOI]

Proceedings of the 27th European Signal Processing Conference, 2019

Acoustic Scene Classification in DCASE 2019 Challenge: Closed and Open Set Classification and Data Mismatch Setups.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

Crowdsourcing a Dataset of Audio Captions.

[BibT_eX]

[DOI]

Samuel Lipping

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

Language Modelling for Sound Event Detection with Teacher Forcing and Scheduled Sampling.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

Localization, Detection and Tracking of Multiple Moving Sound Sources with a Convolutional Recurrent Neural Network.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

A Multi-room Reverberant Dataset for Sound Event Localization and Detection.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

2018

Separation of Moving Sound Sources Using Multichannel NMF and Acoustic Tracking.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2018

Detection and Classification of Acoustic Scenes and Events: Outcome of the DCASE 2016 Challenge.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2018

Multichannel Blind Sound Source Separation Using Spatial Covariance Model With Level and Time Differences and Nonnegative Matrix Factorization.

[BibT_eX]

[DOI]

Julio J. Carabias-Orti

Pedro Vera-Candeas

IEEE ACM Trans. Audio Speech Lang. Process., 2018

Cascade of Boolean detector combinations.

[BibT_eX]

[DOI]

Katariina Mahkonen

Joni-Kristian Kämäräinen

EURASIP J. Image Video Process., 2018

Automatic segmentation of infant cry signals using hidden Markov models.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2018

Close Miking Empirical Practice Verification: A Source Separation Approach.

[BibT_eX]

[DOI]

Andreas Floros

Gerald Schuller

CoRR, 2018

Acoustic Scene Classification: a Competition Review.

[BibT_eX]

[DOI]

Proceedings of the 28th IEEE International Workshop on Machine Learning for Signal Processing, 2018

An Active Learning Method Using Clustering and Committee-Based Sample Selection for Sound Event Classification.

[BibT_eX]

[DOI]

Shuyang Zhao

Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Time-Frequency Masking Strategies for Single-Channel Low-Latency Speech Enhancement Using Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Deep Neural Network Based Speech Separation Optimizing an Objective Estimator of Intelligibility for Low Latency Applications.

[BibT_eX]

[DOI]

Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Acoustic Scene Classification: An Overview of Dcase 2017 Challenge Entries.

[BibT_eX]

[DOI]

Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

On Modeling the STFT Phase of Audio Signals with the Von Mises Distribution.

[BibT_eX]

[DOI]

Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Towards Complex Nonnegative Matrix Factorization with the Beta-Divergence.

[BibT_eX]

[DOI]

Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Using Sequential Information in Polyphonic Sound Event Detection.

[BibT_eX]

[DOI]

Guangpu Huang

Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Harmonic-Percussive Source Separation with Deep Neural Networks and Phase Recovery.

[BibT_eX]

[DOI]

Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Expectation-Maximization Algorithms for Itakura-Saito Nonnegative Matrix Factorization.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Reducing Interference with Phase Recovery in DNN-based Monaural Singing Voice Separation.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

MaD TwinNet: Masker-Denoiser Architecture with Twin Networks for Monaural Sound Source Separation.

[BibT_eX]

[DOI]

Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

End-to-End Polyphonic Sound Event Detection Using Convolutional Recurrent Neural Networks with Learned Time-Frequency Representation Input.

[BibT_eX]

[DOI]

Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

Multichannel Sound Event Detection Using 3D Convolutional Neural Networks for Learning Inter-channel Features.

[BibT_eX]

[DOI]

Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

Estimation of Time-Varying Room Impulse Responses of Multiple Sound Sources from Observed Mixture and Isolated Source Signals.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Monaural Singing Voice Separation with Skip-Filtering Connections and Recurrent Inference of Time-Frequency Mask.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Bayesian Anisotropic Gaussian Model for Audio Source Separation.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Direction of Arrival Estimation for Multiple Sound Sources Using Convolutional Recurrent Neural Network.

[BibT_eX]

[DOI]

Proceedings of the 26th European Signal Processing Conference, 2018

A multi-device dataset for urban acoustic scene classification.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

Unsupervised adversarial domain adaptation for acoustic scene classification.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

2017

DCASE2016 Challenge Submissions Package.

[BibT_eX]

[DOI]

Dataset, September, 2017

Introduction to the Special Section on Sound Scene and Event Analysis.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2017

Binary Non-Negative Matrix Deconvolution for Audio Dictionary Learning.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2017

Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection.

[BibT_eX]

[DOI]

Emre Çakir

Heikki Huttunen

IEEE ACM Trans. Audio Speech Lang. Process., 2017

A report on sound event detection with different binaural features.

[BibT_eX]

[DOI]

CoRR, 2017

Stacked Convolutional and Recurrent Neural Networks for Music Emotion Recognition.

[BibT_eX]

[DOI]

CoRR, 2017

Learning vocal mode classifiers from heterogeneous data sources.

[BibT_eX]

[DOI]

Shuyang Zhao

Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Low latency sound source separation using convolutional recurrent neural networks.

[BibT_eX]

[DOI]

Gaurav Naithani

Lars Bramslow

Niels Henrik Pontoppidan

Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Assessment of human and machine performance in acoustic scene classification: Dcase 2016 case study.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Consistent anisotropic Wiener filtering for audio source separation.

[BibT_eX]

[DOI]

Jonathan Le Roux

Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Automated audio captioning with recurrent neural networks.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Transfer learning of weakly labelled audio.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

A recurrent encoder-decoder approach with skip-filtering connections for monaural singing voice separation.

[BibT_eX]

[DOI]

Gerald Schuller

Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017

A convolutional neural network approach for acoustic scene classification.

[BibT_eX]

[DOI]

Michele Valenti

Stefano Squartini

Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

Active learning for sound event classification by clustering unlabeled data.

[BibT_eX]

[DOI]

Shuyang Zhao

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Sound event detection using spatial features and convolutional recurrent neural network.

[BibT_eX]

[DOI]

Pasi Pertilä

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Time-difference of arrival model for spherical microphone arrays and application to direction of arrival estimation.

[BibT_eX]

[DOI]

Proceedings of the 25th European Signal Processing Conference, 2017

Convolutional recurrent neural networks for bird audio detection.

[BibT_eX]

[DOI]

Proceedings of the 25th European Signal Processing Conference, 2017

Stacked convolutional and recurrent neural networks for bird audio detection.

[BibT_eX]

[DOI]

Proceedings of the 25th European Signal Processing Conference, 2017

ASR in Classroom Today: Automatic Visualization of Conceptual Network in Science Classrooms.

[BibT_eX]

[DOI]

Proceedings of the Data Driven Approaches in Digital Education, 2017

DCASE2017 Challenge Setup: Tasks, Datasets and Baseline System.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017

Convolutional Recurrent Neural Networks for Rare Sound Event Detection.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017

Sound Event Detection Using Weakly Labeled Dataset with Stacked Convolutional and Recurrent Neural Network.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017

2016

Blind Separation of Audio Mixtures Through Nonnegative Tensor Factorization of Modulation Spectrograms.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2016

Binaural rendering of microphone array captures based on source separation.

[BibT_eX]

[DOI]

Speech Commun., 2016

Filterbank learning for deep neural network based polyphonic sound event detection.

[BibT_eX]

[DOI]

Ezgi Can Ozan

Proceedings of the 2016 International Joint Conference on Neural Networks, 2016

Recurrent neural networks for polyphonic sound event detection in real life recordings.

[BibT_eX]

[DOI]

Heikki Huttunen

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Low-latency sound source separation using deep neural networks.

[BibT_eX]

[DOI]

Gaurav Naithani

Niels Henrik Pontoppidan

Proceedings of the 2016 IEEE Global Conference on Signal and Information Processing, 2016

TUT database for acoustic scene classification and sound event detection.

[BibT_eX]

[DOI]

Proceedings of the 24th European Signal Processing Conference, 2016

Cascade processing for speeding up sliding window sparse classification.

[BibT_eX]

[DOI]

Katariina Mahkonen

Joni-Kristian Kamarainen

Proceedings of the 24th European Signal Processing Conference, 2016

Noise-robust detection of whispering in telephone calls using deep neural networks.

[BibT_eX]

[DOI]

Proceedings of the 24th European Signal Processing Conference, 2016

DCASE 2016 Acoustic Scene Classification Using Convolutional Neural Networks.

[BibT_eX]

[DOI]

Michele Valenti

Stefano Squartini

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016

Sound Event Detection in Multichannel Audio Using Spatial and Harmonic Features.

[BibT_eX]

[DOI]

Pasi Pertilä

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016

2015

Coupled Dictionaries for Exemplar-Based Speech Enhancement and Automatic Speech Recognition.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2015

Compositional Models for Audio Processing: Uncovering the structure of sound mixtures.

[BibT_eX]

[DOI]

IEEE Signal Process. Mag., 2015

Non-negative tensor factorization models for Bayesian audio processing.

[BibT_eX]

[DOI]

Umut Simsekli

Ali Taylan Cemgil

Digit. Signal Process., 2015

Archetypal analysis for audio dictionary learning.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015

Noise robust speaker recognition with convolutive sparse coding.

[BibT_eX]

[DOI]

Rahim Saeidi

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Polyphonic sound event detection using multi label deep neural networks.

[BibT_eX]

[DOI]

Proceedings of the 2015 International Joint Conference on Neural Networks, 2015

Sound event detection in real life recordings using coupled matrix factorization of spectral representations and class activity annotations.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Similarity induced group sparsity for non-negative matrix factorisation.

[BibT_eX]

[DOI]

Rahim Saeidi

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Low-latency sound-source-separation using non-negative matrix factorisation with coupled analysis and synthesis dictionaries.

[BibT_eX]

[DOI]

Niels Henrik Pontoppidan

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Exemplar-based speech enhancement for deep neural network based automatic speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Speaker Verification Using Adaptive Dictionaries in Non-negative Spectrogram Deconvolution.

[BibT_eX]

[DOI]

Szymon Drgas

Proceedings of the Latent Variable Analysis and Signal Separation, 2015

Automatic recognition of environmental sound events using all-pole group delay features.

[BibT_eX]

[DOI]

Proceedings of the 23rd European Signal Processing Conference, 2015

Multi-label vs. combined single-label sound event detection with deep neural networks.

[BibT_eX]

[DOI]

Proceedings of the 23rd European Signal Processing Conference, 2015

2014

Exemplar-Based Sparse Representation With Residual Compensation for Voice Conversion.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2014

Direction of Arrival Based Spatial Covariance Model for Blind Sound Source Separation.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2014

Method for creating location-specific audio textures.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2014

Exemplar-based noise robust automatic speech recognition using modulation spectrogram features.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Modelling primitive streaming of simple tone sequences through factorisation of modulation pattern tensors.

[BibT_eX]

[DOI]

Hugo Van hamme

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Semi-supervised non-negative tensor factorisation of modulation spectrograms for monaural speech separation.

[BibT_eX]

[DOI]

Proceedings of the 2014 International Joint Conference on Neural Networks, 2014

Active-set newton algorithm for non-negative sparse coding of audio.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Multichannel audio separation by direction of arrival based spatial covariance model and non-negative matrix factorization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Ultrasound-coupled semi-supervised nonnegative matrix factorisation for speech enhancement.

[BibT_eX]

[DOI]

Olivier Delhomme

Proceedings of the IEEE International Conference on Acoustics, 2014

Coupled dictionary training for exemplar-based speech enhancement.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Recognition of acoustic events using deep neural networks.

[BibT_eX]

[DOI]

Oguzhan Gencoglu

Heikki Huttunen

Proceedings of the 22nd European Signal Processing Conference, 2014

Lifelog Scene Change Detection Using Cascades of Audio and Video Detectors.

[BibT_eX]

[DOI]

Katariina Mahkonen

Joni-Kristian Kämäräinen

Proceedings of the Computer Vision - ACCV 2014 Workshops, 2014

2013

Active-Set Newton Algorithm for Overcomplete Non-Negative Representations of Audio.

[BibT_eX]

[DOI]

Jort Florent Gemmeke

IEEE Trans. Speech Audio Process., 2013

On the human ability to discriminate audio ambiances from similar locations of an urban environment.

[BibT_eX]

[DOI]

Pers. Ubiquitous Comput., 2013

Context-dependent sound event detection.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2013

Modelling non-stationary noise with spectral factorisation in automatic speech recognition.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2013

Music self-similarity modeling using augmented nonnegative matrix factorization of block and stripe patterns.

[BibT_eX]

[DOI]

Joonas Kauppinen

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

Exemplar-based voice conversion using non-negative spectrogram deconvolution.

[BibT_eX]

[DOI]

Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

The 9th annual MLSP competition: New methods for acoustic classification of multiple simultaneous bird species in a noisy environment.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2013

Exemplar-based unit selection for voice conversion utilizing temporal information.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Non-negative tensor factorisation of modulation spectrograms for monaural sound source separation.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Supervised model training for overlapping sound events based on unsupervised source separation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Exemplar-based joint channel and noise compensation.

[BibT_eX]

[DOI]

Kris Demuynck

Proceedings of the IEEE International Conference on Acoustics, 2013

Acquiring variable length speech bases for factorisation-based noise robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 21st European Signal Processing Conference, 2013

Semi-supervised learning for musical instrument recognition.

[BibT_eX]

[DOI]

Proceedings of the 21st European Signal Processing Conference, 2013

Group Delay Function from All-Pole Models for Musical Instrument Recognition.

[BibT_eX]

[DOI]

Proceedings of the Sound, Music, and Motion - 10th International Symposium, 2013

Learning state labels for sparse classification of speech with matrix deconvolution.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012

Voice Conversion Using Dynamic Kernel Partial Least Squares Regression.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2012

Exemplar-based sparse representation and sparse discrimination for noise robust speaker identification.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

Phase spectrum prediction of audio signals.

[BibT_eX]

[DOI]

Ali Bahrami Rad

Proceedings of the 5th International Symposium on Communications, 2012

Human sound perception - what can we learn from it when developing audio analysis algorithms?

[BibT_eX]

[DOI]

Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2012

Group Sparsity for Speaker Identity Discrimination in Factorisation-based Speech Recognition.

[BibT_eX]

[DOI]

Rahim Saeidi

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Non-negative matrix factorization for highly noise-robust ASR: To enhance or to recognize?

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Modelling spectro-temporal dynamics in factorisation-based noise-robust automatic speech recognition.

[BibT_eX]

[DOI]

Francisco J. Rodríguez-Serrano

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Multiple Instrument Mixtures Source Separation Evaluation Using Instrument-Dependent NMF Models.

[BibT_eX]

[DOI]

Julio J. Carabias-Orti

Pedro Vera-Candeas

Nicolás Ruiz-Reyes

Proceedings of the Latent Variable Analysis and Signal Separation, 2012

Permutation alignment of frequency-domain ICA by the maximization of intra-source envelope correlations.

[BibT_eX]

[DOI]

Proceedings of the 20th European Signal Processing Conference, 2012

Detection, separation and recognition of speech from continuous signals using spectral factorisation.

[BibT_eX]

[DOI]

Proceedings of the 20th European Signal Processing Conference, 2012

Introduction.

[BibT_eX]

[DOI]

Proceedings of the Techniques for Noise Robustness in Automatic Speech Recognition, 2012

The Basics of Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Techniques for Noise Robustness in Automatic Speech Recognition, 2012

The Problem of Robustness in Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Techniques for Noise Robustness in Automatic Speech Recognition, 2012

2011

Exemplar-Based Sparse Representations for Noise Robust Automatic Speech Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2011

Musical Instrument Sound Multi-Excitation Model for Non-Negative Spectrogram Factorization.

[BibT_eX]

[DOI]

Julio J. Carabias-Orti

Francisco J. Cañadas-Quesada

Pedro Vera-Candeas

Nicolás Ruiz-Reyes

IEEE J. Sel. Top. Signal Process., 2011

Multichannel audio upmixing based on non-negative tensor factorization representation.

[BibT_eX]

[DOI]

Miikka Vilermo

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011

Phoneme-Dependent NMF for Speech Enhancement in Monaural Mixtures.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Mapping Sparse Representation to State Likelihoods in Noise-Robust Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Uncertainty Measures for Improving Exemplar-Based Source Separation.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Non-negative matrix deconvolution in noise robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Toward a practical implementation of exemplar-based noise robust ASR.

[BibT_eX]

[DOI]

Proceedings of the 19th European Signal Processing Conference, 2011

2010

Representing Musical Sounds With an Interpolating State Model.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2010

Voice Conversion Using Partial Least Squares Regression.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2010

Automatic Recognition of Lyrics in Singing.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2010

Audio Query by Example Using Similarity Measures between Probability Density Functions of Features.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2010

State-based labelling for a sparse representation of speech and its application to robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Non-negative matrix factorization based compensation of music for automatic speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Artificial and online acquired noise dictionaries for noise robust ASR.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Noise-to-mask ratio minimization by weighted non-negative matrix factorization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Recognition of phonemes and words in singing.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Sound source separation in monaural music signals using excitation-filter model and em algorithm.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Noise robust exemplar-based connected digit recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Acoustic event detection in real life recordings.

[BibT_eX]

[DOI]

Proceedings of the 18th European Signal Processing Conference, 2010

Comparison of noise robust methods in large vocabulary speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 18th European Signal Processing Conference, 2010

Audio context recognition using audio event histograms.

[BibT_eX]

[DOI]

Proceedings of the 18th European Signal Processing Conference, 2010

2009

Musical Instrument Recognition in Polyphonic Audio Using Source-Filter Model for Sound Separation.

[BibT_eX]

[DOI]

Proceedings of the 10th International Society for Music Information Retrieval Conference, 2009

Mixtures of Gamma Priors for Non-negative Matrix Factorization Based Speech Separation.

[BibT_eX]

[DOI]

Ali Taylan Cemgil

Proceedings of the Independent Component Analysis and Signal Separation, 2009

Interpolating hidden Markov model and its application to automatic instrument recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

Spectral covariance in prior distributions of non-negative matrix factorization based speech separation.

[BibT_eX]

[DOI]

Proceedings of the 17th European Signal Processing Conference, 2009

Non-stationary noise model compensation in voice activity detection.

[BibT_eX]

[DOI]

Mikko Myllymäki

Proceedings of the 17th European Signal Processing Conference, 2009

Adaptation of a speech recognizer for singing voice.

[BibT_eX]

[DOI]

Proceedings of the 17th European Signal Processing Conference, 2009

2008

Combining pitch-based inference and non-negative spectrogram factorization in separating vocals from polyphonic music.

[BibT_eX]

[DOI]

Matti Ryynänen

Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2008

Accompaniment separation and karaoke application based on automatic melody transcription.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Bayesian extensions to non-negative matrix factorisation for audio signal modelling.

[BibT_eX]

[DOI]

Ali Taylan Cemgil

Simon J. Godsill

Proceedings of the IEEE International Conference on Acoustics, 2008

Voice activity detection in the presence of breathing noise using neural network and hidden Markov model.

[BibT_eX]

[DOI]

Mikko Myllymäki

Proceedings of the 2008 16th European Signal Processing Conference, 2008

2007

Monaural Sound Source Separation by Nonnegative Matrix Factorization With Temporal Continuity and Sparseness Criteria.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2007

Singer Identification in Polyphonic Music Using Vocal Separation and Pattern Recognition Methods.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Music Information Retrieval, 2007

Query by Example of Audio Signals using Euclidean Distance Between Gaussian Mixture Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

2006

Speech recognition using factorial hidden Markov models for separation in the feature space.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

2005

Drum transcription with non-negative spectrogram factorisation.

[BibT_eX]

[DOI]

Jouni Paulus

Proceedings of the 13th European Signal Processing Conference, 2005

Modeling musical sounds with an interpolating state model.

[BibT_eX]

[DOI]

Proceedings of the 13th European Signal Processing Conference, 2005

Separation of drums from polyphonic music using non-negative matrix factorization and support vector machine.

[BibT_eX]

[DOI]

Proceedings of the 13th European Signal Processing Conference, 2005

2004

Separation of sound sources by convolutive sparse coding.

[BibT_eX]

[DOI]

Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, 2004

2003

Sound Source Separation Using Sparse Coding with Temporal Continuity Objective.

[BibT_eX]

[DOI]

Proceedings of the 2003 International Computer Music Conference, 2003

2002

Separation of harmonic sounds using linear models for the overtone series.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2002

2000

Detection of artifacts in monitored trends in intensive care.

[BibT_eX]

[DOI]

Comput. Methods Programs Biomed., 2000

Separation of harmonic sound sources using sinusoidal modeling.

[BibT_eX]

[DOI]