Tuomas Virtanen

Orcid: 0000-0002-4604-9729

Affiliations:
  • Tampere University of Technology, Finland


According to our database1, Tuomas Virtanen authored at least 236 papers between 2000 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Dynamic Processing Neural Network Architecture for Hearing Loss Compensation.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Neural Ambisonics encoding for compact irregular microphone arrays.
CoRR, 2024

2023
Attention-Driven Multichannel Speech Enhancement in Moving Sound Source Scenarios.
CoRR, 2023

Crowdsourcing and Evaluating Text-Based Audio Retrieval Relevances.
CoRR, 2023

Permutation Invariant Recurrent Neural Networks for Sound Source Tracking Applications.
CoRR, 2023

Few-shot Class-incremental Audio Classification Using Adaptively-refined Prototypes.
CoRR, 2023

Adversarial Representation Learning for Robust Privacy Preservation in Audio.
CoRR, 2023

Multi-Channel Masking with Learnable Filterbank for Sound Source Separation.
CoRR, 2023

Single-Channel Speaker Distance Estimation in Reverberant Environments.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

Representation Learning for Audio Privacy Preservation Using Source Separation and Robust Adversarial Learning.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

On Negative Sampling for Contrastive Audio-Text Retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2023

Attention-Based Methods For Audio Question Answering.
Proceedings of the 31st European Signal Processing Conference, 2023

Spectrogram Inversion for Audio Source Separation via Consistency, Mixing, and Magnitude Constraints.
Proceedings of the 31st European Signal Processing Conference, 2023

Simultaneous or Sequential Training? How Speech Representations Cooperate in a Multi-Task Self-Supervised Learning System.
Proceedings of the 31st European Signal Processing Conference, 2023

Position Tracking of a Varying Number of Sound Sources with Sliding Permutation Invariant Training.
Proceedings of the 31st European Signal Processing Conference, 2023

2022
Self-Supervised Learning of Audio Representations From Audio-Visual Data Using Spatial Alignment.
IEEE J. Sel. Top. Signal Process., 2022

Editorial: Intelligent Signal Analysis for Contagious Virus Diseases.
IEEE J. Sel. Top. Signal Process., 2022

Subjective Evaluation of Deep Neural Network Based Speech Enhancement Systems in Real-World Conditions.
Proceedings of the 24th IEEE International Workshop on Multimedia Signal Processing, 2022

Domestic Activity Clustering from Audio via Depthwise Separable Convolutional Autoencoder Network.
Proceedings of the 24th IEEE International Workshop on Multimedia Signal Processing, 2022

Unsupervised Audio-Caption Aligning Learns Correspondences Between Individual Sound Events and Textual Phrases.
Proceedings of the IEEE International Conference on Acoustics, 2022

Clotho-AQA: A Crowdsourced Dataset for Audio Question Answering.
Proceedings of the 30th European Signal Processing Conference, 2022

Noise, Device and Room Robustness Methods for Pronunciation Error Detection.
Proceedings of the 30th European Signal Processing Conference, 2022

Zero-Shot Audio Classification using Image Embeddings.
Proceedings of the 30th European Signal Processing Conference, 2022

Language-Based Audio Retrieval Task in DCASE 2022 Challenge.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

STARSS22: A Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

Low-Complexity Acoustic Scene Classification in DCASE 2022 Challenge.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

2021
Zero-Shot Audio Classification Via Semantic Embeddings.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Overview and Evaluation of Sound Event Localization and Detection in DCASE 2019.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Sound Event Detection: A tutorial.
IEEE Signal Process. Mag., 2021

Joint speaker separation and recognition using non-negative matrix deconvolution with adaptive dictionary.
Comput. Speech Lang., 2021

Differentiable Tracking-Based Training of Deep Learning Sound Source Localizers.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

Towards Sonification in Multimodal and User-friendlyExplainable Artificial Intelligence.
Proceedings of the ICMI '21: International Conference on Multimodal Interaction, 2021

Zero-Shot Audio Classification with Factored Linear and Nonlinear Acoustic-Semantic Projections.
Proceedings of the IEEE International Conference on Acoustics, 2021

A Curated Dataset of Urban Scenes for Audio-Visual Scene Analysis.
Proceedings of the IEEE International Conference on Acoustics, 2021

Learning Contextual Tag Embeddings for Cross-Modal Alignment of Audio and Tags.
Proceedings of the IEEE International Conference on Acoustics, 2021

Deep Neural Network Based Low-Latency Speech Separation with Asymmetric Analysis-Synthesis Window Pair.
Proceedings of the 29th European Signal Processing Conference, 2021

WaveTransformer: An Architecture for Audio Captioning Based on Learning Temporal and Time-Frequency Information.
Proceedings of the 29th European Signal Processing Conference, 2021

Mobile Microphone Array Speech Detection and Localization in Diverse Everyday Environments.
Proceedings of the 29th European Signal Processing Conference, 2021

Neural network-based acoustic vehicle counting.
Proceedings of the 29th European Signal Processing Conference, 2021

Audio-Visual Scene Classification: Analysis of DCASE 2021 Challenge Submissions.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

A Dataset of Dynamic Reverberant Sound Scenes with Directional Interferers for Sound Event Localization and Detection.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

Low-Complexity Acoustic Scene Classification for Multi-Device Audio: Analysis of DCASE 2021 Challenge Systems.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

2020
Active Learning for Sound Event Detection.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Online Spectrogram Inversion for Low-Latency Audio Source Separation.
IEEE Signal Process. Lett., 2020

WaveTransformer: A Novel Architecture for Audio Captioning Based on Learning Temporal and Time-Frequency Information.
CoRR, 2020

Conditioned Time-Dilated Convolutions for Sound Event Detection.
CoRR, 2020

COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations.
CoRR, 2020

Depthwise Separable Convolutions Versus Recurrent Neural Networks for Monaural Singing Voice Separation.
Proceedings of the 22nd IEEE International Workshop on Multimedia Signal Processing, 2020

Robust Audio-Based Vehicle Counting in Low-to-Moderate Traffic Flow.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2020

Sound Event Detection with Depthwise Separable and Dilated Convolutions.
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

Sound Event Detection Via Dilated Convolutional Recurrent Neural Networks.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Clotho: an Audio Captioning Dataset.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Memory Requirement Reduction of Deep Neural Networks for Field Programmable Gate Arrays Using Low-Bit Quantization of Parameters.
Proceedings of the 28th European Signal Processing Conference, 2020

A Dataset of Reverberant Spatial Sound Scenes with Moving Sources for Sound Event Localization and Detection.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

Temporal Sub-Sampling of Audio Feature Sequences for Automated Audio Captioning.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

Acoustic Scene Classification in DCASE 2020 Challenge: Generalization Across Devices and Low Complexity Solutions.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

Multi-Task Regularization Based on Infrequent Classes for Audio Captioning.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

2019
Analysis of an efficient parallel implementation of active-set Newton algorithm.
J. Supercomput., 2019

Sound Event Detection in the DCASE 2017 Challenge.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Complex ISNMF: A Phase-Aware Model for Monaural Audio Source Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Deep Learning for Audio Signal Processing.
IEEE J. Sel. Top. Signal Process., 2019

Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks.
IEEE J. Sel. Top. Signal Process., 2019

Generalization of the K-SVD algorithm for minimization of <i>β</i>-divergence.
Digit. Signal Process., 2019

VOICe: A Sound Event Detection Dataset For Generalizable Domain Adaptation.
CoRR, 2019

Memory Requirement Reduction of Deep Neural Networks Using Low-bit Quantization of Parameters.
CoRR, 2019

Zero-Shot Audio Classification Based On Class Label Embeddings.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Joint Measurement of Localization and Detection of Sound Events.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Acoustic Scene Classification Using Higher-Order Ambisonic Features.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Unsupervised Adversarial Domain Adaptation Based on The Wasserstein Distance For Acoustic Scene Classification.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

City Classification from Multiple Real-World Sound Scenes.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Detection of Typical Pronunciation Errors in Non-native English Speech Using Convolutional Recurrent Neural Networks.
Proceedings of the International Joint Conference on Neural Networks, 2019

Low-latency Deep Clustering for Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2019

Sound Event Envelope Estimation in Polyphonic Mixtures.
Proceedings of the IEEE International Conference on Acoustics, 2019

Audio-Based Epileptic Seizure Detection.
Proceedings of the 27th European Signal Processing Conference, 2019

Acoustic Scene Classification in DCASE 2019 Challenge: Closed and Open Set Classification and Data Mismatch Setups.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

Crowdsourcing a Dataset of Audio Captions.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

Language Modelling for Sound Event Detection with Teacher Forcing and Scheduled Sampling.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

Localization, Detection and Tracking of Multiple Moving Sound Sources with a Convolutional Recurrent Neural Network.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

A Multi-room Reverberant Dataset for Sound Event Localization and Detection.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

2018
Separation of Moving Sound Sources Using Multichannel NMF and Acoustic Tracking.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Detection and Classification of Acoustic Scenes and Events: Outcome of the DCASE 2016 Challenge.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Multichannel Blind Sound Source Separation Using Spatial Covariance Model With Level and Time Differences and Nonnegative Matrix Factorization.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Cascade of Boolean detector combinations.
EURASIP J. Image Video Process., 2018

Automatic segmentation of infant cry signals using hidden Markov models.
EURASIP J. Audio Speech Music. Process., 2018

Close Miking Empirical Practice Verification: A Source Separation Approach.
CoRR, 2018

Acoustic Scene Classification: a Competition Review.
Proceedings of the 28th IEEE International Workshop on Machine Learning for Signal Processing, 2018

An Active Learning Method Using Clustering and Committee-Based Sample Selection for Sound Event Classification.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Time-Frequency Masking Strategies for Single-Channel Low-Latency Speech Enhancement Using Neural Networks.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Deep Neural Network Based Speech Separation Optimizing an Objective Estimator of Intelligibility for Low Latency Applications.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Acoustic Scene Classification: An Overview of Dcase 2017 Challenge Entries.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

On Modeling the STFT Phase of Audio Signals with the Von Mises Distribution.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Towards Complex Nonnegative Matrix Factorization with the Beta-Divergence.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Using Sequential Information in Polyphonic Sound Event Detection.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Harmonic-Percussive Source Separation with Deep Neural Networks and Phase Recovery.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Expectation-Maximization Algorithms for Itakura-Saito Nonnegative Matrix Factorization.
Proceedings of the Interspeech 2018, 2018

Reducing Interference with Phase Recovery in DNN-based Monaural Singing Voice Separation.
Proceedings of the Interspeech 2018, 2018

MaD TwinNet: Masker-Denoiser Architecture with Twin Networks for Monaural Sound Source Separation.
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

End-to-End Polyphonic Sound Event Detection Using Convolutional Recurrent Neural Networks with Learned Time-Frequency Representation Input.
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

Multichannel Sound Event Detection Using 3D Convolutional Neural Networks for Learning Inter-channel Features.
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

Estimation of Time-Varying Room Impulse Responses of Multiple Sound Sources from Observed Mixture and Isolated Source Signals.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Monaural Singing Voice Separation with Skip-Filtering Connections and Recurrent Inference of Time-Frequency Mask.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Bayesian Anisotropic Gaussian Model for Audio Source Separation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Direction of Arrival Estimation for Multiple Sound Sources Using Convolutional Recurrent Neural Network.
Proceedings of the 26th European Signal Processing Conference, 2018

A multi-device dataset for urban acoustic scene classification.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

Unsupervised adversarial domain adaptation for acoustic scene classification.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

2017
Introduction to the Special Section on Sound Scene and Event Analysis.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Binary Non-Negative Matrix Deconvolution for Audio Dictionary Learning.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

A report on sound event detection with different binaural features.
CoRR, 2017

Stacked Convolutional and Recurrent Neural Networks for Music Emotion Recognition.
CoRR, 2017

Learning vocal mode classifiers from heterogeneous data sources.
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Low latency sound source separation using convolutional recurrent neural networks.
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Assessment of human and machine performance in acoustic scene classification: Dcase 2016 case study.
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Consistent anisotropic Wiener filtering for audio source separation.
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Automated audio captioning with recurrent neural networks.
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Transfer learning of weakly labelled audio.
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

A recurrent encoder-decoder approach with skip-filtering connections for monaural singing voice separation.
Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017

A convolutional neural network approach for acoustic scene classification.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

Active learning for sound event classification by clustering unlabeled data.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Sound event detection using spatial features and convolutional recurrent neural network.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Time-difference of arrival model for spherical microphone arrays and application to direction of arrival estimation.
Proceedings of the 25th European Signal Processing Conference, 2017

Convolutional recurrent neural networks for bird audio detection.
Proceedings of the 25th European Signal Processing Conference, 2017

Stacked convolutional and recurrent neural networks for bird audio detection.
Proceedings of the 25th European Signal Processing Conference, 2017

ASR in Classroom Today: Automatic Visualization of Conceptual Network in Science Classrooms.
Proceedings of the Data Driven Approaches in Digital Education, 2017

DCASE2017 Challenge Setup: Tasks, Datasets and Baseline System.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017

Convolutional Recurrent Neural Networks for Rare Sound Event Detection.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017

Sound Event Detection Using Weakly Labeled Dataset with Stacked Convolutional and Recurrent Neural Network.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017

2016
Blind Separation of Audio Mixtures Through Nonnegative Tensor Factorization of Modulation Spectrograms.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Binaural rendering of microphone array captures based on source separation.
Speech Commun., 2016

Filterbank learning for deep neural network based polyphonic sound event detection.
Proceedings of the 2016 International Joint Conference on Neural Networks, 2016

Recurrent neural networks for polyphonic sound event detection in real life recordings.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Low-latency sound source separation using deep neural networks.
Proceedings of the 2016 IEEE Global Conference on Signal and Information Processing, 2016

TUT database for acoustic scene classification and sound event detection.
Proceedings of the 24th European Signal Processing Conference, 2016

Cascade processing for speeding up sliding window sparse classification.
Proceedings of the 24th European Signal Processing Conference, 2016

Noise-robust detection of whispering in telephone calls using deep neural networks.
Proceedings of the 24th European Signal Processing Conference, 2016

DCASE 2016 Acoustic Scene Classification Using Convolutional Neural Networks.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016

Sound Event Detection in Multichannel Audio Using Spatial and Harmonic Features.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016

2015
Coupled Dictionaries for Exemplar-Based Speech Enhancement and Automatic Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Compositional Models for Audio Processing: Uncovering the structure of sound mixtures.
IEEE Signal Process. Mag., 2015

Non-negative tensor factorization models for Bayesian audio processing.
Digit. Signal Process., 2015

Archetypal analysis for audio dictionary learning.
Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015

Noise robust speaker recognition with convolutive sparse coding.
Proceedings of the INTERSPEECH 2015, 2015

Polyphonic sound event detection using multi label deep neural networks.
Proceedings of the 2015 International Joint Conference on Neural Networks, 2015

Sound event detection in real life recordings using coupled matrix factorization of spectral representations and class activity annotations.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Similarity induced group sparsity for non-negative matrix factorisation.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Low-latency sound-source-separation using non-negative matrix factorisation with coupled analysis and synthesis dictionaries.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Exemplar-based speech enhancement for deep neural network based automatic speech recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Speaker Verification Using Adaptive Dictionaries in Non-negative Spectrogram Deconvolution.
Proceedings of the Latent Variable Analysis and Signal Separation, 2015

Automatic recognition of environmental sound events using all-pole group delay features.
Proceedings of the 23rd European Signal Processing Conference, 2015

Multi-label vs. combined single-label sound event detection with deep neural networks.
Proceedings of the 23rd European Signal Processing Conference, 2015

2014
Exemplar-Based Sparse Representation With Residual Compensation for Voice Conversion.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

Direction of Arrival Based Spatial Covariance Model for Blind Sound Source Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

Method for creating location-specific audio textures.
EURASIP J. Audio Speech Music. Process., 2014

Exemplar-based noise robust automatic speech recognition using modulation spectrogram features.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Modelling primitive streaming of simple tone sequences through factorisation of modulation pattern tensors.
Proceedings of the INTERSPEECH 2014, 2014

Semi-supervised non-negative tensor factorisation of modulation spectrograms for monaural speech separation.
Proceedings of the 2014 International Joint Conference on Neural Networks, 2014

Active-set newton algorithm for non-negative sparse coding of audio.
Proceedings of the IEEE International Conference on Acoustics, 2014

Multichannel audio separation by direction of arrival based spatial covariance model and non-negative matrix factorization.
Proceedings of the IEEE International Conference on Acoustics, 2014

Ultrasound-coupled semi-supervised nonnegative matrix factorisation for speech enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2014

Coupled dictionary training for exemplar-based speech enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2014

Recognition of acoustic events using deep neural networks.
Proceedings of the 22nd European Signal Processing Conference, 2014

Lifelog Scene Change Detection Using Cascades of Audio and Video Detectors.
Proceedings of the Computer Vision - ACCV 2014 Workshops, 2014

2013
Active-Set Newton Algorithm for Overcomplete Non-Negative Representations of Audio.
IEEE Trans. Speech Audio Process., 2013

On the human ability to discriminate audio ambiances from similar locations of an urban environment.
Pers. Ubiquitous Comput., 2013

Context-dependent sound event detection.
EURASIP J. Audio Speech Music. Process., 2013

Modelling non-stationary noise with spectral factorisation in automatic speech recognition.
Comput. Speech Lang., 2013

Music self-similarity modeling using augmented nonnegative matrix factorization of block and stripe patterns.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

Exemplar-based voice conversion using non-negative spectrogram deconvolution.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

The 9th annual MLSP competition: New methods for acoustic classification of multiple simultaneous bird species in a noisy environment.
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2013

Exemplar-based unit selection for voice conversion utilizing temporal information.
Proceedings of the INTERSPEECH 2013, 2013

Non-negative tensor factorisation of modulation spectrograms for monaural sound source separation.
Proceedings of the INTERSPEECH 2013, 2013

Supervised model training for overlapping sound events based on unsupervised source separation.
Proceedings of the IEEE International Conference on Acoustics, 2013

Exemplar-based joint channel and noise compensation.
Proceedings of the IEEE International Conference on Acoustics, 2013

Acquiring variable length speech bases for factorisation-based noise robust speech recognition.
Proceedings of the 21st European Signal Processing Conference, 2013

Semi-supervised learning for musical instrument recognition.
Proceedings of the 21st European Signal Processing Conference, 2013

Group Delay Function from All-Pole Models for Musical Instrument Recognition.
Proceedings of the Sound, Music, and Motion - 10th International Symposium, 2013

Learning state labels for sparse classification of speech with matrix deconvolution.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012
Voice Conversion Using Dynamic Kernel Partial Least Squares Regression.
IEEE Trans. Speech Audio Process., 2012

Exemplar-based sparse representation and sparse discrimination for noise robust speaker identification.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

Phase spectrum prediction of audio signals.
Proceedings of the 5th International Symposium on Communications, 2012

Human sound perception - what can we learn from it when developing audio analysis algorithms?
Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2012

Group Sparsity for Speaker Identity Discrimination in Factorisation-based Speech Recognition.
Proceedings of the INTERSPEECH 2012, 2012

Non-negative matrix factorization for highly noise-robust ASR: To enhance or to recognize?
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Modelling spectro-temporal dynamics in factorisation-based noise-robust automatic speech recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Multiple Instrument Mixtures Source Separation Evaluation Using Instrument-Dependent NMF Models.
Proceedings of the Latent Variable Analysis and Signal Separation, 2012

Permutation alignment of frequency-domain ICA by the maximization of intra-source envelope correlations.
Proceedings of the 20th European Signal Processing Conference, 2012

Detection, separation and recognition of speech from continuous signals using spectral factorisation.
Proceedings of the 20th European Signal Processing Conference, 2012

Introduction.
Proceedings of the Techniques for Noise Robustness in Automatic Speech Recognition, 2012

The Basics of Automatic Speech Recognition.
Proceedings of the Techniques for Noise Robustness in Automatic Speech Recognition, 2012

The Problem of Robustness in Automatic Speech Recognition.
Proceedings of the Techniques for Noise Robustness in Automatic Speech Recognition, 2012

2011
Exemplar-Based Sparse Representations for Noise Robust Automatic Speech Recognition.
IEEE Trans. Speech Audio Process., 2011

Musical Instrument Sound Multi-Excitation Model for Non-Negative Spectrogram Factorization.
IEEE J. Sel. Top. Signal Process., 2011

Multichannel audio upmixing based on non-negative tensor factorization representation.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011

Phoneme-Dependent NMF for Speech Enhancement in Monaural Mixtures.
Proceedings of the INTERSPEECH 2011, 2011

Mapping Sparse Representation to State Likelihoods in Noise-Robust Automatic Speech Recognition.
Proceedings of the INTERSPEECH 2011, 2011

Uncertainty Measures for Improving Exemplar-Based Source Separation.
Proceedings of the INTERSPEECH 2011, 2011

Non-negative matrix deconvolution in noise robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

Toward a practical implementation of exemplar-based noise robust ASR.
Proceedings of the 19th European Signal Processing Conference, 2011

2010
Representing Musical Sounds With an Interpolating State Model.
IEEE Trans. Speech Audio Process., 2010

Voice Conversion Using Partial Least Squares Regression.
IEEE Trans. Speech Audio Process., 2010

Automatic Recognition of Lyrics in Singing.
EURASIP J. Audio Speech Music. Process., 2010

Audio Query by Example Using Similarity Measures between Probability Density Functions of Features.
EURASIP J. Audio Speech Music. Process., 2010

State-based labelling for a sparse representation of speech and its application to robust speech recognition.
Proceedings of the INTERSPEECH 2010, 2010

Non-negative matrix factorization based compensation of music for automatic speech recognition.
Proceedings of the INTERSPEECH 2010, 2010

Artificial and online acquired noise dictionaries for noise robust ASR.
Proceedings of the INTERSPEECH 2010, 2010

Noise-to-mask ratio minimization by weighted non-negative matrix factorization.
Proceedings of the IEEE International Conference on Acoustics, 2010

Recognition of phonemes and words in singing.
Proceedings of the IEEE International Conference on Acoustics, 2010

Sound source separation in monaural music signals using excitation-filter model and em algorithm.
Proceedings of the IEEE International Conference on Acoustics, 2010

Noise robust exemplar-based connected digit recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

Acoustic event detection in real life recordings.
Proceedings of the 18th European Signal Processing Conference, 2010

Comparison of noise robust methods in large vocabulary speech recognition.
Proceedings of the 18th European Signal Processing Conference, 2010

Audio context recognition using audio event histograms.
Proceedings of the 18th European Signal Processing Conference, 2010

2009
Musical Instrument Recognition in Polyphonic Audio Using Source-Filter Model for Sound Separation.
Proceedings of the 10th International Society for Music Information Retrieval Conference, 2009

Mixtures of Gamma Priors for Non-negative Matrix Factorization Based Speech Separation.
Proceedings of the Independent Component Analysis and Signal Separation, 2009

Interpolating hidden Markov model and its application to automatic instrument recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009

Spectral covariance in prior distributions of non-negative matrix factorization based speech separation.
Proceedings of the 17th European Signal Processing Conference, 2009

Non-stationary noise model compensation in voice activity detection.
Proceedings of the 17th European Signal Processing Conference, 2009

Adaptation of a speech recognizer for singing voice.
Proceedings of the 17th European Signal Processing Conference, 2009

2008
Combining pitch-based inference and non-negative spectrogram factorization in separating vocals from polyphonic music.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2008

Accompaniment separation and karaoke application based on automatic melody transcription.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Bayesian extensions to non-negative matrix factorisation for audio signal modelling.
Proceedings of the IEEE International Conference on Acoustics, 2008

Voice activity detection in the presence of breathing noise using neural network and hidden Markov model.
Proceedings of the 2008 16th European Signal Processing Conference, 2008

2007
Monaural Sound Source Separation by Nonnegative Matrix Factorization With Temporal Continuity and Sparseness Criteria.
IEEE Trans. Speech Audio Process., 2007

Singer Identification in Polyphonic Music Using Vocal Separation and Pattern Recognition Methods.
Proceedings of the 8th International Conference on Music Information Retrieval, 2007

Query by Example of Audio Signals using Euclidean Distance Between Gaussian Mixture Models.
Proceedings of the IEEE International Conference on Acoustics, 2007

2006
Speech recognition using factorial hidden Markov models for separation in the feature space.
Proceedings of the INTERSPEECH 2006, 2006

2005
Drum transcription with non-negative spectrogram factorisation.
Proceedings of the 13th European Signal Processing Conference, 2005

Modeling musical sounds with an interpolating state model.
Proceedings of the 13th European Signal Processing Conference, 2005

Separation of drums from polyphonic music using non-negative matrix factorization and support vector machine.
Proceedings of the 13th European Signal Processing Conference, 2005

2004
Separation of sound sources by convolutive sparse coding.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, 2004

2003
Sound Source Separation Using Sparse Coding with Temporal Continuity Objective.
Proceedings of the 2003 International Computer Music Conference, 2003

2002
Separation of harmonic sounds using linear models for the overtone series.
Proceedings of the IEEE International Conference on Acoustics, 2002

2000
Detection of artifacts in monitored trends in intensive care.
Comput. Methods Programs Biomed., 2000

Separation of harmonic sound sources using sinusoidal modeling.
Proceedings of the IEEE International Conference on Acoustics, 2000

Recognition of acoustic noise mixtures by combined bottom-up and top-down processing.
Proceedings of the 10th European Signal Processing Conference, 2000


  Loading...