Tuomas Virtanen

According to our database1, Tuomas Virtanen authored at least 165 papers between 2000 and 2018.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepage:

On csauthors.net:

Bibliography

2018
Separation of Moving Sound Sources Using Multichannel NMF and Acoustic Tracking.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2018

Detection and Classification of Acoustic Scenes and Events: Outcome of the DCASE 2016 Challenge.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2018

Multichannel Blind Sound Source Separation Using Spatial Covariance Model With Level and Time Differences and Nonnegative Matrix Factorization.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2018

Unsupervised adversarial domain adaptation for acoustic scene classification.
CoRR, 2018

Acoustic Scene Classification: A Competition Review.
CoRR, 2018

Harmonic-Percussive Source Separation with Deep Neural Networks and Phase Recovery.
CoRR, 2018

A multi-device dataset for urban acoustic scene classification.
CoRR, 2018

Deep neural network based speech separation optimizing an objective estimator of intelligibility for low latency applications.
CoRR, 2018

Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks.
CoRR, 2018

End-to-End Polyphonic Sound Event Detection Using Convolutional Recurrent Neural Networks with Learned Time-Frequency Representation Input.
CoRR, 2018

Close Miking Empirical Practice Verification: A Source Separation Approach.
CoRR, 2018

Complex ISNMF: a Phase-Aware Model for Monaural Audio Source Separation.
CoRR, 2018

MaD TwinNet: Masker-Denoiser Architecture with Twin Networks for Monaural Sound Source Separation.
CoRR, 2018

Multichannel Sound Event Detection Using 3D Convolutional Neural Networks for Learning Inter-channel Features.
CoRR, 2018

Acoustic Scene Classification: a Competition Review.
Proceedings of the 28th IEEE International Workshop on Machine Learning for Signal Processing, 2018

An Active Learning Method Using Clustering and Committee-Based Sample Selection for Sound Event Classification.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Time-Frequency Masking Strategies for Single-Channel Low-Latency Speech Enhancement Using Neural Networks.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Deep Neural Network Based Speech Separation Optimizing an Objective Estimator of Intelligibility for Low Latency Applications.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Acoustic Scene Classification: An Overview of Dcase 2017 Challenge Entries.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

On Modeling the STFT Phase of Audio Signals with the Von Mises Distribution.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Towards Complex Nonnegative Matrix Factorization with the Beta-Divergence.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Using Sequential Information in Polyphonic Sound Event Detection.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Harmonic-Percussive Source Separation with Deep Neural Networks and Phase Recovery.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Expectation-Maximization Algorithms for Itakura-Saito Nonnegative Matrix Factorization.
Proceedings of the Interspeech 2018, 2018

Reducing Interference with Phase Recovery in DNN-based Monaural Singing Voice Separation.
Proceedings of the Interspeech 2018, 2018

MaD TwinNet: Masker-Denoiser Architecture with Twin Networks for Monaural Sound Source Separation.
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

End-to-End Polyphonic Sound Event Detection Using Convolutional Recurrent Neural Networks with Learned Time-Frequency Representation Input.
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

Multichannel Sound Event Detection Using 3D Convolutional Neural Networks for Learning Inter-channel Features.
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

Estimation of Time-Varying Room Impulse Responses of Multiple Sound Sources from Observed Mixture and Isolated Source Signals.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Monaural Singing Voice Separation with Skip-Filtering Connections and Recurrent Inference of Time-Frequency Mask.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Bayesian Anisotropic Gaussian Model for Audio Source Separation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Introduction to the Special Section on Sound Scene and Event Analysis.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2017

Binary Non-Negative Matrix Deconvolution for Audio Dictionary Learning.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2017

Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2017

Monaural Singing Voice Separation with Skip-Filtering Connections and Recurrent Inference of Time-Frequency Mask.
CoRR, 2017

Direction of arrival estimation for multiple sound sources using convolutional recurrent neural network.
CoRR, 2017

Separation of Moving Sound Sources Using Multichannel NMF and Acoustic Tracking.
CoRR, 2017

Sound event detection using weakly labeled dataset with stacked convolutional and recurrent neural network.
CoRR, 2017

A report on sound event detection with different binaural features.
CoRR, 2017

A Recurrent Encoder-Decoder Approach with Skip-filtering Connections for Monaural Singing Voice Separation.
CoRR, 2017

Stacked Convolutional and Recurrent Neural Networks for Music Emotion Recognition.
CoRR, 2017

Automated Audio Captioning with Recurrent Neural Networks.
CoRR, 2017

Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection.
CoRR, 2017

Convolutional Recurrent Neural Networks for Bird Audio Detection.
CoRR, 2017

Sound Event Detection Using Spatial Features and Convolutional Recurrent Neural Network.
CoRR, 2017

Sound Event Detection in Multichannel Audio Using Spatial and Harmonic Features.
CoRR, 2017

Stacked Convolutional and Recurrent Neural Networks for Bird Audio Detection.
CoRR, 2017

Learning vocal mode classifiers from heterogeneous data sources.
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Low latency sound source separation using convolutional recurrent neural networks.
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Assessment of human and machine performance in acoustic scene classification: Dcase 2016 case study.
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Consistent anisotropic Wiener filtering for audio source separation.
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Automated audio captioning with recurrent neural networks.
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Transfer learning of weakly labelled audio.
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

A recurrent encoder-decoder approach with skip-filtering connections for monaural singing voice separation.
Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017

A convolutional neural network approach for acoustic scene classification.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

Active learning for sound event classification by clustering unlabeled data.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Sound event detection using spatial features and convolutional recurrent neural network.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Time-difference of arrival model for spherical microphone arrays and application to direction of arrival estimation.
Proceedings of the 25th European Signal Processing Conference, 2017

Convolutional recurrent neural networks for bird audio detection.
Proceedings of the 25th European Signal Processing Conference, 2017

Stacked convolutional and recurrent neural networks for bird audio detection.
Proceedings of the 25th European Signal Processing Conference, 2017

ASR in Classroom Today: Automatic Visualization of Conceptual Network in Science Classrooms.
Proceedings of the Data Driven Approaches in Digital Education, 2017

2016
Blind Separation of Audio Mixtures Through Nonnegative Tensor Factorization of Modulation Spectrograms.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2016

Binaural rendering of microphone array captures based on source separation.
Speech Communication, 2016

Recurrent Neural Networks for Polyphonic Sound Event Detection in Real Life Recordings.
CoRR, 2016

Filterbank learning for deep neural network based polyphonic sound event detection.
Proceedings of the 2016 International Joint Conference on Neural Networks, 2016

Recurrent neural networks for polyphonic sound event detection in real life recordings.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Low-latency sound source separation using deep neural networks.
Proceedings of the 2016 IEEE Global Conference on Signal and Information Processing, 2016

TUT database for acoustic scene classification and sound event detection.
Proceedings of the 24th European Signal Processing Conference, 2016

Cascade processing for speeding up sliding window sparse classification.
Proceedings of the 24th European Signal Processing Conference, 2016

Noise-robust detection of whispering in telephone calls using deep neural networks.
Proceedings of the 24th European Signal Processing Conference, 2016

2015
Coupled Dictionaries for Exemplar-Based Speech Enhancement and Automatic Speech Recognition.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2015

Compositional Models for Audio Processing: Uncovering the structure of sound mixtures.
IEEE Signal Process. Mag., 2015

Non-negative tensor factorization models for Bayesian audio processing.
Digital Signal Processing, 2015

Archetypal analysis for audio dictionary learning.
Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015

Noise robust speaker recognition with convolutive sparse coding.
Proceedings of the INTERSPEECH 2015, 2015

Polyphonic sound event detection using multi label deep neural networks.
Proceedings of the 2015 International Joint Conference on Neural Networks, 2015

Sound event detection in real life recordings using coupled matrix factorization of spectral representations and class activity annotations.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Similarity induced group sparsity for non-negative matrix factorisation.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Low-latency sound-source-separation using non-negative matrix factorisation with coupled analysis and synthesis dictionaries.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Exemplar-based speech enhancement for deep neural network based automatic speech recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Speaker Verification Using Adaptive Dictionaries in Non-negative Spectrogram Deconvolution.
Proceedings of the Latent Variable Analysis and Signal Separation, 2015

Automatic recognition of environmental sound events using all-pole group delay features.
Proceedings of the 23rd European Signal Processing Conference, 2015

Multi-label vs. combined single-label sound event detection with deep neural networks.
Proceedings of the 23rd European Signal Processing Conference, 2015

2014
Exemplar-Based Sparse Representation With Residual Compensation for Voice Conversion.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2014

Direction of Arrival Based Spatial Covariance Model for Blind Sound Source Separation.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2014

Method for creating location-specific audio textures.
EURASIP J. Audio, Speech and Music Processing, 2014

Exemplar-based noise robust automatic speech recognition using modulation spectrogram features.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Modelling primitive streaming of simple tone sequences through factorisation of modulation pattern tensors.
Proceedings of the INTERSPEECH 2014, 2014

Semi-supervised non-negative tensor factorisation of modulation spectrograms for monaural speech separation.
Proceedings of the 2014 International Joint Conference on Neural Networks, 2014

Active-set newton algorithm for non-negative sparse coding of audio.
Proceedings of the IEEE International Conference on Acoustics, 2014

Multichannel audio separation by direction of arrival based spatial covariance model and non-negative matrix factorization.
Proceedings of the IEEE International Conference on Acoustics, 2014

Ultrasound-coupled semi-supervised nonnegative matrix factorisation for speech enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2014

Coupled dictionary training for exemplar-based speech enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2014

Recognition of acoustic events using deep neural networks.
Proceedings of the 22nd European Signal Processing Conference, 2014

Lifelog Scene Change Detection Using Cascades of Audio and Video Detectors.
Proceedings of the Computer Vision - ACCV 2014 Workshops, 2014

2013
Active-Set Newton Algorithm for Overcomplete Non-Negative Representations of Audio.
IEEE Trans. Audio, Speech & Language Processing, 2013

On the human ability to discriminate audio ambiances from similar locations of an urban environment.
Personal and Ubiquitous Computing, 2013

Context-dependent sound event detection.
EURASIP J. Audio, Speech and Music Processing, 2013

Modelling non-stationary noise with spectral factorisation in automatic speech recognition.
Computer Speech & Language, 2013

Music self-similarity modeling using augmented nonnegative matrix factorization of block and stripe patterns.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

Exemplar-based voice conversion using non-negative spectrogram deconvolution.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

The 9th annual MLSP competition: New methods for acoustic classification of multiple simultaneous bird species in a noisy environment.
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2013

Exemplar-based unit selection for voice conversion utilizing temporal information.
Proceedings of the INTERSPEECH 2013, 2013

Non-negative tensor factorisation of modulation spectrograms for monaural sound source separation.
Proceedings of the INTERSPEECH 2013, 2013

Supervised model training for overlapping sound events based on unsupervised source separation.
Proceedings of the IEEE International Conference on Acoustics, 2013

Exemplar-based joint channel and noise compensation.
Proceedings of the IEEE International Conference on Acoustics, 2013

Acquiring variable length speech bases for factorisation-based noise robust speech recognition.
Proceedings of the 21st European Signal Processing Conference, 2013

Semi-supervised learning for musical instrument recognition.
Proceedings of the 21st European Signal Processing Conference, 2013

Group Delay Function from All-Pole Models for Musical Instrument Recognition.
Proceedings of the Sound, Music, and Motion - 10th International Symposium, 2013

Learning state labels for sparse classification of speech with matrix deconvolution.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012
Voice Conversion Using Dynamic Kernel Partial Least Squares Regression.
IEEE Trans. Audio, Speech & Language Processing, 2012

Exemplar-based sparse representation and sparse discrimination for noise robust speaker identification.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

Phase spectrum prediction of audio signals.
Proceedings of the 5th International Symposium on Communications, 2012

Human sound perception - what can we learn from it when developing audio analysis algorithms?
Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2012

Group Sparsity for Speaker Identity Discrimination in Factorisation-based Speech Recognition.
Proceedings of the INTERSPEECH 2012, 2012

Non-negative matrix factorization for highly noise-robust ASR: To enhance or to recognize?
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Modelling spectro-temporal dynamics in factorisation-based noise-robust automatic speech recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Multiple Instrument Mixtures Source Separation Evaluation Using Instrument-Dependent NMF Models.
Proceedings of the Latent Variable Analysis and Signal Separation, 2012

Permutation alignment of frequency-domain ICA by the maximization of intra-source envelope correlations.
Proceedings of the 20th European Signal Processing Conference, 2012

Detection, separation and recognition of speech from continuous signals using spectral factorisation.
Proceedings of the 20th European Signal Processing Conference, 2012

2011
Exemplar-Based Sparse Representations for Noise Robust Automatic Speech Recognition.
IEEE Trans. Audio, Speech & Language Processing, 2011

Musical Instrument Sound Multi-Excitation Model for Non-Negative Spectrogram Factorization.
J. Sel. Topics Signal Processing, 2011

Multichannel audio upmixing based on non-negative tensor factorization representation.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011

Phoneme-Dependent NMF for Speech Enhancement in Monaural Mixtures.
Proceedings of the INTERSPEECH 2011, 2011

Mapping Sparse Representation to State Likelihoods in Noise-Robust Automatic Speech Recognition.
Proceedings of the INTERSPEECH 2011, 2011

Uncertainty Measures for Improving Exemplar-Based Source Separation.
Proceedings of the INTERSPEECH 2011, 2011

Non-negative matrix deconvolution in noise robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

Toward a practical implementation of exemplar-based noise robust ASR.
Proceedings of the 19th European Signal Processing Conference, 2011

2010
Representing Musical Sounds With an Interpolating State Model.
IEEE Trans. Audio, Speech & Language Processing, 2010

Voice Conversion Using Partial Least Squares Regression.
IEEE Trans. Audio, Speech & Language Processing, 2010

Automatic Recognition of Lyrics in Singing.
EURASIP J. Audio, Speech and Music Processing, 2010

Audio Query by Example Using Similarity Measures between Probability Density Functions of Features.
EURASIP J. Audio, Speech and Music Processing, 2010

State-based labelling for a sparse representation of speech and its application to robust speech recognition.
Proceedings of the INTERSPEECH 2010, 2010

Non-negative matrix factorization based compensation of music for automatic speech recognition.
Proceedings of the INTERSPEECH 2010, 2010

Artificial and online acquired noise dictionaries for noise robust ASR.
Proceedings of the INTERSPEECH 2010, 2010

Noise-to-mask ratio minimization by weighted non-negative matrix factorization.
Proceedings of the IEEE International Conference on Acoustics, 2010

Recognition of phonemes and words in singing.
Proceedings of the IEEE International Conference on Acoustics, 2010

Sound source separation in monaural music signals using excitation-filter model and em algorithm.
Proceedings of the IEEE International Conference on Acoustics, 2010

Noise robust exemplar-based connected digit recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

Acoustic event detection in real life recordings.
Proceedings of the 18th European Signal Processing Conference, 2010

Comparison of noise robust methods in large vocabulary speech recognition.
Proceedings of the 18th European Signal Processing Conference, 2010

Audio context recognition using audio event histograms.
Proceedings of the 18th European Signal Processing Conference, 2010

2009
Musical Instrument Recognition in Polyphonic Audio Using Source-Filter Model for Sound Separation.
Proceedings of the 10th International Society for Music Information Retrieval Conference, 2009

Mixtures of Gamma Priors for Non-negative Matrix Factorization Based Speech Separation.
Proceedings of the Independent Component Analysis and Signal Separation, 2009

Interpolating hidden Markov model and its application to automatic instrument recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009

Spectral covariance in prior distributions of non-negative matrix factorization based speech separation.
Proceedings of the 17th European Signal Processing Conference, 2009

Non-stationary noise model compensation in voice activity detection.
Proceedings of the 17th European Signal Processing Conference, 2009

Adaptation of a speech recognizer for singing voice.
Proceedings of the 17th European Signal Processing Conference, 2009

2008
Combining pitch-based inference and non-negative spectrogram factorization in separating vocals from polyphonic music.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2008

Accompaniment separation and karaoke application based on automatic melody transcription.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Bayesian extensions to non-negative matrix factorisation for audio signal modelling.
Proceedings of the IEEE International Conference on Acoustics, 2008

Voice activity detection in the presence of breathing noise using neural network and hidden Markov model.
Proceedings of the 2008 16th European Signal Processing Conference, 2008

2007
Monaural Sound Source Separation by Nonnegative Matrix Factorization With Temporal Continuity and Sparseness Criteria.
IEEE Trans. Audio, Speech & Language Processing, 2007

Singer Identification in Polyphonic Music Using Vocal Separation and Pattern Recognition Methods.
Proceedings of the 8th International Conference on Music Information Retrieval, 2007

Query by Example of Audio Signals using Euclidean Distance Between Gaussian Mixture Models.
Proceedings of the IEEE International Conference on Acoustics, 2007

2006
Speech recognition using factorial hidden Markov models for separation in the feature space.
Proceedings of the INTERSPEECH 2006, 2006

2005
Drum transcription with non-negative spectrogram factorisation.
Proceedings of the 13th European Signal Processing Conference, 2005

Modeling musical sounds with an interpolating state model.
Proceedings of the 13th European Signal Processing Conference, 2005

Separation of drums from polyphonic music using non-negative matrix factorization and support vector machine.
Proceedings of the 13th European Signal Processing Conference, 2005

2004
Separation of sound sources by convolutive sparse coding.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, 2004

2003
Sound Source Separation Using Sparse Coding with Temporal Continuity Objective.
Proceedings of the 2003 International Computer Music Conference, 2003

2002
Separation of harmonic sounds using linear models for the overtone series.
Proceedings of the IEEE International Conference on Acoustics, 2002

2000
Detection of artifacts in monitored trends in intensive care.
Computer Methods and Programs in Biomedicine, 2000

Separation of harmonic sound sources using sinusoidal modeling.
Proceedings of the IEEE International Conference on Acoustics, 2000

Recognition of acoustic noise mixtures by combined bottom-up and top-down processing.
Proceedings of the 10th European Signal Processing Conference, 2000


  Loading...