Slim Essid

Orcid: 0000-0002-0028-327X

According to our database1, Slim Essid authored at least 119 papers between 2002 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Online speaker diarization of meetings guided by speech separation.
CoRR, 2024

2023
On the choice of the optimal temporal support for audio classification with Pre-trained embeddings.
CoRR, 2023

Collaborating Foundation models for Domain Generalized Semantic Segmentation.
CoRR, 2023

Speech Self-Supervised Representations Benchmarking: a Case for Larger Probing Heads.
CoRR, 2023

SAMbA: Speech enhancement with Asynchronous ad-hoc Microphone Arrays.
CoRR, 2023

Automatic Data Augmentation for Domain Adapted Fine-Tuning of Self-Supervised Speech Representations.
CoRR, 2023

Speech Self-Supervised Representation Benchmarking: Are We Doing it Right?
CoRR, 2023

Resilient Multiple Choice Learning: A learned scoring scheme with application to audio scene analysis.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

A Repetition-Based Triplet Mining Approach for Music Segmentation.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023

Fine-Tuning Strategies for Faster Inference Using Speech Self-Supervised Models: A Comparative Study.
Proceedings of the IEEE International Conference on Acoustics, 2023

Cosmopolite Sound Monitoring (CoSMo): A Study of Urban Sound Event Detection Systems Generalizing to Multiple Cities.
Proceedings of the IEEE International Conference on Acoustics, 2023

One-shot Unsupervised Domain Adaptation with Personalized Diffusion Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Pretext Tasks Selection for Multitask Self-Supervised Audio Representation Learning.
IEEE J. Sel. Top. Signal Process., 2022

Opinions in Interactions : New Annotations of the SEMAINE Database.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Learning Multi-Level Representations for Hierarchical Music Structure Analysis..
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

Automatic Data Augmentation Selection and Parametrization in Contrastive Self-Supervised Speech Representation Learning.
Proceedings of the Interspeech 2022, 2022

Latent and Adversarial Data Augmentations for Sound Event Detection and Classification.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

2021
DNN-Based Mask Estimation for Distributed Speech Enhancement in Spatially Unconstrained Microphone Arrays.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Early Detection of User Engagement Breakdown in Spontaneous Human-Humanoid Interaction.
IEEE Trans. Affect. Comput., 2021

Pretext Tasks selection for multitask self-supervised speech representation learning.
CoRR, 2021

User-Guided One-Shot Deep Model Adaptation for Music Source Separation.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

Conditional Independence for Pretext Task Selection in Self-Supervised Speech Representation Learning.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Distributed Speech Separation in Spatially Unconstrained Microphone Arrays.
Proceedings of the IEEE International Conference on Acoustics, 2021

Neuro-Steered Music Source Separation With EEG-Based Auditory Attention Decoding And Contrastive-NMF.
Proceedings of the IEEE International Conference on Acoustics, 2021

Attention-based distributed speech enhancement for unconstrained microphone arrays with varying number of nodes.
Proceedings of the 29th European Signal Processing Conference, 2021

2020
Weakly Supervised Representation Learning for Audio-Visual Scene Analysis.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

On-the-fly Detection of User Engagement Decrease in Spontaneous Human-Robot Interaction, International Journal of Social Robotics, 2019.
CoRR, 2020

DNN-based Distributed Multichannel Mask Estimation for Speech Enhancement in Microphone Arrays.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Audiovisual Analysis of Music Performances: Overview of an Emerging Field.
IEEE Signal Process. Mag., 2019

On-the-Fly Detection of User Engagement Decrease in Spontaneous Human-Robot Interaction Using Recurrent and Deep Neural Networks.
Int. J. Soc. Robotics, 2019

A multimodal movie review corpus for fine-grained opinion mining.
CoRR, 2019

Identify, Locate and Separate: Audio-Visual Object Extraction in Large Video Collections Using Weak Supervision.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

EEG-Based Decoding of Auditory Attention to a Target Instrument in Polyphonic Music.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

SAMBASET: A Dataset of Historical Samba de Enredo Recordings for Computational Music Analysis.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

Tracking Beats and Microtiming in Afro-Latin American Music Using Conditional Random Fields and Deep Learning.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

A Music Structure Informed Downbeat Tracking System Using Skip-chain Conditional Random Fields and Deep Learning.
Proceedings of the IEEE International Conference on Acoustics, 2019

From the Token to the Review: A Hierarchical Multimodal approach to Opinion Mining.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

2018
A robust audio classification system for detecting pulmonary edema.
Biomed. Signal Process. Control., 2018

Analysis of Common Design Choices in Deep Learning Systems for Downbeat Tracking.
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018

Main Melody Estimation with Source-Filter NMF and CRNN.
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018

Structured Output Learning with Abstention: Application to Accurate Opinion Prediction.
Proceedings of the 35th International Conference on Machine Learning, 2018

An Ensemble Learning Approach to Detect Epileptic Seizures from Long Intracranial EEG Recordings.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Attitude Classification in Adjacency Pairs of a Human-Agent Interaction with Hidden Conditional Random Fields.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Multi-task Feature Learning for EEG-based Emotion Recognition Using Group Nonnegative Matrix Factorization.
Proceedings of the 26th European Signal Processing Conference, 2018

Weakly Supervised Representation Learning for Unsynchronized Audio-Visual Events.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

2017
Feature Learning With Matrix Factorization Applied to Acoustic Scene Classification.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Guiding audio source separation by video object information.
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Leveraging deep neural networks with nonnegative representations for improved environmental sound classification.
Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017

Opinion Dynamics Modeling for Movie Review Transcripts Classification with Hidden Conditional Random Fields.
Proceedings of the Interspeech 2017, 2017

UE-HRI: a new dataset for the study of user engagement in spontaneous human-robot interactions.
Proceedings of the 19th ACM International Conference on Multimodal Interaction, 2017

Supervised group nonnegative matrix factorisation with similarity constraints and applications to speaker identification.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Motion informed audio source separation.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Overlapping sound event detection with supervised Nonnegative Matrix Factorization.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

EMOEEG: A new multimodal dataset for dynamic EEG-based emotion recognition with audiovisual elicitation.
Proceedings of the 25th European Signal Processing Conference, 2017

Nonnegative Feature Learning Methods for Acoustic Scene Classification.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017

2016
Mini-batch stochastic approaches for accelerated multiplicative updates in nonnegative matrix factorisation with beta-divergence.
Proceedings of the 26th IEEE International Workshop on Machine Learning for Signal Processing, 2016

Downbeat Detection with Conditional Random Fields and Deep Learned Features.
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016

Machine listening techniques as a complement to video image analysis in forensics.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Group nonnegative matrix factorisation with speaker and session variability compensation for speaker identification.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Acoustic scene classification with matrix factorization for unsupervised feature learning.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
TPT-Dance&Actions : un corpus multimodal d'activités humaines.
Traitement du Signal, 2015

Melody Extraction by Contour Classification.
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015

A Conditional Random Field system for beat tracking.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

HOG and subband power distribution image features for acoustic scene classification.
Proceedings of the 23rd European Signal Processing Conference, 2015

2014
Soft Nonnegative Matrix Co-Factorization.
IEEE Trans. Signal Process., 2014

Piecewise constant nonnegative matrix factorization.
Proceedings of the IEEE International Conference on Acoustics, 2014

Gesture recognition using a NMF-based representation of motion-traces extracted from depth silhouettes.
Proceedings of the IEEE International Conference on Acoustics, 2014

Assessment of new spectral features for eeg-based emotion recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
A Multimodal Approach to Speaker Diarization on TV Talk-Shows.
IEEE Trans. Multim., 2013

Smooth Nonnegative Matrix Factorization for Unsupervised Audiovisual Document Structuring.
IEEE Trans. Multim., 2013

Learning Optimal Features for Polyphonic Audio-to-Score Alignment.
IEEE Trans. Speech Audio Process., 2013

A multi-modal dance corpus for research into interaction between humans in virtual environments.
J. Multimodal User Interfaces, 2013

Multimodal classification of dance movements using body joint trajectories and step sounds.
Proceedings of the 14th International Workshop on Image Analysis for Multimedia Interactive Services, 2013

Exploring new features for music classification.
Proceedings of the 14th International Workshop on Image Analysis for Multimedia Interactive Services, 2013

Non-negative Tensor Factorization for single-channel EEG artifact rejection.
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2013

Soft nonnegative matrix co-factorizationwith application to multimodal speaker diarization.
Proceedings of the IEEE International Conference on Acoustics, 2013

Probabilistic dance performance alignment by fusion of multimodal features.
Proceedings of the IEEE International Conference on Acoustics, 2013

Non-negative matrix factorization for single-channel EEG artifact rejection.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Analysis of dance movements using gaussian processes: extended abstract.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Decomposing the video editing structure of a talk-show using nonnegative matrix factorization.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

A regressive boosting approach to automatic audio tagging based on soft annotator fusion.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

An advanced virtual dance performance evaluator.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

A single-class SVM based algorithm for computing an identifiable NMF.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Fusion of Multimodal Information in Music Content Analysis.
Proceedings of the Multimodal Music Processing, 2012

2011
A Conditional Random Field Framework for Robust and Scalable Audio-to-Score Matching.
IEEE ACM Trans. Audio Speech Lang. Process., 2011

Optimizing the mapping from a symbolic to an audio representation for music-to-score alignment.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011

Interactive Classification of Sound Objects for Polyphonic Electro-Acoustic Music Annotation.
Proceedings of the AES International Conference Semantic Audio 2011, 2011

Enhanced visualisation of dance performance from automatically synchronised multimodal recordings.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

An audio-driven virtual dance-teaching assistant.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

An Interactive System for Electro-Acoustic Music Analysis.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011

Multi-scale temporal fusion by boosting for music classification.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011

Hidden Discrete Tempo Model: A tempo-aware timing model for audio-to-score alignment.
Proceedings of the IEEE International Conference on Acoustics, 2011

Machine Learning Techniques for Multimedia Analysis.
Proceedings of the Multimedia Semantics: Metadata, Analysis and Interaction, 2011

Feature Extraction for Multimedia Analysis.
Proceedings of the Multimedia Semantics: Metadata, Analysis and Interaction, 2011

2010
A conditional random field viewpoint of symbolic audio-to-score matching.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

YAAFE, an Easy to Use and Efficient Audio Feature Extraction Software.
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

An Improved Hierarchical Approach for Music-to-symbolic Score Alignment.
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

Robust visual features for the multimodal identification of unregistered speakers in TV talk-shows.
Proceedings of the International Conference on Image Processing, 2010

A comparative study of tonal acoustic features for a symbolic level music-to-score alignment.
Proceedings of the IEEE International Conference on Acoustics, 2010

A multimodal approach to initialisation for top-down speaker diarization of television shows.
Proceedings of the 18th European Signal Processing Conference, 2010

2009
Temporal Integration for Audio Classification With Application to Musical Instrument Classification.
IEEE Trans. Speech Audio Process., 2009

Incorporating prior knowledge on the digital media creation process into audio classifiers.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Rushes video summarization using a collaborative approach.
Proceedings of the 2nd ACM Workshop on Video Summarization, 2008

A collaborative approach to automatic rushes video summarization.
Proceedings of the International Conference on Image Processing, 2008

On the robustness of audio features for musical instrument classification.
Proceedings of the 2008 16th European Signal Processing Conference, 2008

Alignment kernels for audio classification with application to music instrument recognition.
Proceedings of the 2008 16th European Signal Processing Conference, 2008

2007
On the Correlation of Automatic Audio and Visual Segmentations of Music Videos.
IEEE Trans. Circuits Syst. Video Technol., 2007


Combined Supervised and Unsupervised Approaches for Automatic Segmentation of Radiophonic Audio Streams.
Proceedings of the IEEE International Conference on Acoustics, 2007

2006
Musical instrument recognition by pairwise classification strategies.
IEEE Trans. Speech Audio Process., 2006

Instrument recognition in polyphonic music based on automatic taxonomies.
IEEE Trans. Speech Audio Process., 2006

Hierarchical Classification of Musical Instruments on Solo Recordings.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Classification automatique des signaux audio-fréquences : reconnaissance des instruments de musique. (Automatic Classification of Audio Signals: Machine Recognition of Musical Instruments).
PhD thesis, 2005

Inferring Efficient Hierarchical Taxonomies for MIR Tasks: Application to Musical Instruments.
Proceedings of the ISMIR 2005, 2005

Instrument recognition in polyphonic music.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Musical instrument recognition based on class pairwise feature selection.
Proceedings of the ISMIR 2004, 2004

Musical instrument recognition on solo performances.
Proceedings of the 2004 12th European Signal Processing Conference, 2004

2002
Dynamic temporal segmentation in parametric non-stationary modeling for percussive musical signals.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002

Transient modeling with a frequency-transform subspace algorithm and "transient+sinusoidal" scheme.
Proceedings of the 14th International Conference on Digital Signal Processing, 2002


  Loading...