Konstantinos Drossos

Orcid: 0000-0002-3605-7127

According to our database1, Konstantinos Drossos authored at least 60 papers between 2010 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Development of a speech emotion recognizer for large-scale child-centered audio recordings from a hospital environment.
Speech Commun., March, 2023

Adversarial Representation Learning for Robust Privacy Preservation in Audio.
CoRR, 2023

Representation Learning for Audio Privacy Preservation Using Source Separation and Robust Adversarial Learning.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

2022
Domestic Activity Clustering from Audio via Depthwise Separable Convolutional Autoencoder Network.
Proceedings of the 24th IEEE International Workshop on Multimedia Signal Processing, 2022

Unsupervised Audio-Caption Aligning Learns Correspondences Between Individual Sound Events and Textual Phrases.
Proceedings of the IEEE International Conference on Acoustics, 2022

Clotho-AQA: A Crowdsourced Dataset for Audio Question Answering.
Proceedings of the 30th European Signal Processing Conference, 2022

2021
Enriched Music Representations With Multiple Cross-Modal Contrastive Learning.
IEEE Signal Process. Lett., 2021

Towards Citizen Science for Smart Cities: A Framework for a Collaborative Game of Bird Call Recognition Based on Internet of Sound Practices.
CoRR, 2021

Automatic Analysis of the Emotional Content of Speech in Daylong Child-Centered Recordings from a Neonatal Intensive Care Unit.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Towards Sonification in Multimodal and User-friendlyExplainable Artificial Intelligence.
Proceedings of the ICMI '21: International Conference on Multimodal Interaction, 2021

Learning Contextual Tag Embeddings for Cross-Modal Alignment of Audio and Tags.
Proceedings of the IEEE International Conference on Acoustics, 2021

WaveTransformer: An Architecture for Audio Captioning Based on Learning Temporal and Time-Frequency Information.
Proceedings of the 29th European Signal Processing Conference, 2021

Evaluating Off-the-Shelf Machine Listening and Natural Language Models for Automated Audio Captioning.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

Fairness and Underspecification in Acoustic Scene Classification: The Case for Disaggregated Evaluations.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

Assessment of Self-Attention on Learned Features For Sound Event Localization and Detection.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

Continual Learning for Automated Audio Captioning Using the Learning without Forgetting Approach.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

2020
Examining the Mapping Functions of Denoising Autoencoders in Singing Voice Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

WaveTransformer: A Novel Architecture for Audio Captioning Based on Learning Temporal and Time-Frequency Information.
CoRR, 2020

Conditioned Time-Dilated Convolutions for Sound Event Detection.
CoRR, 2020

Revisiting Representation Learning for Singing Voice Separation with Sinkhorn Distances.
CoRR, 2020

COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations.
CoRR, 2020

Multichannel Singing Voice Separation by Deep Neural Network Informed DOA Constrained CNMF.
CoRR, 2020

Depthwise Separable Convolutions Versus Recurrent Neural Networks for Monaural Singing Voice Separation.
Proceedings of the 22nd IEEE International Workshop on Multimedia Signal Processing, 2020

Multichannel Singing Voice Separation by Deep Neural Network Informed DOA Constrained CMNMF.
Proceedings of the 22nd IEEE International Workshop on Multimedia Signal Processing, 2020

Sound Event Detection with Depthwise Separable and Dilated Convolutions.
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

Sound Event Detection Via Dilated Convolutional Recurrent Neural Networks.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Clotho: an Audio Captioning Dataset.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Memory Requirement Reduction of Deep Neural Networks for Field Programmable Gate Arrays Using Low-Bit Quantization of Parameters.
Proceedings of the 28th European Signal Processing Conference, 2020

Unsupervised Interpretable Representation Learning for Singing Voice Separation.
Proceedings of the 28th European Signal Processing Conference, 2020

Temporal Sub-Sampling of Audio Feature Sequences for Automated Audio Captioning.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

Multi-Task Regularization Based on Infrequent Classes for Audio Captioning.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

2019
VOICe: A Sound Event Detection Dataset For Generalizable Domain Adaptation.
CoRR, 2019

Memory Requirement Reduction of Deep Neural Networks Using Low-bit Quantization of Parameters.
CoRR, 2019

Examining the Mapping Functions of Denoising Autoencoders in Music Source Separation.
CoRR, 2019

Unsupervised Adversarial Domain Adaptation Based on The Wasserstein Distance For Acoustic Scene Classification.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Crowdsourcing a Dataset of Audio Captions.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

Language Modelling for Sound Event Detection with Teacher Forcing and Scheduled Sampling.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

2018
Close Miking Empirical Practice Verification: A Source Separation Approach.
CoRR, 2018

Harmonic-Percussive Source Separation with Deep Neural Networks and Phase Recovery.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Reducing Interference with Phase Recovery in DNN-based Monaural Singing Voice Separation.
Proceedings of the Interspeech 2018, 2018

MaD TwinNet: Masker-Denoiser Architecture with Twin Networks for Monaural Sound Source Separation.
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

Monaural Singing Voice Separation with Skip-Filtering Connections and Recurrent Inference of Time-Frequency Mask.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Unsupervised adversarial domain adaptation for acoustic scene classification.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

Examining the Perceptual Effect of Alternative Objective Functions for Deep Learning Based Music Source Separation.
Proceedings of the 52nd Asilomar Conference on Signals, Systems, and Computers, 2018

2017
Stacked Convolutional and Recurrent Neural Networks for Music Emotion Recognition.
CoRR, 2017

Automated audio captioning with recurrent neural networks.
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

A recurrent encoder-decoder approach with skip-filtering connections for monaural singing voice separation.
Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017

Convolutional recurrent neural networks for bird audio detection.
Proceedings of the 25th European Signal Processing Conference, 2017

Stacked convolutional and recurrent neural networks for bird audio detection.
Proceedings of the 25th European Signal Processing Conference, 2017

2015
Investigating the Impact of Sound Angular Position on the Listener Affective State.
IEEE Trans. Affect. Comput., 2015

Accessible games for blind children, empowered by binaural sound.
Proceedings of the 8th ACM International Conference on PErvasive Technologies Related to Assistive Environments, 2015

2014
A socially-intelligent multi-robot service team for in-home monitoring.
Proceedings of the 5th International Conference on Information, 2014

BEADS: A dataset of Binaural Emotionally Annotated Digital Sounds.
Proceedings of the 5th International Conference on Information, 2014

Swarm Lake: A Game of Swarm Intelligence, Human Interaction and Collaborative Music Composition.
Proceedings of the Music Technology meets Philosophy, 2014

2013
Sound events and emotions: Investigating the relation of rhythmic characteristics and arousal.
Proceedings of the 4th International Conference on Information, 2013

Gestural user interface for audio multitrack real-time stereo mixing.
Proceedings of the Audio Mostly 2013, 2013

2012
Stereo Goes Mobile: Spatial Enhancement for Short-distance Loudspeaker Setups.
Proceedings of the Eighth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2012

Affective acoustic ecology: towards emotionally enhanced sound events.
Proceedings of the Audio Mostly 2012, 2012

2011
Emotional control and visual representation using advanced audiovisual interaction.
Int. J. Arts Technol., 2011

2010
Binaural mixing using gestural control interaction.
Proceedings of the AM '10, 2010


  Loading...