Konstantinos Drossos
Orcid: 0000-0002-3605-7127
  According to our database1,
  Konstantinos Drossos
  authored at least 73 papers
  between 2010 and 2025.
  
  
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
- 
    on orcid.org
On csauthors.net:
Bibliography
  2025
Lightweight DNN for Full-Band Speech Denoising on Mobile Devices: Exploiting Long and Short Temporal Patterns.
    
  
    CoRR, September, 2025
    
  
    CoRR, July, 2025
    
  
Attractor-Based Speech Separation of Multiple Utterances by Unknown Number of Speakers.
    
  
    CoRR, May, 2025
    
  
Knowledge Distillation for Speech Denoising by Latent Representation Alignment with Cosine Distance.
    
  
    CoRR, May, 2025
    
  
    Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
    
  
  2023
Development of a speech emotion recognizer for large-scale child-centered audio recordings from a hospital environment.
    
  
    Speech Commun., March, 2023
    
  
    CoRR, 2023
    
  
Representation Learning for Audio Privacy Preservation Using Source Separation and Robust Adversarial Learning.
    
  
    Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023
    
  
  2022
Domestic Activity Clustering from Audio via Depthwise Separable Convolutional Autoencoder Network.
    
  
    Proceedings of the 24th IEEE International Workshop on Multimedia Signal Processing, 2022
    
  
Unsupervised Audio-Caption Aligning Learns Correspondences Between Individual Sound Events and Textual Phrases.
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2022
    
  
    Proceedings of the 30th European Signal Processing Conference, 2022
    
  
  2021
    IEEE Signal Process. Lett., 2021
    
  
Towards Citizen Science for Smart Cities: A Framework for a Collaborative Game of Bird Call Recognition Based on Internet of Sound Practices.
    
  
    CoRR, 2021
    
  
Automatic Analysis of the Emotional Content of Speech in Daylong Child-Centered Recordings from a Neonatal Intensive Care Unit.
    
  
    Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
    
  
Towards Sonification in Multimodal and User-friendlyExplainable Artificial Intelligence.
    
  
    Proceedings of the ICMI '21: International Conference on Multimodal Interaction, 2021
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2021
    
  
WaveTransformer: An Architecture for Audio Captioning Based on Learning Temporal and Time-Frequency Information.
    
  
    Proceedings of the 29th European Signal Processing Conference, 2021
    
  
Evaluating Off-the-Shelf Machine Listening and Natural Language Models for Automated Audio Captioning.
    
  
    Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021
    
  
Fairness and Underspecification in Acoustic Scene Classification: The Case for Disaggregated Evaluations.
    
  
    Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021
    
  
Assessment of Self-Attention on Learned Features For Sound Event Localization and Detection.
    
  
    Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021
    
  
Continual Learning for Automated Audio Captioning Using the Learning without Forgetting Approach.
    
  
    Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021
    
  
  2020
Dataset used in COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations.
    
  
    Dataset, June, 2020
    
  
Examining the Mapping Functions of Denoising Autoencoders in Singing Voice Separation.
    
  
    IEEE ACM Trans. Audio Speech Lang. Process., 2020
    
  
WaveTransformer: A Novel Architecture for Audio Captioning Based on Learning Temporal and Time-Frequency Information.
    
  
    CoRR, 2020
    
  
Revisiting Representation Learning for Singing Voice Separation with Sinkhorn Distances.
    
  
    CoRR, 2020
    
  
COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations.
    
  
    CoRR, 2020
    
  
Multichannel Singing Voice Separation by Deep Neural Network Informed DOA Constrained CNMF.
    
  
    CoRR, 2020
    
  
Depthwise Separable Convolutions Versus Recurrent Neural Networks for Monaural Singing Voice Separation.
    
  
    Proceedings of the 22nd IEEE International Workshop on Multimedia Signal Processing, 2020
    
  
Multichannel Singing Voice Separation by Deep Neural Network Informed DOA Constrained CMNMF.
    
  
    Proceedings of the 22nd IEEE International Workshop on Multimedia Signal Processing, 2020
    
  
    Proceedings of the 2020 International Joint Conference on Neural Networks, 2020
    
  
    Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
    
  
    Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
    
  
Memory Requirement Reduction of Deep Neural Networks for Field Programmable Gate Arrays Using Low-Bit Quantization of Parameters.
    
  
    Proceedings of the 28th European Signal Processing Conference, 2020
    
  
    Proceedings of the 28th European Signal Processing Conference, 2020
    
  
    Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020
    
  
    Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020
    
  
  2019
Code of the method presented in the paper: Drossos et al, "Language Modelling for Sound Event Detection with Teacher Forcing and Scheduled Sampling," in proceedings of DCASE 2019.
    
  
    Dataset, November, 2019
    
  
    CoRR, 2019
    
  
Memory Requirement Reduction of Deep Neural Networks Using Low-bit Quantization of Parameters.
    
  
    CoRR, 2019
    
  
Examining the Mapping Functions of Denoising Autoencoders in Music Source Separation.
    
  
    CoRR, 2019
    
  
Unsupervised Adversarial Domain Adaptation Based on The Wasserstein Distance For Acoustic Scene Classification.
    
  
    Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019
    
  
    Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019
    
  
Language Modelling for Sound Event Detection with Teacher Forcing and Scheduled Sampling.
    
  
    Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019
    
  
  2018
    CoRR, 2018
    
  
    Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018
    
  
Reducing Interference with Phase Recovery in DNN-based Monaural Singing Voice Separation.
    
  
    Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
    
  
MaD TwinNet: Masker-Denoiser Architecture with Twin Networks for Monaural Sound Source Separation.
    
  
    Proceedings of the 2018 International Joint Conference on Neural Networks, 2018
    
  
Monaural Singing Voice Separation with Skip-Filtering Connections and Recurrent Inference of Time-Frequency Mask.
    
  
    Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
    
  
    Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018
    
  
Examining the Perceptual Effect of Alternative Objective Functions for Deep Learning Based Music Source Separation.
    
  
    Proceedings of the 52nd Asilomar Conference on Signals, Systems, and Computers, 2018
    
  
  2017
    CoRR, 2017
    
  
    Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017
    
  
A recurrent encoder-decoder approach with skip-filtering connections for monaural singing voice separation.
    
  
    Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017
    
  
    Proceedings of the 25th European Signal Processing Conference, 2017
    
  
    Proceedings of the 25th European Signal Processing Conference, 2017
    
  
  2015
    IEEE Trans. Affect. Comput., 2015
    
  
    Proceedings of the 8th ACM International Conference on PErvasive Technologies Related to Assistive Environments, 2015
    
  
  2014
    Proceedings of the 5th International Conference on Information, 2014
    
  
    Proceedings of the 5th International Conference on Information, 2014
    
  
Swarm Lake: A Game of Swarm Intelligence, Human Interaction and Collaborative Music Composition.
    
  
    Proceedings of the Music Technology meets Philosophy, 2014
    
  
  2013
Sound events and emotions: Investigating the relation of rhythmic characteristics and arousal.
    
  
    Proceedings of the 4th International Conference on Information, 2013
    
  
    Proceedings of the Audio Mostly 2013, 2013
    
  
  2012
    Proceedings of the Eighth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2012
    
  
    Proceedings of the Audio Mostly 2012, 2012
    
  
  2011
    Int. J. Arts Technol., 2011
    
  
  2010