Cem Subakan

Orcid: 0000-0002-7593-6589

According to our database1, Cem Subakan authored at least 39 papers between 2020 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Audio Prototypical Network For Controllable Music Recommendation.
CoRR, August, 2025

Autoregressive Speech Enhancement via Acoustic Tokens.
CoRR, July, 2025

Discrete Audio Tokens: More Than a Survey!
CoRR, June, 2025

ALAS: Measuring Latent Speech-Text Alignment For Spoken Language Understanding In Multimodal LLMs.
CoRR, May, 2025

LiSTEN: Learning Soft Token Embeddings for Neural Audio LLMs.
CoRR, May, 2025

Sample Compression for Continual Learning.
CoRR, March, 2025

ReTreever: Tree-based Coarse-to-Fine Representations for Retrieval.
CoRR, February, 2025

FocalCodec: Low-Bitrate Speech Coding via Focal Modulation Networks.
CoRR, February, 2025

Investigating the Effectiveness of Explainability Methods in Parkinson's Detection from Speech.
Proceedings of the IEEE International Conference on Acoustics, 2025

LMAC-TD: Producing Time Domain Explanations for Audio Classifiers.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Planing It by Ear: Convolutional Neural Networks for Acoustic Anomaly Detection in Industrial Wood Planers.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
CL-MASR: A Continual Learning Benchmark for Multilingual ASR.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Open-Source Conversational AI with SpeechBrain 1.0.
CoRR, 2024

DASB - Discrete Audio and Speech Benchmark.
CoRR, 2024

Listenable Maps for Zero-Shot Audio Classifiers.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Convolutional Neural Network-Based Reconstruction of Binarized Ultrasound Data for Non-Destructive Testing.
Proceedings of the 34th IEEE International Workshop on Machine Learning for Signal Processing, 2024

Audio Editing with Non-Rigid Text Prompts.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

How Should We Extract Discrete Audio Tokens from Self-Supervised Models?
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Phoneme Discretized Saliency Maps for Explainable Detection of AI-Generated Voice.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Listenable Maps for Audio Classifiers.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Resource-Efficient Separation Transformer.
Proceedings of the IEEE International Conference on Acoustics, 2024

Focal Modulation Networks for Interpretable Sound Classification.
Proceedings of the IEEE International Conference on Acoustics, 2024

CryCeleb: A Speaker Verification Dataset Based on Infant Cry Sounds.
Proceedings of the IEEE International Conference on Acoustics, 2024

Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve?
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Dynamic HumTrans: Humming Transcription Using CNNs and Dynamic Programming.
Proceedings of the Artificial Neural Networks in Pattern Recognition, 2024

2023
Exploring Self-Attention Mechanisms for Speech Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Audio Editing with Non-Rigid Text Prompts.
CoRR, 2023

CryCeleb: A Speaker Verification Dataset Based on Infant Cry Sounds.
CoRR, 2023

Posthoc Interpretation via Quantization.
CoRR, 2023

Unsupervised Improvement of Audio-Text Cross-Modal Representations.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Self-Supervised Learning for Infant Cry Analysis.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Learning Representations for New Sound Classes With Continual Self-Supervised Learning.
IEEE Signal Process. Lett., 2022

Resource-Efficient Separation Transformer.
CoRR, 2022

On Using Transformers for Speech-Separation.
CoRR, 2022

Real-M: Towards Speech Separation on Real Mixtures.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
SpeechBrain: A General-Purpose Speech Toolkit.
CoRR, 2021

Attention Is All You Need In Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
A Generative Modeling Approach for Interpreting Population-Level Variability in Brain Structure.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020


  Loading...