Cem Subakan

Orcid: 0000-0002-7593-6589

According to our database¹, Cem Subakan authored at least 43 papers between 2020 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Investigating Faithfulness in Large Audio Language Models.

[BibT_eX]

[DOI]

CoRR, September, 2025

Virtual Consistency for Audio Editing.

[BibT_eX]

[DOI]

CoRR, September, 2025

FocalCodec-Stream: Streaming Low-Bitrate Speech Coding via Causal Distillation.

[BibT_eX]

[DOI]

Luca Della Libera

Cem Subakan

Mirco Ravanelli

CoRR, September, 2025

Autoregressive Speech Enhancement via Acoustic Tokens.

[BibT_eX]

[DOI]

Luca Della Libera

Cem Subakan

Mirco Ravanelli

CoRR, July, 2025

ALAS: Measuring Latent Speech-Text Alignment For Spoken Language Understanding In Multimodal LLMs.

[BibT_eX]

[DOI]

CoRR, May, 2025

Sample Compression for Continual Learning.

[BibT_eX]

[DOI]

CoRR, March, 2025

ReTreever: Tree-based Coarse-to-Fine Representations for Retrieval.

[BibT_eX]

[DOI]

Valentina Zantedeschi

CoRR, February, 2025

FocalCodec: Low-Bitrate Speech Coding via Focal Modulation Networks.

[BibT_eX]

[DOI]

CoRR, February, 2025

Discrete Audio Tokens: More Than a Survey!

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2025

Towards Generalizable Learning Models for EEG-Based Identification of Pain Perception.

[BibT_eX]

[DOI]

Proceedings of the 35th IEEE International Workshop on Machine Learning for Signal Processing, 2025

Audio Prototypical Network for Controllable Music Recommendation.

[BibT_eX]

[DOI]

Proceedings of the 35th IEEE International Workshop on Machine Learning for Signal Processing, 2025

LiSTEN: Learning Soft Token Embeddings for Neural Audio LLMs.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Investigating the Effectiveness of Explainability Methods in Parkinson's Detection from Speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2025

LMAC-TD: Producing Time Domain Explanations for Audio Classifiers.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Planing It by Ear: Convolutional Neural Networks for Acoustic Anomaly Detection in Industrial Wood Planers.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024

CL-MASR: A Continual Learning Benchmark for Multilingual ASR.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2024

Open-Source Conversational AI with SpeechBrain 1.0.

[BibT_eX]

[DOI]

CoRR, 2024

DASB - Discrete Audio and Speech Benchmark.

[BibT_eX]

[DOI]

CoRR, 2024

Listenable Maps for Zero-Shot Audio Classifiers.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Convolutional Neural Network-Based Reconstruction of Binarized Ultrasound Data for Non-Destructive Testing.

[BibT_eX]

[DOI]

Alexandre Moreau

Angélique Bouchard

Cem Subakan

Guillaume Painchaud-April

Alain Le Duff

Proceedings of the 34th IEEE International Workshop on Machine Learning for Signal Processing, 2024

Audio Editing with Non-Rigid Text Prompts.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

How Should We Extract Discrete Audio Tokens from Self-Supervised Models?

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Phoneme Discretized Saliency Maps for Explainable Detection of AI-Generated Voice.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Listenable Maps for Audio Classifiers.

[BibT_eX]

[DOI]

Francesco Paissan

Mirco Ravanelli

Cem Subakan

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Resource-Efficient Separation Transformer.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Focal Modulation Networks for Interpretable Sound Classification.

[BibT_eX]

[DOI]

Luca Della Libera

Cem Subakan

Mirco Ravanelli

Proceedings of the IEEE International Conference on Acoustics, 2024

CryCeleb: A Speaker Verification Dataset Based on Infant Cry Sounds.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve?

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Dynamic HumTrans: Humming Transcription Using CNNs and Dynamic Programming.

[BibT_eX]

[DOI]

Shubham Gupta

Isaac Neri Gomez-Sarmiento

Faez Amjed Mezdari

Mirco Ravanelli

Cem Subakan

Proceedings of the Artificial Neural Networks in Pattern Recognition, 2024

2023

Exploring Self-Attention Mechanisms for Speech Separation.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2023

Audio Editing with Non-Rigid Text Prompts.

[BibT_eX]

[DOI]

CoRR, 2023

CryCeleb: A Speaker Verification Dataset Based on Infant Cry Sounds.

[BibT_eX]

[DOI]

CoRR, 2023

Posthoc Interpretation via Quantization.

[BibT_eX]

[DOI]

Cem Subakan

Francesco Paissan

Mirco Ravanelli

CoRR, 2023

Unsupervised Improvement of Audio-Text Cross-Modal Representations.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Self-Supervised Learning for Infant Cry Analysis.

[BibT_eX]

[DOI]

Samantha Latremouille

Charles C. Onu

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Learning Representations for New Sound Classes With Continual Self-Supervised Learning.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2022

Resource-Efficient Separation Transformer.

[BibT_eX]

[DOI]

CoRR, 2022

On Using Transformers for Speech-Separation.

[BibT_eX]

[DOI]

CoRR, 2022

Real-M: Towards Speech Separation on Real Mixtures.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

SpeechBrain: A General-Purpose Speech Toolkit.

[BibT_eX]

[DOI]

CoRR, 2021

Attention Is All You Need In Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

A Generative Modeling Approach for Interpreting Population-Level Variability in Brain Structure.

[BibT_eX]

[DOI]

Ran Liu

Cem Subakan

Aishwarya H. Balwani

Jennifer D. Whitesell

Julie Harris

Sanmi Koyejo

Eva L. Dyer

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

Cem Subakan

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...