Ismail Rasim Ülgen

Orcid: 0000-0003-4593-9057

According to our database1, Ismail Rasim Ülgen authored at least 15 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
DiffAnon: Diffusion-based Prosody Control for Voice Anonymization.
CoRR, April, 2026

Neuron-Level Emotion Control in Speech-Generative Large Audio-Language Models.
CoRR, March, 2026

2025
NaturalVoices: A Large-Scale, Spontaneous and Emotional Podcast Dataset for Voice Conversion.
CoRR, November, 2025

HuLA: Prosody-Aware Anti-Spoofing with Multi-Task Learning for Expressive and Emotional Synthetic Speech.
CoRR, September, 2025

Objective Evaluation of Prosody and Intelligibility in Speech Synthesis via Conditional Prediction of Discrete Tokens.
CoRR, September, 2025

The Interspeech 2025 Challenge on Speech Emotion Recognition in Naturalistic Conditions.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Can Emotion Fool Anti-spoofing?
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

2024
SelectTTS: Synthesizing Anyone's Voice via Discrete Unit-Based Frame Selection.
CoRR, 2024

We Need Variations in Speech Synthesis: Sub-center Modelling for Speaker Embeddings.
CoRR, 2024

Discrete Unit Based Masking For Improving Disentanglement in Voice Conversion.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Towards Naturalistic Voice Conversion: NaturalVoices Dataset with an Automatic Processing Pipeline.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Revealing Emotional Clusters in Speaker Embeddings: A Contrastive Learning Strategy for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024

2022
Unsupervised Domain Adaptation of Neural PLDA Using Segment Pairs for Speaker Verification.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

2021
Predicting Biometric Error Behaviour from Speaker Embeddings and a Fast Score Normalization Scheme.
Proceedings of the Speech and Computer - 23rd International Conference, 2021

2020
Speech Activity Detection Under Adverse Conditions Using Neural Networks and Speaker Diarization.
Proceedings of the 28th Signal Processing and Communications Applications Conference, 2020


  Loading...