Alkis Koudounas

Orcid: 0000-0003-4386-0409

According to our database1, Alkis Koudounas authored at least 33 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
A Concept-based approach to Voice Disorder Detection.
CoRR, July, 2025

"KAN you hear me?" Exploring Kolmogorov-Arnold Networks for Spoken Language Understanding.
CoRR, May, 2025

Exploring Generative Error Correction for Dysarthric Speech Recognition.
CoRR, May, 2025

MVP: Multi-source Voice Pathology detection.
CoRR, May, 2025

DeepDialogue: A Multi-Turn Emotionally-Rich Spoken Dialogue Dataset.
CoRR, May, 2025

"Alexa, can you forget me?" Machine Unlearning Benchmark in Spoken Language Understanding.
CoRR, May, 2025

voc2vec: A Foundation Model for Non-Verbal Vocalization.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Multimodal Fusion Techniques to Enhance Voice Disorder Diagnoses.
Proceedings of the Workshops of the EDBT/ICDT 2025 Joint Conference co-located with the EDBT/ICDT 2025 Joint Conference, 2025

Detecting and Mitigating Challenges in Zero-Shot Video Summarization with Video LLMs.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
Towards Comprehensive Subgroup Performance Analysis in Speech Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Ex(o)plain: Subgroup-Level Analysis of Exoplanet Atmospheric Parameters.
IEEE Access, 2024

MALTO at SemEval-2024 Task 6: Leveraging Synthetic Data for LLM Hallucination Detection.
Proceedings of the 18th International Workshop on Semantic Evaluation, 2024

MAINDZ at SemEval-2024 Task 5: CLUEDO - Choosing Legal oUtcome by Explaining Decision through Oversight.
Proceedings of the 18th International Workshop on Semantic Evaluation, 2024

Assessing Speech Model Performance: A Subgroup Perspective.
Proceedings of the 32nd Symposium of Advanced Database Systems, 2024

A Contrastive Learning Approach to Mitigate Bias in Speech Models.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Voice Disorder Analysis: a Transformer-based Approach.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Benchmarking Representations for Speech, Music, and Acoustic Events.
Proceedings of the IEEE International Conference on Acoustics, 2024

Leveraging Confidence Models for Identifying Challenging Data Subgroups in Speech Models.
Proceedings of the IEEE International Conference on Acoustics, 2024

Prioritizing Data Acquisition for end-to-end Speech Model Improvement.
Proceedings of the IEEE International Conference on Acoustics, 2024

Houston we have a Divergence: A Subgroup Performance Analysis of ASR Models.
Proceedings of the IEEE International Conference on Acoustics, 2024

Ainur: Harmonizing Speed and Quality in Deep Music Generation Through Lyrics-Audio Embeddings.
Proceedings of the IEEE International Conference on Acoustics, 2024

Non-invasive AI-powered Diagnostics: The case of Voice-Disorder Detection.
Proceedings of the Workshops of the EDBT/ICDT 2024 Joint Conference co-located with the EDBT/ICDT 2024 Joint Conference, 2024

Explaining Speech Classification Models via Word-Level Audio Segments and Paralinguistic Features.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Speech Analysis of Language Varieties in Italy.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Reconstructing Atmospheric Parameters of Exoplanets Using Deep Learning.
CoRR, 2023

PoliToHFI at SemEval-2023 Task 6: Leveraging Entity-Aware and Hierarchical Transformers For Legal Entity Recognition and Court Judgment Prediction.
Proceedings of the The 17th International Workshop on Semantic Evaluation, 2023

Designing Logic Tensor Networks for Visual Sudoku Puzzle Classification.
Proceedings of the 17th International Workshop on Neural-Symbolic Learning and Reasoning, 2023

ITALIC: An Italian Intent Classification Dataset.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Exploring Subgroup Performance in End-to-End Speech Models.
Proceedings of the IEEE International Conference on Acoustics, 2023

ba휌tti at GeoLingIt: Beyond Boundaries, Enhancing Geolocation Prediction and Dialect Classification on Social Media in Italy.
Proceedings of the Eighth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2023), 2023

2022
Transformer-based Non-Verbal Emotion Recognition: Exploring Model Portability across Speakers' Genders.
Proceedings of the MuSe@MM 2022: Proceedings of the 3rd International on Multimodal Sentiment Analysis Workshop and Challenge, 2022

How Much Attention Should we Pay to Mosquitoes?
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

2020
Gradient-based Learning Methods Extended to Smooth Manifolds Applied to Automated Clustering.
J. Artif. Intell. Res., 2020


  Loading...