Edresson Casanova

Orcid: 0000-0003-0160-7173

According to our database1, Edresson Casanova authored at least 40 papers between 2019 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Tagarela - A Portuguese speech dataset from podcasts.
CoRR, March, 2026

Certas Palavras: A 1980s-90s Brazilian Radio Corpus to Test TTS Models in Noisy Multi-Speaker Dialogues.
Proceedings of the 17th International Conference on Computational Processing of Portuguese, 2026

2025
The Impact of Prosodic Segmentation on Speech Synthesis of Spontaneous Speech.
CoRR, November, 2025

Align2Speak: Improving TTS for Low Resource Languages via ASR-Guided Online Preference Optimization.
CoRR, September, 2025

Frame-Stacked Local Transformers For Efficient Multi-Codebook Speech Generation.
CoRR, September, 2025

HiFiTTS-2: A Large-Scale High Bandwidth Speech Dataset.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Efficient and Direct Duplex Modeling for Speech-to-Speech Language Model.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

NanoCodec: Towards High-Quality Ultra Fast Speech LLM Inference.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Low Frame-rate Speech Codec: a Codec Designed for Fast High-quality Speech LLM Training and Inference.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Koel-TTS: Enhancing LLM based Speech Generation with Preference Alignment and Classifier Free Guidance.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

MuPe Life Stories Dataset: Spontaneous Speech in Brazilian Portuguese with a Case Study Evaluation on ASR Bias against Speakers Groups and Topic Modeling.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

The Impact of Prosodic Segmentation on Speech Synthesis of Spontaneous Speech.
Proceedings of the Intelligent Systems - 35th Brazilian Conference, 2025

Open Full-duplex Voice Agent with Speech-to-Speech Language Model.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2025

2024
TTS applied to the generation of datasets for automatic speech recognition.
Proceedings of the 16th International Conference on Computational Processing of Portuguese, 2024

XTTS: a Massively Multilingual Zero-Shot Text-to-Speech Model.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

MLAAD: The Multi-Language Audio Anti-Spoofing Dataset.
Proceedings of the International Joint Conference on Neural Networks, 2024

2023
CORAA ASR: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese.
Lang. Resour. Evaluation, September, 2023

Evaluating OpenAI's Whisper ASR for Punctuation Prediction and Topic Modeling of life histories of the Museum of the Person.
CoRR, 2023

CML-TTS: A Multilingual Dataset for Speech Synthesis in Low-Resource Languages.
Proceedings of the Text, Speech, and Dialogue - 26th International Conference, 2023

Evaluation of Speech Representations for MOS Prediction.
Proceedings of the Text, Speech, and Dialogue - 26th International Conference, 2023

ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022
TTS-Portuguese Corpus: a corpus for speech synthesis in Brazilian Portuguese.
Lang. Resour. Evaluation, 2022

Interpretability Analysis of Deep Models for COVID-19 Detection.
CoRR, 2022

A single speaker is almost all you need for automatic speech recognition.
CoRR, 2022

Overview of the Automatic Speech Recognition for Spontaneous and Prepared Speech & Speech Emotion Recognition in Portuguese (S&ER) Shared-tasks at PROPOR 2022.
Proceedings of the Workshop on Automatic Speech Recognition for Spontaneous and Prepared Speech & Speech Emotion Recognition in Portuguese co-located with 15th edition of the International Conference on the Computational Processing of Portuguese (PROPOR 2022), 2022

Brazilian Portuguese Speech Recognition Using Wav2vec 2.0.
Proceedings of the Computational Processing of the Portuguese Language, 2022

BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for Everyone.
Proceedings of the International Conference on Machine Learning, 2022

2021
CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese.
CoRR, 2021

Evaluating Semantic Similarity Methods to Build Semantic Predictability Norms of Reading Data.
Proceedings of the Text, Speech, and Dialogue - 24th International Conference, 2021

SC-GlowTTS: An Efficient Zero-Shot Multi-Speaker Text-To-Speech Model.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Transfer Learning and Data Augmentation Techniques to the COVID-19 Identification Tasks in ComParE 2021.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Speech2Phone: A Novel and Efficient Method for Training Speaker Recognition Models.
Proceedings of the Intelligent Systems - 10th Brazilian Conference, 2021

Deep Learning against COVID-19: Respiratory Insufficiency Detection in Brazilian Portuguese Speech.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
End-To-End Speech Synthesis Applied to Brazilian Portuguese.
CoRR, 2020

Speech2Phone: A Multilingual and Text Independent Speaker Identification Model.
CoRR, 2020

Natural Language Inference for Portuguese Using BERT and Multilingual Information.
Proceedings of the Computational Processing of the Portuguese Language, 2020

Evaluating Sentence Segmentation in Different Datasets of Neuropsychological Language Tests in Brazilian Portuguese.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

2019
NILC at ASSIN 2: Exploring Multilingual Approaches.
Proceedings of the ASSIN 2 Shared Task: Evaluating Semantic Textual Similarity and Textual Entailment in Portuguese co-located with XII Symposium in Information and Human Language Technology (STIL 2019), 2019


  Loading...