Sri Karlapati

According to our database¹, Sri Karlapati authored at least 16 papers between 2020 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2024

BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data.

[BibT_eX]

[DOI]

Álvaro Martín-Cortinas

Soledad López Gambino

Kayeon Yoo

Elena Sokolova

Thomas Drugman

CoRR, 2024

Energy-conserving equivariant GNN for elasticity of lattice architected metamaterials.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

A Comparative Analysis of Pretrained Language Models for Text-to-Speech.

[BibT_eX]

[DOI]

Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023

eCat: An End-to-End Model for Multi-Speaker TTS & Many-to-Many Fine-Grained Prosody Transfer.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022

Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody.

[BibT_eX]

[DOI]

CoRR, 2022

Expressive, Variable, and Controllable Duration Modelling in TTS.

[BibT_eX]

[DOI]

CoRR, 2022

CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer.

[BibT_eX]

[DOI]

CoRR, 2022

Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Expressive, Variable, and Controllable Duration Modelling in TTS.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021

Voicy: Zero-Shot Non-Parallel Voice Conversion in Noisy Reverberant Environments.

[BibT_eX]

[DOI]

Alejandro Mottini

Jaime Lorenzo-Trueba

Sri Vishnu Kumar Karlapati

Thomas Drugman

Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021

Multi-Scale Spectrogram Modelling for Neural Text-to-Speech.

[BibT_eX]

[DOI]

Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021

A Learned Conditional Prior for the VAE Acoustic Space of a TTS System.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Camp: A Two-Stage Approach to Modelling Prosody in Context.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

CopyCat: Many-to-Many Fine-Grained Prosody Transfer for Neural Text-to-Speech.

[BibT_eX]

[DOI]

Daniel Sáez-Trigueros

Thomas Drugman

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Sri Karlapati

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...