Adriana Cornelia Stan

Elena Irimia

Verginica Barbu Mititelu

Nat. Lang. Eng., May, 2023

Towards generalisable and calibrated synthetic speech detection with self-supervised representations.

[BibT_eX]

[DOI]

CoRR, 2023

An analysis on the effects of speaker embedding choice in non auto-regressive TTS.

[BibT_eX]

[DOI]

Johannah O'Mahony

CoRR, 2023

Residual Information in Deep Speaker Embedding Architectures.

[BibT_eX]

[DOI]

CoRR, 2023

An analysis of large speech models-based representations for speech emotion recognition.

[BibT_eX]

[DOI]

Adrian Bogdan Stânea

Vlad Striletchi

Cosmin Striletchi

Adriana Cornelia Stan

Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2023

Kinyarwanda TTS: Using a multi-speaker dataset to build a Kinyarwanda TTS model.

[BibT_eX]

[DOI]

Samuel Rutunda

Kleber Kabanda

Proceedings of the 4th Workshop on African Natural Language Processing, 2023

2022

FlexLip: A Controllable Text-to-Lip System.

[BibT_eX]

[DOI]

Sensors, 2022

Gamification-Based Tools Embedded in the Helios Educational Platform.

[BibT_eX]

[DOI]

Cosmin Striletchi

Adriana Cornelia Stan

Eusebiu Jecan

Proceedings of the Information Systems and Technologies, 2022

The ZevoMOS entry to VoiceMOS Challenge 2022.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2022, 2022

2021

The MARA corpus: Expressivity in end-to-end TTS systems using synthesised speech data.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2021

An analysis of the data efficiency in Tacotron2 speech synthesis system.

[BibT_eX]

[DOI]

Georgiana Saracu

Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2021

An Evaluation of Word-Level Confidence Estimation for End-to-End Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

LiRo: Benchmark and leaderboard for Romanian language tasks.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

An objective evaluation of the effects of recording conditions and speaker characteristics in multi-speaker deep neural speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the Knowledge-Based and Intelligent Information & Engineering Systems: Proceedings of the 25th International Conference KES-2021, 2021

Speaker disentanglement in video-to-speech conversion.

[BibT_eX]

[DOI]

Dan Oneata

Horia Cucu

Proceedings of the 29th European Signal Processing Conference, 2021

Speaker verification-derived loss and data augmentation for DNN-based multispeaker speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 29th European Signal Processing Conference, 2021

2020

Designing a Synthesized Content Feed System for Community Radio.

[BibT_eX]

[DOI]

Kristen M. Scott

Simone Ashby

Proceedings of the NordiCHI '20: Shaping Experiences, 2020

An Evaluation of Postfiltering for Deep Learning Based Speech Synthesis with Limited Data.

[BibT_eX]

[DOI]

Proceedings of the 10th IEEE International Conference on Intelligent Systems, 2020

RECOApy: Data Recording, Pre-Processing and Phonetic Transcription for End-to-End Speech-Based Applications.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2020, 2020

2019

Input Encoding for Sequence-to-Sequence Learning of Romanian Grapheme-to-Phoneme Conversion.

[BibT_eX]

[DOI]

Proceedings of the 2019 International Conference on Speech Technology and Human-Computer Dialogue, 2019

All Together Now: The Living Audio Dataset.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2019, 2019

Deep Learning for Automatic Diacritics Restoration in Romanian.

[BibT_eX]

[DOI]

Maria Nutu

Proceedings of the 15th IEEE International Conference on Intelligent Computer Communication and Processing, 2019

Romanian Part of Speech Tagging using LSTM Networks.

[BibT_eX]

[DOI]

Maria Nutu

Proceedings of the 15th IEEE International Conference on Intelligent Computer Communication and Processing, 2019

2017

MaRePhoR - An open access machine-readable phonetic dictionary for Romanian.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2017

The SWARA speech corpus: A large parallel Romanian read speech dataset.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2017

2016

ALISA: An automatic lightly supervised speech segmentation and alignment tool.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2016

Blind speech segmentation using spectrogram image-based features and Mel cepstral coefficients.

[BibT_eX]

[DOI]

Cassia Valentini-Botinhao

Bogdan Orza

Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Improving sentence-level alignment of speech with imperfect transcripts using utterance concatenation and VAD.

[BibT_eX]

[DOI]

Alexandru Moldovan

Proceedings of the IEEE 12th International Conference on Intelligent Computer Communication and Processing, 2016

2015

Phonetic segmentation of speech using STEP and t-SNE.

[BibT_eX]

[DOI]

Cassia Valentini-Botinhao

Simon King

Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2015

2014

RSS-TOBI - A Prosodically Enhanced Romanian Speech Corpus.

[BibT_eX]

[DOI]

Tiberiu Boros

Oliver Watts

Stefan Daniel Dumitrescu

Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Neural net word representations for phrase-break prediction without a part of speech tagger.

[BibT_eX]

[DOI]

Oliver Watts

Siva Reddy Gangireddy

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

Unsupervised and lightly-supervised learning for rapid construction of TTS systems in multiple languages from 'found' data: evaluation and analysis.

[BibT_eX]

[DOI]

Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Using adaptation to improve speech transcription alignment in noisy and reverberant environments.

[BibT_eX]

[DOI]

Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Evaluation of sentiment polarity prediction using a dimensional and a categorical approach.

[BibT_eX]

[DOI]

Proceedings of the 7th Conference on Speech Technology and Human-Computer Dialogue, 2013

TUNDRA: a multilingual corpus of found data for TTS research created with light supervision.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2013, 2013

Lightly supervised discriminative training of grapheme models for improved sentence-level alignment of speech and text data.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2013, 2013

Lightly supervised GMM VAD to use audiobook for speech synthesiser.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

A grapheme-based method for automatic alignment of speech and text data.

[BibT_eX]

[DOI]

Peter Bell

Simon King

Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

2011

The Romanian speech synthesis (RSS) corpus: Building a high quality HMM-based speech synthesis system using a high sampling rate.

[BibT_eX]

[DOI]

Speech Commun., 2011

A superpositional model applied to F0 parameterization using DCT for text-to-speech synthesis.

[BibT_eX]

[DOI]