Adriana Cornelia Stan

Orcid: 0000-0003-2894-5770

Affiliations:
  • Technical University of Cluj-Napoca, Romania


According to our database1, Adriana Cornelia Stan authored at least 41 papers between 2011 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
RoLEX: The development of an extended Romanian lexical dataset and its evaluation at predicting concurrent lexical information.
Nat. Lang. Eng., May, 2023

Towards generalisable and calibrated synthetic speech detection with self-supervised representations.
CoRR, 2023

An analysis on the effects of speaker embedding choice in non auto-regressive TTS.
CoRR, 2023

Residual Information in Deep Speaker Embedding Architectures.
CoRR, 2023

An analysis of large speech models-based representations for speech emotion recognition.
Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2023

Kinyarwanda TTS: Using a multi-speaker dataset to build a Kinyarwanda TTS model.
Proceedings of the 4th Workshop on African Natural Language Processing, 2023

2022
FlexLip: A Controllable Text-to-Lip System.
Sensors, 2022

Gamification-Based Tools Embedded in the Helios Educational Platform.
Proceedings of the Information Systems and Technologies, 2022

The ZevoMOS entry to VoiceMOS Challenge 2022.
Proceedings of the Interspeech 2022, 2022

2021
The MARA corpus: Expressivity in end-to-end TTS systems using synthesised speech data.
Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2021

An analysis of the data efficiency in Tacotron2 speech synthesis system.
Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2021

An Evaluation of Word-Level Confidence Estimation for End-to-End Automatic Speech Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021


An objective evaluation of the effects of recording conditions and speaker characteristics in multi-speaker deep neural speech synthesis.
Proceedings of the Knowledge-Based and Intelligent Information & Engineering Systems: Proceedings of the 25th International Conference KES-2021, 2021

Speaker disentanglement in video-to-speech conversion.
Proceedings of the 29th European Signal Processing Conference, 2021

Speaker verification-derived loss and data augmentation for DNN-based multispeaker speech synthesis.
Proceedings of the 29th European Signal Processing Conference, 2021

2020
Designing a Synthesized Content Feed System for Community Radio.
Proceedings of the NordiCHI '20: Shaping Experiences, 2020

An Evaluation of Postfiltering for Deep Learning Based Speech Synthesis with Limited Data.
Proceedings of the 10th IEEE International Conference on Intelligent Systems, 2020

RECOApy: Data Recording, Pre-Processing and Phonetic Transcription for End-to-End Speech-Based Applications.
Proceedings of the Interspeech 2020, 2020

2019
Input Encoding for Sequence-to-Sequence Learning of Romanian Grapheme-to-Phoneme Conversion.
Proceedings of the 2019 International Conference on Speech Technology and Human-Computer Dialogue, 2019

All Together Now: The Living Audio Dataset.
Proceedings of the Interspeech 2019, 2019

Deep Learning for Automatic Diacritics Restoration in Romanian.
Proceedings of the 15th IEEE International Conference on Intelligent Computer Communication and Processing, 2019

Romanian Part of Speech Tagging using LSTM Networks.
Proceedings of the 15th IEEE International Conference on Intelligent Computer Communication and Processing, 2019

2017
MaRePhoR - An open access machine-readable phonetic dictionary for Romanian.
Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2017

The SWARA speech corpus: A large parallel Romanian read speech dataset.
Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2017

2016
ALISA: An automatic lightly supervised speech segmentation and alignment tool.
Comput. Speech Lang., 2016

Blind speech segmentation using spectrogram image-based features and Mel cepstral coefficients.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Improving sentence-level alignment of speech with imperfect transcripts using utterance concatenation and VAD.
Proceedings of the IEEE 12th International Conference on Intelligent Computer Communication and Processing, 2016

2015
Phonetic segmentation of speech using STEP and t-SNE.
Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2015

2014
RSS-TOBI - A Prosodically Enhanced Romanian Speech Corpus.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Neural net word representations for phrase-break prediction without a part of speech tagger.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Unsupervised and lightly-supervised learning for rapid construction of TTS systems in multiple languages from 'found' data: evaluation and analysis.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Using adaptation to improve speech transcription alignment in noisy and reverberant environments.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Evaluation of sentiment polarity prediction using a dimensional and a categorical approach.
Proceedings of the 7th Conference on Speech Technology and Human-Computer Dialogue, 2013

TUNDRA: a multilingual corpus of found data for TTS research created with light supervision.
Proceedings of the INTERSPEECH 2013, 2013

Lightly supervised discriminative training of grapheme models for improved sentence-level alignment of speech and text data.
Proceedings of the INTERSPEECH 2013, 2013

Lightly supervised GMM VAD to use audiobook for speech synthesiser.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
A grapheme-based method for automatic alignment of speech and text data.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

2011
The Romanian speech synthesis (RSS) corpus: Building a high quality HMM-based speech synthesis system using a high sampling rate.
Speech Commun., 2011

A superpositional model applied to F0 parameterization using DCT for text-to-speech synthesis.
Proceedings of the 6th International Conference Speech Technology and Human-Computer Dialogue, 2011

Interactive Intonation Optimisation Using CMA-ES and DCT Parameterisation of the F0 Contour for Speech Synthesis.
Proceedings of the Nature Inspired Cooperative Strategies for Optimization, 2011


  Loading...