Erica Cooper

Orcid: 0000-0002-2978-2793

According to our database1, Erica Cooper authored at least 49 papers between 2009 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
The PartialSpoof Database and Countermeasures for the Detection of Short Fake Speech Segments Embedded in an Utterance.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Speaker Anonymization Using Orthogonal Householder Neural Network.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Uncertainty as a Predictor: Leveraging Self-Supervised Learning for Zero-Shot MOS Prediction.
CoRR, 2023

ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations.
CoRR, 2023

Speaker-Text Retrieval via Contrastive Learning.
CoRR, 2023

DDSP-based Neural Waveform Synthesis of Polyphonic Guitar Performance from String-wise MIDI Input.
CoRR, 2023

SynVox2: Towards a privacy-friendly VoxCeleb2 dataset.
CoRR, 2023

Language-independent speaker anonymization using orthogonal Householder neural network.
CoRR, 2023

Range-Based Equal Error Rate for Spoof Localization.
CoRR, 2023

Can Knowledge of End-to-End Text-to-Speech Models Improve Neural Midi-to-Audio Synthesis Systems?
Proceedings of the IEEE International Conference on Acoustics, 2023

Partial Rank Similarity Minimization Method for Quality MOS Prediction of Unseen Speech Synthesis Systems in Zero-Shot and Semi-Supervised Setting.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

The Voicemos Challenge 2023: Zero-Shot Subjective Speech Quality Prediction for Multiple Domains.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Exploring Isolated Musical Notes as Pre-training Data for Predominant Instrument Recognition in Polyphonic Music.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
Use of Speaker Recognition Approaches for Learning and Evaluating Embedding Representations of Musical Instrument Sounds.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Joint Speaker Encoder and Neural Back-end Model for Fully End-to-End Automatic Speaker Verification with Multiple Enrollment Utterances.
CoRR, 2022

The PartialSpoof Database and Countermeasures for the Detection of Short Generated Audio Segments Embedded in a Speech Utterance.
CoRR, 2022

Language-Independent Speaker Anonymization Approach Using Self-Supervised Pre-Trained Models.
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022

Analyzing Language-Independent Speaker Anonymization Framework under Unseen Conditions.
Proceedings of the Interspeech 2022, 2022

The VoiceMOS Challenge 2022.
Proceedings of the Interspeech 2022, 2022

Attention Back-End for Automatic Speaker Verification with Multiple Enrollment Utterances.
Proceedings of the IEEE International Conference on Acoustics, 2022

On the Interplay between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2022

LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech.
Proceedings of the IEEE International Conference on Acoustics, 2022

Generalization Ability of MOS Prediction Networks.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Multi-Task Learning in Utterance-Level and Segmental-Level Spoof Detection.
CoRR, 2021

Use of speaker recognition approaches for learning timbre representations of musical instrument sounds from raw waveforms.
CoRR, 2021

How do Voices from Past Speech Synthesis Challenges Compare Today?
CoRR, 2021

Exploring Disentanglement with Multilingual and Monolingual VQ-VAE.
CoRR, 2021

Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis.
CoRR, 2021

Attention Back-end for Automatic Speaker Verification with Multiple Enrollment Utterances.
CoRR, 2021

An Initial Investigation for Detecting Partially Spoofed Audio.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Learning Disentangled Phone and Speaker Representations in a Semi-Supervised VQ-VAE Paradigm.
Proceedings of the IEEE International Conference on Acoustics, 2021

How Similar or Different is Rakugo Speech Synthesizer to Professional Performers?
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Pretraining Strategies, Waveform Model Choice, and Acoustic Configurations for Multi-Speaker End-to-End Speech Synthesis.
CoRR, 2020

Grapheme or phoneme? An Analysis of Tacotron's Embedded Representations.
CoRR, 2020

Modeling of Rakugo Speech and Its Limitations: Toward Speech Synthesis That Entertains Audiences.
IEEE Access, 2020

Can Speaker Augmentation Improve Multi-Speaker End-to-End TTS?
Proceedings of the Interspeech 2020, 2020

Improved Prosody from Learned F0 Codebook Representations for VQ-VAE Speech Waveform Reconstruction.
Proceedings of the Interspeech 2020, 2020

Zero-Shot Multi-Speaker Text-To-Speech with State-Of-The-Art Neural Speaker Embeddings.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Text-to-Speech Synthesis Using Found Data for Low-Resource Languages.
PhD thesis, 2019

2018
A Comparison of Speaker-based and Utterance-based Data Selection for Text-to-Speech Synthesis.
Proceedings of the Interspeech 2018, 2018

2017
Utterance Selection for Optimizing Intelligibility of TTS Voices Trained on ASR Data.
Proceedings of the Interspeech 2017, 2017

2016
Data Selection and Adaptation for Naturalness in HMM-Based Speech Synthesis.
Proceedings of the Interspeech 2016, 2016

Babler - Data Collection from the Web to Support Speech Recognition and Keyword Search.
Proceedings of the 10th Web as Corpus Workshop, 2016

2015
Improving speech recognition and keyword search for low resource languages using web data.
Proceedings of the INTERSPEECH 2015, 2015

2014
Rescoring Confusion Networks for Keyword Search.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Cross-language phrase boundary detection.
Proceedings of the IEEE International Conference on Acoustics, 2013

2009
Web derived pronunciations for spoken term detection.
Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2009

Unsupervised pronunciation validation.
Proceedings of the IEEE International Conference on Acoustics, 2009

Effect of pronounciations on OOV queries in spoken term detection.
Proceedings of the IEEE International Conference on Acoustics, 2009


  Loading...