Erica Cooper
Orcid: 0000-0002-2978-2793
  According to our database1,
  Erica Cooper
  authored at least 69 papers
  between 2009 and 2025.
  
  
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
  2025
    CoRR, September, 2025
    
  
    CoRR, May, 2025
    
  
Towards An Integrated Approach for Expressive Piano Performance Synthesis from Music Scores.
    
  
    CoRR, January, 2025
    
  
Phoneme-Level Duration Controllable Neural Text-to-Speech With Phoneme Embedding Skip Connection and Modified Gaussian Duration Modeling.
    
  
    IEEE Access, 2025
    
  
Towards An Integrated Approach for Expressive Piano Performance Synthesis from Music Scores.
    
  
    Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
    
  
Mora-Level Prosody Prediction for Text-to-Speech Using Japanese BERT Without Accentual Labels.
    
  
    Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
    
  
  2024
ZMM-TTS: Zero-Shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-Supervised Discrete Speech Representations.
    
  
    IEEE ACM Trans. Audio Speech Lang. Process., 2024
    
  
Joint speaker encoder and neural back-end model for fully end-to-end automatic speaker verification with multiple enrollment utterances.
    
  
    Comput. Speech Lang., 2024
    
  
MOS-Bench: Benchmarking Generalization Abilities of Subjective Speech Quality Assessment Models.
    
  
    CoRR, 2024
    
  
    Proceedings of the IEEE Spoken Language Technology Workshop, 2024
    
  
    Proceedings of the IEEE Spoken Language Technology Workshop, 2024
    
  
    Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
    
  
    Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
    
  
An Initial Investigation of Language Adaptation for TTS Systems under Low-resource Scenarios.
    
  
    Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
    
  
Generating Speakers by Prompting Listener Impressions for Pre-trained Multi-Speaker Text-to-Speech Systems.
    
  
    Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
    
  
Uncertainty as a Predictor: Leveraging Self-Supervised Learning for Zero-Shot MOS Prediction.
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2024
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2024
    
  
  2023
The PartialSpoof Database and Countermeasures for the Detection of Short Fake Speech Segments Embedded in an Utterance.
    
  
    IEEE ACM Trans. Audio Speech Lang. Process., 2023
    
  
    IEEE ACM Trans. Audio Speech Lang. Process., 2023
    
  
DDSP-based Neural Waveform Synthesis of Polyphonic Guitar Performance from String-wise MIDI Input.
    
  
    CoRR, 2023
    
  
Language-independent speaker anonymization using orthogonal Householder neural network.
    
  
    CoRR, 2023
    
  
    Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
    
  
Improving Generalization Ability of Countermeasures for New Mismatch Scenario by Combining Multiple Advanced Regularization Terms.
    
  
    Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
    
  
    Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
    
  
Investigating Range-Equalizing Bias in Mean Opinion Score Ratings of Synthesized Speech.
    
  
    Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
    
  
Can Knowledge of End-to-End Text-to-Speech Models Improve Neural Midi-to-Audio Synthesis Systems?
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2023
    
  
Partial Rank Similarity Minimization Method for Quality MOS Prediction of Unseen Speech Synthesis Systems in Zero-Shot and Semi-Supervised Setting.
    
  
    Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
    
  
The Voicemos Challenge 2023: Zero-Shot Subjective Speech Quality Prediction for Multiple Domains.
    
  
    Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
    
  
Exploring Isolated Musical Notes as Pre-training Data for Predominant Instrument Recognition in Polyphonic Music.
    
  
    Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023
    
  
  2022
Use of Speaker Recognition Approaches for Learning and Evaluating Embedding Representations of Musical Instrument Sounds.
    
  
    IEEE ACM Trans. Audio Speech Lang. Process., 2022
    
  
The PartialSpoof Database and Countermeasures for the Detection of Short Generated Audio Segments Embedded in a Speech Utterance.
    
  
    CoRR, 2022
    
  
Language-Independent Speaker Anonymization Approach Using Self-Supervised Pre-Trained Models.
    
  
    Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022
    
  
Analyzing Language-Independent Speaker Anonymization Framework under Unseen Conditions.
    
  
    Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
    
  
    Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
    
  
Attention Back-End for Automatic Speaker Verification with Multiple Enrollment Utterances.
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2022
    
  
On the Interplay between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis.
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2022
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2022
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2022
    
  
  2021
    CoRR, 2021
    
  
Use of speaker recognition approaches for learning timbre representations of musical instrument sounds from raw waveforms.
    
  
    CoRR, 2021
    
  
Attention Back-end for Automatic Speaker Verification with Multiple Enrollment Utterances.
    
  
    CoRR, 2021
    
  
    Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021
    
  
    Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021
    
  
    Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021
    
  
    Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
    
  
Learning Disentangled Phone and Speaker Representations in a Semi-Supervised VQ-VAE Paradigm.
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2021
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2021
    
  
  2020
Pretraining Strategies, Waveform Model Choice, and Acoustic Configurations for Multi-Speaker End-to-End Speech Synthesis.
    
  
    CoRR, 2020
    
  
Modeling of Rakugo Speech and Its Limitations: Toward Speech Synthesis That Entertains Audiences.
    
  
    IEEE Access, 2020
    
  
    Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
    
  
Improved Prosody from Learned F0 Codebook Representations for VQ-VAE Speech Waveform Reconstruction.
    
  
    Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
    
  
Zero-Shot Multi-Speaker Text-To-Speech with State-Of-The-Art Neural Speaker Embeddings.
    
  
    Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
    
  
  2019
    PhD thesis, 2019
    
  
Rakugo speech synthesis using segment-to-segment neural transduction and style tokens - toward speech synthesis for entertaining audiences.
    
  
    Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019
    
  
Subset Selection, Adaptation, Gemination and Prosody Prediction for Amharic Text-to-Speech Synthesis.
    
  
    Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019
    
  
  2018
A Comparison of Speaker-based and Utterance-based Data Selection for Text-to-Speech Synthesis.
    
  
    Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
    
  
  2017
Utterance Selection for Optimizing Intelligibility of TTS Voices Trained on ASR Data.
    
  
    Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
    
  
  2016
    Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
    
  
Babler - Data Collection from the Web to Support Speech Recognition and Keyword Search.
    
  
    Proceedings of the 10th Web as Corpus Workshop, 2016
    
  
  2015
Improving speech recognition and keyword search for low resource languages using web data.
    
  
    Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
    
  
  2014
    Proceedings of the IEEE International Conference on Acoustics, 2014
    
  
  2013
    Proceedings of the IEEE International Conference on Acoustics, 2013
    
  
  2009
    Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2009
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2009
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2009