Yuma Shirahata
  According to our database1,
  Yuma Shirahata
  authored at least 14 papers
  between 2019 and 2025.
  
  
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
  2025
Grapheme-Coherent Phonemic and Prosodic Annotation of Speech by Implicit and Explicit Grapheme Conditioning.
    
  
    CoRR, June, 2025
    
  
BitTTS: Highly Compact Text-to-Speech Using 1.58-bit Quantization and Weight Indexing.
    
  
    CoRR, June, 2025
    
  
    Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
    
  
  2024
Audio-conditioned phonemic and prosodic annotation for building text-to-speech models from unlabeled speech data.
    
  
    Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
    
  
    Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
    
  
LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning.
    
  
    Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
    
  
PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions.
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2024
    
  
  2023
Period VITS: Variational Inference with Explicit Pitch Modeling for End-To-End Emotional Speech Synthesis.
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2023
    
  
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform.
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2023
    
  
  2022
Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation.
    
  
    Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
    
  
  2021
How Do Speakers Pause and Hesitate in English and Japanese? - A Comparison Using Parallel Corpora of English and Japanese Presentation Speeches -.
    
  
    Proceedings of the 24th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2021
    
  
  2020
Discriminative Method to Extract Coarse Prosodic Structure and its Application for Statistical Phrase/Accent Command Estimation.
    
  
    Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
    
  
  2019
Generative Modeling of F0 Contours Leveraged by Phrase Structure and Its Application to Statistical Focus Control.
    
  
    Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019
    
  
    Proceedings of the Blizzard Challenge 2019, Vienna, Austria, September 23, 2019, 2019