Jun Ma

Orcid: 0009-0003-8713-0667

Affiliations:
  • Ping An Technology, China


According to our database1, Jun Ma authored at least 15 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
EfficientTTS 2: Variational End-to-End Text-to-Speech Synthesis and Voice Conversion.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

2022
Towards Efficiently Learning Monotonic Alignments for Attention-based End-to-End Speech Recognition.
Proceedings of the Interspeech 2022, 2022

A compact transformer-based GAN vocoder.
Proceedings of the Interspeech 2022, 2022

2021
EfficientSing: A Chinese Singing Voice Synthesis System Using Duration-Free Acoustic Model and HiFi-GAN Vocoder.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Improving Polyphone Disambiguation for Mandarin Chinese by Combining Mix-Pooling Strategy and Window-Based Attention.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

EfficientTTS: An Efficient and High-Quality Text-to-Speech Architecture.
Proceedings of the 38th International Conference on Machine Learning, 2021

Unsupervised Learning for Multi-Style Speech Synthesis with Limited Data.
Proceedings of the IEEE International Conference on Acoustics, 2021

Improving Neural Text Normalization with Partial Parameter Generator and Pointer-Generator Network.
Proceedings of the IEEE International Conference on Acoustics, 2021

SEQ-CPC : Sequential Contrastive Predictive Coding for Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Contextualized Emotion Recognition in Conversation as Sequence Tagging.
Proceedings of the 21th Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2020

Improving Replay Detection System with Channel Consistency DenseNeXt for the ASVspoof 2019 Challenge.
Proceedings of the Interspeech 2020, 2020

Non-Parallel Voice Conversion with Fewer Labeled Data by Conditional Generative Adversarial Networks.
Proceedings of the Interspeech 2020, 2020

Nonparallel Emotional Speech Conversion Using VAE-GAN.
Proceedings of the Interspeech 2020, 2020

Flow-TTS: A Non-Autoregressive Network for Text to Speech Based on Flow.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Cross-Lingual, Multi-Speaker Text-To-Speech Synthesis Using Neural Speaker Embedding.
Proceedings of the Interspeech 2019, 2019


  Loading...