Ye Jia

Orcid: 0000-0002-0457-8083

According to our database1, Ye Jia authored at least 32 papers between 2018 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Toward an Edu-Metaverse of Knowledge: Immersive Exploration of University Courses.
IEEE Trans. Learn. Technol., December, 2023

Large Scale Foundation Models for Intelligent Manufacturing Applications: A Survey.
CoRR, 2023

From Classroom to Metaverse: A Study on Gamified Constructivist Teaching in Higher Education.
Proceedings of the Advances in Web-Based Learning - ICWL 2023, 2023

Textless Direct Speech-to-Speech Translation with Discrete Speech Representation.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Speech Aware Dialog System Technology Challenge (DSTC11).
CoRR, 2022

mSLAM: Massively multilingual joint pre-training for speech and text.
CoRR, 2022

CVSS Corpus and Massively Multilingual Speech-to-Speech Translation.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation.
Proceedings of the Interspeech 2022, 2022

Training Text-To-Speech Systems From Synthetic Data: A Practical Approach For Accent Transfer Tasks.
Proceedings of the Interspeech 2022, 2022


Translatotron 2: High-quality direct speech-to-speech translation with voice preservation.
Proceedings of the International Conference on Machine Learning, 2022

More than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training.
CoRR, 2021

Translatotron 2: Robust direct speech-to-speech translation.
CoRR, 2021

PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Parallel Tacotron: Non-Autoregressive and Controllable TTS.
Proceedings of the IEEE International Conference on Acoustics, 2021

Textual Echo Cancellation.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Non-Attentive Tacotron: Robust and Controllable Neural TTS Synthesis Including Unsupervised Duration Modeling.
CoRR, 2020

Improved Noisy Student Training for Automatic Speech Recognition.
Proceedings of the Interspeech 2020, 2020

2019
The ASVspoof 2019 database.
CoRR, 2019

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling.
CoRR, 2019

Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning.
Proceedings of the Interspeech 2019, 2019

LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech.
Proceedings of the Interspeech 2019, 2019

VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking.
Proceedings of the Interspeech 2019, 2019

Direct Speech-to-Speech Translation with a Sequence-to-Sequence Model.
Proceedings of the Interspeech 2019, 2019

Parrotron: An End-to-End Speech-to-Speech Conversion Model and its Applications to Hearing-Impaired Speech and Speech Separation.
Proceedings of the Interspeech 2019, 2019

Hierarchical Generative Modeling for Controllable Speech Synthesis.
Proceedings of the 7th International Conference on Learning Representations, 2019

Leveraging Weakly Supervised Data to Improve End-to-end Speech-to-text Translation.
Proceedings of the IEEE International Conference on Acoustics, 2019

Speech Recognition with Augmented Synthesized Speech.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis.
Proceedings of the 35th International Conference on Machine Learning, 2018


  Loading...