Rui Liu

Orcid: 0000-0003-4524-7413

Affiliations:
  • Inner Mongolia University, College of Computer Science, Hohhot, China


According to our database1, Rui Liu authored at least 41 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Multi-space channel representation learning for mono-to-binaural conversion based audio deepfake detection.
Inf. Fusion, May, 2024

Modified suppressed relative entropy fuzzy c-means clustering algorithm.
J. Intell. Fuzzy Syst., March, 2024

Text-to-Speech for Low-Resource Agglutinative Language With Morphology-Aware Language Model Pre-Training.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

2023
Decoupling Speaker-Independent Emotions for Voice Conversion via Source-Filter Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Distributed Sensor Selection for Speech Enhancement With Acoustic Sensor Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling.
CoRR, 2023

Learning Noise-Robust Joint Representation for Multimodal Emotion Recognition under Realistic Incomplete Data Scenarios.
CoRR, 2023

FluentEditor: Text-based Speech Editing by Considering Acoustic and Prosody Consistency.
CoRR, 2023

Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech.
CoRR, 2023

Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion.
CoRR, 2023

MnTTS2: An Open-Source Multi-Speaker Mongolian Text-to-Speech Synthesis Dataset.
CoRR, 2023

Exploiting Modality-Invariant Feature for Robust Multimodal Emotion Recognition with Missing Modalities.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Decoding Knowledge Transfer for Neural Text-to-Speech Training.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Emotional voice conversion: Theory, databases and ESD.
Speech Commun., 2022

Multistage Deep Transfer Learning for EmIoT-Enabled Human-Computer Interaction.
IEEE Internet Things J., 2022

Explicit Intensity Control for Accented Text-to-speech.
CoRR, 2022

FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis.
CoRR, 2022

A Deep Investigation of RNN and Self-attention for the Cyrillic-Traditional Mongolian Bidirectional Conversion.
CoRR, 2022

Controllable Accented Text-to-Speech Synthesis.
CoRR, 2022

Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning.
Proceedings of the Interspeech 2022, 2022

A Deep Investigation of RNN and Self-attention for the Cyrillic-Traditional Mongolian Bidirectional Conversion.
Proceedings of the Neural Information Processing - 29th International Conference, 2022

Alignment-Learning Based Single-Step Decoding for Accurate and Fast Non-Autoregressive Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

Visualtts: TTS with Accurate Lip-Speech Synchronization for Automatic Voice Over.
Proceedings of the IEEE International Conference on Acoustics, 2022

MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline.
Proceedings of the International Conference on Asian Language Processing, 2022

2021
Expressive TTS Training With Frame and Style Reconstruction Loss.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Exploiting Morphological and Phonological Features to Improve Prosodic Phrasing for Mongolian Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

FastTalker: A neural text-to-speech architecture with shallow and group autoregression.
Neural Networks, 2021

StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech Synthesis.
CoRR, 2021

Reinforcement Learning for Emotional Text-to-Speech Synthesis with Improved Emotion Discriminability.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Seen and Unseen Emotional Style Transfer for Voice Conversion with A New Emotional Speech Dataset.
Proceedings of the IEEE International Conference on Acoustics, 2021

Graphspeech: Syntax-Aware Graph Attention Network for Neural Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2021

Mongolian emotional speech synthesis based on transfer learning and emotional embedding.
Proceedings of the International Conference on Asian Language Processing, 2021

2020
Modeling Prosodic Phrasing With Multi-Task Learning in Tacotron-Based TTS.
IEEE Signal Process. Lett., 2020

WaveTTS: Tacotron-based TTS with Joint Time-Frequency Domain Loss.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Teacher-Student Training For Robust Tacotron-Based TTS.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Building Mongolian TTS Front-End with Encoder-Decoder Model by Using Bridge Method and Multi-view Features.
Proceedings of the Neural Information Processing - 26th International Conference, 2019

2018
Phonologically Aware BiLSTM Model for Mongolian Phrase Break Prediction with Attention Mechanism.
Proceedings of the PRICAI 2018: Trends in Artificial Intelligence, 2018

End-to-End Mongolian Text-to-Speech System.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Improving Mongolian Phrase Break Prediction by Using Syllable and Morphological Embeddings with BiLSTM Model.
Proceedings of the Interspeech 2018, 2018

A LSTM Approach with Sub-Word Embeddings for Mongolian Phrase Break Prediction.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

2016
Mongolian prosodic phrase prediction using suffix segmentation.
Proceedings of the 2016 International Conference on Asian Language Processing, 2016


  Loading...