Heng Lu

Orcid: 0009-0009-9236-8825

Affiliations:
  • Tencent AI Lab, China


According to our database1, Heng Lu authored at least 28 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
METTS: Multilingual Emotional Text-to-Speech by Cross-Speaker and Cross-Lingual Emotion Transfer.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

GMP-ATL: Gender-augmented Multi-scale Pseudo-label Enhanced Adaptive Transfer Learning for Speech Emotion Recognition via HuBERT.
CoRR, 2024

Promptvc: Flexible Stylistic Voice Conversion in Latent Space Driven by Natural Language Prompts.
Proceedings of the IEEE International Conference on Acoustics, 2024

GEmo-CLAP: Gender-Attribute-Enhanced Contrastive Language-Audio Pretraining for Accurate Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Vec-Tok Speech: speech vectorization and tokenization for neural speech generation.
CoRR, 2023

GEmo-CLAP: Gender-Attribute-Enhanced Contrastive Language-Audio Pretraining for Speech Emotion Recognition.
CoRR, 2023

Exploring the Power of Cross-Contextual Large Language Model in Mimic Emotion Prediction.
Proceedings of the 4th on Multimodal Sentiment Analysis Challenge and Workshop: Mimicked Emotions, 2023

Multimodal Cross-Lingual Features and Weight Fusion for Cross-Cultural Humor Detection.
Proceedings of the 4th on Multimodal Sentiment Analysis Challenge and Workshop: Mimicked Emotions, 2023

Hybridformer: Improving Squeezeformer with Hybrid Attention and NSR Mechanism.
Proceedings of the IEEE International Conference on Acoustics, 2023

PP-MET: A Real-World Personalized Prompt Based Meeting Transcription System.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Salt: Distinguishable Speaker Anonymization Through Latent Space Transformation.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
LMEC: Learnable Multiplicative Absolute Position Embedding Based Conformer for Speech Recognition.
CoRR, 2022

Efficient Text Analysis with Pre-Trained Neural Network Models.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Improving Cross-Lingual Speech Synthesis with Triplet Training Scheme.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
FeatherTTS: Robust and Efficient attention based Neural TTS.
Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021

2020
On the localness modeling for the self-attention based end-to-end speech synthesis.
Neural Networks, 2020

Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training.
CoRR, 2020

AdaDurIAN: Few-shot Adaptation for Neural Text-to-Speech with DurIAN.
CoRR, 2020

DurIAN-SC: Duration Informed Attention Network Based Singing Voice Conversion System.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

DurIAN: Duration Informed Attention Network for Speech Synthesis.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Peking Opera Synthesis via Duration Informed Attention Network.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

FeatherWave: An Efficient High-Fidelity Neural Vocoder with Multi-Band Linear Prediction.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Pitchnet: Unsupervised Singing Voice Conversion with Pitch Adversarial Network.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

The Tencent speech synthesis system for Blizzard Challenge 2020.
Proceedings of the Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020

2019
Synthesising Expressiveness in Peking Opera via Duration Informed Attention Network.
CoRR, 2019

Learning Singing From Speech.
CoRR, 2019

DurIAN: Duration Informed Attention Network For Multimodal Synthesis.
CoRR, 2019

Enhancing Hybrid Self-attention Structure with Relative-position-aware Bias for Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2019


  Loading...