Xulong Zhang
Orcid: 0000-0001-7005-992XAffiliations:
- LLAM | Lab of Large Audio Model, Shanghai, China
- Ping An Technology, Shenzhen, China
- Fudan University, Shanghai, China (PhD 2021)
According to our database1,
Xulong Zhang
authored at least 56 papers
between 2013 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
-
on gitlab.com
-
on github.com
On csauthors.net:
Bibliography
2024
ED-TTS: Multi-Scale Emotion Modeling using Cross-Domain Emotion Diarization for Emotional Speech Synthesis.
CoRR, 2024
Learning Disentangled Speech Representations with Contrastive Learning and Time-Invariant Retrieval.
CoRR, 2024
CoRR, 2024
2023
ACM Trans. Multim. Comput. Commun. Appl., 2023
Research on the Impact of Executive Shareholding on New Investment in Enterprises Based on Multivariable Linear Regression Model.
CoRR, 2023
A Hierarchy-based Analysis Approach for Blended Learning: A Case Study with Chinese Students.
CoRR, 2023
DiffTalker: Co-driven audio-image diffusion for talking faces via intermediate landmarks.
CoRR, 2023
CoRR, 2023
CoRR, 2023
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the International Joint Conference on Neural Networks, 2023
Proceedings of the 35th IEEE International Conference on Tools with Artificial Intelligence, 2023
Proceedings of the 35th IEEE International Conference on Tools with Artificial Intelligence, 2023
Proceedings of the 35th IEEE International Conference on Tools with Artificial Intelligence, 2023
Improving EEG-based Emotion Recognition by Fusing Time-Frequency and Spatial Representations.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
VQ-CL: Learning Disentangled Speech Representations with Contrastive Learning and Vector Quantization.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Improving Music Genre Classification from multi-modal Properties of Music and Genre Correlations Perspective.
Proceedings of the IEEE International Conference on Acoustics, 2023
DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized Representation.
Proceedings of the IEEE Intl Conf on Parallel & Distributed Processing with Applications, 2023
Proceedings of the IEEE Intl Conf on Parallel & Distributed Processing with Applications, 2023
CLN-VC: Text-Free Voice Conversion Based on Fine-Grained Style Control and Contrastive Learning with Negative Samples Augmentation.
Proceedings of the IEEE Intl Conf on Parallel & Distributed Processing with Applications, 2023
Proceedings of the Advanced Data Mining and Applications - 19th International Conference, 2023
Proceedings of the Advanced Data Mining and Applications - 19th International Conference, 2023
Proceedings of the Advanced Data Mining and Applications - 19th International Conference, 2023
2022
Proceedings of the 18th International Conference on Mobility, Sensing and Networking, 2022
Improving Speech Representation Learning via Speech-level and Phoneme-level Masking Approach.
Proceedings of the 18th International Conference on Mobility, Sensing and Networking, 2022
Proceedings of the 18th International Conference on Mobility, Sensing and Networking, 2022
Proceedings of the 18th International Conference on Mobility, Sensing and Networking, 2022
Proceedings of the 18th International Conference on Mobility, Sensing and Networking, 2022
Adapitch: Adaption Multi-Speaker Text-to-Speech Conditioned on Pitch Disentangling with Untranscribed Data.
Proceedings of the 18th International Conference on Mobility, Sensing and Networking, 2022
Proceedings of the Interspeech 2022, 2022
Proceedings of the International Joint Conference on Neural Networks, 2022
Singer Identification for Metaverse with Timbral and Middle-Level Perceptual Features.
Proceedings of the International Joint Conference on Neural Networks, 2022
TDASS: Target Domain Adaptation Speech Synthesis Framework for Multi-speaker Low-Resource TTS.
Proceedings of the International Joint Conference on Neural Networks, 2022
Proceedings of the International Joint Conference on Neural Networks, 2022
Proceedings of the International Joint Conference on Neural Networks, 2022
Pre-Avatar: An Automatic Presentation Generation Framework Leveraging Talking Avatar.
Proceedings of the 34th IEEE International Conference on Tools with Artificial Intelligence, 2022
Proceedings of the Neural Information Processing - 29th International Conference, 2022
nnSpeech: Speaker-Guided Conditional Variational Autoencoder for Zero-Shot Multi-speaker text-to-speech.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Avqvc: One-Shot Voice Conversion By Vector Quantization With Applying Contrastive Learning.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the Computer Supported Cooperative Work and Social Computing, 2022
Proceedings of the Web and Big Data - 6th International Joint Conference, 2022
2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
TGAVC: Improving Autoencoder Voice Conversion with Text-Guided and Adversarial Training.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
2020
CoRR, 2020
Comparison for Improvements of Singing Voice Detection System Based on Vocal Separation.
CoRR, 2020
2017
2013
Probability-Symmetric Storage Allocation for Distributed Storage Systems Based on Network Coding.
Int. J. Online Biomed. Eng., 2013