Ye-Xin Lu

This page is a disambiguation page, it actually contains mutiple papers from persons of the same or a similar name.

Bibliography

2025
Improving Noise Robustness of LLM-based Zero-shot TTS via Discrete Acoustic Token Denoising.
CoRR, May, 2025

Explicit estimation of magnitude and phase spectra in parallel for high-quality speech enhancement.
Neural Networks, 2025

Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Can Automated Speech Recognition Errors Provide Valuable Clues for Alzheimer's Disease Detection?
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
APCodec: A Neural Audio Codec With Parallel Amplitude and Phase Spectrum Encoding and Decoding.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

ESTVocoder: An Excitation-Spectral-Transformed Neural Vocoder Conditioned on Mel Spectrogram.
CoRR, 2024

Multi-Stage Speech Bandwidth Extension with Flexible Sampling Rate Control.
CoRR, 2024

Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction.
CoRR, 2024

Pitch-and-Spectrum-Aware Singing Quality Assessment with Bias Correction and Model Fusion.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Stage-Wise and Prior-Aware Neural Speech Phase Prediction.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

MDCTCodec: A Lightweight MDCT-Based Neural Audio Codec Towards High Sampling Rate and Low Bitrate Scenarios.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

SAMOS: A Neural MOS Prediction Model Leveraging Semantic Representations and Acoustic Features.
Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024

MultiStage Speech Bandwidth Extension with Flexible Sampling Rate Control.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

BiVocoder: A Bidirectional Neural Vocoder Integrating Feature Extraction and Waveform Generation.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

A Low-Bitrate Neural Audio Codec Framework with Bandwidth Reduction and Recovery for High-Sampling-Rate Waveforms.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

2023
Long-Frame-Shift Neural Speech Phase Prediction With Spectral Continuity Enhancement and Interpolation Error Compensation.
IEEE Signal Process. Lett., 2023

Source-Filter-Based Generative Adversarial Neural Vocoder for High Fidelity Speech Synthesis.
CoRR, 2023

Nurturing Eco-Consciousness: The Journey of the EcoMorph Guardian in Shaping Tomorrow's Stewards.
Proceedings of the 2023 Symposium on Learning, Design and Technology, 2023

MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

The USTC-NERCSLIP System for the Track 1.2 of Audio Deepfake Detection (ADD 2023) Challenge.
Proceedings of the Workshop on Deepfake Audio Detection and Analysis co-located with 32th International Joint Conference on Artificial Intelligence (IJCAI 2023), 2023


  Loading...