Tianrui Wang
Orcid: 0000-0002-2765-5889
According to our database1,
Tianrui Wang
authored at least 41 papers
between 2020 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
Perturbation Self-Supervised Representations for Cross-Lingual Emotion TTS: Stage-Wise Modeling of Emotion and Speaker.
CoRR, October, 2025
Evaluating Bias in Spoken Dialogue LLMs for Real-World Decisions and Recommendations.
CoRR, October, 2025
CoRR, September, 2025
Pay More Attention To Audio: Mitigating Imbalance of Cross-Modal Attention in Large Audio Language Models.
CoRR, September, 2025
MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix.
CoRR, May, 2025
CoRR, April, 2025
Characteristic-Specific Partial Fine-Tuning for Efficient Emotion and Speaker Adaptation in Codec Language Text-to-Speech Models.
CoRR, January, 2025
IEEE Signal Process. Lett., 2025
LORT: Locally refined convolution and Taylor transformer for monaural speech enhancement.
Speech Commun., 2025
IACR Cryptol. ePrint Arch., 2025
ASDA: Audio Spectrogram Differential Attention Mechanism for Self-Supervised Representation Learning.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025
Augment Mandarin to Cantonese Speech Databases via Retrieval-Augmented Generation and Speech Synthesis.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025
A Three-Stage Beamforming with Harmonic Guidance for Multi-Channel Speech Enhancement.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025
A Progressive Generation Framework with Speech Pre-trained Model for Expressive Voice Conversion.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025
Adapting Whisper for Code-Switching through Encoding Refining and Language-Aware Decoding.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
Time-Graph Frequency Representation with Singular Value Decomposition for Neural Speech Enhancement.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
Discrete Unit-based Low-latency Multi-lingual Speech Synthesis for LIMMITS'25 Challenge.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
Reducing the Gap Between Pretrained Speech Enhancement and Recognition Models Using a Real Speech-Trained Bridging Module.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
2024
VioLA: Conditional Language Models for Speech Recognition, Synthesis, and Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
EmoPro: A Prompt Selection Strategy for Emotional Expression in LM-based Speech Synthesis.
CoRR, 2024
Progressive Residual Extraction based Pre-training for Speech Representation Learning.
CoRR, 2024
VQ-CTAP: Cross-Modal Fine-Grained Sequence Representation Learning for Speech Processing.
CoRR, 2024
Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024
Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024
VoiCor: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
2023
IEEE ACM Trans. Audio Speech Lang. Process., 2023
CoRR, 2023
VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation.
CoRR, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Black-box Word-level Textual Adversarial Attack Based On Discrete Harris Hawks Optimization.
Proceedings of the 26th International Conference on Computer Supported Cooperative Work in Design, 2023
Exploring Decryption Failures of BIKE: New Class of Weak Keys and Key Recovery Attacks.
Proceedings of the Advances in Cryptology - CRYPTO 2023, 2023
Reimagining Public Utilities through AI-Driven User-Centric Multimodal Interaction: A Case Study on the Lighthouse System.
Proceedings of the Eleventh International Symposium of Chinese CHI, 2023
On Decoder-Only Architecture For Speech-to-Text and Large Language Model Integration.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023
2022
FusionNet: A Convolution-Transformer Fusion Network for Hyperspectral Image Classification.
Remote. Sens., 2022
IEEE Internet Things J., 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
2020
Is Image Encoding Beneficial for Deep Learning in Finance? An Analysis of Image Encoding Methods for the Application of Convolutional Neural Networks in Finance.
CoRR, 2020