We stand with Ukraine

We stand with Ukraine

Tianrui Wang

Orcid: 0000-0002-2765-5889

According to our database¹, Tianrui Wang authored at least 41 papers between 2020 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Perturbation Self-Supervised Representations for Cross-Lingual Emotion TTS: Stage-Wise Modeling of Emotion and Speaker.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, October, 2025

Evaluating Bias in Spoken Dialogue LLMs for Real-World Decisions and Recommendations.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, October, 2025

Word-Level Emotional Expression Control in Zero-Shot Text-to-Speech Synthesis.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, September, 2025

Pay More Attention To Audio: Mitigating Imbalance of Cross-Modal Attention in Large Audio Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, September, 2025

MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Emmanouil Benetos

,

,

,

CoRR, May, 2025

EmoVoice: LLM-based Emotional Text-To-Speech Model with Freestyle Text Prompting.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, April, 2025

Characteristic-Specific Partial Fine-Tuning for Efficient Emotion and Speaker Adaptation in Codec Language Text-to-Speech Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, January, 2025

Emotional Style Transfer With Intensity Control in Zero-Shot TTS.

[BibT_eX]

[DOI]

,

,

,

,

IEEE Signal Process. Lett., 2025

LORT: Locally refined convolution and Taylor transformer for monaural speech enhancement.

[BibT_eX]

[DOI]

,

,

,

,

,

Speech Commun., 2025

A Hybrid Algorithm for the Regular Syndrome Decoding Problem.

[BibT_eX]

[DOI]

,

,

,

,

,

,

IACR Cryptol. ePrint Arch., 2025

ASDA: Audio Spectrogram Differential Attention Mechanism for Self-Supervised Representation Learning.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Augment Mandarin to Cantonese Speech Databases via Retrieval-Augmented Generation and Speech Synthesis.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

A Three-Stage Beamforming with Harmonic Guidance for Multi-Channel Speech Enhancement.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

A Progressive Generation Framework with Speech Pre-trained Model for Expressive Voice Conversion.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

Adapting Whisper for Code-Switching through Encoding Refining and Language-Aware Decoding.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Time-Graph Frequency Representation with Singular Value Decomposition for Neural Speech Enhancement.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Mamba-SEUNet: Mamba UNet for Monaural Speech Enhancement.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

A Chinese Expressive Long-dialogue Speech Dataset with Scripts.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Discrete Unit-based Low-latency Multi-lingual Speech Synthesis for LIMMITS'25 Challenge.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Reducing the Gap Between Pretrained Speech Enhancement and Recognition Models Using a Real Speech-Trained Bridging Module.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024

VioLA: Conditional Language Models for Speech Recognition, Synthesis, and Translation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2024

EmoPro: A Prompt Selection Strategy for Emotional Expression in LM-based Speech Synthesis.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2024

Progressive Residual Extraction based Pre-training for Speech Representation Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

VQ-CTAP: Cross-Modal Fine-Grained Sequence Representation Learning for Speech Processing.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

Expressive Speech Synthesis with Theme-Oriented Few-Shot Learning in ICAGC 2024.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024

Expressive Text-to-Speech with Contextual Background for ICAGC 2024.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024

VoiCor: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

2023

Harmonic Attention for Monaural Speech Enhancement.

[BibT_eX]

[DOI]

,

,

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2023

A Refining Underlying Information Framework for Monaural Speech Enhancement.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2023

VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2023

An Adapter Based Multi-Label Pre-Training for Speech Separation and Enhancement.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2023

Black-box Word-level Textual Adversarial Attack Based On Discrete Harris Hawks Optimization.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 26th International Conference on Computer Supported Cooperative Work in Design, 2023

Exploring Decryption Failures of BIKE: New Class of Weak Keys and Key Recovery Attacks.

[BibT_eX]

[DOI]

,

,

Proceedings of the Advances in Cryptology - CRYPTO 2023, 2023

Reimagining Public Utilities through AI-Driven User-Centric Multimodal Interaction: A Case Study on the Lighthouse System.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Eleventh International Symposium of Chinese CHI, 2023

On Decoder-Only Architecture For Speech-to-Text and Large Language Model Integration.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Noise-robust Pitch Detection Based on Super-Resolution Harmonics.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022

FusionNet: A Convolution-Transformer Fusion Network for Hyperspectral Image Classification.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Remote. Sens., 2022

Is Image Encoding Beneficial for Deep Learning in Finance?

[BibT_eX]

[DOI]

,

,

IEEE Internet Things J., 2022

HGCN: Harmonic Gated Compensation Network for Speech Enhancement.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2022

Harmonic Gated Compensation Network Plus for ICASSP 2022 DNS Challenge.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2022

2020

Is Image Encoding Beneficial for Deep Learning in Finance? An Analysis of Image Encoding Methods for the Application of Convolutional Neural Networks in Finance.

[BibT_eX]

[DOI]

,

,

CoRR, 2020

Loading...