We stand with Ukraine

We stand with Ukraine

Wenrui Liu

Orcid: 0009-0000-5940-5369

Affiliations:

Zhejiang University, Hangzhou, China

According to our database¹, Wenrui Liu authored at least 16 papers between 2023 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

Online presence:

on orcid.org

On csauthors.net:

Bibliography

2026

WavCube: Unifying Speech Representation for Understanding and Generation via Semantic-Acoustic Joint Modeling.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, May, 2026

Say More with Less: Variable-Frame-Rate Speech Tokenization via Adaptive Clustering and Implicit Duration Coding.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

EmoVoice: LLM-based Emotional Text-To-Speech Model with Freestyle Text Prompting.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, April, 2025

EmoVoice: LLM-based Emotional Text-To-Speech Model with Freestyle Text Prompting.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Speech Token Prediction via Compressed-to-fine Language Modeling for Speech Generation.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

MelRe: Vision-Based Mel-Spectrogram Restoration.

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

GTA: Towards Generative Text-To-Audio Retrieval via Multi-Scale Tokenizer.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Enhancing Expressive Voice Conversion with Discrete Pitch-Conditioned Flow Matching Model.

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

VoxpopuliTTS: a large-scale multilingual TTS corpus for zero-shot speech generation.

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the 31st International Conference on Computational Linguistics, 2025

Analyzing and Mitigating Inconsistency in Discrete Speech Tokens for Neural Codec Language Models.

[DOI]

,

,

,

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

CART: A Generative Cross-Modal Retrieval Framework With Coarse-To-Fine Semantic Modeling.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

Analyzing and Mitigating Inconsistency in Discrete Audio Tokens for Neural Codec Language Models.

[DOI]

,

,

,

,

,

,

CoRR, 2024

ACE: A Generative Cross-Modal Retrieval Framework with Coarse-To-Fine Semantic Modeling.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension.

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

Sequential Style Consistency Learning for Domain-Generalizable Text Recognition.

[DOI]

Pengcheng Zhang

,

,

,

,

,

,

,

,

Proceedings of the Artificial Intelligence - Third CAAI International Conference, 2023

Loading...