Dingdong Wang

According to our database1, Dingdong Wang authored at least 10 papers between 2024 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
MMSU: A Massive Multi-task Spoken Language Understanding and Reasoning Benchmark.
CoRR, June, 2025

EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

SocialCC: Interactive Evaluation for Cultural Competence in Language Agents.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

InSerter: Speech Instruction Following with Unsupervised Interleaved Pre-training.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Language-Codec: Bridging Discrete Codec Representations and Speech Language Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
A Comparative Study of Discrete Speech Tokens for Semantic-Related Tasks with Large Language Models.
CoRR, 2024

EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions.
CoRR, 2024

Exploring SSL Discrete Tokens for Multilingual ASR.
CoRR, 2024

SimpleSpeech: Towards Simple and Efficient Text-to-Speech with Scalar Latent Transformer Diffusion Models.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

CoLM-DSR: Leveraging Neural Codec Language Modeling for Multi-Modal Dysarthric Speech Reconstruction.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024


  Loading...