Dingdong Wang

Orcid: 0009-0001-5091-3452

According to our database¹, Dingdong Wang authored at least 17 papers between 2024 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Smaller Models are Natural Explorers for Policy-Level Diversity in GRPO.

[BibT_eX]

[DOI]

CoRR, May, 2026

AVBench: Human-Aligned and Automated Evaluation Benchmark for Audio-Video Generative Models.

[BibT_eX]

[DOI]

CoRR, May, 2026

V2A-DPO: Omni-Preference Optimization for Video-to-Audio Generation.

[BibT_eX]

[DOI]

CoRR, March, 2026

EmotionThinker: Prosody-Aware Reinforcement Learning for Explainable Speech Emotion Reasoning.

[BibT_eX]

[DOI]

CoRR, January, 2026

A Physics-Informed Deep Learning Method for Quantitative Evaluation of Artificial Precipitation Enhancement Effects.

[BibT_eX]

[DOI]

Int. J. Intell. Syst., 2026

2025

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance.

[BibT_eX]

[DOI]

CoRR, December, 2025

MMSU: A Massive Multi-task Spoken Language Understanding and Reasoning Benchmark.

[BibT_eX]

[DOI]

CoRR, June, 2025

Speech Discrete Tokens or Continuous Features? A Comparative Analysis for Spoken Language Understanding in SpeechLLMs.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

SocialCC: Interactive Evaluation for Cultural Competence in Language Agents.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

InSerter: Speech Instruction Following with Unsupervised Interleaved Pre-training.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Language-Codec: Bridging Discrete Codec Representations and Speech Language Models.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

A Comparative Study of Discrete Speech Tokens for Semantic-Related Tasks with Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions.

[BibT_eX]

[DOI]

CoRR, 2024

Exploring SSL Discrete Tokens for Multilingual ASR.

[BibT_eX]

[DOI]

CoRR, 2024

SimpleSpeech: Towards Simple and Efficient Text-to-Speech with Scalar Latent Transformer Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

CoLM-DSR: Leveraging Neural Codec Language Modeling for Multi-Modal Dysarthric Speech Reconstruction.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Dingdong Wang

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...