Shuiyuan Wang

According to our database¹, Shuiyuan Wang authored at least 16 papers between 2020 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

HumDial-EIBench: A Human-Recorded Multi-Turn Emotional Intelligence Benchmark for Audio Language Models.

[BibT_eX]

[DOI]

CoRR, April, 2026

FastTurn: Unifying Acoustic and Streaming Semantic Cues for Low-Latency and Robust Turn Detection.

[BibT_eX]

[DOI]

CoRR, April, 2026

OSUM-Pangu: An Open-Source Multidimension Speech Understanding Foundation Model Built upon OpenPangu on Ascend NPUs.

[BibT_eX]

[DOI]

CoRR, March, 2026

The ICASSP 2026 HumDial Challenge: Benchmarking Human-like Spoken Dialogue Systems in the LLM Era.

[BibT_eX]

[DOI]

CoRR, January, 2026

WenetSpeech-Yue: A Large-Scale Cantonese Speech Corpus with Multi-dimensional Annotation.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

Serial-Parallel Dual-Path Architecture for Speaking Style Recognition.

[BibT_eX]

[DOI]

CoRR, October, 2025

Easy Turn: Integrating Acoustic and Linguistic Modalities for Robust Turn-Taking in Full-Duplex Spoken Dialogue Systems.

[BibT_eX]

[DOI]

CoRR, September, 2025

WenetSpeech-Chuan: A Large-Scale Sichuanese Corpus with Rich Annotation for Dialectal Speech Processing.

[BibT_eX]

[DOI]

CoRR, September, 2025

OSUM-EChat: Enhancing End-to-End Empathetic Spoken Chatbot via Understanding-Driven Spoken Dialogue.

[BibT_eX]

[DOI]

CoRR, August, 2025

AISHELL-5: The First Open-Source In-Car Multi-Channel Multi-Speaker Speech Dataset for Automatic Speech Diarization and Recognition.

[BibT_eX]

[DOI]

CoRR, May, 2025

Steering Language Model to Stable Speech Emotion Recognition via Contextual Perception and Chain of Thought.

[BibT_eX]

[DOI]

CoRR, February, 2025

OSUM: Advancing Open Speech Understanding Models with Limited Resources in Academia.

[BibT_eX]

[DOI]

CoRR, January, 2025

AISHELL-5: The First Open-Source In-Car Multi-Channel Multi-Speaker Speech Dataset for Automatic Speech Diarization and Recognition.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

2023

Video object segmentation based on temporal frame context information fusion and feature enhancement.

[BibT_eX]

[DOI]

Appl. Intell., March, 2023

2022

Multioperation Mode Ferroelectric Channel Devices for Memory and Computation.

[BibT_eX]

[DOI]

Adv. Intell. Syst., 2022

2020

Neuromorphic Engineering for Hardware Computational Acceleration and Biomimetic Perception Motion Integration.

[BibT_eX]

[DOI]

Adv. Intell. Syst., 2020

Shuiyuan Wang

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...