Shuiyuan Wang

According to our database1, Shuiyuan Wang authored at least 16 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
HumDial-EIBench: A Human-Recorded Multi-Turn Emotional Intelligence Benchmark for Audio Language Models.
CoRR, April, 2026

FastTurn: Unifying Acoustic and Streaming Semantic Cues for Low-Latency and Robust Turn Detection.
CoRR, April, 2026

OSUM-Pangu: An Open-Source Multidimension Speech Understanding Foundation Model Built upon OpenPangu on Ascend NPUs.
CoRR, March, 2026

The ICASSP 2026 HumDial Challenge: Benchmarking Human-like Spoken Dialogue Systems in the LLM Era.
CoRR, January, 2026

WenetSpeech-Yue: A Large-Scale Cantonese Speech Corpus with Multi-dimensional Annotation.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Serial-Parallel Dual-Path Architecture for Speaking Style Recognition.
CoRR, October, 2025

Easy Turn: Integrating Acoustic and Linguistic Modalities for Robust Turn-Taking in Full-Duplex Spoken Dialogue Systems.
CoRR, September, 2025

WenetSpeech-Chuan: A Large-Scale Sichuanese Corpus with Rich Annotation for Dialectal Speech Processing.
CoRR, September, 2025

OSUM-EChat: Enhancing End-to-End Empathetic Spoken Chatbot via Understanding-Driven Spoken Dialogue.
CoRR, August, 2025

AISHELL-5: The First Open-Source In-Car Multi-Channel Multi-Speaker Speech Dataset for Automatic Speech Diarization and Recognition.
CoRR, May, 2025

Steering Language Model to Stable Speech Emotion Recognition via Contextual Perception and Chain of Thought.
CoRR, February, 2025

OSUM: Advancing Open Speech Understanding Models with Limited Resources in Academia.
CoRR, January, 2025

AISHELL-5: The First Open-Source In-Car Multi-Channel Multi-Speaker Speech Dataset for Automatic Speech Diarization and Recognition.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

2023
Video object segmentation based on temporal frame context information fusion and feature enhancement.
Appl. Intell., March, 2023

2022
Multioperation Mode Ferroelectric Channel Devices for Memory and Computation.
Adv. Intell. Syst., 2022

2020
Neuromorphic Engineering for Hardware Computational Acceleration and Biomimetic Perception Motion Integration.
Adv. Intell. Syst., 2020


  Loading...