Ruoyu Wang

Affiliations:
  • University of Science and Technology of China, Hefei, China


According to our database1, Ruoyu Wang authored at least 20 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2026
Three-stage modular speaker diarization collaborating with front-end techniques in the CHiME-8 NOTSOFAR-1 challenge.
Comput. Speech Lang., 2026

2025
Exploring Speaker Diarization with Mixture of Experts.
CoRR, June, 2025

Latent Swap Joint Diffusion for Long-Form Audio Generation.
CoRR, February, 2025

QA-MDT: Quality-aware Masked Diffusion Transformer for Enhanced Music Generation.
Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

Incorporating Spatial Cues in Modular Speaker Diarization for Multi-channel Multi-party Meetings.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
DCF-DS: Deep Cascade Fusion of Diarization and Separation for Speech Recognition under Realistic Single-Channel Conditions.
CoRR, 2024

The USTC-NERCSLIP Systems for the CHiME-8 NOTSOFAR-1 Challenge.
CoRR, 2024

Quality-aware Masked Diffusion Transformer for Enhanced Music Generation.
CoRR, 2024

Multitask frame-level learning for few-shot sound event detection.
CoRR, 2024

A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition.
CoRR, 2024

Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding with Sequence-to-Sequence Architecture.
Proceedings of the IEEE International Conference on Acoustics, 2024

The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction.
Proceedings of the IEEE International Conference on Acoustics, 2024

Implicit Enhancement of Target Speaker in Speaker-Adaptive ASR through Efficient Joint Optimization.
Proceedings of the IEEE International Conference on Acoustics, 2024

A Spatial Long-Term Iterative Mask Estimation Approach for Multi-Channel Speaker Diarization and Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024

A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction.
CoRR, 2023

The USTC-NERCSLIP Systems for the CHiME-7 DASR Challenge.
CoRR, 2023

Quantum Transfer Learning Using the Large-Scale Unsupervised Pre-Trained Model Wavlm-Large for Synthetic Speech Detection.
Proceedings of the IEEE International Conference on Acoustics, 2023

Enhancing Privacy Preservation with Quantum Computing for Word-Level Audio-Visual Speech Recognition.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
External Text Based Data Augmentation for Low-Resource Speech Recognition in the Constrained Condition of OpenASR21 Challenge.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022


  Loading...