Jiarui Hai

Orcid: 0000-0001-9968-7372

According to our database1, Jiarui Hai authored at least 20 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
AVMeme Exam: A Multimodal Multilingual Multicultural Benchmark for LLMs' Contextual and Cultural Knowledge and Thinking.
CoRR, January, 2026

Summary of The Inaugural Music Source Restoration Challenge.
CoRR, January, 2026

2025
Adapting Speech Language Model to Singing Voice Synthesis.
CoRR, December, 2025

MSRBench: A Benchmarking Dataset for Music Source Restoration.
CoRR, October, 2025

CapSpeech: Enabling Downstream Applications in Style-Captioned Text-to-Speech.
CoRR, June, 2025

DeepSeek in Healthcare: A Survey of Capabilities, Risks, and Clinical Applications of Open-Source Large Language Models.
CoRR, June, 2025

SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline.
CoRR, May, 2025

FlexSED: Towards Open-Vocabulary Sound Event Detection.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2025

SynSonic: Augmenting Sound Event Detection through Text-to-Audio Diffusion ControlNet and Effective Sample Filtering.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2025

EzAudio: Enhancing Text-to-Audio Generation with Efficient Diffusion Transformer.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

SSR-Speech: Towards Stable, Safe and Robust Zero-shot Text-based Speech Editing and Synthesis.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
Noise-robust Speech Separation with Fast Generative Correction.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

DreamVoice: Text-Guided Voice Conversion.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Investigating Self-Supervised Deep Representations for EEG-Based Auditory Attention Decoding.
Proceedings of the IEEE International Conference on Acoustics, 2024

DPM-TSE: A Diffusion Probabilistic Model for Target Sound Extraction.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Diff-Pitcher: Diffusion-Based Singing Voice Pitch Correction.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

Boosting Modality Representation With Pre-Trained Models and Multi-Task Training for Multimodal Sentiment Analysis.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Progressive Teacher-Student Training Framework for Music Tagging.
Proceedings of the IEEE International Conference on Acoustics, 2022

Leveraging Natural Language Processing and Time Series Models to Analyze COVID-19 Vaccination Sentiment Dynamics from Tweets.
Proceedings of the AMIA 2022, 2022


  Loading...