Jiarui Hai

Orcid: 0000-0001-9968-7372

According to our database¹, Jiarui Hai authored at least 20 papers between 2022 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

AVMeme Exam: A Multimodal Multilingual Multicultural Benchmark for LLMs' Contextual and Cultural Knowledge and Thinking.

[BibT_eX]

[DOI]

CoRR, January, 2026

Summary of The Inaugural Music Source Restoration Challenge.

[BibT_eX]

[DOI]

CoRR, January, 2026

2025

Adapting Speech Language Model to Singing Voice Synthesis.

[BibT_eX]

[DOI]

CoRR, December, 2025

MSRBench: A Benchmarking Dataset for Music Source Restoration.

[BibT_eX]

[DOI]

CoRR, October, 2025

CapSpeech: Enabling Downstream Applications in Style-Captioned Text-to-Speech.

[BibT_eX]

[DOI]

Laureano Moro-Velázquez

CoRR, June, 2025

DeepSeek in Healthcare: A Survey of Capabilities, Risks, and Clinical Applications of Open-Source Large Language Models.

[BibT_eX]

[DOI]

CoRR, June, 2025

SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline.

[BibT_eX]

[DOI]

Laureano Moro-Velázquez

Jesús Villalba

Najim Dehak

CoRR, May, 2025

FlexSED: Towards Open-Vocabulary Sound Event Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2025

SynSonic: Augmenting Sound Event Detection through Text-to-Audio Diffusion ControlNet and Effective Sample Filtering.

[BibT_eX]

[DOI]

Jiarui Hai

Mounya Elhilali

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2025

EzAudio: Enhancing Text-to-Audio Generation with Efficient Diffusion Transformer.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

SSR-Speech: Towards Stable, Safe and Robust Zero-shot Text-based Speech Editing and Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024

Noise-robust Speech Separation with Fast Generative Correction.

[BibT_eX]

[DOI]

Helin Wang

Jesús Villalba

Laureano Moro-Velázquez

Jiarui Hai

Thomas Thebaud

Najim Dehak

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

DreamVoice: Text-Guided Voice Conversion.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Investigating Self-Supervised Deep Representations for EEG-Based Auditory Attention Decoding.

[BibT_eX]

[DOI]

Karan Thakkar

Jiarui Hai

Mounya Elhilali

Proceedings of the IEEE International Conference on Acoustics, 2024

DPM-TSE: A Diffusion Probabilistic Model for Target Sound Extraction.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Diff-Pitcher: Diffusion-Based Singing Voice Pitch Correction.

[BibT_eX]

[DOI]

Jiarui Hai

Mounya Elhilali

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

Boosting Modality Representation With Pre-Trained Models and Multi-Task Training for Multimodal Sentiment Analysis.

[BibT_eX]

[DOI]

Jiarui Hai

Yu-Jeh Liu

Mounya Elhilali

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022

Progressive Teacher-Student Training Framework for Music Tagging.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Leveraging Natural Language Processing and Time Series Models to Analyze COVID-19 Vaccination Sentiment Dynamics from Tweets.

[BibT_eX]

[DOI]

Proceedings of the AMIA 2022, 2022

Jiarui Hai

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...