Qiushi Zhu

Orcid: 0000-0002-1196-7781

According to our database1, Qiushi Zhu authored at least 22 papers between 2015 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASR.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
A Joint Speech Enhancement and Self-Supervised Representation Learning Framework for Noise-Robust Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Rep2wav: Noise Robust text-to-speech Using self-supervised representations.
CoRR, 2023

Noise-aware Speech Enhancement using Diffusion Probabilistic Model.
CoRR, 2023

BASEN: Time-Domain Brain-Assisted Speech Enhancement Network with Convolutional Cross Attention in Multi-talker Conditions.
CoRR, 2023

Speech Enhancement with Multi-granularity Vector Quantization.
CoRR, 2023

Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Robust Data2VEC: Noise-Robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive Learning.
Proceedings of the IEEE International Conference on Acoustics, 2023

Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Speech Enhancement with Multi-granularity Vector Quantization.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
VATLM: Visual-Audio-Text Pre-Training with Unified Masked Prediction for Speech Representation Learning.
CoRR, 2022

Speech Enhancement Using Self-Supervised Pre-Trained Model and Vector Quantization.
CoRR, 2022

Joint Training of Speech Enhancement and Self-supervised Model for Noise-robust ASR.
CoRR, 2022

A Complementary Joint Training Approach Using Unpaired Speech and Text for Low-Resource Automatic Speech Recognition.
CoRR, 2022

A Complementary Joint Training Approach Using Unpaired Speech and Text A Complementary Joint Training Approach Using Unpaired Speech and Text.
Proceedings of the Interspeech 2022, 2022

A Noise-Robust Self-Supervised Pre-Training Model Based Speech Representation Learning for Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

Supervised and Self-Supervised Pretraining Based Covid-19 Detection Using Acoustic Breathing/Cough/Speech Signals.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
An Improved Wav2Vec 2.0 Pre-Training Approach Using Enhanced Local Dependency Modeling for Speech Recognition.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

2017
Approximate evaluation of average downtime under an integrated approach of opportunistic maintenance for multi-component systems.
Comput. Ind. Eng., 2017

2015
A condition-based maintenance policy for multi-component systems with a high maintenance setup cost.
OR Spectr., 2015


  Loading...