Szu-Wei Fu

Orcid: 0000-0002-3487-8212

According to our database1, Szu-Wei Fu authored at least 51 papers between 2014 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech.
CoRR, 2024

2023
Deep Learning-Based Non-Intrusive Multi-Objective Speech Assessment Model With Cross-Domain Features.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Multi-objective Non-intrusive Hearing-aid Speech Assessment Model.
CoRR, 2023

A Study on Incorporating Whisper for Robust Speech Assessment.
CoRR, 2023

QuAVF: Quality-aware Audio-Visual Fusion for Ego4D Talking to Me Challenge.
CoRR, 2023

Real-Time Speech Interruption Analysis: from Cloud to Client Deployment.
Proceedings of the IEEE International Conference on Acoustics, 2023

Study on the Correlation Between Objective Evaluations and Subjective Speech Quality and Intelligibility.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
SEOFP-NET: Compression and Acceleration of Deep Neural Networks for Speech Enhancement Using Sign-Exponent-Only Floating-Points.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

CITISEN: A Deep Learning-Based Speech Signal-Processing Mobile Application.
IEEE Access, 2022

Improving Meeting Inclusiveness using Speech Interruption Analysis.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

MTI-Net: A Multi-Target Speech Intelligibility Prediction Model.
Proceedings of the Interspeech 2022, 2022

OSSEM: one-shot speaker adaptive speech enhancement using meta learning.
Proceedings of the Interspeech 2022, 2022

Boosting Self-Supervised Embeddings for Speech Enhancement.
Proceedings of the Interspeech 2022, 2022

Perceptual Contrast Stretching on Target Feature for Speech Enhancement.
Proceedings of the Interspeech 2022, 2022

MetricGAN-U: Unsupervised Speech Enhancement/ Dereverberation Based Only on Noisy/ Reverberated Speech.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
A Study of Joint Effect on Denoising Techniques and Visual Cues to Improve Speech Intelligibility in Cochlear Implant Simulation.
IEEE Trans. Cogn. Dev. Syst., 2021

SpeechBrain: A General-Purpose Speech Toolkit.
CoRR, 2021

Improving Perceptual Quality by Phone-Fortified Perceptual Loss Using Wasserstein Distance for Speech Enhancement.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

2020
Learning With Learned Loss Function: Speech Enhancement With Quality-Net to Improve Perceptual Evaluation of Speech Quality.
IEEE Signal Process. Lett., 2020

Improving Perceptual Quality by Phone-Fortified Perceptual Loss for Speech Enhancement.
CoRR, 2020

Boosting Objective Scores of Speech Enhancement Model through MetricGAN Post-Processing.
CoRR, 2020

iMetricGAN: Intelligibility Enhancement for Speech-in-Noise Using Generative Adversarial Network-Based Metric Learning.
Proceedings of the Interspeech 2020, 2020

STOI-Net: A Deep Learning based Non-Intrusive Speech Intelligibility Assessment Model.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

Boosting Objective Scores of a Speech Enhancement Model by MetricGAN Post-processing.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
Increasing Compactness of Deep Learning Based Speech Enhancement Models With Parameter Pruning and Quantization Techniques.
IEEE Signal Process. Lett., 2019

Time-Domain Multi-modal Bone/air Conducted Speech Enhancement.
CoRR, 2019

Seeing Voices in Noise: A Study of Audiovisual-Enhanced Vocoded Speech Intelligibility in Cochlear Implant Simulation.
CoRR, 2019

Improving the Intelligibility of Electric and Acoustic Stimulation Speech Using Fully Convolutional Networks Based Speech Enhancement.
CoRR, 2019

Multichannel Speech Enhancement by Raw Waveform-mapping using Fully Convolutional Networks.
CoRR, 2019

Noise Reduction in ECG Signals Using Fully Convolutional Denoising Autoencoders.
IEEE Access, 2019

Specialized Speech Enhancement Model Selection Based on Learned Non-Intrusive Quality Assessment Metric.
Proceedings of the Interspeech 2019, 2019

MOSNet: Deep Learning-Based Objective Assessment for Voice Conversion.
Proceedings of the Interspeech 2019, 2019

IA-NET: Acceleration and Compression of Speech Enhancement Using Integer-Adder Deep Neural Network.
Proceedings of the Interspeech 2019, 2019

MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement.
Proceedings of the 36th International Conference on Machine Learning, 2019

2018
End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

A Study on Speech Enhancement Using Exponent-Only Floating Point Quantized Neural Network (EOFP-QNN).
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model Based on BLSTM.
Proceedings of the Interspeech 2018, 2018

2017
Joint Dictionary Learning-Based Non-Negative Matrix Factorization for Voice Conversion to Improve Speech Intelligibility After Oral Surgery.
IEEE Trans. Biomed. Eng., 2017

End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks.
CoRR, 2017

Multi-Metrics Learning for Speech Enhancement.
CoRR, 2017

Complex spectrogram enhancement by convolutional neural network with multi-metrics learning.
Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017

Raw waveform-based speech enhancement by fully convolutional networks.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Collagen image compression using the JPEG-based predictive lossless coding scheme.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016
Maximum Entropy Learning with Deep Belief Networks.
Entropy, 2016

SNR-Aware Convolutional Neural Network Modeling for Speech Enhancement.
Proceedings of the Interspeech 2016, 2016

2015
Horizontal adaptive disparity estimation scheme for stereoscopic images.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014
Compression for the feature points with binary descriptors.
Proceedings of the 19th International Conference on Digital Signal Processing, 2014

Image deblurring using a pyramid-based Richardson-Lucy algorithm.
Proceedings of the 19th International Conference on Digital Signal Processing, 2014

A novel compression algorithm for IMFs of Hilbert-Huang transform.
Proceedings of the 19th International Conference on Digital Signal Processing, 2014

End-point preserved stroke extraction.
Proceedings of the International Conference on Audio, 2014


  Loading...