Jianwei Yu

Orcid: 0000-0002-2449-1436

According to our database1, Jianwei Yu authored at least 83 papers between 2008 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Continuous Target Speech Extraction: Enhancing Personalized Diarization and Extraction on Complex Recordings.
CoRR, 2024

SECap: Speech Emotion Captioning with Large Language Model.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Diffsound: Discrete Diffusion Model for Text-to-Sound Generation.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Integrating Lattice-Free MMI Into End-to-End Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Music Source Separation With Band-Split RNN.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Consistent and Relevant: Rethink the Query Embedding in General Sound Separation.
CoRR, 2023

Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction.
CoRR, 2023

AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data.
CoRR, 2023

Leveraging In-the-Wild Data for Effective Self-Supervised Pretraining in Speaker Recognition.
CoRR, 2023

Improved Factorized Neural Transducer Model For text-only Domain Adaptation.
CoRR, 2023

Complexity Scaling for Speech Denoising.
CoRR, 2023

Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression.
CoRR, 2023

Bayes Risk Transducer: Transducer with Controllable Alignment Prediction.
CoRR, 2023

The Sound Demixing Challenge 2023 - Cinematic Demixing Track.
CoRR, 2023

The Sound Demixing Challenge 2023 - Music Demixing Track.
CoRR, 2023

Use of Speech Impairment Severity for Dysarthric Speech Recognition.
CoRR, 2023

The MineTrans Systems for IWSLT 2023 Offline Speech Translation and Speech-to-Speech Translation Tasks.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023

Bayes Risk CTC: Controllable CTC Alignment in Sequence-to-Sequence Tasks.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Efficient Monaural Speech Enhancement with Universal Sample Rate Band-Split RNN.
Proceedings of the IEEE International Conference on Acoustics, 2023

TSpeech-AI System Description to the 5th Deep Noise Suppression (DNS) Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2023

Vision for the 12 LABOURS Digital Twin Platform.
Proceedings of the 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2023

2022
Neural Architecture Search for LF-MMI Trained Time Delay Neural Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Improving Mandarin End-to-End Speech Recognition With Word N-Gram Language Model.
IEEE Signal Process. Lett., 2022

IMU-Aided Registration of MLS Point Clouds Using Inertial Trajectory Error Model and Least Squares Optimization.
Remote. Sens., 2022

Ability boosted knowledge tracing.
Inf. Sci., 2022

NoreSpeech: Knowledge Distillation based Conditional Diffusion Model for Noise-robust Expressive TTS.
CoRR, 2022

Music Source Separation with Band-split RNN.
CoRR, 2022

FRA-RIR: Fast Random Approximation of the Image-source Method.
CoRR, 2022

LAE: Language-Aware Encoder for Monolingual and Multilingual ASR.
CoRR, 2022

Integrate Lattice-Free MMI into End-to-End Speech Recognition.
CoRR, 2022

On-the-fly Feature Based Speaker Adaptation for Dysarthric and Elderly Speech Recognition.
CoRR, 2022

Improving Target Sound Extraction with Timestamp Information.
Proceedings of the Interspeech 2022, 2022

LAE: Language-Aware Encoder for Monolingual and Multilingual ASR.
Proceedings of the Interspeech 2022, 2022

ASR-Robust Natural Language Understanding on ASR-GLUE dataset.
Proceedings of the Interspeech 2022, 2022

Automatic Prosody Annotation with Pre-Trained Text-Speech Model.
Proceedings of the Interspeech 2022, 2022

Multi-Channel Speaker Diarization Using Spatial Features for Meetings.
Proceedings of the IEEE International Conference on Acoustics, 2022

Mixed Precision DNN Quantization for Overlapped Speech Separation and Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

Consistent Training and Decoding for End-to-End Speech Recognition Using Lattice-Free MMI.
Proceedings of the IEEE International Conference on Acoustics, 2022

Audio-Visual Multi-Channel Speech Separation, Dereverberation and Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Audio-Visual Multi-Channel Integration and Recognition of Overlapped Speech.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Mixed Precision Low-Bit Quantization of Neural Network Language Models for Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Recent Progress in the CUHK Dysarthric Speech Recognition System.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Spherically Optimized RANSAC Aided by an IMU for Fisheye Image Matching.
Remote. Sens., 2021

Mixed Precision DNN Qunatization for Overlapped Speech Separation and Recognition.
CoRR, 2021

ASR-GLUE: A New Multi-task Benchmark for ASR-Robust Natural Language Understanding.
CoRR, 2021

Deconvolutional Networks on Graph Data.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Improved End-to-End Dysarthric Speech Recognition via Meta-learning Based Model Re-initialization.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

TeCANet: Temporal-Contextual Attention Network for Environment-Aware Speech Dereverberation.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Adversarial Data Augmentation for Disordered Speech Recognition.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Bayesian Parametric and Architectural Domain Adaptation of LF-MMI Trained TDNNs for Elderly and Dysarthric Speech Recognition.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

A Joint Training Framework of Multi-Look Separator and Speaker Embedding Extractor for Overlapped Speech.
Proceedings of the IEEE International Conference on Acoustics, 2021

Development of the Cuhk Elderly Speech Recognition System for Neurocognitive Disorder Detection Using the Dementiabank Corpus.
Proceedings of the IEEE International Conference on Acoustics, 2021

Bayesian Transformer Language Models for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

Mixed Precision Quantization of Transformer Language Models for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

A Comparative Study of Acoustic and Linguistic Features Classification for Alzheimer's Disease Detection.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Sewer Pipeline Fault Identification Using Anomaly Detection Algorithms on Video Sequences.
IEEE Access, 2020

Bayesian x-vector: Bayesian Neural Network based x-vector System for Speaker Verification.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Dirichlet Graph Variational Autoencoder.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Audio-Visual Multi-Channel Recognition of Overlapped Speech.
Proceedings of the Interspeech 2020, 2020

Exploiting Cross-Domain Visual Feature Generation for Disordered Speech Recognition.
Proceedings of the Interspeech 2020, 2020

Investigation of Data Augmentation Techniques for Disordered Speech Recognition.
Proceedings of the Interspeech 2020, 2020

Audio-Visual Recognition of Overlapped Speech for the LRS2 Dataset.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Low-bit Quantization of Recurrent Neural Network Language Models Using Alternating Direction Methods of Multipliers.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

End-To-End Voice Conversion Via Cross-Modal Knowledge Distillation for Dysarthric Speech Reconstruction.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Adversarial Attacks on GMM I-Vector Based Speaker Verification Systems.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
An Effective Method for Submarine Buried Pipeline Detection via Multi-Sensor Data Fusion.
IEEE Access, 2019

Comparative Study of Parametric and Representation Uncertainty Modeling for Recurrent Neural Network Language Models.
Proceedings of the Interspeech 2019, 2019

Exploiting Visual Features Using Bayesian Gated Neural Networks for Disordered Speech Recognition.
Proceedings of the Interspeech 2019, 2019

LF-MMI Training of Bayesian and Gaussian Process Time Delay Neural Networks for Speech Recognition.
Proceedings of the Interspeech 2019, 2019

The CUHK Dysarthric Speech Recognition Systems for English and Cantonese.
Proceedings of the Interspeech 2019, 2019

Recurrent Neural Network Language Model Training Using Natural Gradient.
Proceedings of the IEEE International Conference on Acoustics, 2019

Speech Emotion Recognition Using Capsule Networks.
Proceedings of the IEEE International Conference on Acoustics, 2019

Gaussian Process Lstm Recurrent Neural Network Language Models for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

Bayesian and Gaussian Process Neural Networks for Large Vocabulary Continuous Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

End-to-end Code-switched TTS with Mix of Monolingual Recordings.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Development of the CUHK Dysarthric Speech Recognition System for the UA Speech Corpus.
Proceedings of the Interspeech 2018, 2018

Gaussian Process Neural Networks for Speech Recognition.
Proceedings of the Interspeech 2018, 2018

Limited-Memory BFGS Optimization of Recurrent Neural Network Language Models for Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2015
A Least Squares Collocation Method for Accuracy Improvement of Mobile LiDAR Systems.
Remote. Sens., 2015

2009
A Case Study on Government Procurement Processes Identifying.
Proceedings of the Second International Workshop on Knowledge Discovery and Data Mining, 2009

2008
Exploring Influencing Factors in E-Commerce Transaction Behaviors.
Proceedings of The International Symposium on Electronic Commerce and Security, 2008


  Loading...