Maximizing Audio Event Detection Model Performance on Small Datasets Through Knowledge Transfer, Data Augmentation, and Pretraining: an Ablation Study.

[BibT_eX]

[DOI]

Daniel Tompkins

Kshitiz Kumar

Jian Wu

Proceedings of the IEEE International Conference on Acoustics, 2022

Unispeech-Sat: Universal Speech Representation Learning With Speaker Aware Pre-Training.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing.

[BibT_eX]

[DOI]

CoRR, 2021

ResNeXt and Res2Net Structures for Speaker Verification.

[BibT_eX]

[DOI]

Tianyan Zhou

Yong Zhao

Jian Wu

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Multi-Channel Automatic Speech Recognition Using Deep Complex Unet.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

IEEE SLT 2021 Alpha-Mini Speech Challenge: Open Datasets, Tracks, Rules and Baselines.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

DESNet: A Multi-Channel Network for Simultaneous Speech Dereverberation, Enhancement and Separation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Investigation of Practical Aspects of Single Channel Speech Separation for ASR.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Ultra Fast Speech Separation Model with Teacher Student Learning.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Sequence-Level Confidence Classifier for ASR Utterance Accuracy and Application to Acoustic Models.

[BibT_eX]

[DOI]

Amber Afshan

Kshitiz Kumar

Jian Wu

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Microsoft Speaker Diarization System for the Voxceleb Speaker Recognition Challenge 2020.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Multi-Dialect Speech Recognition in English Using Attention on Ensemble of Experts.

[BibT_eX]

[DOI]

Amit Das

Kshitiz Kumar

Jian Wu

Proceedings of the IEEE International Conference on Acoustics, 2021

Continuous Speech Separation with Conformer.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020

ResNeXt and Res2Net Structure for Speaker Verification.

[BibT_eX]

[DOI]

Tianyan Zhou

Yong Zhao

Jian Wu

CoRR, 2020

Continuous speech separation: dataset and analysis.

[BibT_eX]

[DOI]

CoRR, 2020

NPU Speaker Verification System for INTERSPEECH 2020 Far-Field Speaker Verification Challenge.

[BibT_eX]

[DOI]

Li Zhang

Jian Wu

Lei Xie

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

An End-to-End Architecture of Online Multi-Channel Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Speaker Attribution with Voice Profiles by Graph-Based Semi-Supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Channel-Wise Subband Input for Better Voice and Accompaniment Separation on High Resolution Music.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Fast and Slow Acoustic Model.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Bandpass Noise Generation and Augmentation for Unified ASR.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

1-D Row-Convolution LSTM: Fast Streaming ASR at Accuracy Parity with LC-BLSTM.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

DCCRN: Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Improving Deep CNN Networks with Long Temporal Context for Text-Independent Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Audio-Visual Recognition of Overlapped Speech for the LRS2 Dataset.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Speaker Diarization with Session-Level Speaker Embedding Refinement Using Graph Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Adaptation of RNN Transducer with Text-To-Speech Technology for Keyword Spotting.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Continuous Speech Separation: Dataset and Analysis.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

End-to-End Multi-Channel Speech Separation.

[BibT_eX]

[DOI]

CoRR, 2019

Improved Speaker-Dependent Separation for CHiME-5 Challenge.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

A Comprehensive Study of Speech Separation: Spectrogram vs Waveform Separation.

[BibT_eX]

[DOI]

Fahimeh Bahmaninezhad

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

CNN with Phonetic Attention for Text-Independent Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Time Domain Audio Visual Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2009

Cross-lingual speech recognition under runtime resource constraints.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

2008

Robust Speech Recognition Using a Cepstral Minimum-Mean-Square-Error-Motivated Noise Suppressor.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2008

Improvements on Mel-Frequency Cepstrum Minimum-Mean-Square-Error Noise Suppressor for Robust Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

A minimum-mean-square-error noise reduction algorithm on Mel-frequency cepstra for robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

Adaptation of compressed HMM parameters for resource-constrained speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

2005

Analysis and comparison of two speech feature extraction/compensation algorithms.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2005

Jian Wu

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...