Jian Wu

Affiliations:
  • Microsoft Corporation, USA
  • Northwestern Polytechnical University, Xi'an, China


According to our database1, Jian Wu authored at least 58 papers between 2005 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning.
CoRR, 2023

t-SOT FNT: Streaming Multi-talker ASR with Text-only Domain Adaptation Capability.
CoRR, 2023

Bilingual Streaming ASR with Grapheme units and Auxiliary Monolingual Loss.
CoRR, 2023

Simulating Realistic Speech Overlaps Improves Multi-Talker ASR.
Proceedings of the IEEE International Conference on Acoustics, 2023

Target Speaker Voice Activity Detection with Transformers and Its Integration with End-To-End Neural Diarization.
Proceedings of the IEEE International Conference on Acoustics, 2023

Vararray Meets T-Sot: Advancing the State of the Art of Streaming Distant Conversational Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Speech Separation with Large-Scale Self-Supervised Learning.
Proceedings of the IEEE International Conference on Acoustics, 2023

On Decoder-Only Architecture For Speech-to-Text and Large Language Model Integration.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing.
IEEE J. Sel. Top. Signal Process., 2022

Self-supervised learning with bi-label masked speech prediction for streaming multi-talker speech recognition.
CoRR, 2022

Speech separation with large-scale self-supervised learning.
CoRR, 2022

Deploying self-supervised learning in the wild for hybrid automatic speech recognition.
CoRR, 2022

Multilingual Transformer Language Model for Speech Recognition in Low-resource Languages.
Proceedings of the Ninth International Conference on Social Networks Analysis, 2022

Streaming Multi-Talker ASR with Token-Level Serialized Output Training.
Proceedings of the Interspeech 2022, 2022

Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings.
Proceedings of the Interspeech 2022, 2022

Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition?
Proceedings of the Interspeech 2022, 2022

Continuous Speech Separation with Recurrent Selective Attention Network.
Proceedings of the IEEE International Conference on Acoustics, 2022

Maximizing Audio Event Detection Model Performance on Small Datasets Through Knowledge Transfer, Data Augmentation, and Pretraining: an Ablation Study.
Proceedings of the IEEE International Conference on Acoustics, 2022

Unispeech-Sat: Universal Speech Representation Learning With Speaker Aware Pre-Training.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing.
CoRR, 2021

ResNeXt and Res2Net Structures for Speaker Verification.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Multi-Channel Automatic Speech Recognition Using Deep Complex Unet.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

IEEE SLT 2021 Alpha-Mini Speech Challenge: Open Datasets, Tracks, Rules and Baselines.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

DESNet: A Multi-Channel Network for Simultaneous Speech Dereverberation, Enhancement and Separation.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Investigation of Practical Aspects of Single Channel Speech Separation for ASR.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Ultra Fast Speech Separation Model with Teacher Student Learning.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Sequence-Level Confidence Classifier for ASR Utterance Accuracy and Application to Acoustic Models.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Microsoft Speaker Diarization System for the Voxceleb Speaker Recognition Challenge 2020.
Proceedings of the IEEE International Conference on Acoustics, 2021

Multi-Dialect Speech Recognition in English Using Attention on Ensemble of Experts.
Proceedings of the IEEE International Conference on Acoustics, 2021

Continuous Speech Separation with Conformer.
Proceedings of the IEEE International Conference on Acoustics, 2021

A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
ResNeXt and Res2Net Structure for Speaker Verification.
CoRR, 2020

Continuous speech separation: dataset and analysis.
CoRR, 2020

NPU Speaker Verification System for INTERSPEECH 2020 Far-Field Speaker Verification Challenge.
Proceedings of the Interspeech 2020, 2020

An End-to-End Architecture of Online Multi-Channel Speech Separation.
Proceedings of the Interspeech 2020, 2020

Speaker Attribution with Voice Profiles by Graph-Based Semi-Supervised Learning.
Proceedings of the Interspeech 2020, 2020

Channel-Wise Subband Input for Better Voice and Accompaniment Separation on High Resolution Music.
Proceedings of the Interspeech 2020, 2020

Fast and Slow Acoustic Model.
Proceedings of the Interspeech 2020, 2020

Bandpass Noise Generation and Augmentation for Unified ASR.
Proceedings of the Interspeech 2020, 2020

1-D Row-Convolution LSTM: Fast Streaming ASR at Accuracy Parity with LC-BLSTM.
Proceedings of the Interspeech 2020, 2020

DCCRN: Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement.
Proceedings of the Interspeech 2020, 2020

Improving Deep CNN Networks with Long Temporal Context for Text-Independent Speaker Verification.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Audio-Visual Recognition of Overlapped Speech for the LRS2 Dataset.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Speaker Diarization with Session-Level Speaker Embedding Refinement Using Graph Neural Networks.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Adaptation of RNN Transducer with Text-To-Speech Technology for Keyword Spotting.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Continuous Speech Separation: Dataset and Analysis.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
End-to-End Multi-Channel Speech Separation.
CoRR, 2019

Improved Speaker-Dependent Separation for CHiME-5 Challenge.
Proceedings of the Interspeech 2019, 2019

A Comprehensive Study of Speech Separation: Spectrogram vs Waveform Separation.
Proceedings of the Interspeech 2019, 2019

CNN with Phonetic Attention for Text-Independent Speaker Verification.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Time Domain Audio Visual Speech Separation.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2009
Cross-lingual speech recognition under runtime resource constraints.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Robust Speech Recognition Using a Cepstral Minimum-Mean-Square-Error-Motivated Noise Suppressor.
IEEE Trans. Speech Audio Process., 2008

Improvements on Mel-Frequency Cepstrum Minimum-Mean-Square-Error Noise Suppressor for Robust Speech Recognition.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

A minimum-mean-square-error noise reduction algorithm on Mel-frequency cepstra for robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008

Adaptation of compressed HMM parameters for resource-constrained speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008

2005
Analysis and comparison of two speech feature extraction/compensation algorithms.
IEEE Signal Process. Lett., 2005


  Loading...