Shuai Wang

Orcid: 0000-0003-1523-9631

Affiliations:
  • Shanghai Jiao Tong University, Department of Computer Science and Engineering, China


According to our database1, Shuai Wang authored at least 42 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Attention-Based Encoder-Decoder End-to-End Neural Diarization With Embedding Enhancer.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

2023
Attention-based Encoder-Decoder Network for End-to-End Neural Speaker Diarization with Target Speaker Attractor.
CoRR, 2023

Wespeaker: A Research and Production Oriented Speaker Embedding Learning Toolkit.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
DF-ResNet: Boosting Speaker Verification Performance with Depth-First Design.
Proceedings of the Interspeech 2022, 2022

Self-Knowledge Distillation via Feature Enhancement for Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Audio-Visual Deep Neural Network for Robust Person Verification.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Voice Activity Detection in the Wild: A Data-Driven Approach Using Teacher-Student Training.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Revisiting the Statistics Pooling Layer in Deep Speaker Embedding Learning.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

Speaker Embedding Augmentation with Noise Distribution Matching.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

Unit Selection Synthesis Based Data Augmentation for Fixed Phrase Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2021

SynAug: Synthesis-Based Data Augmentation for Text-Dependent Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2021

Self-Supervised Learning Based Domain Adaptation for Robust Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Data Augmentation Using Deep Generative Models for Embedding Based Speaker Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

End-to-End Speaker-Dependent Voice Activity Detection.
CoRR, 2020


Dual-Adversarial Domain Adaptation for Generalized Replay Attack Detection.
Proceedings of the Interspeech 2020, 2020

Adversarial Domain Adaptation for Speaker Verification Using Partially Shared Network.
Proceedings of the Interspeech 2020, 2020

Multi-Modality Matters: A Performance Leap on VoxCeleb.
Proceedings of the Interspeech 2020, 2020

Text Adaptation for Speaker Verification with Speaker-Text Factorized Embeddings.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

But System for the Second Dihard Speech Diarization Challenge.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Optimizing Bayesian Hmm Based X-Vector Clustering for the Second Dihard Speech Diarization Challenge.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Channel Invariant Speaker Embedding Learning with Joint Multi-Task and Adversarial Training.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Investigation of Specaugment for Deep Speaker Embedding Learning.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Discriminative Neural Embedding Learning for Short-Duration Text-Independent Speaker Verification.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

BUT System Description to VoxCeleb Speaker Recognition Challenge 2019.
CoRR, 2019

The SJTU Robust Anti-Spoofing System for the ASVspoof 2019 Challenge.
Proceedings of the Interspeech 2019, 2019

Data Augmentation Using Variational Autoencoder for Embedding Based Speaker Verification.
Proceedings of the Interspeech 2019, 2019

On the Usage of Phonetic Information for Text-Independent Speaker Embedding Extraction.
Proceedings of the Interspeech 2019, 2019

Cross-Domain Replay Spoofing Attack Detection Using Domain Adversarial Training.
Proceedings of the Interspeech 2019, 2019

Bayesian HMM Based x-Vector Clustering for Speaker Diarization.
Proceedings of the Interspeech 2019, 2019

Knowledge Distillation for Small Foot-print Deep Speaker Embedding.
Proceedings of the IEEE International Conference on Acoustics, 2019

Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018
Erratum to: Past review, current progress, and challenges ahead on the cocktail party problem.
Frontiers Inf. Technol. Electron. Eng., 2018

Past review, current progress, and challenges ahead on the cocktail party problem.
Frontiers Inf. Technol. Electron. Eng., 2018

Generative Adversarial Networks based X-vector Augmentation for Robust Probabilistic Linear Discriminant Analysis in Speaker Verification.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Deep Discriminant Analysis for i-vector Based Robust Speaker Recognition.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Covariance Based Deep Feature for Text-Dependent Speaker Verification.
Proceedings of the Intelligence Science and Big Data Engineering, 2018

Angular Softmax for Short-Duration Text-independent Speaker Verification.
Proceedings of the Interspeech 2018, 2018

Focal Kl-Divergence Based Dilated Convolutional Neural Networks for Co-Channel Speaker Identification.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Joint I-Vector with End-to-End System for Short Duration Text-Independent Speaker Verification.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
What Does the Speaker Embedding Encode?
Proceedings of the Interspeech 2017, 2017

Integrating online i-vector into GMM-UBM for text-dependent speaker verification.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017


  Loading...