Bing Han

Orcid: 0000-0002-6319-6755

Affiliations:
  • Shanghai Jiao Tong University, Department of Computer Science and Engineering, Shanghai, China


According to our database1, Bing Han authored at least 39 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Exploring Self-Supervised Audio Models for Generalized Anomalous Sound Detection.
CoRR, August, 2025

FISHER: A Foundation Model for Multi-Modal Industrial Signal Comprehensive Representation.
CoRR, July, 2025

Zero-Shot Streaming Text to Speech Synthesis with Transducer and Auto-Regressive Modeling.
CoRR, May, 2025

Data-Efficient Low-Complexity Acoustic Scene Classification via Distilling and Progressive Pruning.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Flow-TSVAD: Target-Speaker Voice Activity Detection via Latent Flow Matching for Speaker Diarization.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Autoregressive Speech Synthesis without Vector Quantization.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Self-Supervised Learning With Cluster-Aware-DINO for High-Performance Robust Speaker Verification.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Attention-Based Encoder-Decoder End-to-End Neural Diarization With Embedding Enhancer.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Advancing speaker embedding learning: Wespeaker toolkit for research and production.
Speech Commun., 2024

Data-Efficient Low-Complexity Acoustic Scene Classification via Distilling and Progressive Pruning.
CoRR, 2024

Flow-TSVAD: Target-Speaker Voice Activity Detection via Latent Flow Matching.
CoRR, 2024

AnoPatch: Towards Better Consistency in Machine Anomalous Sound Detection.
CoRR, 2024

VALL-E R: Robust and Efficient Zero-Shot Text-to-Speech Synthesis via Monotonic Alignment.
CoRR, 2024

Improving Anomalous Sound Detection Via Low-Rank Adaptation Fine-Tuning of Pre-Trained Audio Models.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Prototype and Instance Contrastive Learning for Unsupervised Domain Adaptation in Speaker Verification.
Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024

Combining Self-Supervised Learning and Adversarial Training Based Domain Adaptation for Speaker Verification.
Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024

InstructME: An Instruction Guided Music Edit Framework with Latent Diffusion Models.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Improving Acoustic Scene Classification via Self-Supervised and Semi-Supervised Learning with Efficient Audio Transformer.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Semi-Supervised Acoustic Scene Classification with Test-Time Adaptation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Leveraging in-the-wild Data for Effective Self-supervised Pretraining in Speaker Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024

Robust Cross-Domain Speaker Verification with Multi-Level Domain Adapters.
Proceedings of the IEEE International Conference on Acoustics, 2024

Exploring Large Scale Pre-Trained Models for Robust Machine Anomalous Sound Detection.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
InstructME: An Instruction Guided Music Edit And Remix Framework with Latent Diffusion Models.
CoRR, 2023

Wespeaker baselines for VoxSRC2023.
CoRR, 2023

Build a SRE Challenge System: Lessons from VoxSRC 2022 and CNSRC 2022.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Attention-based Encoder-Decoder Network for End-to-End Neural Speaker Diarization with Target Speaker Attractor.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Improving Dino-Based Self-Supervised Speaker Verification with Progressive Cluster-Aware Training.
Proceedings of the IEEE International Conference on Acoustics, 2023

Exploring Binary Classification Loss for Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
SJTU-AISPEECH System for VoxCeleb Speaker Recognition Challenge 2022.
CoRR, 2022

The SJTU X-LANCE Lab System for CNSRC 2022.
CoRR, 2022

A Comprehensive Study on Self-Supervised Distillation for Speaker Representation Learning.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

The X-Lance Speaker Diarization System for the Conversational Short-phrase Speaker Diarization Challenge 2022.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

DF-ResNet: Boosting Speaker Verification Performance with Depth-First Design.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Self-Supervised Speaker Verification Using Dynamic Loss-Gate and Label Correction.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

The Sjtu System For Multimodal Information Based Speech Processing Challenge 2021.
Proceedings of the IEEE International Conference on Acoustics, 2022

Local Information Modeling with Self-Attention for Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2022

MLP-SVNET: A Multi-Layer Perceptrons Based Network for Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
The SJTU System for Short-Duration Speaker Verification Challenge 2021.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

SynAug: Synthesis-Based Data Augmentation for Text-Dependent Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2021


  Loading...