Bing Yang

Orcid: 0000-0002-8978-2322

Affiliations:
  • Peking University, Shenzhen Graduate School, Key Laboratory of Machine Perception, Beijing, China


According to our database1, Bing Yang authored at least 22 papers between 2017 and 2022.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2022
Enhancing direct-path relative transfer function using deep neural network for robust sound source localization.
CAAI Trans. Intell. Technol., 2022

Head-related transfer function-reserved time-frequency masking for robust binaural sound source localization.
CAAI Trans. Intell. Technol., 2022

SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Supervised Direct-Path Relative Transfer Function Learning for Binaural Sound Source Localization.
Proceedings of the IEEE International Conference on Acoustics, 2021

Multi-Scale Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
An Adaptive Method Based on Multiscale Dilated Convolutional Network for Binaural Speech Source Localization.
Complex., 2020

Deep Metric Learning-Assisted 3D Audio-Visual Speaker Tracking via Two-Layer Particle Filter.
Complex., 2020

Part-Based Lipreading for Audio-Visual Speech Recognition.
Proceedings of the 2020 IEEE International Conference on Systems, Man, and Cybernetics, 2020

Lip Graph Assisted Audio-Visual Speech Recognition Using Bidirectional Synchronous Fusion.
Proceedings of the Interspeech 2020, 2020

Audio-Visual Speech Recognition Using A Two-Step Feature Fusion Strategy.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Mutual Alignment between Audiovisual Features for End-to-End Audiovisual Speech Recognition.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

3D Audio-Visual Speaker Tracking with A Novel Particle Filter.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

A Base-Derivative Framework for Cross-Modality RGB-Infrared Person Re-Identification.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Robust Audio-Visual Speech Recognition Based on Hybrid Fusion.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

2019
Multiple Sound Source Counting and Localization Based on TF-Wise Spatial Spectrum Clustering.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Robust Interaural Time Difference Estimation Based on Convolutional Neural Network.
Proceedings of the 2019 IEEE International Conference on Robotics and Biomimetics, 2019

Synergistic Optimization based Binaural Time-Frequency Masking for Speech Source Localization.
Proceedings of the 2019 IEEE International Conference on Robotics and Biomimetics, 2019

3D Audio-Visual Speaker Tracking with A Two-Layer Particle Filter.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

2018
Multiple Concurrent Sound Source Tracking Based on Observation-Guided Adaptive Particle Filter.
Proceedings of the Interspeech 2018, 2018

2017
Multiple Sound Source Counting and Localization Based on Spatial Principal Eigenvector.
Proceedings of the Interspeech 2017, 2017

Multiple sound source localization based on TDOA clustering and multi-path matching pursuit.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017


  Loading...