Yakun Zhang
Orcid: 0000-0001-5829-1371Affiliations:
- Academy of Military Sciences, Defense Innovation Institute, Beijing, China
- Tianjin Artificial Intelligence Innovation Center, Tianjin, China
- Chinese Academy of Sciences, Laboratory of Artificial Neural Networks and High-speed Circuits, Institute of Semiconductors, Beijing, China
- University of Chinese Academy of Sciences, School of Microelectronics, Beijing, China
According to our database1,
Yakun Zhang authored at least 20 papers
between 2019 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2026
DBMIF: a deep balanced multimodal iterative fusion framework for air- and bone-conduction speech enhancement.
Appl. Intell., April, 2026
Purification Before Fusion: Toward Mask-Free Speech Enhancement for Robust Audio-Visual Speech Recognition.
CoRR, January, 2026
Sequential viseme-driven visual speech recognition through dual-stream interactive neural architecture.
Neural Networks, 2026
DAP-Whisper: A robust audio-visual speech recognition system via distribution-aware prompting and consistency-gated modulation.
Expert Syst. Appl., 2026
2025
AVE Speech: A Comprehensive Multimodal Dataset for Speech Recognition Integrating Audio, Visual, and Electromyographic Signals.
IEEE Trans. Hum. Mach. Syst., August, 2025
DuAGNet: an unrestricted multimodal speech recognition framework using dual adaptive gating fusion.
Appl. Intell., February, 2025
AVE Speech Dataset: A Comprehensive Benchmark for Multi-Modal Speech Recognition Integrating Audio, Visual, and Electromyographic Signals.
CoRR, January, 2025
IEEE Signal Process. Lett., 2025
Speech Commun., 2025
MsDUNE: A multi-scale masked temporal fusion framework for speaker-independent lipreading via Dirichlet uncertainty estimation.
Neural Networks, 2025
Bridging semantics across modalities: Decoupled representation learning for audio-visual speech recognition.
Knowl. Based Syst., 2025
2024
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
2023
EMG-Based Cross-Subject Silent Speech Recognition Using Conditional Domain Adversarial Network.
IEEE Trans. Cogn. Dev. Syst., December, 2023
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
2022
A novel silent speech recognition approach based on parallel inception convolutional neural network and Mel frequency spectral coefficient.
Frontiers Neurorobotics, September, 2022
AGCNN: Adaptive Gabor Convolutional Neural Networks with Receptive Fields for Vein Biometric Recognition.
Concurr. Comput. Pract. Exp., 2022
Proceedings of the International Conference on Multimodal Interaction, 2022
2021
Proceedings of the 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2021
2019