Yakun Zhang

Orcid: 0000-0001-5829-1371

Affiliations:
  • Academy of Military Sciences, Defense Innovation Institute, Beijing, China
  • Tianjin Artificial Intelligence Innovation Center, Tianjin, China
  • Chinese Academy of Sciences, Laboratory of Artificial Neural Networks and High-speed Circuits, Institute of Semiconductors, Beijing, China
  • University of Chinese Academy of Sciences, School of Microelectronics, Beijing, China


According to our database1, Yakun Zhang authored at least 14 papers between 2019 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
AVE Speech: A Comprehensive Multimodal Dataset for Speech Recognition Integrating Audio, Visual, and Electromyographic Signals.
IEEE Trans. Hum. Mach. Syst., August, 2025

DuAGNet: an unrestricted multimodal speech recognition framework using dual adaptive gating fusion.
Appl. Intell., February, 2025

AVE Speech Dataset: A Comprehensive Benchmark for Multi-Modal Speech Recognition Integrating Audio, Visual, and Electromyographic Signals.
CoRR, January, 2025

Neural Chinese silent speech recognition with facial electromyography.
Speech Commun., 2025

MsDUNE: A multi-scale masked temporal fusion framework for speaker-independent lipreading via Dirichlet uncertainty estimation.
Neural Networks, 2025

2024
Landmark-Guided Cross-Speaker Lip Reading with Mutual Information Regularization.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
EMG-Based Cross-Subject Silent Speech Recognition Using Conditional Domain Adversarial Network.
IEEE Trans. Cogn. Dev. Syst., December, 2023

Auxiliary Fine-grained Alignment Constraints for Vision-and-Language Navigation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language Navigation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
A novel silent speech recognition approach based on parallel inception convolutional neural network and Mel frequency spectral coefficient.
Frontiers Neurorobotics, September, 2022

AGCNN: Adaptive Gabor Convolutional Neural Networks with Receptive Fields for Vein Biometric Recognition.
Concurr. Comput. Pract. Exp., 2022

Improved Word-level Lipreading with Temporal Shrinkage Network and NetVLAD.
Proceedings of the International Conference on Multimodal Interaction, 2022

2021
Parallel-Inception CNN Approach for Facial sEMG based Silent Speech Recognition.
Proceedings of the 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2021

2019
Adaptive Learning Gabor Filter for Finger-Vein Recognition.
IEEE Access, 2019


  Loading...