Zuheng Ming
Orcid: 0000-0002-1094-3112
  According to our database1,
  Zuheng Ming
  authored at least 43 papers
  between 2010 and 2025.
  
  
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
  2025
    CoRR, September, 2025
    
  
    CoRR, September, 2025
    
  
M3ET: Efficient Vision-Language Learning for Robotics based on Multimodal Mamba-Enhanced Transformer.
    
  
    CoRR, September, 2025
    
  
    Knowl. Based Syst., 2025
    
  
Prediction and detection of terminal diseases using Internet of Medical Things: A review.
    
  
    Comput. Biol. Medicine, 2025
    
  
GlobalDoc: A Cross-Modal Vision-Language Framework for Real-World Document Image Retrieval and Classification.
    
  
    Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025
    
  
Enhanced Alzheimer's Diagnosis with a Lightweight Transformer: A Multimodal Fusion of Sagittal MRI and Clinical Data.
    
  
    Proceedings of the 32nd International Conference on Systems, Signals and Image Processing, 2025
    
  
  2024
    Multim. Tools Appl., October, 2024
    
  
    CoRR, 2024
    
  
Harnessing Knowledge Distillation for Enhanced Text-to-Text Translation in Low-Resource Languages.
    
  
    Proceedings of the Speech and Computer - 26th International Conference, 2024
    
  
Multimodal Transformer Using Cross-Channel Attention For Object Detection In Remote Sensing Images.
    
  
    Proceedings of the IEEE International Conference on Image Processing, 2024
    
  
  2023
VLCDoC: Vision-Language contrastive pre-training model for cross-Modal document classification.
    
  
    Pattern Recognit., July, 2023
    
  
    Symmetry, February, 2023
    
  
Multimodal Transformer Using Cross-Channel attention for Object Detection in Remote Sensing Images.
    
  
    CoRR, 2023
    
  
TransferDoc: A Self-Supervised Transferable Document Representation Learning Model Unifying Vision and Language.
    
  
    CoRR, 2023
    
  
MMFormer: Multimodal Transformer Using Multiscale Self-Attention for Remote Sensing Image Classification.
    
  
    CoRR, 2023
    
  
    Proceedings of the 25th IEEE International Workshop on Multimedia Signal Processing, 2023
    
  
RsMmFormer: Multimodal Transformer Using Multiscale Self-attention for Remote Sensing Image Classification.
    
  
    Proceedings of the Artificial Intelligence - Third CAAI International Conference, 2023
    
  
  2022
    Pattern Recognit. Lett., 2022
    
  
    CoRR, 2022
    
  
Vitranspad: Video Transformer Using Convolution And Self-Attention For Face Presentation Attack Detection.
    
  
    Proceedings of the 2022 IEEE International Conference on Image Processing, 2022
    
  
  2021
    Int. J. Document Anal. Recognit., 2021
    
  
EAML: ensemble self-attention-based mutual learning network for document image classification.
    
  
    Int. J. Document Anal. Recognit., 2021
    
  
    CoRR, 2021
    
  
  2020
A Survey on Anti-Spoofing Methods for Facial Recognition with RGB Cameras of Generic Consumer Devices.
    
  
    J. Imaging, 2020
    
  
A Survey On Anti-Spoofing Methods For Face Recognition with RGB Cameras of Generic Consumer Devices.
    
  
    CoRR, 2020
    
  
    CoRR, 2020
    
  
    Proceedings of the IEEE International Conference on Image Processing, 2020
    
  
    Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
    
  
  2019
FaceLiveNet+: A Holistic Networks For Face Authentication Based On Dynamic Multi-task Convolutional Neural Networks.
    
  
    CoRR, 2019
    
  
    Proceedings of the 13th IAPR International Workshop on Graphics Recognition, 2019
    
  
Face Detection in Camera Captured Images of Identity Documents Under Challenging Conditions.
    
  
    Proceedings of the 8th International Workshop on Camera-Based Document Analysis and Recognition, 2019
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2019
    
  
  2018
    Proceedings of the 2018 IEEE International Geoscience and Remote Sensing Symposium, 2018
    
  
FaceLiveNet: End-to-End Networks Combining Face Verification with Interactive Facial Expression-Based Liveness Detection.
    
  
    Proceedings of the 24th International Conference on Pattern Recognition, 2018
    
  
  2017
Simple Triplet Loss Based on Intra/Inter-Class Metric Learning for Face Verification.
    
  
    Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017
    
  
  2015
Facial Action Units intensity estimation by the fusion of features with multi-kernel Support Vector Machine.
    
  
    Proceedings of the 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2015
    
  
Synthetic Evidential Study as Augmented Collective Thought Process - Preliminary Report.
    
  
    Proceedings of the Intelligent Information and Database Systems - 7th Asian Conference, 2015
    
  
  2013
    Proceedings of the Auditory-Visual Speech Processing, 2013
    
  
  2012
Mapping de l'espace spectral vers l'espace visuel de la parole : les voyelles du français en langue française parlée complétée (Mapping of the spectral space to the visual speech space for French vowels cued in Cued Speech) [in French].
    
  
    Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012
    
  
  2010
    Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010