We stand with Ukraine

We stand with Ukraine

Zuheng Ming

Orcid: 0000-0002-1094-3112

According to our database¹, Zuheng Ming authored at least 44 papers between 2010 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Toward Better Optimization of Low-Dose CT Enhancement: A Critical Analysis of Loss Functions and Image Quality Assessment Metrics.

[BibT_eX]

[DOI]

,

Azeddine Beghdadi

,

,

CoRR, November, 2025

Indirect Attention: Turning Context Misalignment into a Feature.

[BibT_eX]

[DOI]

Bissmella Bahaduri

,

Hicham Talaoubrid

,

,

,

Anissa Mokraoui

CoRR, September, 2025

LCMF: Lightweight Cross-Modality Mambaformer for Embodied Robotics VQA.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, September, 2025

M3ET: Efficient Vision-Language Learning for Robotics based on Multimodal Mamba-Enhanced Transformer.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, September, 2025

PK-Net: A prior knowledge-driven dual-path network for enhanced glaucoma screening.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Knowl. Based Syst., 2025

Prediction and detection of terminal diseases using Internet of Medical Things: A review.

[BibT_eX]

[DOI]

Akeem Temitope Otapo

,

,

Ghazaleh Khodabandelou

,

Comput. Biol. Medicine, 2025

GlobalDoc: A Cross-Modal Vision-Language Framework for Real-World Document Image Retrieval and Classification.

[BibT_eX]

[DOI]

Souhail Bakkali

,

,

,

Mickaël Coustaty

,

Marçal Rusiñol

,

Oriol Ramos Terrades

,

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

Enhanced Alzheimer's Diagnosis with a Lightweight Transformer: A Multimodal Fusion of Sagittal MRI and Clinical Data.

[BibT_eX]

[DOI]

Akeem Temitope Otapo

,

Ghazaleh Khodabandelou

,

,

Proceedings of the 32nd International Conference on Systems, Signals and Image Processing, 2025

2024

Identifying fraudulent identity documents by analyzing imprinted guilloche patterns.

[BibT_eX]

[DOI]

,

,

,

Petra Gomez-Krämer

,

Mickaël Coustaty

,

,

Jean-Christophe Burie

Multim. Tools Appl., October, 2024

Interactive Masked Image Modeling for Multimodal Object Detection in Remote Sensing.

[BibT_eX]

[DOI]

,

,

,

Bissmella Bahaduri

,

Anissa Mokraoui

CoRR, 2024

Harnessing Knowledge Distillation for Enhanced Text-to-Text Translation in Low-Resource Languages.

[BibT_eX]

[DOI]

Manar Ouled Ahmed

,

,

Proceedings of the Speech and Computer - 26th International Conference, 2024

Multimodal Transformer Using Cross-Channel Attention For Object Detection In Remote Sensing Images.

[BibT_eX]

[DOI]

Bissmella Bahaduri

,

,

,

Anissa Mokraoui

Proceedings of the IEEE International Conference on Image Processing, 2024

2023

VLCDoC: Vision-Language contrastive pre-training model for cross-Modal document classification.

[BibT_eX]

[DOI]

Souhail Bakkali

,

,

Mickaël Coustaty

,

Marçal Rusiñol

,

Oriol Ramos Terrades

Pattern Recognit., July, 2023

A Novel Heteromorphic Ensemble Algorithm for Hand Pose Recognition.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Qammer H. Abbasi

,

Symmetry, February, 2023

Multimodal Transformer Using Cross-Channel attention for Object Detection in Remote Sensing Images.

[BibT_eX]

[DOI]

Bissmella Bahaduri

,

,

,

CoRR, 2023

TransferDoc: A Self-Supervised Transferable Document Representation Learning Model Unifying Vision and Language.

[BibT_eX]

[DOI]

Souhail Bakkali

,

,

,

Mickaël Coustaty

,

Marçal Rusiñol

,

Oriol Ramos Terrades

,

CoRR, 2023

MMFormer: Multimodal Transformer Using Multiscale Self-Attention for Remote Sensing Image Classification.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2023

Guilloche Detection for ID Authentication: A Dataset and Baselines.

[BibT_eX]

[DOI]

,

,

Petra Gomez-Krämer

,

Jean-Christophe Burie

,

Mickaël Coustaty

,

Proceedings of the 25th IEEE International Workshop on Multimedia Signal Processing, 2023

RsMmFormer: Multimodal Transformer Using Multiscale Self-attention for Remote Sensing Image Classification.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Artificial Intelligence - Third CAAI International Conference, 2023

2022

Exploring multi-tasking learning in document attribute classification.

[BibT_eX]

[DOI]

,

,

Pattern Recognit. Lett., 2022

Document Liveness Challenge Dataset (DLC-2021).

[BibT_eX]

[DOI]

Dmitry V. Polevoy

,

Irina V. Sigareva

,

Daria M. Ershova

,

Vladimir V. Arlazarov

,

Dmitry P. Nikolaev

,

,

Muhammad Muzzamil Luqman

,

Jean-Christophe Burie

J. Imaging, 2022

Identity Documents Authentication based on Forgery Detection of Guilloche Pattern.

[BibT_eX]

[DOI]

,

,

Petra Gomez-Krämer

,

Jean-Christophe Burie

CoRR, 2022

Vitranspad: Video Transformer Using Convolution And Self-Attention For Face Presentation Attack Detection.

[BibT_eX]

[DOI]

,

,

,

,

Muhammad Muzzamil Luqman

,

Jean-Christophe Burie

Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

2021

Cross-modal photo-caricature face recognition based on dynamic multi-task learning.

[BibT_eX]

[DOI]

,

Jean-Christophe Burie

,

Muhammad Muzzamil Luqman

Int. J. Document Anal. Recognit., 2021

EAML: ensemble self-attention-based mutual learning network for document image classification.

[BibT_eX]

[DOI]

Souhail Bakkali

,

,

Mickaël Coustaty

,

Marçal Rusiñol

Int. J. Document Anal. Recognit., 2021

MIDV-2020: A Comprehensive Benchmark Dataset for Identity Document Analysis.

[BibT_eX]

[DOI]

Konstantin B. Bulatov

,

Ekaterina Emelianova

,

Daniil V. Tropin

,

Natalya Skoryukina

,

Yulia S. Chernyshova

,

Alexander Sheshkus

,

Sergey A. Usilin

,

,

Jean-Christophe Burie

,

Muhammad Muzzamil Luqman

,

Vladimir V. Arlazarov

CoRR, 2021

2020

A Survey on Anti-Spoofing Methods for Facial Recognition with RGB Cameras of Generic Consumer Devices.

[BibT_eX]

[DOI]

,

,

Muhammad Muzzamil Luqman

,

Jean-Christophe Burie

J. Imaging, 2020

A Survey On Anti-Spoofing Methods For Face Recognition with RGB Cameras of Generic Consumer Devices.

[BibT_eX]

[DOI]

,

,

Muhammad Muzzamil Luqman

,

Jean-Christophe Burie

CoRR, 2020

Cross-modal Multi-task Learning for Graphic Recognition of Caricature Face.

[BibT_eX]

[DOI]

,

Jean-Christophe Burie

,

Muhammad Muzzamil Luqman

CoRR, 2020

Cross-Modal Deep Networks For Document Image Classification.

[BibT_eX]

[DOI]

Souhail Bakkali

,

,

Mickaël Coustaty

,

Marçal Rusiñol

Proceedings of the IEEE International Conference on Image Processing, 2020

Visual and Textual Deep Feature Fusion for Document Image Classification.

[BibT_eX]

[DOI]

Souhail Bakkali

,

,

Mickaël Coustaty

,

Marçal Rusiñol

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Dynamic Multi-Task Learning for Face Recognition with Facial Expression.

[BibT_eX]

[DOI]

,

,

Muhammad Muzzamil Luqman

,

Jean-Christophe Burie

,

CoRR, 2019

FaceLiveNet+: A Holistic Networks For Face Authentication Based On Dynamic Multi-task Convolutional Neural Networks.

[BibT_eX]

[DOI]

,

,

Muhammad Muzzamil Luqman

,

Jean-Christophe Burie

,

CoRR, 2019

Dynamic Deep Multi-task Learning for Caricature-Visual Face Recognition.

[BibT_eX]

[DOI]

,

Jean-Christophe Burie

,

Muhammad Muzzamil Luqman

Proceedings of the 13th IAPR International Workshop on Graphics Recognition, 2019

Face Detection in Camera Captured Images of Identity Documents Under Challenging Conditions.

[BibT_eX]

[DOI]

Souhail Bakkali

,

Muhammad Muzzamil Luqman

,

,

Jean-Christophe Burie

Proceedings of the 8th International Workshop on Camera-Based Document Analysis and Recognition, 2019

Classification of Hyperspectral and Lidar with Deep Rotation Forest.

[BibT_eX]

[DOI]

,

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

Multiple Sources Data Fusion Via Deep Forest.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2018 IEEE International Geoscience and Remote Sensing Symposium, 2018

FaceLiveNet: End-to-End Networks Combining Face Verification with Interactive Facial Expression-Based Liveness Detection.

[BibT_eX]

[DOI]

,

Joseph Chazalon

,

Muhammad Muzzamil Luqman

,

,

Jean-Christophe Burie

Proceedings of the 24th International Conference on Pattern Recognition, 2018

2017

Simple Triplet Loss Based on Intra/Inter-Class Metric Learning for Face Verification.

[BibT_eX]

[DOI]

,

Joseph Chazalon

,

Muhammad Muzzamil Luqman

,

,

Jean-Christophe Burie

Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

2015

Facial Action Units intensity estimation by the fusion of features with multi-kernel Support Vector Machine.

[BibT_eX]

[DOI]

,

Aurélie Bugeau

,

,

Proceedings of the 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2015

Synthetic Evidential Study as Augmented Collective Thought Process - Preliminary Report.

[BibT_eX]

[DOI]

Toyoaki Nishida

,

,

,

,

Sutasinee Thovutikul

,

,

Yasser F. O. Mohammad

,

Christian Nitschke

,

Yoshimasa Ohmoto

,

Atsushi Nakazawa

,

,

,

Aurélie Bugeau

,

,

,

Geoffrey Letournel

,

,

Dominique Fourer

Proceedings of the Intelligent Information and Database Systems - 7th Asian Conference, 2015

2013

GMM mapping of visual features of cued speech from speech spectral features.

[BibT_eX]

[DOI]

,

Denis Beautemps

,

Proceedings of the Auditory-Visual Speech Processing, 2013

2012

Mapping de l'espace spectral vers l'espace visuel de la parole : les voyelles du français en langue française parlée complétée (Mapping of the spectral space to the visual speech space for French vowels cued in Cued Speech) [in French].

[BibT_eX]

[DOI]

,

,

Denis Beautemps

Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012

2010

Estimation of speech lip features from discrete cosinus transform.

[BibT_eX]

[DOI]

,

Denis Beautemps

,

,

Sébastien Schmerber

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Loading...