Wenkai Zhang

Orcid: 0000-0002-8903-2708

Affiliations:
  • Aerospace Information Research Institute, Chinese Academic of Sciences, Beijing, China


According to our database1, Wenkai Zhang authored at least 47 papers between 2018 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
LMF-Net: A Learnable Multimodal Fusion Network for Semantic Segmentation of Remote Sensing Data.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2025

2024
Spatial guided image captioning: Guiding attention with object's spatial interaction.
IET Image Process., October, 2024

Injecting Linguistic Into Visual Backbone: Query-Aware Multimodal Fusion Network for Remote Sensing Visual Grounding.
IEEE Trans. Geosci. Remote. Sens., 2024

Vigen500k: A Sustainable-Expansion Image-Text Aligned Dataset For Remote Sensing.
Proceedings of the IGARSS 2024, 2024

2023
Unsupervised Cross-Scene Aerial Image Segmentation via Spectral Space Transferring and Pseudo-Label Revising.
Remote. Sens., March, 2023

Weakly Supervised Semantic Segmentation in Aerial Imagery via Cross-Image Semantic Mining.
Remote. Sens., February, 2023

Mimicking the Brain's Cognition of Sarcasm From Multidisciplines for Twitter Sarcasm Detection.
IEEE Trans. Neural Networks Learn. Syst., 2023

Hypersphere-Based Remote Sensing Cross-Modal Text-Image Retrieval via Curriculum Learning.
IEEE Trans. Geosci. Remote. Sens., 2023

Efficient and Controllable Remote Sensing Fake Sample Generation Based on Diffusion Model.
IEEE Trans. Geosci. Remote. Sens., 2023

RingMo-SAM: A Foundation Model for Segment Anything in Multimodal Remote-Sensing Images.
IEEE Trans. Geosci. Remote. Sens., 2023

From Plane to Hierarchy: Deformable Transformer for Remote Sensing Image Captioning.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2023

2022
Weakly Supervised Semantic Segmentation in Aerial Imagery via Explicit Pixel-Level Constraints.
IEEE Trans. Geosci. Remote. Sens., 2022

Global Visual Feature and Linguistic State Guided Attention for Remote Sensing Image Captioning.
IEEE Trans. Geosci. Remote. Sens., 2022

Remote Sensing Cross-Modal Text-Image Retrieval Based on Global and Local Information.
IEEE Trans. Geosci. Remote. Sens., 2022

A Lightweight Multi-Scale Crossmodal Text-Image Retrieval Method in Remote Sensing.
IEEE Trans. Geosci. Remote. Sens., 2022

Learning to Evaluate Performance of Multimodal Semantic Localization.
IEEE Trans. Geosci. Remote. Sens., 2022

Exploring a Fine-Grained Multiscale Method for Cross-Modal Remote Sensing Image Retrieval.
IEEE Trans. Geosci. Remote. Sens., 2022

Associatively Segmenting Semantics and Estimating Height From Monocular Remote-Sensing Imagery.
IEEE Trans. Geosci. Remote. Sens., 2022

Multiscale Multiinteraction Network for Remote Sensing Image Captioning.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2022

CSF-Net: Color Spectrum Fusion Network for Semantic Labeling of Airborne Laser Scanning Point Cloud.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2022

Semantic-meshed and content-guided transformer for image captioning.
IET Comput. Vis., 2022

Without detection: Two-step clustering features with local-global attention for image captioning.
IET Comput. Vis., 2022

Learning to Evaluate Performance of Multi-modal Semantic Localization.
CoRR, 2022

MCRN: A Multi-source Cross-modal Retrieval Network for remote sensing.
Int. J. Appl. Earth Obs. Geoinformation, 2022

2021
HECR-Net: Height-Embedding Context Reassembly Network for Semantic Segmentation in Aerial Images.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2021

Reasoning like Humans: On Dynamic Attention Prior in Image Captioning.
Knowl. Based Syst., 2021

Commonalities-, specificities-, and dependencies-enhanced multi-task learning network for judicial decision prediction.
Neurocomputing, 2021

D-MmT: A concise decoder-only multi-modal transformer for abstractive summarization in videos.
Neurocomputing, 2021

An enhanced dynamic interaction network for claim verification.
Neurocomputing, 2021

2020
Boosting Memory with a Persistent Memory Mechanism for Remote Sensing Image Captioning.
Remote. Sens., 2020

An Attention-Based Model Using Character Composition of Entities in Chinese Relation Extraction.
Inf., 2020

Gated hierarchical multi-task learning network for judicial decision prediction.
Neurocomputing, 2020

SA-NLI: A Supervised Attention based framework for Natural Language Inference.
Neurocomputing, 2020

Motion Guided Siamese Trackers for Visual Tracking.
IEEE Access, 2020

Improving Intra- and Inter-Modality Visual Relation for Image Captioning.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Multistage Fusion with Forget Gate for Multimodal Summarization in Open-Domain Videos.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

2019
LAM: Remote Sensing Image Captioning with Label-Attention Mechanism.
Remote. Sens., 2019

Joint optimisation convex-negative matrix factorisation for multi-modal image collection summarisation based on images and tags.
IET Comput. Vis., 2019

VAA: Visual Aligning Attention Model for Remote Sensing Image Captioning.
IEEE Access, 2019

Chinese NER Using Dynamic Meta-Embeddings.
IEEE Access, 2019

Entity Disambiguation Leveraging Multi-Perspective Attention.
IEEE Access, 2019

Effective Classification of Local Climate Zones Based on Multi-Source Remote Sensing Data.
Proceedings of the 2019 IEEE International Geoscience and Remote Sensing Symposium, 2019

Aerial Image and Map Synthesis Using Generative Adversarial Networks.
Proceedings of the 2019 IEEE International Geoscience and Remote Sensing Symposium, 2019

Effective Fusion of Multi-Modal Data with Group Convolutions for Semantic Segmentation of Aerial Imagery.
Proceedings of the 2019 IEEE International Geoscience and Remote Sensing Symposium, 2019

Geometrical Model for the Layover of Gable-Roofed Buildings and its Application in Building Reconstruction.
Proceedings of the 2019 IEEE International Geoscience and Remote Sensing Symposium, 2019

2018
Effective Fusion of Multi-Modal Remote Sensing Data in a Fully Convolutional Network for Semantic Labeling.
Remote. Sens., 2018

Online Multi-Object Tracking via Combining Discriminative Correlation Filters With Making Decision.
IEEE Access, 2018


  Loading...