Di Wang
Orcid: 0000-0001-8027-4287Affiliations:
- Xidian University, School of Computer Science and Technology, Xi'an, China
According to our database1,
Di Wang authored at least 93 papers
between 2015 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2026
ProVG: Progressive Visual Grounding via Language Decoupling for Remote Sensing Imagery.
CoRR, April, 2026
Multimodal Latent Temporal Modeling for Continuous Engagement Assessment in Online Education.
IEEE Trans. Learn. Technol., 2026
Efficient and Accurate Object Detection With Asymmetric Progressive Semi-Decoupled Head and Harmonic Focal Loss.
IEEE Trans. Image Process., 2026
MoDE: Improving Mixture of Depression Experts With Mutual Information Estimator for Depression Detection.
IEEE Trans. Affect. Comput., 2026
Anatomical Region-Guided Contrastive Decoding: A Plug-and-Play Strategy for Mitigating Hallucinations in Medical VLMs.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
RSVG-ZeroOV: Exploring a Training-Free Framework for Zero-Shot Open-Vocabulary Visual Grounding in Remote Sensing Images.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
2025
Enhanced Mask2Former With Multi-Scale and High-Resolution for Concrete Bridge Crack Semantic Segmentation.
IEEE Trans. Intell. Transp. Syst., December, 2025
Improving Few-Shot Change Detection Visual Question Answering via Decision-Ambiguity-guided Reinforcement Fine-Tuning.
CoRR, December, 2025
CoRR, December, 2025
CheXPO-v2: Preference Optimization for Chest X-ray VLMs with Knowledge Graph Consistency.
CoRR, December, 2025
Plugging and Breathing on the Air: A Practical Defense System for Deep Learning-Based Wireless Semantic Communications.
IEEE Trans. Mob. Comput., September, 2025
Enhancing Cross-View Geo-Localization Generalization via Global-Local Consistency and Geometric Equivariance.
CoRR, September, 2025
EvoFormer: Learning Dynamic Graph-Level Representations with Structural and Temporal Bias Correction.
CoRR, August, 2025
A Reinforcement Learning Framework for Efficient Task Allocation Among AGVs in Smart Warehouse.
IEEE Internet Things J., June, 2025
CoRR, May, 2025
IEEE Trans. Multim., 2025
IEEE Trans. Inf. Forensics Secur., 2025
CAETFN: Context Adaptively Enhanced Text-Guided Fusion Network for Multimodal Sentiment Analysis.
IEEE Trans. Affect. Comput., 2025
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2025
Pattern Recognit. Lett., 2025
Pattern Recognit., 2025
Pattern Recognit., 2025
Pattern Recognit., 2025
Fine-grained knowledge fusion for retrieval-augmented medical visual question answering.
Inf. Fusion, 2025
Proceedings of the 33rd ACM International Conference on Multimedia, 2025
DDFD: Diffusion-Based Denoising Fusion for Object Detection in Infrared-Visible Images.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025
Proceedings of the 2025 International Conference on Multimedia Retrieval, 2025
Proceedings of the 2025 International Conference on Multimedia Retrieval, 2025
MambaPose: Efficient 2D Human Pose Estimation with Pose-Prior Guided State Space Model.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025
Uncertainty-Driven Expert Control: Enhancing the Reliability of Medical Vision-Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025
DynPose: Largely Improving the Efficiency of Human Pose Estimation by a Simple Dynamic Framework.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
EvoFormer: Learning Dynamic Graph-Level Representations with Structural and Temporal Bias Correction.
Proceedings of the 34th ACM International Conference on Information and Knowledge Management, 2025
Predicting Depression in Screening Interviews from Interactive Multi-Theme Collaboration.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
CognitionCapturer: Decoding Visual Stimuli from Human EEG Signal with Multimodal Information.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025
FD2-Net: Frequency-Driven Feature Decomposition Network for Infrared-Visible Object Detection.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025
2024
IEEE Trans. Neural Networks Learn. Syst., November, 2024
Part-of-speech- and syntactic-aware graph convolutional network for aspect-level sentiment classification.
Multim. Tools Appl., March, 2024
Multimodal transformer with adaptive modality weighting for multimodal sentiment analysis.
Neurocomputing, March, 2024
Pattern Recognit., January, 2024
IEEE Trans. Multim., 2024
Gist, Content, Target-Oriented: A 3-Level Human-Like Framework for Video Moment Retrieval.
IEEE Trans. Multim., 2024
VLDadaptor: Domain Adaptive Object Detection With Vision-Language Model Distillation.
IEEE Trans. Multim., 2024
IEEE Trans. Geosci. Remote. Sens., 2024
Multiscale Spectral-Spatial Attention Residual Fusion Network for Multisource Remote Sensing Data Classification.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2024
GR-GAN: A unified adversarial framework for single image glare removal and denoising.
Pattern Recognit., 2024
DiagSWin: A multi-scale vision transformer with diagonal-shaped windows for object detection and segmentation.
Neural Networks, 2024
IEEE Geosci. Remote. Sens. Lett., 2024
Candidate-Heuristic In-Context Learning: A new framework for enhancing medical visual question answering with LLMs.
Inf. Process. Manag., 2024
Neurocomputing, 2024
Expert Syst. Appl., 2024
Show Me What and Where has Changed? Question Answering and Grounding for Remote Sensing Change Detection.
CoRR, 2024
Divide and Conquer: Isolating Normal-Abnormal Attributes in Knowledge Graph-Enhanced Radiology Report Generation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Fine-grained Semantics-aware Representation Learning for Text-based Person Retrieval.
Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024
Leveraging Coarse-to-Fine Grained Representations in Contrastive Learning for Differential Medical Visual Question Answering.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024
Proceedings of the IGARSS 2024, 2024
Proceedings of the IGARSS 2024, 2024
Proceedings of the IGARSS 2024, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Unleashing Channel Potential: Space-Frequency Selection Convolution for SAR Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
TMFN: A Target-oriented Multi-grained Fusion Network for End-to-end Aspect-based Multimodal Sentiment Analysis.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Refining Latent Homophilic Structures over Heterophilic Graphs for Robust Graph Convolution Networks.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
Pattern Recognit., August, 2023
Pattern Recognit., April, 2023
IEEE Trans. Multim., 2023
IEEE Trans. Multim., 2023
Mixing Self-Attention and Convolution: A Unified Framework for Multisource Remote Sensing Data Classification.
IEEE Trans. Geosci. Remote. Sens., 2023
Chained-Center-Tracker: an Efficient End-to-End Neural Network for Automated Multi-Object tracking.
Int. J. Robotics Autom., 2023
Relation-Aware Multi-Positive Contrastive Knowledge Graph Completion with Embedding Dimension Scaling.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023
Proceedings of the 31st ACM International Conference on Multimedia, 2023
2022
IEEE Trans. Cybern., 2022
2021
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021
Proceedings of the 7th IEEE International Conference on Cloud Computing and Intelligent Systems, 2021
2020
Pattern Recognit. Lett., 2020
Joint and individual matrix factorization hashing for large-scale cross-modal retrieval.
Pattern Recognit., 2020
IEEE Access, 2020
Online Collective Matrix Factorization Hashing for Large-Scale Cross-Media Retrieval.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020
2019
Label Consistent Matrix Factorization Hashing for Large-Scale Cross-Modal Similarity Search.
IEEE Trans. Pattern Anal. Mach. Intell., 2019
Semi-paired and semi-supervised multimodal hashing via cross-modality label propagation.
Multim. Tools Appl., 2019
Robust joint learning network: improved deep representation learning for person re-identification.
Multim. Tools Appl., 2019
IET Image Process., 2019
2018
IEEE Trans. Circuits Syst. Video Technol., 2018
Proceedings of the Big Data - 6th CCF Conference, 2018
2016
IEEE Trans. Image Process., 2016
IEEE Trans. Cybern., 2016
Neurocomputing, 2016
2015
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015
Proceedings of the Computer Vision - CCF Chinese Conference, 2015