Wenhui Jiang

Orcid: 0000-0002-4144-6725

Affiliations:
  • Jiangxi University of Finance and Economics, School of Computing and Artificial Intelligence, Nanchang, China
  • Beijing University of Posts and Telecommunications, Beijing, China (PhD)


According to our database1, Wenhui Jiang authored at least 35 papers between 2015 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Text-Conditional Visual-Language Alignment for Video Captioning.
IEEE Trans. Circuits Syst. Video Technol., March, 2026

DSRAS: Dual-Stage Reasoning and Answer Selection for Video-Text Visual Question Answering.
IEEE Signal Process. Lett., 2026

Retrieval Augmented video captioning with quality-aware re-ranking and cross-gating fusion.
Displays, 2026

2025
Learning Comprehensive Visual Grounding for Video Captioning.
IEEE Trans. Circuits Syst. Video Technol., April, 2025

Learning Guided Implicit Depth Function With Scale-Aware Feature Fusion.
IEEE Trans. Image Process., 2025

Omnidirectional Image Quality Captioning: A Large-Scale Database and a New Model.
IEEE Trans. Image Process., 2025

Separate, Locate, and Align: Determine Context Relation of Scene Text From Multiple Perspectives in TextVQA.
IEEE Trans. Circuits Syst. Video Technol., 2025

Learning Stage-wise Fusion Transformer for light field saliency detection.
Pattern Recognit. Lett., 2025

Opinion-unaware blind stereoscopic image quality assessment: A comprehensive study.
Pattern Recognit., 2025

Weak-shot Keypoint Estimation via Keyness and Correspondence Transfer.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

What Happens in the Surroundings: A Benchmark for 360° image Captioning.
Proceedings of the International Joint Conference on Neural Networks, 2025

2024
Revisiting the robustness of spatio-temporal modeling in video quality assessment.
Displays, January, 2024

CFNet: Conditional filter learning with dynamic noise estimation for real image denoising.
Knowl. Based Syst., 2024

PosCap: Boosting Video Captioning with Part-of-Speech Guidance.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

Comprehensive Visual Grounding for Video Description.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Lesion-aware network for diabetic retinopathy diagnosis.
Int. J. Imaging Syst. Technol., November, 2023

UDNet: Uncertainty-aware deep network for salient object detection.
Pattern Recognit., 2023

Feature Adaptive YOLO for Remote Sensing Detection in Adverse Weather Conditions.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2023

2022
Visual Cluster Grounding for Image Captioning.
IEEE Trans. Image Process., 2022

Revisiting image captioning via maximum discrepancy competition.
Pattern Recognit., 2022

Hybrid attention network for image captioning.
Displays, 2022

CFNet: Conditional Filter Learning with Dynamic Noise Estimation for Real Image Denoising.
CoRR, 2022

Dual-stream Self-attention Network for Image Captioning.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2022

Bilinear CNNs for Blind Quality Assessment of Fine-Grained Images.
Proceedings of the 24th IEEE International Workshop on Multimedia Signal Processing, 2022

Informative Attention Supervision for Grounded Video Description.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Superpixel-Based Quality Assessment of Multi-Exposure Image Fusion for Both Static and Dynamic Scenes.
IEEE Trans. Image Process., 2021

Visual attention prediction for Autism Spectrum Disorder with hierarchical semantic fusion.
Signal Process. Image Commun., 2021

Dynamic proposal sampling for weakly supervised object detection.
Neurocomputing, 2021

Anomaly detection in video sequences: A benchmark and computational model.
IET Image Process., 2021

2018
Weakly supervised detection with decoupled attention-based deep representation.
Multim. Tools Appl., 2018

2017
Optimizing Region Selection for Weakly Supervised Object Detection.
CoRR, 2017

Ego-Motion Classification for Driving Vehicle.
Proceedings of the Third IEEE International Conference on Multimedia Big Data, 2017

2016
Bayes pooling of visual phrases for object retrieval.
Multim. Tools Appl., 2016

ALADDIN: A locality aligned deep model for instance search.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Part-based deep network for pedestrian detection in surveillance videos.
Proceedings of the 2015 Visual Communications and Image Processing, 2015


  Loading...