Zhenzhen Hu

Orcid: 0000-0003-1042-8361

Affiliations:
  • Hefei University of Technology, China


According to our database1, Zhenzhen Hu authored at least 49 papers between 2013 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
CLAIP-Emo: Parameter-Efficient Adaptation of Language-supervised models for In-the-Wild Audiovisual Emotion Recognition.
CoRR, September, 2025

Generalizable Engagement Estimation in Conversation via Domain Prompting and Parallel Attention.
CoRR, August, 2025

Listening to the Unspoken: Exploring 365 Aspects of Multimodal Interview Performance Assessment.
CoRR, July, 2025

Traits Run Deep: Enhancing Personality Assessment via Psychology-Guided LLM Representations and Multimodal Apparent Behaviors.
CoRR, July, 2025

Contrastive Alignment with Semantic Gap-Aware Corrections in Text-Video Retrieval.
CoRR, May, 2025

VAEmo: Efficient Representation Learning for Visual-Audio Emotion with Knowledge Injection.
CoRR, May, 2025

Grid Jigsaw Representation with CLIP: a new perspective on image clustering.
Multim. Syst., April, 2025

Video Flow as Time Series: Discovering Temporal Consistency and Variability for VideoQA.
CoRR, April, 2025

Multi-Modal Prior-Guided Diffusion Model for Blind Image Super-Resolution.
IEEE Signal Process. Lett., 2025

EPDiff: Enhancing Prior-guided Diffusion model for Real-world Image Super-Resolution.
Comput. Vis. Image Underst., 2025

Concept Drift Guided LayerNorm Tuning for Efficient Multimodal Metaphor Identification.
Proceedings of the 2025 International Conference on Multimedia Retrieval, 2025

Text Proxy: Decomposing Retrieval from a 1-to-N Relationship into N 1-to-1 Relationships for Text-Video Retrieval.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Efficiently Gluing Pre-Trained Language and Vision Models for Image Captioning.
ACM Trans. Intell. Syst. Technol., December, 2024

Exploring and exploiting model uncertainty for robust visual question answering.
Multim. Syst., December, 2024

Math Word Problem Generation via Disentangled Memory Retrieval.
ACM Trans. Knowl. Discov. Data, June, 2024

Embedded Heterogeneous Attention Transformer for Cross-Lingual Image Captioning.
IEEE Trans. Multim., 2024

Decomposing Relationship from 1-to-N into N 1-to-1 for Text-Video Retrieval.
CoRR, 2024

UniLearn: Enhancing Dynamic Facial Expression Recognition through Unified Pre-Training and Fine-Tuning on Images and Videos.
CoRR, 2024

Seeing is Believing? Enhancing Vision-Language Navigation using Visual Perturbations.
CoRR, 2024

Dual-Stream Keyframe Enhancement for Video Question Answering.
Proceedings of the 6th ACM International Conference on Multimedia in Asia, 2024

2023
Efficient and self-adaptive rationale knowledge base for visual commonsense reasoning.
Multim. Syst., October, 2023

A Text-Guided Generation and Refinement Model for Image Captioning.
IEEE Trans. Multim., 2023

Embedded Heterogeneous Attention Transformer for Cross-lingual Image Captioning.
CoRR, 2023

Grid Feature Jigsaw for Self-supervised Image Clustering.
Proceedings of the International Joint Conference on Neural Networks, 2023

Dual Video Summarization: From Frames to Captions.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

CITE: Compact Interactive TransformEr for Multilingual Image Captioning.
Proceedings of the 6th International Conference on Image and Graphics Processing, 2023

2022
Visual feature synthesis with semantic reconstructor for traditional and generalized zero-shot object classification.
Int. J. Intell. Syst., 2022

Compact Bidirectional Transformer for Image Captioning.
CoRR, 2022

Math Word Problem Generation with Memory Retrieval.
Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022

OCR-oriented Master Object for Text Image Captioning.
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

2021
Adversarial co-distillation learning for image recognition.
Pattern Recognit., 2021

Sequential image encoding for vision-to-language problems.
Multim. Tools Appl., 2021

Semi-Autoregressive Transformer for Image Captioning.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

2020
The Balanced Loss Curriculum Learning.
IEEE Access, 2020

WFN-PSC: weighted-fusion network with poly-scale convolution for image dehazing.
Proceedings of the MMAsia 2020: ACM Multimedia Asia, 2020

A Text-Guided Graph Structure for Image Captioning.
Proceedings of the 2020 IEEE International Conference on Multimedia & Expo Workshops, 2020

More Grounded Image Captioning by Distilling Image-Text Matching Model.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Quality-Aware Unpaired Image-to-Image Translation.
IEEE Trans. Multim., 2019

2018
Video Captioning Based on the Spatial-Temporal Saliency Tracing.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Semantic Image Inpainting with Progressive Generative Networks.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Speeding-Up Age Estimation in Intelligent Demographics System via Network Optimization.
Proceedings of the 2018 IEEE International Conference on Communications, 2018

Enhanced Text-Guided Attention Model for Image Captioning.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

2017
Visual Classification of Furniture Styles.
ACM Trans. Intell. Syst. Technol., 2017

Facial Age Estimation With Age Difference.
IEEE Trans. Image Process., 2017

2016
Multi-View Object Retrieval via Multi-Scale Topic Models.
IEEE Trans. Image Process., 2016

2015
Understanding Blooming Human Groups in Social Networks.
IEEE Trans. Multim., 2015

2014
Fashion Parsing With Weak Color-Category Labels.
IEEE Trans. Multim., 2014

PicWords: Render a Picture by Packing Keywords.
IEEE Trans. Multim., 2014

2013
eHeritage of shadow puppetry: creation and manipulation.
Proceedings of the ACM Multimedia Conference, 2013


  Loading...