Yujia Zhang

Orcid: 0000-0002-2335-7657

Affiliations:
  • Chinese Academy of Sciences, Institute of Automation, State Key Laboratory of Multimodal Artificial Intelligence Systems, Beijing, China


According to our database1, Yujia Zhang authored at least 28 papers between 2017 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
HierGAT: hierarchical spatial-temporal network with graph and transformer for video HOI detection.
Multim. Syst., February, 2025

Prompt-guided bidirectional deep fusion network for referring image segmentation.
Neurocomputing, 2025

FAWL: Weakly-Supervised Video Corpus Moment Retrieval with Frame-Wise Auxiliary Alignment and Weighted Contrastive Learning.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

RefCap: Zero-shot Video Corpus Moment Retrieval Based on Refined Dense Video Captioning.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
Multi-Stage Image-Language Cross-Generative Fusion Network for Video-Based Referring Expression Comprehension.
IEEE Trans. Image Process., 2024

PSAIR: A Neuro-Symbolic Approach to Zero-Shot Visual Grounding.
Proceedings of the International Joint Conference on Neural Networks, 2024

Coarse-to-Fine Recurrently Aligned Transformer with Balance Tokens for Video Moment Retrieval and Highlight Detection.
Proceedings of the International Joint Conference on Neural Networks, 2024

2023
Visual enhanced hierarchical network for sentence-based video thumbnail generation.
Appl. Intell., October, 2023

2022
Beyond Crack: Fine-Grained Pavement Defect Segmentation Using Three-Stream Neural Networks.
IEEE Trans. Intell. Transp. Syst., 2022

Cross-modality synergy network for referring expression comprehension and segmentation.
Neurocomputing, 2022

Generalized zero-shot emotion recognition from body gestures.
Appl. Intell., 2022

Multi-Task Learning for Pavement Disease Segmentation Using Wavelet Transform.
Proceedings of the International Joint Conference on Neural Networks, 2022

Hand Acupoint Detection from Images Based on Improved HRNet.
Proceedings of the International Joint Conference on Neural Networks, 2022

2021
Rethinking semantic-visual alignment in zero-shot object detection via a softplus margin focal loss.
Neurocomputing, 2021

Robot learning through observation via coarse-to-fine grained video summarization.
Appl. Soft Comput., 2021

TB-Net: A Three-Stream Boundary-Aware Network for Fine-Grained Pavement Disease Segmentation.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Stress Detection Using Wearable Devices based on Transfer Learning.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2021

2020
Unsupervised object-level video summarization with online motion auto-encoder.
Pattern Recognit. Lett., 2020

A Generalized Zero-Shot Framework for Emotion Recognition from Body Gestures.
CoRR, 2020

A Prototype-Based Generalized Zero-Shot Learning Framework for Hand Gesture Recognition.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

2019
ConnNet: A Long-Range Relation-Aware Pixel-Connectivity Network for Salient Segmentation.
IEEE Trans. Image Process., 2019

Dilated temporal relational adversarial network for generic video summarization.
Multim. Tools Appl., 2019

Rethinking Knowledge Graph Propagation for Zero-Shot Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

DTR-GAN: dilated temporal relational adversarial network for video summarization.
Proceedings of the ACM Turing Celebration Conference - China, 2019

2018
DTR-GAN: Dilated Temporal Relational Adversarial Network for Video Summarization.
CoRR, 2018

Query-Conditioned Three-Player Adversarial Network for Video Summarization.
Proceedings of the British Machine Vision Conference 2018, 2018

2017
A robot Pose estimation Approach based on Object tracking in Monitoring Scenes.
Int. J. Robotics Autom., 2017

Vision-based illegal human ladder climbing action recognition in substation.
Proceedings of the Ninth International Conference on Advanced Computational Intelligence, 2017


  Loading...