Shihao Zou

Orcid: 0000-0002-9042-6069

According to our database1, Shihao Zou authored at least 44 papers between 2018 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
The Fourth Challenge on Image Super-Resolution (⨉4) at NTIRE 2026: Benchmark Results and Method Overview.
CoRR, April, 2026

Surgical Data Science in Time-Critical Contexts: A Roadmap Toward Brain-Inspired Computing.
J. Comput. Sci. Technol., January, 2026

Multi-Modal Motion Retrieval by Learning a Fine-Grained Joint Embedding Space.
IEEE Trans. Multim., 2026

Appearance Discrepancy-guided Sequence Hybrid Masking for Robust Scene Text Recognition.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Surgical Scene Segmentation using a Spike-Driven Video Transformer with Real-Time Potential.
CoRR, December, 2025

SAM3-I: Segment Anything with Instructions.
CoRR, December, 2025

Highly Efficient 3D Human Pose Tracking From Events With Spiking Spatiotemporal Transformer.
IEEE Trans. Circuits Syst. Video Technol., October, 2025

Intentional tendency-based dynamic heterogeneous graph network for emotion recognition in conversations.
J. Intell. Inf. Syst., June, 2025

Generating High-Fidelity Clothed Human Dynamics with Temporal Diffusion.
ACM Trans. Multim. Comput. Commun. Appl., March, 2025

Spatial-Aware Multi-Level Parsing Network for Human-Object Interaction.
Int. J. Interact. Multim. Artif. Intell., 2025

Semantics-aware human motion generation from audio instructions.
Graph. Model., 2025

Multi-Condition Guided Diffusion Network for Multimodal Emotion Recognition in Conversation.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Cerebrovascular Diseases Screening from Color Fundus Photography via Cross-View Fusion and Graph-Based Discrimination.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2025, 2025

SpikeVideoFormer: An Efficient Spike-Driven Video Transformer with Hamming Attention and O(T) Complexity.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Medical Open Set Recognition via Intra-Class Clustering.
Proceedings of the International Conference on Cyberworlds, 2025

Modal Feature Optimization Network with Prompt for Multimodal Sentiment Analysis.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

Distance-Aware and Knowledge-Driven Vision Mamba U-Net for Radiotherapy Dose Prediction.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2025

TempDiffReg: Temporal Diffusion Model for Non-Rigid 2D-3D Vascular Registration.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2025

2024
PSAN: Prompt Semantic Augmented Network for aspect-based sentiment analysis.
Expert Syst. Appl., March, 2024

Improving conversational recommender systems via multi-preference modelling and knowledge-enhanced.
Knowl. Based Syst., 2024

Multimodal Knowledge-enhanced Interactive Network with Mixed Contrastive Learning for Emotion Recognition in Conversation.
Neurocomputing, 2024

Multi-behavior collaborative contrastive learning for sequential recommendation.
Complex Intell. Syst., 2024

C³LPGCN:Integrating Contrastive Learning and Cooperative Learning with Prompt into Graph Convolutional Network for Aspect-based Sentiment Analysis.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

RACon: Retrieval-Augmented Simulated Character Locomotion Control.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Tri-Modal Motion Retrieval by Learning a Joint Embedding Space.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Snipper: A Spatiotemporal Transformer for Simultaneous Multi-Person 3D Pose Estimation Tracking and Forecasting on a Video Snippet.
IEEE Trans. Circuits Syst. Video Technol., September, 2023

Human Pose and Shape Estimation From Single Polarization Images.
IEEE Trans. Multim., 2023

Event-based Human Pose Tracking by Spiking Spatiotemporal Transformer.
CoRR, 2023

Bi-directional Frame Interpolation for Unsupervised Video Anomaly Detection.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Multimodal Prompt Transformer with Hybrid Contrastive Learning for Emotion Recognition in Conversation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

2022
Speckle Noise Spectrum at Near-Nadir Incidence Angles for a Time-Varying Sea Surface.
IEEE Trans. Geosci. Remote. Sens., 2022

Improving multimodal fusion with Main Modal Transformer for emotion recognition in conversation.
Knowl. Based Syst., 2022

Action2video: Generating Videos of Human 3D Actions.
Int. J. Comput. Vis., 2022

Generating Diverse and Natural 3D Human Motions from Text.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Human Pose and Shape Estimation from Single Polarization Images.
CoRR, 2021

EventHPE: Event-based 3D Human Pose and Shape Estimation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
Polarization Human Shape and Pose Dataset.
CoRR, 2020

Action2Motion: Conditioned Generation of 3D Human Motions.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

3D Human Shape Reconstruction from a Polarization Image.
Proceedings of the Computer Vision - ECCV 2020, 2020

Learning to Communicate Implicitly by Actions.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
A Regularized Opponent Model with Maximum Entropy Objective.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

MarlRank: Multi-agent Reinforced Learning to Rank.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

2018
Learning Multi-agent Implicit Communication Through Actions: A Case Study in Contract Bridge, a Collaborative Imperfect-Information Game.
CoRR, 2018

On the Equilibrium of Query Reformulation and Document Retrieval.
Proceedings of the 2018 ACM SIGIR International Conference on Theory of Information Retrieval, 2018


  Loading...