Shihao Zou

Orcid: 0000-0002-9042-6069

According to our database¹, Shihao Zou authored at least 45 papers between 2018 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Towards Unified Surgical Scene Understanding:Bridging Reasoning and Grounding via MLLMs.

[BibT_eX]

[DOI]

CoRR, May, 2026

The Fourth Challenge on Image Super-Resolution (⨉4) at NTIRE 2026: Benchmark Results and Method Overview.

[BibT_eX]

[DOI]

et al.

CoRR, April, 2026

Surgical Data Science in Time-Critical Contexts: A Roadmap Toward Brain-Inspired Computing.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., January, 2026

Multi-Modal Motion Retrieval by Learning a Fine-Grained Joint Embedding Space.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2026

SAM3-I: Segment Anything with Instructions.

[BibT_eX]

[DOI]

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

Appearance Discrepancy-guided Sequence Hybrid Masking for Robust Scene Text Recognition.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

Surgical Scene Segmentation using a Spike-Driven Video Transformer with Real-Time Potential.

[BibT_eX]

[DOI]

CoRR, December, 2025

Highly Efficient 3D Human Pose Tracking From Events With Spiking Spatiotemporal Transformer.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., October, 2025

Intentional tendency-based dynamic heterogeneous graph network for emotion recognition in conversations.

[BibT_eX]

[DOI]

Xinyi Gan

Xianying Huang

Shihao Zou

J. Intell. Inf. Syst., June, 2025

Generating High-Fidelity Clothed Human Dynamics with Temporal Diffusion.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., March, 2025

Spatial-Aware Multi-Level Parsing Network for Human-Object Interaction.

[BibT_eX]

[DOI]

Int. J. Interact. Multim. Artif. Intell., 2025

Semantics-aware human motion generation from audio instructions.

[BibT_eX]

[DOI]

Graph. Model., 2025

Multi-Condition Guided Diffusion Network for Multimodal Emotion Recognition in Conversation.

[BibT_eX]

[DOI]

Wenjin Tian

Xianying Huang

Shihao Zou

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Cerebrovascular Diseases Screening from Color Fundus Photography via Cross-View Fusion and Graph-Based Discrimination.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2025, 2025

SpikeVideoFormer: An Efficient Spike-Driven Video Transformer with Hamming Attention and O(T) Complexity.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Medical Open Set Recognition via Intra-Class Clustering.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Cyberworlds, 2025

Modal Feature Optimization Network with Prompt for Multimodal Sentiment Analysis.

[BibT_eX]

[DOI]

Xiangmin Zhang

Wei Wei

Shihao Zou

Proceedings of the 31st International Conference on Computational Linguistics, 2025

Distance-Aware and Knowledge-Driven Vision Mamba U-Net for Radiotherapy Dose Prediction.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2025

TempDiffReg: Temporal Diffusion Model for Non-Rigid 2D-3D Vascular Registration.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2025

2024

PSAN: Prompt Semantic Augmented Network for aspect-based sentiment analysis.

[BibT_eX]

[DOI]

Expert Syst. Appl., March, 2024

Improving conversational recommender systems via multi-preference modelling and knowledge-enhanced.

[BibT_eX]

[DOI]

Knowl. Based Syst., 2024

Multimodal Knowledge-enhanced Interactive Network with Mixed Contrastive Learning for Emotion Recognition in Conversation.

[BibT_eX]

[DOI]

Neurocomputing, 2024

Multi-behavior collaborative contrastive learning for sequential recommendation.

[BibT_eX]

[DOI]

Complex Intell. Syst., 2024

C³LPGCN:Integrating Contrastive Learning and Cooperative Learning with Prompt into Graph Convolutional Network for Aspect-based Sentiment Analysis.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

RACon: Retrieval-Augmented Simulated Character Locomotion Control.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Tri-Modal Motion Retrieval by Learning a Joint Embedding Space.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Snipper: A Spatiotemporal Transformer for Simultaneous Multi-Person 3D Pose Estimation Tracking and Forecasting on a Video Snippet.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., September, 2023

Human Pose and Shape Estimation From Single Polarization Images.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Event-based Human Pose Tracking by Spiking Spatiotemporal Transformer.

[BibT_eX]

[DOI]

CoRR, 2023

Bi-directional Frame Interpolation for Unsupervised Video Anomaly Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Multimodal Prompt Transformer with Hybrid Contrastive Learning for Emotion Recognition in Conversation.

[BibT_eX]

[DOI]

Shihao Zou

Xianying Huang

Xudong Shen

Proceedings of the 31st ACM International Conference on Multimedia, 2023

2022

Speckle Noise Spectrum at Near-Nadir Incidence Angles for a Time-Varying Sea Surface.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2022

Improving multimodal fusion with Main Modal Transformer for emotion recognition in conversation.

[BibT_eX]

[DOI]

Knowl. Based Syst., 2022

Action2video: Generating Videos of Human 3D Actions.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2022

Generating Diverse and Natural 3D Human Motions from Text.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Human Pose and Shape Estimation from Single Polarization Images.

[BibT_eX]

[DOI]

CoRR, 2021

EventHPE: Event-based 3D Human Pose and Shape Estimation.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020

Polarization Human Shape and Pose Dataset.

[BibT_eX]

[DOI]

CoRR, 2020

Action2Motion: Conditioned Generation of 3D Human Motions.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

3D Human Shape Reconstruction from a Polarization Image.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Learning to Communicate Implicitly by Actions.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

A Regularized Opponent Model with Maximum Entropy Objective.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

MarlRank: Multi-agent Reinforced Learning to Rank.

[BibT_eX]

[DOI]

Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

2018

Learning Multi-agent Implicit Communication Through Actions: A Case Study in Contract Bridge, a Collaborative Imperfect-Information Game.

[BibT_eX]

[DOI]

CoRR, 2018

On the Equilibrium of Query Reformulation and Document Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM SIGIR International Conference on Theory of Information Retrieval, 2018

Shihao Zou

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...