Zhaobo Qi

Orcid: 0000-0001-9196-9818

According to our database1, Zhaobo Qi authored at least 16 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
KN-VLM: KNowledge-guided Vision-and-Language Model for visual abductive reasoning.
Multim. Syst., April, 2025

Dual-guided multi-modal bias removal strategy for temporal sentence grounding in video.
Multim. Syst., April, 2025

Masked Temporal Interpolation Diffusion for Procedure Planning in Instructional Videos.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Learning Fine-Grained Representations through Textual Token Disentanglement in Composed Video Retrieval.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Enhancing Pre-trained Representation Classifiability can Boost its Interpretability.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Video Language Model Pretraining with Spatio-temporal Masking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Procedure Knowledge Decoupled Distillation Strategy for Procedure Planning in Instructional Videos.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Uncertainty-Boosted Robust Video Activity Anticipation.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Collaborative Debias Strategy for Temporal Sentence Grounding in Video.
IEEE Trans. Circuits Syst. Video Technol., November, 2024

Improving Sequential DeepFake Detection with Local information enhancement.
Proceedings of the 6th ACM International Conference on Multimedia in Asia, 2024

Bias-Conflict Sample Synthesis and Adversarial Removal Debias Strategy for Temporal Sentence Grounding in Video.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Temporal Dynamic Concept Modeling Network for Explainable Video Event Recognition.
ACM Trans. Multim. Comput. Commun. Appl., November, 2023

Self-Regulated Learning for Egocentric Video Activity Anticipation.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Semantic-Aware Dynamic Feature Selection and Fusion for Object Detection in UAV Videos.
Proceedings of the ACM Multimedia Asia 2023, 2023

2020
Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Towards More Explainability: Concept Knowledge Mining Network for Event Recognition.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020


  Loading...