Qingpei Guo

Orcid: 0000-0002-8638-6594

According to our database1, Qingpei Guo authored at least 18 papers between 2015 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
SNP-S<sup>3</sup>: Shared Network Pre-Training and Significant Semantic Strengthening for Various Video-Text Tasks.
IEEE Trans. Circuits Syst. Video Technol., April, 2024

M2-RAAP: A Multi-Modal Recipe for Advancing Adaptation-based Pre-training towards Effective and Efficient Zero-shot Video-text Retrieval.
CoRR, 2024

SNP-S3: Shared Network Pre-training and Significant Semantic Strengthening for Various Video-Text Tasks.
CoRR, 2024

M<sub>2</sub>-Encoder: Advancing Bilingual Image-Text Understanding by Large-scale Efficient Pretraining.
CoRR, 2024

Knowledge-enhanced Multi-perspective Video Representation Learning for Scene Recognition.
CoRR, 2024

SyCoCa: Symmetrizing Contrastive Captioners with Attentive Masking for Multimodal Alignment.
CoRR, 2024

2023
Text as Image: Learning Transferable Adapter for Multi-Label Classification.
CoRR, 2023

Pink: Unveiling the Power of Referential Comprehension for Multi-modal LLMs.
CoRR, 2023

EVE: Efficient zero-shot text-based Video Editing with Depth Map Guidance and Temporal Consistency Constraints.
CoRR, 2023

Dual-Modal Attention-Enhanced Text-Video Retrieval with Triplet Partial Margin Contrastive Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Temporal Sentence Grounding in Streaming Videos.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Boundary-aware Backward-Compatible Representation via Adversarial Learning in Image Retrieval.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

CNVid-3.5M: Build, Filter, and Pre-Train the Large-Scale Public Chinese Video-Text Dataset.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Switch-BERT: Learning to Model Multimodal Interactions by Switching Attention and Input.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
LPSNet: A Lightweight Solution for Fast Panoptic Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Automatic Car Damage Assessment System: Reading and Understanding Videos as Professional Insurance Inspectors.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2017
Non-Frontal Facial Expression Recognition Using a Depth-Patch Based Deep Neural Network.
J. Comput. Sci. Technol., 2017

2015
The Implementation of Hadoop-based Crawler System and Graphlite-based PageRank-Calculation In Search Engine.
CoRR, 2015


  Loading...