Chaolei Tan

Orcid: 0009-0005-2864-0696

According to our database1, Chaolei Tan authored at least 16 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
TubeRMC: Tube-conditioned Reconstruction with Mutual Constraints for Weakly-supervised Spatio-Temporal Video Grounding.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Image-to-Video Transfer Learning based on Image-Language Foundation Models: A Comprehensive Survey.
CoRR, October, 2025

HLFormer: Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning.
CoRR, July, 2025

Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation.
CoRR, May, 2025

ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

SAUGE: Taming SAM for Uncertainty-Aligned Multi-Granularity Edge Detection.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Siamese Learning with Joint Alignment and Regression for Weakly-Supervised Video Paragraph Grounding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Ranking Distillation for Open-Ended Video Question Answering with Insufficient Labels.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Hierarchical Semantic Correspondence Networks for Video Paragraph Grounding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Collaborative Static and Dynamic Vision-Language Streams for Spatio-Temporal Video Grounding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
STVGFormer: Spatio-Temporal Video Grounding with Static-Dynamic Cross-Modal Understanding.
CoRR, 2022

Context Alignment Network for Video Moment Retrieval.
Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

Matching and Localizing: A Simple yet Effective Framework for Human-Centric Spatio-Temporal Video Grounding.
Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

2021
Augmented 2D-TAN: A Two-stage Approach for Human-centric Spatio-Temporal Video Grounding.
CoRR, 2021


  Loading...