Chaolei Tan

Orcid: 0009-0005-2864-0696

According to our database¹, Chaolei Tan authored at least 16 papers between 2021 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

TubeRMC: Tube-conditioned Reconstruction with Mutual Constraints for Weakly-supervised Spatio-Temporal Video Grounding.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

Image-to-Video Transfer Learning based on Image-Language Foundation Models: A Comprehensive Survey.

[BibT_eX]

[DOI]

CoRR, October, 2025

HLFormer: Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning.

[BibT_eX]

[DOI]

CoRR, July, 2025

Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation.

[BibT_eX]

[DOI]

CoRR, May, 2025

ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

SAUGE: Taming SAM for Uncertainty-Aligned Multi-Granularity Edge Detection.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Siamese Learning with Joint Alignment and Regression for Weakly-Supervised Video Paragraph Grounding.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Ranking Distillation for Open-Ended Video Question Answering with Insufficient Labels.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Hierarchical Semantic Correspondence Networks for Video Paragraph Grounding.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Collaborative Static and Dynamic Vision-Language Streams for Spatio-Temporal Video Grounding.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

STVGFormer: Spatio-Temporal Video Grounding with Static-Dynamic Cross-Modal Understanding.

[BibT_eX]

[DOI]

CoRR, 2022

Context Alignment Network for Video Moment Retrieval.

[BibT_eX]

[DOI]

Chaolei Tan

Jian-Fang Hu

Wei-Shi Zheng

Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

Matching and Localizing: A Simple yet Effective Framework for Human-Centric Spatio-Temporal Video Grounding.

[BibT_eX]

[DOI]

Chaolei Tan

Jian-Fang Hu

Wei-Shi Zheng

Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

2021

Augmented 2D-TAN: A Two-stage Approach for Human-centric Spatio-Temporal Video Grounding.

[BibT_eX]

[DOI]

CoRR, 2021

Chaolei Tan

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...