Yizhuo Li

Orcid: 0000-0001-8463-979X

Affiliations:
  • Shanghai Jiao Tong University, China


According to our database1, Yizhuo Li authored at least 18 papers between 2020 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
HAKE: A Knowledge Engine Foundation for Human Activity Understanding.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

MVBench: A Comprehensive Multi-modal Video Understanding Benchmark.
CoRR, 2023

Harvest Video Foundation Models via Efficient Post-Pretraining.
CoRR, 2023

InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation.
CoRR, 2023

VideoChat: Chat-Centric Video Understanding.
CoRR, 2023

Unmasked Teacher: Towards Training-Efficient Video Foundation Models.
CoRR, 2023

UniFormerV2: Unlocking the Potential of Image ViTs for Video Understanding.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Unmasked Teacher: Towards Training-Efficient Video Foundation Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
InternVideo: General Video Foundation Models via Generative and Discriminative Learning.
CoRR, 2022

UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer.
CoRR, 2022

InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges.
CoRR, 2022

Modeling Human Memory in Multi-Object Tracking with Transformers.
Proceedings of the IEEE International Conference on Acoustics, 2022

Unsupervised Representation for Semantic Segmentation by Implicit Cycle-Attention Contrastive Learning.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Test-Time Personalization with a Transformer for Human Pose Estimation.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

PGT: A Progressive Method for Training Models on Long Videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

TDAF: Top-Down Attention Framework for Vision Tasks.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
HOI Analysis: Integrating and Decomposing Human-Object Interaction.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

TubeTK: Adopting Tubes to Track Multi-Object in a One-Step Training Model.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020


  Loading...