Yinan He

According to our database1, Yinan He authored at least 27 papers between 2009 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding.
CoRR, 2024

VideoMamba: State Space Model for Efficient Video Understanding.
CoRR, 2024

From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities.
CoRR, 2024

2023
VBench: Comprehensive Benchmark Suite for Video Generative Models.
CoRR, 2023

MVBench: A Comprehensive Multi-modal Video Understanding Benchmark.
CoRR, 2023

Harvest Video Foundation Models via Efficient Post-Pretraining.
CoRR, 2023

LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models.
CoRR, 2023

InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation.
CoRR, 2023

VideoChat: Chat-Centric Video Understanding.
CoRR, 2023

InternGPT: Solving Vision-Centric Tasks by Interacting with Chatbots Beyond Language.
CoRR, 2023

Unmasked Teacher: Towards Training-Efficient Video Foundation Models.
CoRR, 2023

UniFormerV2: Unlocking the Potential of Image ViTs for Video Understanding.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Unmasked Teacher: Towards Training-Efficient Video Foundation Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
InternVideo: General Video Foundation Models via Generative and Discriminative Learning.
CoRR, 2022

Exploring adaptation of VideoMAE for Audio-Visual Diarization & Social @ Ego4d Looking at me Challenge.
CoRR, 2022

UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer.
CoRR, 2022

InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges.
CoRR, 2022

X-Learner: Learning Cross Sources and Tasks for Universal Visual Representation.
CoRR, 2022

TreMo: Continuous Vital Sign Monitoring Based on Subtle Intrinsic Tremors with COTS Mobile Devices.
Proceedings of the IEEE International Conference on Communications, 2022

X-Learner: Learning Cross Sources and Tasks for Universal Visual Representation.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
ForgeryNet - Face Forgery Analysis Challenge 2021: Methods and Results.
CoRR, 2021

INTERN: A New Learning Paradigm Towards General Vision.
CoRR, 2021

ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2019
Inter-frame Relationship Graph Based Near-Duplicate Video Clip Detection Method.
Proceedings of the Image and Graphics Technologies and Applications, 2019

2017
Highly Portable, Sensor-Based System for Human Fall Monitoring.
Sensors, 2017

2009
Height Servo System for Straw-Checkerboard Sand Barriers Paving Robot.
Proceedings of the 2009 Second International Symposium on Computational Intelligence and Design, 2009


  Loading...