Guo Chen
Affiliations:- Nanjing University, State Key Laboratory for Novel Software Technology, China
According to our database1,
Guo Chen
authored at least 36 papers
between 2022 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
EgoExoBench: A Benchmark for First- and Third-person View Video Understanding in MLLMs.
CoRR, July, 2025
CoRR, July, 2025
Bridging Perspectives: A Survey on Cross-view Collaborative Intelligence with Egocentric-Exocentric Vision.
CoRR, June, 2025
AV-Reasoner: Improving and Benchmarking Clue-Grounded Audio-Visual Counting for MLLMs.
CoRR, June, 2025
CoRR, April, 2025
CoRR, April, 2025
CoRR, March, 2025
Eagle 2: Building Post-Training Data Strategies from Scratch for Frontier Vision-Language Models.
CoRR, January, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
2024
Int. J. Comput. Vis., September, 2024
Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model.
CoRR, 2024
CoRR, 2024
CoRR, 2024
Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding.
CoRR, 2024
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Intern VL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
2023
Comput. Vis. Image Underst., July, 2023
InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks.
CoRR, 2023
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
2022
InternVideo: General Video Foundation Models via Generative and Discriminative Learning.
CoRR, 2022
Exploring State Change Capture of Heterogeneous Backbones @ Ego4D Hands and Objects Challenge 2022.
CoRR, 2022
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022