Yi Wang
Orcid: 0000-0002-1728-9563Affiliations:
- Shanghai AI Laboratory, China
According to our database1,
Yi Wang
authored at least 72 papers
between 2012 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
CoRR, July, 2025
InternSpatial: A Comprehensive Dataset for Spatial Reasoning in Vision-Language Models.
CoRR, June, 2025
CoRR, June, 2025
Int. J. Comput. Vis., May, 2025
TraveLLaMA: Facilitating Multi-modal Large Language Models to Understand Urban Scenes and Provide Travel Assistance.
CoRR, April, 2025
CoRR, April, 2025
MSFM-UNET: enhancing medical image segmentation with multi-scale and multi-view frequency fusion.
Pattern Anal. Appl., March, 2025
CoRR, March, 2025
CoRR, January, 2025
CoRR, January, 2025
CoRR, January, 2025
Scale-View Co-Awareness Framework for Simultaneous Segmentation of Pancreas and Tumors.
IEEE Trans. Instrum. Meas., 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
2024
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling.
CoRR, 2024
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text.
CoRR, 2024
CoRR, 2024
From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities.
CoRR, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
OSFENet: Object Spatiotemporal Feature Enhanced Network for Surgical Phase Recognition.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
2023
Int. J. Comput. Vis., October, 2023
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation.
CoRR, 2023
InternGPT: Solving Vision-Centric Tasks by Interacting with Chatbots Beyond Language.
CoRR, 2023
TMT-VIS: Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
NeRFLiX: High-Quality Neural View Synthesis by Learning a Degradation-Driven Inter-viewpoint MiXer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Learning Open-Vocabulary Semantic Segmentation Models From Natural Language Supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2023
2022
IEEE Trans. Pattern Anal. Mach. Intell., 2022
InternVideo: General Video Foundation Models via Generative and Discriminative Learning.
CoRR, 2022
CoRR, 2022
CoRR, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
2021
Int. J. Pattern Recognit. Artif. Intell., 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
2020
Automatic liver tumour segmentation in CT combining FCN and NMF-based deformable model.
Comput. methods Biomech. Biomed. Eng. Imaging Vis., 2020
Proceedings of the Computer Vision - ECCV 2020, 2020
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
2019
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
2018
Wavelet-based extended morphological profile and deep autoencoder for hyperspectral image classification.
Int. J. Wavelets Multiresolution Inf. Process., 2018
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
2017
Automatic Liver Lesion Segmentation in CT Combining Fully Convolutional Networks and Non-negative Matrix Factorization.
Proceedings of the Imaging for Patient-Customized Simulations and Systems for Point-of-Care Ultrasound, 2017
A novel variational method for liver segmentation based on statistical shape model prior and enforced local statistical feature.
Proceedings of the 14th IEEE International Symposium on Biomedical Imaging, 2017
2016
Face recognition via collaborative representation based multiple one-dimensional embedding.
Int. J. Wavelets Multiresolution Inf. Process., 2016
2014
An effective and robust method for modeling multi-furcation liver vessel by using Gap Border Pairing.
Comput. Medical Imaging Graph., 2014
2013
Automatic Multi-Scale Segmentation of Intrahepatic Vessel in CT Images for Liver Surgery Planning.
Int. J. Pattern Recognit. Artif. Intell., 2013
2012
Proceedings of the 38th Annual Conference on IEEE Industrial Electronics Society, 2012