Xingjian He

Orcid: 0000-0001-5396-6253

According to our database1, Xingjian He authored at least 22 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models.
CoRR, 2024

Beyond Literal Descriptions: Understanding and Locating Open-World Objects Aligned with Human Intentions.
CoRR, 2024

2023
Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentation.
CoRR, 2023

EAVL: Explicitly Align Vision and Language for Referring Image Segmentation.
CoRR, 2023

COSA: Concatenated Sample Pretrained Vision-Language Foundation Model.
CoRR, 2023

MMNet: Multi-Mask Network for Referring Image Segmentation.
CoRR, 2023

VLAB: Enhancing Video Language Pre-training by Feature Adapting and Blending.
CoRR, 2023

CM-MaskSD: Cross-Modality Masked Self-Distillation for Referring Image Segmentation.
CoRR, 2023

VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset.
CoRR, 2023

MAMO: Fine-Grained Vision-Language Representations Learning with Masked Multimodal Modeling.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

CSDNet: Contrastive Similarity Distillation Network for Multi-lingual Image-Text Retrieval.
Proceedings of the Image and Graphics - 12th International Conference, 2023

WL-MSR: Watch and Listen for Multimodal Subtitle Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
An Efficient Sampling-Based Attention Network for Semantic Segmentation.
IEEE Trans. Image Process., 2022

MAMO: Masked Multimodal Modeling for Fine-Grained Vision-Language Representation Learning.
CoRR, 2022

2021
Exploiting Spatial-Temporal Semantic Consistency for Video Scene Parsing.
CoRR, 2021

Global-Local Propagation Network for RGB-D Semantic Segmentation.
CoRR, 2021

Dynamic Warping Network for Semantic Video Segmentation.
Complex., 2021

Consistent-Separable Feature Representation for Semantic Segmentation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Non-Autoregressive Image Captioning with Counterfactuals-Critical Multi-Agent Learning.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

2019
Image fusion method based on simultaneous sparse representation with non-subsampled contourlet transform.
IET Comput. Vis., 2019

2017
Validation of the merged co-variation signal in interacting protein pairs by mirror-dendrogram.
Int. J. Data Min. Bioinform., 2017


  Loading...