Xinhang Song

Orcid: 0000-0002-0895-1076

According to our database1, Xinhang Song authored at least 43 papers between 2013 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Multi-Object Navigation Using Potential Target Position Policy Function.
IEEE Trans. Image Process., 2023

Composite Object Relation Modeling for Few-Shot Scene Recognition.
IEEE Trans. Image Process., 2023

Dataset Bias in Few-Shot Image Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Phase Discriminated Multi-Policy for Visual Room Rearrangement.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2023

CaMP: Causal Multi-policy Planning for Interactive Navigation in Multi-room Scenes.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Generating Explanations for Embodied Action Decision from Visual Observation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Long-Short Term Policy for Visual Object Navigation.
IROS, 2023

Layout-based Causal Inference for Object Navigation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Bi-Level Meta-Learning for Few-Shot Domain Generalization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Amorphous Region Context Modeling for Scene Recognition.
IEEE Trans. Multim., 2022

Generative Meta-Adversarial Network for Unseen Object Navigation.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
See More for Scene: Pairwise Consistency Learning for Scene Classification.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

ION: Instance-level Object Navigation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Hierarchical Object-to-Zone Graph for Object Navigation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
Learning Scene Attribute for Scene Recognition.
IEEE Trans. Multim., 2020

Image Representations With Spatial Object-to-Object Relations for RGB-D Scene Recognition.
IEEE Trans. Image Process., 2020

Spatio-Temporal Memory Attention for Image Captioning.
IEEE Trans. Image Process., 2020

Scene Recognition With Prototype-Agnostic Scene Layout.
IEEE Trans. Image Process., 2020

Image captioning via semantic element embedding.
Neurocomputing, 2020

Generalized Zero-shot Learning with Multi-source Semantic Embeddings for Scene Recognition.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

2019
Deep Patch Representations with Shared Codebook for Scene Classification.
ACM Trans. Multim. Comput. Commun. Appl., 2019

Learning Effective RGB-D Representations for Scene Recognition.
IEEE Trans. Image Process., 2019

Aberrance-aware Gradient-sensitive Attentions for Scene Recognition with RGB-D Videos.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

MUCH: Mutual Coupling Enhancement of Scene Recognition and Dense Captioning.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

A Real-Time Scene Recognition System Based on RGB-D Video Streams.
Proceedings of the International Conference on Multimodal Interaction, 2019

2018
Focal Loss for Region Proposal Network.
Proceedings of the Pattern Recognition and Computer Vision - First Chinese Conference, 2018

2017
Multi-Scale Multi-Feature Context Modeling for Scene Recognition in the Semantic Manifold.
IEEE Trans. Image Process., 2017

RGB-D Scene Recognition with Object-to-Object Relation.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Combining Models from Multiple Sources for RGB-D Scene Recognition.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Keyword-driven image captioning via Context-dependent Bilateral LSTM.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Depth CNNs for RGB-D Scene Recognition: Learning from Scratch Better than Transferring from RGB-CNNs.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Category co-occurrence modeling for large scale scene recognition.
Pattern Recognit., 2016

Image Captioning with both Object and Scene Information.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Joint Learning of CNN and LSTM for Image Captioning.
Proceedings of the Working Notes of CLEF 2016, 2016

2015
Geolocalized Modeling for Dish Recognition.
IEEE Trans. Multim., 2015

Polysemious visual representation based on feature aggregation for large scale image applications.
Multim. Tools Appl., 2015

Rich Image Description Based on Regions.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Joint multi-feature spatial context for scene recognition in the semantic manifold.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
Relative image similarity learning with contextual information for Internet cross-media retrieval.
Multim. Syst., 2014

Multipath Convolutional-Recursive Neural Networks for Object Recognition.
Proceedings of the Intelligent Information Processing VII, 2014

Semantic Features for Food Image Recognition with Geo-Constraints.
Proceedings of the 2014 IEEE International Conference on Data Mining Workshops, 2014

2013
Cross Concept Local Fisher Discriminant Analysis for Image Classification.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

MIAR ICT Participation at Robot Vision 2013.
Proceedings of the Working Notes for CLEF 2013 Conference , 2013


  Loading...