Yaoxian Song

Orcid: 0000-0002-8146-2236

According to our database1, Yaoxian Song authored at least 18 papers between 2019 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning.
CoRR, July, 2025

Learning 6-DoF Fine-Grained Grasp Detection Based on Part Affordance Grounding.
IEEE Trans Autom. Sci. Eng., 2025

Evaluating Semantic Variation in Text-to-Image Synthesis: A Causal Perspective.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Scene-Driven Multimodal Knowledge Graph Construction for Embodied AI.
IEEE Trans. Knowl. Data Eng., November, 2024

3D Question Answering for City Scene Understanding.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Multi-task Domain Adaptation for Language Grounding with 3D Objects.
Proceedings of the Computer Vision - ECCV 2024, 2024

Flickr30K-CFQ: A Compact and Fragmented Query Dataset for Text-image Retrieval.
Proceedings of the Database Systems for Advanced Applications, 2024

Towards Coarse-grained Visual Language Navigation Task Planning Enhanced by Event Knowledge Graph.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

2023
Vision-force-fused curriculum learning for robotic contact-rich assembly tasks.
Frontiers Neurorobotics, June, 2023

Learning 6-DoF Fine-grained Grasp Detection Based on Part Affordance Grounding.
CoRR, 2023

Out-of-Distribution Generalization in Natural Language Processing: Past, Present, and Future.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
Human-in-the-loop Robotic Grasping Using BERT Scene Representation.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

2021
Tactile-Visual Fusion Based Robotic Grasp Detection Method with a Reproducible Sensor.
Int. J. Comput. Intell. Syst., 2021

2020
Multimodal Aggregation Approach for Memory Vision-Voice Indoor Navigation with Meta-Learning.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

2019
Deep Robotic Prediction with hierarchical RGB-D Fusion.
CoRR, 2019

2.5D Image based Robotic Grasping.
CoRR, 2019

UG-Net for Robotic Grasping using Only Depth Image.
Proceedings of the 2019 IEEE International Conference on Real-time Computing and Robotics, 2019

2.5D Image-based Robotic Grasping.
Proceedings of the 2019 Australian & New Zealand Control Conference (ANZCC), 2019


  Loading...