Penglei Sun

Orcid: 0009-0005-2290-0944

According to our database1, Penglei Sun authored at least 19 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
SpatialGrammar: A Domain-Specific Language for LLM-Based 3D Indoor Scene Generation.
CoRR, April, 2026

Rule-VLN: Bridging Perception and Compliance via Semantic Reasoning and Geometric Rectification.
CoRR, April, 2026

WaterVideoQA: ASV-Centric Perception and Rule-Compliant Reasoning via Multi-Modal Agents.
CoRR, February, 2026

Janus-Q: End-to-End Event-Driven Trading via Hierarchical-Gated Reward Modeling.
CoRR, February, 2026

OmniReview: A Large-scale Benchmark and LLM-enhanced Framework for Realistic Reviewer Recommendation.
CoRR, February, 2026

2025
FinKario: Event-Enhanced Automated Construction of Financial Knowledge Graph.
CoRR, August, 2025

MRD-RAG: Enhancing Medical Diagnosis with Multi-Round Retrieval-Augmented Generation.
CoRR, April, 2025

Learning 6-DoF Fine-Grained Grasp Detection Based on Part Affordance Grounding.
IEEE Trans Autom. Sci. Eng., 2025

City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Evaluating Semantic Variation in Text-to-Image Synthesis: A Causal Perspective.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Perovskite-LLM: Knowledge-Enhanced Large Language Models for Perovskite Solar Cell Research.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

The Multi-Round Diagnostic RAG Framework for Emulating Clinical Reasoning.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2025

2024
Scene-Driven Multimodal Knowledge Graph Construction for Embodied AI.
IEEE Trans. Knowl. Data Eng., November, 2024

Multi-Modal Knowledge Graph Construction and Application: A Survey.
IEEE Trans. Knowl. Data Eng., February, 2024

3D Question Answering for City Scene Understanding.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Multi-task Domain Adaptation for Language Grounding with 3D Objects.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
A Contrastive Compositional Benchmark for Text-to-Image Synthesis: A Study with Unified Text-to-Image Fidelity Metrics.
CoRR, 2023

Learning 6-DoF Fine-grained Grasp Detection Based on Part Affordance Grounding.
CoRR, 2023

2022
Human-in-the-loop Robotic Grasping Using BERT Scene Representation.
Proceedings of the 29th International Conference on Computational Linguistics, 2022


  Loading...