Zirui Song

Orcid: 0009-0003-5698-6315

According to our database1, Zirui Song authored at least 27 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
FineState-Bench: A Comprehensive Benchmark for Fine-Grained State Control in GUI Agents.
CoRR, August, 2025

Under the Shadow of Babel: How Language Shapes Reasoning in LLMs.
CoRR, June, 2025

Evaluating and mitigating bias in AI-based medical text generation.
Nat. Comput. Sci., May, 2025

SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models.
CoRR, May, 2025

ManipLVM-R1: Reinforcement Learning for Reasoning in Embodied Manipulation with Large Vision-Language Models.
CoRR, May, 2025

Divide-Fuse-Conquer: Eliciting "Aha Moments" in Multi-Scenario Games.
CoRR, May, 2025

Evaluate Bias without Manual Test Sets: A Concept Representation Perspective for LLMs.
CoRR, May, 2025

Audio Jailbreak: An Open Comprehensive Benchmark for Jailbreaking Large Audio-Language Models.
CoRR, May, 2025

Motion Anything: Any to Motion Generation.
CoRR, March, 2025

PedDet: Adaptive Spectral Optimization for Multimodal Pedestrian Detection.
CoRR, February, 2025

Geolocation with Real Human Gameplay Data: A Large-Scale Dataset and Human-Like Reasoning Framework.
CoRR, February, 2025

Injecting Domain-Specific Knowledge into Large Language Models: A Comprehensive Survey.
CoRR, February, 2025

Hazards in Daily Life? Enabling Robots to Proactively Detect and Resolve Anomalies.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Word Form Matters: LLMs' Semantic Reconstruction under Typoglycemia.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
Bin2Summary: Beyond Function Name Prediction in Stripped Binaries with Functionality-Specific Code Embeddings.
Proc. ACM Softw. Eng., 2024

Foundations and Recent Trends in Multimodal Mobile Agents: A Survey.
CoRR, 2024

From a Tiny Slip to a Giant Leap: An LLM-Based Simulation for Fake News Evolution.
CoRR, 2024

MMAC-Copilot: Multi-modal Agent Collaboration Operating System Copilot.
CoRR, 2024

A Dynamical System Approach to Robotic Ultrasound Imaging: Towards Intrinsically Stable Robotic Sonography.
Proceedings of the IEEE International Conference on Real-time Computing and Robotics, 2024

TypeFSL: Type Prediction from Binaries via Inter-procedural Data-flow Analysis and Few-shot Learning.
Proceedings of the 39th IEEE/ACM International Conference on Automated Software Engineering, 2024

Efficient Reinforcement Learning via Decoupling Exploration and Utilization.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2024

MedINST: Meta Dataset of Biomedical Instructions.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

BenchLMM: Benchmarking Cross-Style Visual Capability of Large Multimodal Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

LiftFuzz: Validating Binary Lifters through Context-aware Fuzzing with GPT.
Proceedings of the 2024 on ACM SIGSAC Conference on Computer and Communications Security, 2024

2023
Optimistic and Pessimistic Actor in RL: Decoupling Exploration and Utilization.
CoRR, 2023

BenchLMM: Benchmarking Cross-style Visual Capability of Large Multimodal Models.
CoRR, 2023

2022
LiCA: A Fine-grained and Path-sensitive Linux Capability Analysis Framework.
Proceedings of the 25th International Symposium on Research in Attacks, 2022


  Loading...