Zirui Song

Orcid: 0009-0003-5698-6315

According to our database1, Zirui Song authored at least 43 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
SPGA: graph representation learning and attention fusion for enhanced disease-associated snoRNA prediction.
BMC Bioinform., December, 2026

TextAlign: Preference Alignment for Text Rendering with Hierarchical Rewards.
CoRR, May, 2026

The Cylindrical Representation Hypothesis for Language Model Steering.
CoRR, May, 2026

FineState-Bench: Benchmarking State-Conditioned Grounding for Fine-grained GUI State Setting.
CoRR, April, 2026

ServImage: An Image Generation and Editing Benchmark from Real-world Commercial Imaging Services.
CoRR, April, 2026

Temporal Contrastive Decoding: A Training-Free Method for Large Audio-Language Models.
CoRR, April, 2026

FlashSign: Pose-Free Guidance for Efficient Sign Language Video Generation.
CoRR, March, 2026

ManipLVM-R1: Reinforcement Learning for Reasoning in Embodied Manipulation with Large Vision-Language Models.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
When Personalization Tricks Detectors: The Feature-Inversion Trap in Machine-Generated Text Detection.
CoRR, October, 2025

Beyond Survival: Evaluating LLMs in Social Deduction Games with Human-Aligned Strategies.
CoRR, October, 2025

Do LLMs "Feel"? Emotion Circuits Discovery and Control.
CoRR, October, 2025

DyFlow: Dynamic Workflow Framework for Agentic Reasoning.
CoRR, September, 2025

FineState-Bench: A Comprehensive Benchmark for Fine-Grained State Control in GUI Agents.
CoRR, August, 2025

Evaluating and mitigating bias in AI-based medical text generation.
Nat. Comput. Sci., May, 2025

SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models.
CoRR, May, 2025

ManipLVM-R1: Reinforcement Learning for Reasoning in Embodied Manipulation with Large Vision-Language Models.
CoRR, May, 2025

Divide-Fuse-Conquer: Eliciting "Aha Moments" in Multi-Scenario Games.
CoRR, May, 2025

Evaluate Bias without Manual Test Sets: A Concept Representation Perspective for LLMs.
CoRR, May, 2025

Audio Jailbreak: An Open Comprehensive Benchmark for Jailbreaking Large Audio-Language Models.
CoRR, May, 2025

Motion Anything: Any to Motion Generation.
CoRR, March, 2025

Geolocation with Real Human Gameplay Data: A Large-Scale Dataset and Human-Like Reasoning Framework.
CoRR, February, 2025

Multitemporal Thick Cloud Removal via Temporal Smoothness in Image and Gradient Domains.
IEEE Trans. Geosci. Remote. Sens., 2025

A Physics-Informed Neural Network Aided Venturi-Microwave Co-Sensing Method for Three-Phase Metering.
Comput., 2025

DyFlow: Dynamic Workflow Framework for Agentic Reasoning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Hazards in Daily Life? Enabling Robots to Proactively Detect and Resolve Anomalies.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Under the Shadow of Babel: How Language Shapes Reasoning in LLMs.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Injecting Domain-Specific Knowledge into Large Language Models: A Comprehensive Survey.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

The Stepwise Deception: Simulating the Evolution from True News to Fake News with LLM Agents.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

PedDet: Adaptive Spectral Optimization for Multimodal Pedestrian Detection.
Proceedings of the ECAI 2025 - 28th European Conference on Artificial Intelligence, 25-30 October 2025, Bologna, Italy, 2025

Word Form Matters: LLMs' Semantic Reconstruction under Typoglycemia.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
Bin2Summary: Beyond Function Name Prediction in Stripped Binaries with Functionality-Specific Code Embeddings.
Proc. ACM Softw. Eng., 2024

Foundations and Recent Trends in Multimodal Mobile Agents: A Survey.
CoRR, 2024

From a Tiny Slip to a Giant Leap: An LLM-Based Simulation for Fake News Evolution.
CoRR, 2024

MMAC-Copilot: Multi-modal Agent Collaboration Operating System Copilot.
CoRR, 2024

A Dynamical System Approach to Robotic Ultrasound Imaging: Towards Intrinsically Stable Robotic Sonography.
Proceedings of the IEEE International Conference on Real-time Computing and Robotics, 2024

TypeFSL: Type Prediction from Binaries via Inter-procedural Data-flow Analysis and Few-shot Learning.
Proceedings of the 39th IEEE/ACM International Conference on Automated Software Engineering, 2024

Efficient Reinforcement Learning via Decoupling Exploration and Utilization.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2024

MedINST: Meta Dataset of Biomedical Instructions.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

BenchLMM: Benchmarking Cross-Style Visual Capability of Large Multimodal Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

LiftFuzz: Validating Binary Lifters through Context-aware Fuzzing with GPT.
Proceedings of the 2024 on ACM SIGSAC Conference on Computer and Communications Security, 2024

2023
Optimistic and Pessimistic Actor in RL: Decoupling Exploration and Utilization.
CoRR, 2023

BenchLMM: Benchmarking Cross-style Visual Capability of Large Multimodal Models.
CoRR, 2023

2022
LiCA: A Fine-grained and Path-sensitive Linux Capability Analysis Framework.
Proceedings of the 25th International Symposium on Research in Attacks, 2022


  Loading...