Sirui Han

Orcid: 0000-0001-7303-0671

According to our database1, Sirui Han authored at least 73 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
When Text Hijacks Vision: Benchmarking and Mitigating Text Overlay-Induced Hallucination in Vision Language Models.
CoRR, April, 2026

ContextLens: Modeling Imperfect Privacy and Safety Context for Legal Compliance.
CoRR, April, 2026

Bit-by-Bit: Progressive QAT Strategy with Outlier Channel Splitting for Stable Low-Bit LLMs.
CoRR, April, 2026

QaRL: Rollout-Aligned Quantization-Aware RL for Fast and Stable Training under Training-Inference Mismatch.
CoRR, April, 2026

Not Just the Destination, But the Journey: Reasoning Traces Causally Shape Generalization Behaviors.
CoRR, March, 2026

LABSHIELD: A Multimodal Benchmark for Safety-Critical Reasoning and Planning in Scientific Laboratories.
CoRR, March, 2026

DC-W2S: Dual-Consensus Weak-to-Strong Training for Reliable Process Reward Modeling in Biological Reasoning.
CoRR, March, 2026

Unlocking Data Value in Finance: A Study on Distillation and Difficulty-Aware Training.
CoRR, March, 2026

TwinRL-VLA: Digital Twin-Driven Reinforcement Learning for Real-World Robotic Manipulation.
CoRR, February, 2026

MemFly: On-the-Fly Memory Optimization via Information Bottleneck.
CoRR, February, 2026

Learning While Staying Curious: Entropy-Preserving Supervised Fine-Tuning via Adaptive Self-Distillation for Large Reasoning Models.
CoRR, February, 2026

Pushing the Boundaries of Natural Reasoning: Interleaved Bonus from Formal-Logic Verification.
CoRR, January, 2026

Glance-or-Gaze: Incentivizing LMMs to Adaptively Focus Search via Reinforcement Learning.
CoRR, January, 2026

LRAS: Advanced Legal Reasoning with Agentic Search.
CoRR, January, 2026

Reinforcement Learning of Large Language Models for Interpretable Credit Card Fraud Detection.
CoRR, January, 2026

AM<sup>3</sup>Safety: Towards Data Efficient Alignment of Multi-modal Multi-turn Safety for MLLMs.
CoRR, January, 2026

MMFCTUB: Multi-Modal Financial Credit Table Understanding Benchmark.
CoRR, January, 2026

Wow, wo, val! A Comprehensive Embodied World Model Evaluation Turing Test.
CoRR, January, 2026

CoCoGesture: Towards coherent co-speech 3D gesture generation in the wild.
Inf. Fusion, 2026

An image steganography algorithm using selective timestep embedding and diffusion model.
Expert Syst. Appl., 2026

CareerCraft: Supporting New Graduates on Job Hunting with LLM-Assisted Self-Construction of Career Profile.
Proceedings of the 2026 CHI Conference on Human Factors in Computing Systems, 2026

Reimagining Legal Fact Verification with GenAI: Toward Effective Human-AI Collaboration.
Proceedings of the 2026 CHI Conference on Human Factors in Computing Systems, 2026

Outlier Matters: Efficient Long-to-Short Reasoning via Outlier-Guided Model Merging.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

What, Whether and How? Unveiling Process Reward Models for Thinking with Images Reasoning.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Sub-MoE: Efficient Mixture-of-Expert LLMs Compression via Subspace Expert Merging.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

ManipDreamer3D: Synthesizing Plausible Robotic Manipulation Video with Occupancy-aware 3D Trajectory.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
MedInsightBench: Evaluating Medical Analytics Agents Through Multi-Step Insight Discovery in Multimodal Medical Data.
CoRR, December, 2025

InsightEval: An Expert-Curated Benchmark for Assessing Insight Discovery in LLM-Driven Data Agents.
CoRR, November, 2025

Perception, Understanding and Reasoning, A Multimodal Benchmark for Video Fake News Detection.
CoRR, October, 2025

SafeMT: Multi-turn Safety for Multimodal Language Models.
CoRR, October, 2025

WristWorld: Generating Wrist-Views via 4D World Models for Robotic Manipulation.
CoRR, October, 2025

Can World Models Benefit VLMs for World Dynamics?
CoRR, October, 2025

GSPR: Aligning LLM Safeguards as Generalizable Safety Policy Reasoners.
CoRR, September, 2025

WoW: Towards a World omniscient World model Through Embodied Interaction.
CoRR, September, 2025

DanceEditor: Towards Iterative Editable Music-driven Dance Generation with Open-Vocabulary Descriptions.
CoRR, August, 2025

EgoTwin: Dreaming Body and View in First Person.
CoRR, August, 2025

HKGAI-V1: Towards Regional Sovereign Large Language Model for Hong Kong.
CoRR, July, 2025

Semantic-guided Diverse Decoding for Large Language Model.
CoRR, June, 2025

IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering.
CoRR, June, 2025

Sub-MoE: Efficient Mixture-of-Expert LLMs Compression via Subspace Expert Merging.
CoRR, June, 2025

MinD: Unified Visual Imagination and Control via Hierarchical World Models.
CoRR, June, 2025

BTC-LLM: Efficient Sub-1-Bit LLM Quantization via Learnable Transformation and Binary Codebook.
CoRR, June, 2025

SafeLawBench: Towards Safe Alignment of Large Language Models.
CoRR, June, 2025

Follow-Your-Motion: Video Motion Transfer via Efficient Spatial-Temporal Decoupled Finetuning.
CoRR, June, 2025

InterMT: Multi-Turn Interleaved Preference Alignment with Human Feedback.
CoRR, May, 2025

The Mirage of Multimodality: Where Truth is Tested and Honesty Unravels.
CoRR, May, 2025

Mitigating Deceptive Alignment via Self-Monitoring.
CoRR, May, 2025

Generative RLHF-V: Learning Principles from Multi-modal Human Preference.
CoRR, May, 2025

J1: Exploring Simple Test-Time Scaling for LLM-as-a-Judge.
CoRR, May, 2025

Measuring Hong Kong Massive Multi-Task Language Understanding.
CoRR, May, 2025

DIDS: Domain Impact-aware Data Sampling for Large Language Model Training.
CoRR, April, 2025

Benchmarking Multi-National Value Alignment for Large Language Models.
CoRR, April, 2025

Safe RLHF-V: Safe Reinforcement Learning from Human Feedback in Multimodal Large Language Models.
CoRR, March, 2025

ThinkPatterns-21k: A Systematic Study on the Impact of Thinking Patterns in LLMs.
CoRR, March, 2025

Outlier-Aware Model Merging for Efficient Multitask Inference.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Consistent and Invariant Generalization Learning for Short-video Misinformation Detection.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

DanceEditor: Towards Iterative Editable Music-Driven Dance Generation with Open-Vocabulary Descriptions.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

AIRA: Activation-Informed Low-Rank Adaptation for Large Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Efficient Fine-Tuning of Large Models Via Nested Low-Rank Adaptation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

DIDS: Domain Impact-aware Data Sampling for Large Language Model Training.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Automate Strategy Finding with LLM in Quant Investment.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Context Reasoner: Incentivizing Reasoning Capability for Contextualized Privacy and Safety Compliance via Reinforcement Learning.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Towards Advanced Mathematical Reasoning for LLMs via First-Order Logic Theorem Proving.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Scenario, Role, and Persona: A Scoping Review of Design Strategies for Socially Intelligent AI Agents.
Proceedings of the Extended Abstracts of the CHI Conference on Human Factors in Computing Systems, 2025

LegalReasoner: Step-wised Verification-Correction for Legal Judgment Reasoning.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Out-of-Distribution Detection via LLM-Guided Outlier Generation for Text-attributed Graph.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Benchmarking Multi-National Value Alignment for Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

PKU-SafeRLHF: Towards Multi-Level Safety Alignment for LLMs with Human Preference.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Boosting Policy and Process Reward Models with Monte Carlo Tree Search in Open-Domain QA.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

SafeLawBench: Towards Safe Alignment of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

PrivaCI-Bench: Evaluating Privacy with Contextual Integrity and Legal Compliance.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

FinMME: Benchmark Dataset for Financial Multi-Modal Reasoning Evaluation.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2023
Extracting Scenarios from Autonomous-Vehicle-Involved Crashes.
Proceedings of the 18th International Conference on Intelligent Systems and Knowledge Engineering, 2023


  Loading...