Sirui Han
Orcid: 0000-0001-7303-0671
According to our database1,
Sirui Han authored at least 73 papers
between 2023 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
When Text Hijacks Vision: Benchmarking and Mitigating Text Overlay-Induced Hallucination in Vision Language Models.
CoRR, April, 2026
CoRR, April, 2026
Bit-by-Bit: Progressive QAT Strategy with Outlier Channel Splitting for Stable Low-Bit LLMs.
CoRR, April, 2026
QaRL: Rollout-Aligned Quantization-Aware RL for Fast and Stable Training under Training-Inference Mismatch.
CoRR, April, 2026
Not Just the Destination, But the Journey: Reasoning Traces Causally Shape Generalization Behaviors.
CoRR, March, 2026
LABSHIELD: A Multimodal Benchmark for Safety-Critical Reasoning and Planning in Scientific Laboratories.
CoRR, March, 2026
DC-W2S: Dual-Consensus Weak-to-Strong Training for Reliable Process Reward Modeling in Biological Reasoning.
CoRR, March, 2026
Unlocking Data Value in Finance: A Study on Distillation and Difficulty-Aware Training.
CoRR, March, 2026
TwinRL-VLA: Digital Twin-Driven Reinforcement Learning for Real-World Robotic Manipulation.
CoRR, February, 2026
CoRR, February, 2026
Learning While Staying Curious: Entropy-Preserving Supervised Fine-Tuning via Adaptive Self-Distillation for Large Reasoning Models.
CoRR, February, 2026
Pushing the Boundaries of Natural Reasoning: Interleaved Bonus from Formal-Logic Verification.
CoRR, January, 2026
Glance-or-Gaze: Incentivizing LMMs to Adaptively Focus Search via Reinforcement Learning.
CoRR, January, 2026
Reinforcement Learning of Large Language Models for Interpretable Credit Card Fraud Detection.
CoRR, January, 2026
AM<sup>3</sup>Safety: Towards Data Efficient Alignment of Multi-modal Multi-turn Safety for MLLMs.
CoRR, January, 2026
CoRR, January, 2026
CoRR, January, 2026
Inf. Fusion, 2026
An image steganography algorithm using selective timestep embedding and diffusion model.
Expert Syst. Appl., 2026
CareerCraft: Supporting New Graduates on Job Hunting with LLM-Assisted Self-Construction of Career Profile.
Proceedings of the 2026 CHI Conference on Human Factors in Computing Systems, 2026
Reimagining Legal Fact Verification with GenAI: Toward Effective Human-AI Collaboration.
Proceedings of the 2026 CHI Conference on Human Factors in Computing Systems, 2026
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
What, Whether and How? Unveiling Process Reward Models for Thinking with Images Reasoning.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
ManipDreamer3D: Synthesizing Plausible Robotic Manipulation Video with Occupancy-aware 3D Trajectory.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
2025
MedInsightBench: Evaluating Medical Analytics Agents Through Multi-Step Insight Discovery in Multimodal Medical Data.
CoRR, December, 2025
InsightEval: An Expert-Curated Benchmark for Assessing Insight Discovery in LLM-Driven Data Agents.
CoRR, November, 2025
Perception, Understanding and Reasoning, A Multimodal Benchmark for Video Fake News Detection.
CoRR, October, 2025
CoRR, October, 2025
CoRR, September, 2025
CoRR, September, 2025
DanceEditor: Towards Iterative Editable Music-driven Dance Generation with Open-Vocabulary Descriptions.
CoRR, August, 2025
CoRR, July, 2025
IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering.
CoRR, June, 2025
CoRR, June, 2025
CoRR, June, 2025
BTC-LLM: Efficient Sub-1-Bit LLM Quantization via Learnable Transformation and Binary Codebook.
CoRR, June, 2025
Follow-Your-Motion: Video Motion Transfer via Efficient Spatial-Temporal Decoupled Finetuning.
CoRR, June, 2025
CoRR, May, 2025
CoRR, May, 2025
CoRR, May, 2025
CoRR, April, 2025
CoRR, April, 2025
Safe RLHF-V: Safe Reinforcement Learning from Human Feedback in Multimodal Large Language Models.
CoRR, March, 2025
CoRR, March, 2025
Proceedings of the 33rd ACM International Conference on Multimedia, 2025
Consistent and Invariant Generalization Learning for Short-video Misinformation Detection.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025
DanceEditor: Towards Iterative Editable Music-Driven Dance Generation with Open-Vocabulary Descriptions.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025
Context Reasoner: Incentivizing Reasoning Capability for Contextualized Privacy and Safety Compliance via Reinforcement Learning.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
Towards Advanced Mathematical Reasoning for LLMs via First-Order Logic Theorem Proving.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
Scenario, Role, and Persona: A Scoping Review of Design Strategies for Socially Intelligent AI Agents.
Proceedings of the Extended Abstracts of the CHI Conference on Human Factors in Computing Systems, 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
Out-of-Distribution Detection via LLM-Guided Outlier Generation for Text-attributed Graph.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
Proceedings of the Findings of the Association for Computational Linguistics, 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
Boosting Policy and Process Reward Models with Monte Carlo Tree Search in Open-Domain QA.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
Proceedings of the Findings of the Association for Computational Linguistics, 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
2023
Proceedings of the 18th International Conference on Intelligent Systems and Knowledge Engineering, 2023