Zhenhong Zhou
Orcid: 0000-0003-4065-1740
According to our database1,
Zhenhong Zhou authored at least 49 papers
between 2021 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
CoRR, May, 2026
A Survey of Large Audio Language Models: Generalization, Trustworthiness, and Outlook.
CoRR, May, 2026
Explaining and Breaking the Safety-Helpfulness Ceiling via Preference Dimensional Expansion.
CoRR, May, 2026
CoRR, April, 2026
CoRR, March, 2026
LARFT: Closing the Cognition-Action Gap for Length Instruction Following in Large Language Models.
CoRR, March, 2026
MCPShield: A Security Cognition Layer for Adaptive Trust Calibration in Model Context Protocol Agents.
CoRR, February, 2026
Omni-Safety under Cross-Modality Conflict: Vulnerabilities, Dynamics Mechanisms and Efficient Alignment.
CoRR, February, 2026
RECUR: Resource Exhaustion Attack via Recursive-Entropy Guided Counterfactual Utilization and Reflection.
CoRR, February, 2026
From Helpfulness to Toxic Proactivity: Diagnosing Behavioral Misalignment in LLM Agents.
CoRR, February, 2026
CoRR, January, 2026
SEE: Signal Embedding Energy for Quantifying Noise Interference in Large Audio Language Models.
CoRR, January, 2026
ChronosAudio: A Comprehensive Long-Audio Benchmark for Evaluating Audio-Large Language Models.
CoRR, January, 2026
CSSBench: Evaluating the Safety of Lightweight LLMs against Chinese-Specific Adversarial Patterns.
CoRR, January, 2026
Hidden in the Noise: Unveiling Backdoors in Audio LLMs Alignment Through Latent Acoustic Pattern Triggers.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
2025
LeechHijack: Covert Computational Resource Exploitation in Intelligent Agent Systems.
CoRR, December, 2025
Backdoor Collapse: Eliminating Unknown Threats via Known Backdoor Aggregation in Language Models.
CoRR, October, 2025
DiffuGuard: How Intrinsic Safety is Lost and Found in Diffusion Large Language Models.
CoRR, September, 2025
CoRR, September, 2025
Jailbreaking Large Language Diffusion Models: Revealing Hidden Safety Flaws in Diffusion-Based Text Generation.
CoRR, July, 2025
CoRR, July, 2025
Goal-Aware Identification and Rectification of Misinformation in Multi-Agent Systems.
CoRR, June, 2025
PD<sup>3</sup>F: A Pluggable and Dynamic DoS-Defense Framework Against Resource Consumption Attacks Targeting Large Language Models.
CoRR, May, 2025
A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment.
CoRR, April, 2025
CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language Models.
CoRR, February, 2025
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025
Proceedings of the Forty-second International Conference on Machine Learning, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
DemonAgent: Dynamically Encrypted Multi-Backdoor Implantation Attack on LLM-based Agent.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025
PD³F: A Pluggable and Dynamic DoS-Defense Framework against resource consumption attacks targeting Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025
Crabs: Consuming Resource via Auto-generation for LLM-DoS Attack under Black-box Settings.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
2024
Future Gener. Comput. Syst., 2024
Crabs: Consuming Resrouce via Auto-generation for LLM-DoS Attack under Black-box Settings.
CoRR, 2024
Alignment-Enhanced Decoding:Defending via Token-Level Adaptive Refining of Probability Distributions.
CoRR, 2024
Speak Out of Turn: Safety Vulnerability of Large Language Models in Multi-turn Dialogue.
CoRR, 2024
How Alignment and Jailbreak Work: Explain LLM Safety through Intermediate Hidden States.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024
Alignment-Enhanced Decoding: Defending Jailbreaks via Token-Level Adaptive Refining of Probability Distributions.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2021
Three-Dimensional Reconstruction of Huizhou Landscape Combined with Multimedia Technology and Geographic Information System.
Mob. Inf. Syst., 2021