Yujun Zhou

Orcid: 0000-0003-1376-5187

Affiliations:
  • King Abdullah University of Science and Technology, Thuwal, Saudi Arabia


According to our database1, Yujun Zhou authored at least 17 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Position: General Alignment Has Hit a Ceiling; Edge Alignment Must Be Taken Seriously.
CoRR, February, 2026

Capability-Oriented Training Induced Alignment Risk.
CoRR, February, 2026

2025
Exploring Multi-Temperature Strategies for Token- and Rollout-Level Control in RLVR.
CoRR, October, 2025

Beyond Single-Value Metrics: Evaluating and Enhancing LLM Unlearning with Cognitive Diagnosis.
CoRR, February, 2025

Artificial Intelligence in Spectroscopy: Advancing Chemistry from Prediction to Generation and Beyond.
CoRR, February, 2025

Position: We Need An Adaptive Interpretation of Helpful, Honest, and Harmless Principles.
CoRR, February, 2025

Dissecting Logical Reasoning in LLMs: A Fine-Grained Evaluation and Supervision Study.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

2024
Social Science Meets LLMs: How Reliable Are Large Language Models in Social Simulations?
CoRR, 2024

LabSafety Bench: Benchmarking LLMs on Safety Issues in Scientific Labs.
CoRR, 2024

Defending Jailbreak Prompts via In-Context Adversarial Game.
CoRR, 2024

Can LLMs Solve Molecule Puzzles? A Multimodal Benchmark for Molecular Structure Elucidation.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Attack-free Evaluating and Enhancing Adversarial Robustness on Categorical Data.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Defending Jailbreak Prompts via In-Context Adversarial Game.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

SceMQA: A Scientific College Entrance Level Multimodal Question Answering Benchmark.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2024

2023
Towards Efficient and Domain-Agnostic Evasion Attack with High-Dimensional Categorical Inputs.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Towards Understanding the Robustness Against Evasion Attack on Categorical Data.
Proceedings of the Tenth International Conference on Learning Representations, 2022

AdvCat: Domain-Agnostic Robustness Assessment for Cybersecurity-Critical Applications with Categorical Inputs.
Proceedings of the IEEE International Conference on Big Data, 2022


  Loading...