Zhichen Dong

According to our database1, Zhichen Dong authored at least 9 papers between 2024 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
SafeWork-R1: Coevolving Safety and Intelligence under the AI-45<sup>°</sup> Law.
CoRR, July, 2025

Emergent Response Planning in LLM.
CoRR, February, 2025

2024
Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models.
CoRR, 2024

Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey.
CoRR, 2024

Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Towards Imitation Learning to Branch for MIP: A Hybrid Reinforcement Learning based Sample Augmentation Approach.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

L2P-MIP: Learning to Presolve for Mixed Integer Programming.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024


  Loading...