Ronghui Mu

Orcid: 0000-0001-6150-4948

According to our database1, Ronghui Mu authored at least 20 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Safeguarding large language models: a survey.
Artif. Intell. Rev., December, 2025

POT: Inducing Overthinking in LLMs via Black-Box Iterative Optimization.
CoRR, August, 2025

ThermoRL:Structure-Aware Reinforcement Learning for Protein Mutation Design to Enhance Thermostability.
CoRR, July, 2025

Principal Eigenvalue Regularization for Improved Worst-Class Certified Robustness of Smoothed Classifiers.
CoRR, March, 2025

Invariant Correlation of Representation With Label.
IEEE Trans. Inf. Forensics Secur., 2025

Safety of Embodied Navigation: A Survey.
Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

Enhancing Robust Fairness via Confusional Spectral Regularization.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
A survey of safety and trustworthiness of large language models through the lens of verification and validation.
Artif. Intell. Rev., July, 2024

Nrat: towards adversarial training with inherent label noise.
Mach. Learn., June, 2024

3DVerifier: efficient robustness verification for 3D point cloud models.
Mach. Learn., April, 2024

Enhancing robustness in video recognition models: Sparse adversarial attacks and beyond.
Neural Networks, 2024

Building Guardrails for Large Language Models.
CoRR, 2024

PRASS: Probabilistic Risk-averse Robust Learning with Stochastic Search.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Position: Building Guardrails for Large Language Models Requires Systematic Design.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

DeepGRE: Global Robustness Evaluation of Deep Neural Networks.
Proceedings of the IEEE International Conference on Acoustics, 2024

Towards Fairness-Aware Adversarial Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Reward Certification for Policy Smoothed Reinforcement Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Randomized Adversarial Training via Taylor Expansion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Certified Policy Smoothing for Cooperative Multi-Agent Reinforcement Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2021
Sparse Adversarial Video Attacks with Spatial Transformations.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021


  Loading...