Zhihao Xu

Affiliations:
  • Renmin University of China, Beijing, China


According to our database1, Zhihao Xu authored at least 5 papers between 2024 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Internal Value Alignment in Large Language Models through Controlled Value Vector Activation.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Evaluating Concept-based Explanations of Language Models: A Study on Faithfulness and Readability.
CoRR, 2024

Uncovering Safety Risks in Open-source LLMs through Concept Activation Vector.
CoRR, 2024

Uncovering Safety Risks of Large Language Models through Concept Activation Vector.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Evaluating Readability and Faithfulness of Concept-based Explanations.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024


  Loading...