Xinwei Wu

Orcid: 0009-0001-2167-128X

Affiliations:

Tianjin University, College of Intelligence and Computing, China
Jilin University, College of Computer Science and Technology, Changchun, China (former)

According to our database¹, Xinwei Wu authored at least 12 papers between 2021 and 2025.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

TaP: A Taxonomy-Guided Framework for Automated and Scalable Preference Data Generation.

[BibT_eX]

[DOI]

CoRR, June, 2025

CONTRANS: Weak-to-Strong Alignment Engineering via Concept Transplantation.

[BibT_eX]

[DOI]

Proceedings of the 31st International Conference on Computational Linguistics, 2025

2024

Large Language Model Safety: A Holistic Survey.

[BibT_eX]

[DOI]

CoRR, 2024

Exploring Multilingual Human Value Concepts in Large Language Models: Is Value Alignment Consistent, Transferable and Controllable across Languages?

[BibT_eX]

[DOI]

CoRR, 2024

IRCAN: Mitigating Knowledge Conflicts in LLM Generation via Identifying and Reweighting Context-Aware Neurons.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Exploring Multilingual Concepts of Human Values in Large Language Models: Is Value Alignment Consistent, Transferable and Controllable across Languages?

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Mitigating Privacy Seesaw in Large Language Models: Augmented Privacy Neuron Editing via Activation Patching.

[BibT_eX]

[DOI]

Xinwei Wu