Xinwei Wu

Orcid: 0009-0001-2167-128X

Affiliations:
  • Tianjin University, College of Intelligence and Computing, China
  • Jilin University, College of Computer Science and Technology, Changchun, China (former)


According to our database1, Xinwei Wu authored at least 19 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Incentivizing Parametric Knowledge via Reinforcement Learning with Verifiable Rewards for Cross-Cultural Entity Translation.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

M²PO: Multi-Perspective Multi-Pair Preference Optimization for Machine Translation.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

From Insight to Action: A Novel Framework for Interpretability-Guided Data Selection in Large Language Models.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

Finding the Translation Switch: Discovering and Exploiting the Task-Initiation Features in LLMs.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Challenging Multilingual LLMs: A New Taxonomy and Benchmark for Unraveling Hallucination in Translation.
CoRR, October, 2025

TaP: A Taxonomy-Guided Framework for Automated and Scalable Preference Data Generation.
CoRR, June, 2025

DiplomacyAgent: Do LLMs Balance Interests and Ethical Principles in International Events?
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Towards a Unified Paradigm of Concept Editing in Large Language Models.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

CONTRANS: Weak-to-Strong Alignment Engineering via Concept Transplantation.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

2024
Large Language Model Safety: A Holistic Survey.
CoRR, 2024

Exploring Multilingual Human Value Concepts in Large Language Models: Is Value Alignment Consistent, Transferable and Controllable across Languages?
CoRR, 2024

IRCAN: Mitigating Knowledge Conflicts in LLM Generation via Identifying and Reweighting Context-Aware Neurons.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Exploring Multilingual Concepts of Human Values in Large Language Models: Is Value Alignment Consistent, Transferable and Controllable across Languages?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Mitigating Privacy Seesaw in Large Language Models: Augmented Privacy Neuron Editing via Activation Patching.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Large Language Model Alignment: A Survey.
CoRR, 2023

DEPN: Detecting and Editing Privacy Neurons in Pretrained Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
FewFedWeight: Few-shot Federated Learning Framework across Multiple NLP Tasks.
CoRR, 2022

Swing Distillation: A Privacy-Preserving Knowledge Distillation Framework.
CoRR, 2022

2021
Unbiased Learning to Rank in Feeds Recommendation.
Proceedings of the WSDM '21, 2021


  Loading...