Xinpeng Wang

Affiliations:
  • LMU Munich, Germany


According to our database1, Xinpeng Wang authored at least 13 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Refusal Direction is Universal Across Safety-Aligned Languages.
CoRR, May, 2025

Think Before Refusal : Triggering Safety Reflection in LLMs to Mitigate False Refusal Behavior.
CoRR, March, 2025

Surgical, Cheap, and Flexible: Mitigating False Refusal in Language Models via Single Vector Ablation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Algorithmic Fidelity of Large Language Models in Generating Synthetic German Public Opinions: A Case Study.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Understanding When Tree of Thoughts Succeeds: Larger Models Excel in Generation, Not Discrimination.
CoRR, 2024

FinerCut: Finer-grained Interpretable Layer Pruning for Large Language Models.
CoRR, 2024

Look at the Text: Instruction-Tuned Language Models are More Robust Multiple Choice Selectors than You Think.
CoRR, 2024

The Potential and Challenges of Evaluating Attitudes, Opinions, and Values in Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

"Seeing the Big through the Small": Can LLMs Approximate Human Judgment Distributions on NLI from a Few Explanations?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

"My Answer is C": First-Token Probabilities Do Not Match Text Answers in Instruction-Tuned Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
ACTOR: Active Learning with Annotator-specific Classification Heads to Embrace Human Label Variation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

How to Distill your BERT: An Empirical Study on the Impact of Weight Initialisation and Distillation Objectives.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

2021
SceneFormer: Indoor Scene Generation with Transformers.
Proceedings of the International Conference on 3D Vision, 2021


  Loading...