Wei Jie Yeo

According to our database1, Wei Jie Yeo authored at least 10 papers between 2024 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Beyond I'm Sorry, I Can't: Dissecting Large-Language-Model Refusal.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Mitigating Jailbreaks with Intent-Aware LLMs.
CoRR, August, 2025

A comprehensive review on financial explainable AI.
Artif. Intell. Rev., June, 2025

Debiasing CLIP: Interpreting and Correcting Bias in Attention Heads.
CoRR, May, 2025

SusGen-GPT: A Data-Centric LLM for Financial NLP and Sustainability Report Generation.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Towards Faithful Natural Language Explanations: A Study Using Activation Patching in Large Language Models.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Understanding Refusal in Language Models with Sparse Autoencoders.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

2024
How Interpretable are Reasoning Explanations from Prompting Large Language Models?
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Self-training Large Language Models through Knowledge Detection.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Plausible Extractive Rationalization through Semi-Supervised Entailment Signal.
Proceedings of the Findings of the Association for Computational Linguistics, 2024


  Loading...