Wei Jie Yeo

According to our database1, Wei Jie Yeo authored at least 9 papers between 2024 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Mitigating Jailbreaks with Intent-Aware LLMs.
CoRR, August, 2025

A comprehensive review on financial explainable AI.
Artif. Intell. Rev., June, 2025

Understanding Refusal in Language Models with Sparse Autoencoders.
CoRR, May, 2025

Debiasing CLIP: Interpreting and Correcting Bias in Attention Heads.
CoRR, May, 2025

SusGen-GPT: A Data-Centric LLM for Financial NLP and Sustainability Report Generation.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

2024
Towards Faithful Natural Language Explanations: A Study Using Activation Patching in Large Language Models.
CoRR, 2024

How Interpretable are Reasoning Explanations from Prompting Large Language Models?
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Self-training Large Language Models through Knowledge Detection.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Plausible Extractive Rationalization through Semi-Supervised Entailment Signal.
Proceedings of the Findings of the Association for Computational Linguistics, 2024


  Loading...