Shawn Im

According to our database1, Shawn Im authored at least 11 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
How Do Transformers Learn to Associate Tokens: Gradient Leading Terms Bring Mechanistic Interpretability.
CoRR, January, 2026

How Well Can Preference Optimization Generalize Under Noisy Feedback?
Trans. Mach. Learn. Res., 2026

2025
How Well Can Preference Optimization Generalize Under Noisy Feedback?
CoRR, October, 2025

Towards Interpretability Without Sacrifice: Faithful Dense Layer Decomposition with Mixture of Decoders.
CoRR, May, 2025

Visual Instruction Bottleneck Tuning.
CoRR, May, 2025

A Unified Understanding and Evaluation of Steering Methods.
CoRR, February, 2025

Position: Challenges and Future Directions of Data-Centric AI Alignment.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Understanding Multimodal LLMs Under Distribution Shifts: An Information-Theoretic Approach.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024
On the Generalization of Preference Learning with DPO.
CoRR, 2024

Understanding the Learning Dynamics of Alignment with Human Feedback.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Evaluating the Utility of Model Explanations for Model Development.
CoRR, 2023


  Loading...