Shawn Im

According to our database1, Shawn Im authored at least 13 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Cyclical Entropy Eruption: Entropy Dynamics in Agent Reinforcement Learning.
CoRR, May, 2026

How Do Transformers Learn to Associate Tokens: Gradient Leading Terms Bring Mechanistic Interpretability.
CoRR, January, 2026

How Well Can Preference Optimization Generalize Under Noisy Feedback?
Trans. Mach. Learn. Res., 2026

2025
How Well Can Preference Optimization Generalize Under Noisy Feedback?
CoRR, October, 2025

Visual Instruction Bottleneck Tuning.
CoRR, May, 2025

A Unified Understanding and Evaluation of Steering Methods.
CoRR, February, 2025

Towards Interpretability Without Sacrifice: Faithful Dense Layer Decomposition with Mixture of Decoders.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Visual Instruction Bottleneck Tuning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Position: Challenges and Future Directions of Data-Centric AI Alignment.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Understanding Multimodal LLMs Under Distribution Shifts: An Information-Theoretic Approach.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024
On the Generalization of Preference Learning with DPO.
CoRR, 2024

Understanding the Learning Dynamics of Alignment with Human Feedback.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Evaluating the Utility of Model Explanations for Model Development.
CoRR, 2023


  Loading...