Andrew Lee

Affiliations:
  • Harvard University, School of Engineering and Applied Sciences, Columbia, MA, USA
  • University of Michigan, Ann Arbor, MI, USA (PhD 2024)


According to our database1, Andrew Lee authored at least 20 papers between 2019 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
The Geometry of Self-Verification in a Task-Specific Reasoning Model.
CoRR, April, 2025

Shared Global and Local Geometry of Language Model Embeddings.
CoRR, March, 2025

Eeyore: Realistic Depression Simulation via Supervised and Preference Optimization.
CoRR, March, 2025

ICLR: In-Context Learning of Representations.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Eeyore: Realistic Depression Simulation via Expert-in-the-Loop Supervised and Preference Optimization.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
Interpretability and Controllability in the Age of Language Models
PhD thesis, 2024

Emergence of Hidden Capabilities: Exploring Learning Dynamics in Concept Space.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

A Comparative Multidimensional Analysis of Empathetic Systems.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Towards Algorithmic Fidelity: Mental Health Representation across Demographics in Synthetic vs. Human-generated Data.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Has It All Been Solved? Open NLP Research Questions Not Solved by Large Language Models.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Some things are more CRINGE than others: Preference Optimization with the Pairwise Cringe Loss.
CoRR, 2023

A PhD Student's Perspective on Research in NLP in the Era of Very Large Language Models.
CoRR, 2023

Empathy Identification Systems are not Accurately Accounting for Context.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Emergent Linear Representations in World Models of Self-Supervised Sequence Models.
Proceedings of the 6th BlackboxNLP Workshop: Analyzing and Interpreting Neural Networks for NLP, 2023

2022
Improving Chess Commentaries by Combining Language Models with Symbolic Reasoning Engines.
CoRR, 2022

Augmenting Task-Oriented Dialogue Systems with Relation Extraction.
CoRR, 2022

2021
Micromodels for Efficient, Explainable, and Reusable Systems: A Case Study on Mental Health.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

2019
Outlier Detection for Improved Data Quality and Diversity in Dialog Systems.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019


  Loading...