Hannah Cyberey
Affiliations:- University of Virginia, Department of Computer Science, Charlottesville, VA, USA
According to our database1,
Hannah Cyberey authored at least 10 papers
between 2020 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on linkedin.com
-
on twitter.com
-
on github.com
On csauthors.net:
Bibliography
2026
2025
Steering the CensorShip: Uncovering Representation Vectors for LLM "Thought" Control.
CoRR, April, 2025
Sensing and Steering Stereotypes: Extracting and Applying Gender Representation Vectors in LLMs.
CoRR, February, 2025
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
2024
The Mismeasure of Man and Models: Evaluating Allocational Harms in Large Language Models.
CoRR, 2024
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024
2022
Balanced Adversarial Training: Balancing Tradeoffs between Fickleness and Obstinacy in NLP Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
2020
Finding Friends and Flipping Frenemies: Automatic Paraphrase Dataset Augmentation Using Graph Theory.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, 2020