Hannah Cyberey
Affiliations:- University of Virginia, Department of Computer Science, Charlottesville, VA, USA
According to our database1,
Hannah Cyberey
authored at least 7 papers
between 2020 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on linkedin.com
-
on twitter.com
-
on github.com
On csauthors.net:
Bibliography
2025
Steering the CensorShip: Uncovering Representation Vectors for LLM "Thought" Control.
CoRR, April, 2025
Sensing and Steering Stereotypes: Extracting and Applying Gender Representation Vectors in LLMs.
CoRR, February, 2025
2024
The Mismeasure of Man and Models: Evaluating Allocational Harms in Large Language Models.
CoRR, 2024
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024
2022
Balanced Adversarial Training: Balancing Tradeoffs between Fickleness and Obstinacy in NLP Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
2020
Finding Friends and Flipping Frenemies: Automatic Paraphrase Dataset Augmentation Using Graph Theory.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, 2020