Hannah Cyberey

Affiliations:
  • University of Virginia, Department of Computer Science, Charlottesville, VA, USA


According to our database1, Hannah Cyberey authored at least 10 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Aligning Language Model Benchmarks with Pairwise Preferences.
CoRR, February, 2026

White-Box Sensitivity Auditing with Steering Vectors.
CoRR, January, 2026

2025
Steering the CensorShip: Uncovering Representation Vectors for LLM "Thought" Control.
CoRR, April, 2025

Sensing and Steering Stereotypes: Extracting and Applying Gender Representation Vectors in LLMs.
CoRR, February, 2025

Unsupervised Concept Vector Extraction for Bias Control in LLMs.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2024
The Mismeasure of Man and Models: Evaluating Allocational Harms in Large Language Models.
CoRR, 2024

Addressing Both Statistical and Causal Gender Fairness in NLP Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

2022
Balanced Adversarial Training: Balancing Tradeoffs between Fickleness and Obstinacy in NLP Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2020
Finding Friends and Flipping Frenemies: Automatic Paraphrase Dataset Augmentation Using Graph Theory.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Pointwise Paraphrase Appraisal is Potentially Problematic.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, 2020


  Loading...