Angelina Wang

Orcid: 0000-0001-9140-3523

According to our database1, Angelina Wang authored at least 45 papers between 2017 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Queer NLP: A Critical Survey on Literature Gaps, Biases and Trends.
CoRR, February, 2026

The Limits of AI Data Transparency Policy: Three Disclosure Fallacies.
CoRR, January, 2026

Ambiguity Collapse by LLMs: A Taxonomy of Epistemic Risks.
Proceedings of the 2026 ACM Conference on Fairness, Accountability, and Transparency, 2026

Scrutinizing Index-Based Risk Assessments: A Case Study in NYC Decision-making for Heat Emergency Management.
Proceedings of the 2026 ACM Conference on Fairness, Accountability, and Transparency, 2026

SCENEBench: An Audio Understanding Benchmark Grounded in Assistive and Industrial Use Cases.
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

Whose Knowledge Counts? Co-Designing Community-Centered AI Auditing Tools with Educators in Hawai'i.
Proceedings of the 2026 CHI Conference on Human Factors in Computing Systems, 2026

2025
Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations.
CoRR, November, 2025

Disclosure and Evaluation as Fairness Interventions for General-Purpose AI.
CoRR, October, 2025

The Inadequacy of Offline LLM Evaluations: A Need to Account for Personalization in Model Behavior.
CoRR, September, 2025

Bridging Prediction and Intervention Problems in Social Systems.
CoRR, July, 2025

The California Report on Frontier AI Policy.
CoRR, June, 2025

Rigor in AI: Doing Rigorous AI Work Requires a Broader, Responsible AI-Informed Conception of Rigor.
CoRR, June, 2025

Measurement to Meaning: A Validity-Centered Framework for AI Evaluation.
CoRR, May, 2025

Toward an Evaluation Science for Generative AI Systems.
CoRR, March, 2025

The inadequacy of offline large language model evaluations: A need to account for personalization in model behavior.
Patterns, 2025

Large language models that replace human participants can harmfully misportray and flatten identity groups.
Nat. Mac. Intell., 2025


Measuring Machine Learning Harms from Stereotypes Requires Understanding Who Is Harmed by Which Errors in What Ways.
Proceedings of the 2025 ACM Conference on Fairness, Accountability, and Transparency, 2025

Identities are not Interchangeable: The Problem of Overgeneralization in Fair Machine Learning.
Proceedings of the 2025 ACM Conference on Fairness, Accountability, and Transparency, 2025

Fairness through Difference Awareness: Measuring Desired Group Discrimination in LLMs.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Operationalizing Responsible Machine Learning: From Equality Towards Equity
PhD thesis, 2024

Benchmark suites instead of leaderboards for evaluating AI fairness.
Patterns, 2024

Evaluating Generative AI Systems is a Social Science Measurement Challenge.
CoRR, 2024

Measuring machine learning harms from stereotypes: requires understanding who is being harmed by which errors in what ways.
CoRR, 2024

Measuring Implicit Bias in Explicitly Unbiased Large Language Models.
CoRR, 2024

Large language models cannot replace human participants because they cannot portray identity groups.
CoRR, 2024

Visions of a Discipline: Analyzing Introductory AI Courses on YouTube.
Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency, 2024

Strategies for Increasing Corporate Responsible AI Prioritization.
Proceedings of the Seventh AAAI/ACM Conference on AI, Ethics, and Society (AIES-24) - Full Archival Papers, October 21-23, 2024, San Jose, California, USA, 2024

2023
Manipulative tactics are the norm in political emails: Evidence from 300K emails from the 2020 US election cycle.
Big Data Soc., January, 2023

Overcoming Bias in Pretrained Models by Manipulating the Finetuning Dataset.
CoRR, 2023

Overwriting Pretrained Bias with Finetuning Data.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Gender Artifacts in Visual Datasets.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Against Predictive Optimization: On the Legitimacy of Decision-Making Algorithms that Optimize Predictive Accuracy.
Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency, 2023

Taxonomizing and Measuring Representational Harms: A Look at Image Tagging.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
REVISE: A Tool for Measuring and Mitigating Bias in Visual Datasets.
Int. J. Comput. Vis., 2022

Towards Intersectionality in Machine Learning: Including More Identities, Handling Underrepresentation, and Performing Evaluation.
Proceedings of the FAccT '22: 2022 ACM Conference on Fairness, Accountability, and Transparency, Seoul, Republic of Korea, June 21, 2022

Measuring Representational Harms in Image Captioning.
Proceedings of the FAccT '22: 2022 ACM Conference on Fairness, Accountability, and Transparency, Seoul, Republic of Korea, June 21, 2022

2021
The Limits of Global Inclusion in AI Development.
CoRR, 2021

Directional Bias Amplification.
Proceedings of the 38th International Conference on Machine Learning, 2021

Understanding and Evaluating Racial Biases in Image Captioning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
ViBE: A Tool for Measuring and Mitigating Bias in Image Datasets.
CoRR, 2020

REVISE: A Tool for Measuring and Mitigating Bias in Visual Datasets.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
Learning Robotic Manipulation through Visual Planning and Acting.
CoRR, 2019

Learning Robotic Manipulation through Visual Planning and Acting.
Proceedings of the Robotics: Science and Systems XV, 2019

2017
Safer Classification by Synthesis.
CoRR, 2017


  Loading...