Karina Nguyen

According to our database1, Karina Nguyen authored at least 10 papers between 2022 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Evaluating and Mitigating Discrimination in Language Model Decisions.
CoRR, 2023

Specific versus General Principles for Constitutional AI.
CoRR, 2023

Studying Large Language Model Generalization with Influence Functions.
CoRR, 2023

Measuring Faithfulness in Chain-of-Thought Reasoning.
CoRR, 2023

Question Decomposition Improves the Faithfulness of Model-Generated Reasoning.
CoRR, 2023

Vision Transformers for Mobile Applications: A Short Survey.
CoRR, 2023

FAIR-Ensemble: When Fairness Naturally Emerges From Deep Ensembling.
CoRR, 2023

The Capacity for Moral Self-Correction in Large Language Models.
CoRR, 2023


2022
Discovering Language Model Behaviors with Model-Written Evaluations.
CoRR, 2022


  Loading...