Richard Ren

Remote Labor Index: Measuring AI Automation of Remote Work.

[BibT_eX]

[DOI]

Mantas Mazeika

,

Alice Gatti

,

Cristina Menghini

,

Udari Madhushani Sehwag

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Ed-Yeremai Hernandez-Cardona

,

,

,

,

,

,

CoRR, October, 2025

The MASK Benchmark: Disentangling Honesty From Accuracy in AI Systems.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, March, 2025

Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, February, 2025

Deep Reinforcement Learning for Decentralized Multi-Robot Exploration With Macro Actions.

[BibT_eX]

[DOI]

Aaron Hao Tan

,

Federico Pizarro Bejarano

,

Yuhan Zhu

,

Richard Ren

,

Goldie Nejat

IEEE Robotics Autom. Lett., 2023

Localizing Lying in Llama: Understanding Instructed Dishonesty on True-False Questions Through Prompting, Probing, and Patching.

[BibT_eX]

[DOI]

James Campbell

,

Richard Ren

,

Phillip Guo

CoRR, 2023

Representation Engineering: A Top-Down Approach to AI Transparency.

[BibT_eX]

[DOI]

CoRR, 2023

Richard Ren

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...