
Anthropic
—
Member of Technical Staff
Links
Focus
Control, Model Organisms, Red-Teaming, Scheming and Deception
Stream
Anthropic
Fabien Roger is an AI safety researcher at Anthropic and previously worked at Redwood Research. Fabien’s research focuses on AI control and dealing with alignment faking.