Nicholas Carlini

Anthropic

—

Research Scientist

Links

Focus

Control, Model Organisms, Red-Teaming, Scheming and Deception

Stream

Anthropic

Nicholas is a research scientist at Google DeepMind researching adversarial machine learning; he likes to break things.