Anthropic
—
Research Scientist
Links
Focus
Control, Model Organisms, Red-Teaming, Scheming and Deception
Stream
Nicholas is a research scientist at Google DeepMind researching adversarial machine learning; he likes to break things.