
Anthropic
—
Research Scientist
Links
Focus
Control, Model Organisms, Red-Teaming, Scheming and Deception
H-index
61
Nicholas is a researcher working at the intersection of machine learning and computer security. Currently he works at Anthropic studying what bad things you could do with, or do to, language models; he likes to break things.