Dylan Sam

OpenAI

Member of Technical Staff

Links

Focus

Scalable Oversight, Control, Monitoring, Interpretability, Adversarial Robustness, Red-Teaming, Alignment Training, Security, Scheming and Deception, Multi-Agent Safety

Dylan is a safety researcher at OpenAI, where he works on curating better/safer training data and monitoring models for harmful behavior.

Before that he completed a PhD in the Machine Learning Department at CMU.