Maksym Andriushchenko

This stream focuses on critical challenges in AI safety and alignment, including risks from automating AI research, bottlenecks to recursive self-improvement, and the automation of safety and alignment research. Priority topics also include AGI privacy, measuring long-horizon agentic capabilities, developing new alignment methods, and advancing the science of post-training.

Apply

View all streams

Stream overview

I'm interested in all areas of AI safety and alignment, but my priority directions are:

Risks from automating AI research
Bottlenecks to recursive self-improvement
Automating safety and alignment research
AGI privacy
Measuring long-horizon agentic capabilities
New alignment methods
Science of post-training

Mentors

Maksym Andriushchenko

ELLIS Institute Tübingen

Principal Investigator (AI Safety and Alignment Group)

Tübingen

—

Agent Foundations

Adversarial Robustness

Monitoring

Scalable Oversight

Scheming and Deception

I am a principal investigator at the ELLIS Institute Tübingen and the Max Planck Institute for Intelligent Systems, where I lead the AI Safety and Alignment group. I also serve as chapter lead for the new edition of the International AI Safety Report chaired by Prof. Yoshua Bengio. I have worked on AI safety with leading organizations in the field (OpenAI, Anthropic, UK AI Safety Institute, Center for AI Safety, Gray Swan AI). I obtained my PhD in machine learning from EPFL in 2024 advised by Prof. Nicolas Flammarion. My PhD thesis was awarded the Patrick Denantes Memorial Prize for the best thesis in the CS department of EPFL and was supported by the Google and Open Phil AI PhD Fellowships.

Mentorship style

I usually spend at least 30 min per week in one-one-one meetings with my mentees. We can also discuss longer time slots if necessary. Besides these time slots, I try to be as responsive as possible over Slack (>2 comprehensive responses per day) and read relevant papers between weekly meetings.

Fellows we are looking for

I'm looking for the following skills:

Prior research experience in a topic related to AI safety (at least one completed project with first-author contribution)
Independent, self-driven personality
Strong general computer science background
Ideally, a good software engineering background
Familiarity with deep learning frameworks
Clear communication

No constraints here. I'm fine with both internal (i.e., within MATS) and external collaborators. I can also pair MATS scholars with PhD students in my group, if it's useful.

Project selection

I would prefer to set the overall direction, but I will listen closely to scholars about their preferences within a broad direction. Converging on a particular topic is expected to be a collaborative process.