The Summer 2022 cohort was MATS's first full-scale program, with 31 fellows and 7 mentors from leading AI safety organizations including OpenAI, ARC, MIRI, EleutherAI, and Aligned AI. The program ran 5 weeks online followed by 8 weeks in-person in Berkeley, where fellows conducted independent research under expert mentorship, participated in educational seminars, and built community with peers in the Berkeley AI safety ecosystem.
Applications are now open. Apply by June 7th.

Key dates for the application and admissions timeline
General Application (May 12th to June 7th)
Applicants fill out a general application to individual tracks which should take 1-2 hours. Applications are due by June 7th EOD AOE.
Additional Evaluations (June 7th to late July)
After an initial evaluation, applicants will apply to individual streams listed below. Additionally, applicants undergo a variety of track specific evaluations including coding tests, writing reviews, work tests, and interviews. Which evaluations you will undergo depend on the tracks, streams and mentors you apply to.
Admissions Decisions (Late July to early August)
Selected applicants are notified of their acceptance and anticipated mentor later in the application cycle.
The main program takes place from September 28th to December 4th of 2026. It is an intensive research phase, where fellows work full time on a research project in AI alignment, security, field-building, or governance. Fellows' research directions will typically be chosen through a collaborative process with their mentors, and fellows are expected to develop their independent research direction as the program continues.
While mentor support will vary depending on the project and mentors, mentors are expected to spend at least 1 hour/week working with each of their scholars, and some spend much more time. Scholars will also receive support from MATS’s Research Management team, who help to scope out and structure research direction. Depending on which stream you participate in, you may collaborate with other fellows in your stream.
By the middle of the program, fellows will be expected to write a report on their projects’ threat model, theory of change, and project deliverables. At the end of the program scholars will be expected to have a tangible research output. In past cohorts, this has involved presenting at a fellow symposium on work conducted over the course of MATS.
Educational seminars and workshops will be held 2-3 times per week. Previously, speakers have included Buck Shlegeris from Redwood Research, Adam Gleave from FAR AI, Neel Nanda from Google DeepMind, William Saunders from OpenAI, Andrew Critch from CHAI, Lennart Heim from GovAI, Ajeya Cotra from Open Philanthropy, and more.
The extension phase starts in December of 2026, soon after the end of the main program. Fellows who demonstrate promise as independent researchers during the main program can apply for the MATS extension phase. Acceptance into the extension is based on mentor evaluation and MATS review of proposed research.
In recent cohorts, ~80% of fellows who apply have been accepted. The extension phase offers a default additional 6-months of funding, with the ability to later apply for a 6-month continuation.
Extension fellows primarily work from the MATS London or Berkeley offices, with the possibility of working from other AI safety hubs or fully remotely.For accepted extension fellows, MATS arranges funding for stipends and housing ($7,680/month), as well as for compute ($8,000/mo), creating a seamless transition into this advanced phase of the program.
MATS aims to accelerate researchers who will:
MATS alumni have gone on to publish safety research, join alignment organizations, including Anthropic and MIRI, and found an alignment research lab. You can read more about MATS alumni here.
SolidGoldMagikarp (plus, prompt generation)
Anomalous tokens: a mysterious failure mode for GPT (which reliably insulted Matthew)
Authors:
Jessica Cooper (Rumbelow), Matthew Watkins
Jessica Cooper, Matthew Watkins
Date:
Apr 23, 2026
Citations:
18
A Toy Model of Universality: Reverse Engineering How Networks Learn Group Operations
Universality is a key hypothesis in mechanistic interpretability -- that different models learn similar features and circuits when trained on similar tasks. In this work, we study the universality hypothesis by examining how small neural networks learn to implement group composition. We present a novel algorithm by which neural networks may implement composition for any finite group via mathematical representation theory. We then show that networks consistently learn this algorithm by reverse engineering model logits and weights, and confirm our understanding using ablations. By studying networks of differing architectures trained on various groups, we find mixed evidence for universality: using our algorithm, we can completely characterize the family of circuits and features that networks learn on this task, but for a given network the precise circuits learned -- as well as the order they develop -- are arbitrary.
Authors:
Bilal Chughtai
Bilal Chughtai, Lawrence Chan, Neel Nanda
Date:
May 7, 2026
Citations:
144
The MATS Program is supported by a diverse and highly respected group of mentors — top-tier researchers, engineers, and thinkers working across AI alignment, governance, interpretability, and security.
MATS Research phase provides scholars with a community of peers.

Scholars work out of a shared office and are supported by the Community Team.
MATS alumni report that the connections with peers that they made during MATS have had the largest impact on them years later. Our full-time Community Team works to facilitate these connections and also provide general well-being support. Weekly lightning talks, scholar-led discussion groups, game nights, and outings to SF are some examples of MATS events.