Enable job alerts via email!
Boost your interview chances
Create a job specific, tailored resume for higher success rate.
Join a pioneering team at the forefront of AI safety research, focusing on mechanistic interpretability. This role offers the chance to work with exceptional researchers and contribute to groundbreaking advancements in ensuring AI models are safe and reliable. You will have the autonomy to explore ambitious research questions, supported by unparalleled resources and a strong culture of learning and development. If you are passionate about tackling the challenges of AI safety and want to make a significant impact, this opportunity is perfect for you.
AISI is launching a brand-new Mechanistic Interpretability team to research the fundamental question of how can we tell if a model is scheming? This is an ambitious bet to bring interpretability as a field into prime time. We believe that this is a vital challenge that mechanistic interpretability can help solve, ensuring that dangerous capability evaluations can be reliably determine if models are safe to release even when the models themselves are capable of gaming the evals. We also think it can lead to an entirely new field of alignment evaluations and make substantial contributions to the problem of technical AI safety.
To launch this project we're looking for a team lead, research scientists and research engineers. Apply now to join the largest technical AI safety lab on the planet - help us make this happen!
This team will have a large amount of scientific autonomy, with the ability to chase ambitious research bets. Your responsibilities may involve any of the following:
You’ll receive coaching from your manager and mentorship from the research directors at AISI (including Geoffrey Irving and Yarin Gal). You will also regularly interact with world-famous researchers and other incredible staff (including alumni from Anthropic, DeepMind, OpenAI and ML professors from Oxford and Cambridge). We have a very strong learning & development culture to support this, including Friday afternoons devoted to deep reading and multiple weekly paper reading groups. From a compute perspective, you'll have unparalleled access to resources including 5,448 Nvidia Grace-Hopper GPUs (e.g., H100s).
You may be a good fit if you have some of the following skills, experience and attitudes:
We are hiring individuals at all ranges of seniority and experience within the research unit, and this advert allows you to apply for any of the roles within this range. We will discuss and calibrate with you as part of the process. The full range of salaries available is as follows:
There are a range of pension options available which can be found through the Civil Service website.
In accordance with the Civil Service Commission rules, the following list contains all selection criteria for the interview process.
We select based on skills and experience regarding the following areas:
We additionally may factor in experience with any of the areas that our work-streams specialise in: