Enable job alerts via email!

Senior / Staff Software Engineer (AI / Compiler)

ZipRecruiter

London

On-site

GBP 145,000 - 167,000

Full time

19 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

An innovative company is seeking Senior and Staff Software Engineers to build cutting-edge high-performance computing infrastructure. This role involves designing and implementing systems that run distributed, low-latency AI workloads, collaborating closely with hardware and ML teams. Ideal candidates will have a strong background in performance-critical systems, with expertise in programming languages such as C++ and Python. Join a dynamic team that is redefining AI capabilities and be part of a rapidly growing organization at the forefront of optical computing. If you thrive in a fast-paced environment and are passionate about technology, this opportunity is perfect for you.

Stock options

Comprehensive healthcare insurance

25 days PTO plus bank holidays

Bonus of £12,000 per year

Private use of 3D printer

5+ years in HPC, HFT, or AI infrastructure.
Strong skills in C++ and Python for performance-sensitive applications.
Experience with ML compilers and optimizations.

Design high-performance systems for AI/ML workloads.
Optimize for ultra-low latency and real-time inference.
Collaborate with teams to enhance software stack performance.

C++

Python

Distributed Systems

Performance Tuning

ML Compilers (LLVM, MLIR)

Debugging Skills

AI Workloads Scaling

Degree in Computer Science

Degree in Engineering

Degree in Mathematics

PyTorch

ONNX

OpenXLA

Job Description

Company Overview

Flux is pioneering a new class of AI accelerators called Optical Tensor Processing Units (OTPUs). We’ve already developed functioning prototypes and are now scaling our operations in London. Our work environment rewards innovation, speed, and bold thinking.

The role

We’re hiring Senior and Staff Software Engineers to build the high-performance computing infrastructure that powers our Optical Tensor Processing Units (OTPUs). This isn’t just about scaling models—it’s about rethinking how AI workloads are executed at speed and scale.

You’ll lead the design and implementation of software systems that run distributed, low-latency inference across clusters. You’ll work closely with hardware and ML teams to optimise every layer of the stack—from model representation and execution to data movement and scheduling. Whether it’s through compiler techniques, systems-level tuning, or custom runtime design, you’ll play a critical role in shaping the performance layer of our AI platform. This is a role for engineers who think in microseconds, not just model accuracy. If you’ve worked in HFT, large-scale scientific compute, or AI infrastructure at serious scale, we’d love to talk.

Responsibilities

Design and build high-performance systems for running AI/ML workloads across distributed compute clusters
Optimise for ultra-low latency and real-time inference at scale—profiling, tuning, and rewriting critical systems as needed
Identify and resolve performance bottlenecks across the stack, from model execution and scheduling to hardware-level constraints
Collaborate with compiler engineers to improve code, execution paths, and memory layouts using tools like LLVM or MLIR
Work with hardware teams to ensure the software stack fully leverages the capabilities of our OTPU architecture
Extend ML frameworks (e.g. PyTorch, ONNX, OpenXLA) to better support performance-critical inference paths
Lead design reviews, mentor engineers, and promote best practices in HPC and performance engineering
Stay on the frontier of new developments in AI infrastructure, compute systems, and compiler tooling

Skills & Experience

5+ years of experience building performance-critical systems in HPC, HFT, large-scale simulation, or AI infrastructure
Deep understanding of distributed systems, with a focus on real-time or near real-time data processing
Strong programming skills in C++ and Python, especially for performance-sensitive applications
Hands-on experience with ML compilers (e.g. LLVM, MLIR), and knowledge of runtime and scheduling optimisations
Practical knowledge of ML frameworks like PyTorch, ONNX, or OpenXLA, and how to optimise their execution
Experience scaling AI workloads across clusters or custom infrastructure—not just deploying on standard cloud setups
Strong debugging, profiling, and performance-tuning skills across the stack
Degree in Computer Science, Engineering, Mathematics, or a related field

Details

Competitive salary ranging from £145k+, depending on experience.
Stock options in a rapidly growing AI company.
Comprehensive healthcare insurance.
25 days PTO policy plus bank holidays.
Based in our new 5,000 square foot office in the AI hub of Kings Cross, London.
Bonus additional salary of £12,000 per year if you’re based within a 20-minute commute of the office.
Private use of our 3D printer.

If you’re passionate about compilers, high-performance computing, and redefining what’s possible in AI, we’d love to talk. Apply now to join Flux and help shape the future of optical computing.

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Senior / Staff Software Engineer (AI / Compiler)

ZipRecruiter

London

On-site

GBP 145,000 - 167,000

Full time

Job summary

Benefits

Qualifications

Responsibilities

Skills

Education

Tools

Job description