Enable job alerts via email!

Director of Engineering

TN United Kingdom

Cambridge

On-site

GBP 165,000

Full time

30 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative firm is seeking a Director of Engineering to lead a global team in High-Performance Computing and Engineering collaboration. This role involves managing a large-scale HPC environment and implementing DevOps best practices to enhance development workflows. You will oversee critical technical areas, including Linux platforms and virtualization, while fostering a culture of innovation and collaboration. With a focus on strategic roadmap development and budget management, this position offers an exciting opportunity to make a significant impact in a supportive and dynamic environment. Join a team that values efficiency and results in a hybrid working model.

Qualifications

  • Strong experience managing large-scale HPC systems with LSF or similar schedulers.
  • Proficient in DevOps methodologies and tools like Terraform and Ansible.

Responsibilities

  • Manage and lead a large-scale HPC environment ensuring high availability and operational efficiency.
  • Drive the implementation of DevOps best practices to automate infrastructure.

Skills

High-Performance Computing (HPC)
DevOps
Infrastructure as Code (IaC)
Cloud Platforms (AWS, GCP, Azure)
Leadership
Budget Management
Product Engineering

Tools

LSF
Terraform
Ansible
GitLab
Atlassian Suite (Jira, Confluence)
VMware
Kubernetes
OpenText ETX

Job description

Position: Director of Engineering HPC Team, Cambridge

We are looking for an experienced and innovative Director of Engineering to lead our clients global engineering team. This key leadership role is part of the Engineering IT Leadership team and will be responsible for overseeing several critical technical areas, including High-Performance Computing (HPC), Engineering Platform Access, Engineering Collaboration and Linux Platforms. You will lead a global team to ensure seamless product development by maintaining and improving the infrastructure that supports engineering teams.

Key Responsibilities:

  • High-Performance Computing (HPC): Manage and lead a large-scale HPC environment (handling half a million cores), using LSF (or similar schedulers) to ensure high availability, scalability, and operational efficiency.
  • DevOps & Automation: Drive the implementation of DevOps best practices (CI/CD, Terraform, Ansible, GitLab) to automate infrastructure and improve the efficiency of development workflows.
  • Engineering Collaboration Tools: Manage and optimize the Atlassian suite (Jira, Confluence) for enhanced engineering collaboration and compliance.
  • Linux Platform Leadership: Oversee the Linux Platform team responsible for managing Linux-based infrastructure, especially for HPC servers.
  • Virtualization & Kubernetes: Lead virtualization efforts involving VMware and Kubernetes clusters, ensuring efficient orchestration and resource utilization.
  • Platform Access & Security: Lead teams handling login servers and user access solutions, ensuring seamless authentication experiences for engineers using OpenText ETX.
  • Strategic Roadmap: Define and implement a clear roadmap for the Engineering Platform that aligns with business goals and engineering needs.
  • Team Leadership: Provide technical leadership, mentorship, and guidance to highly skilled teams, fostering a culture of innovation and continuous improvement.
  • Cross-Functional Collaboration: Work closely with key stakeholders from engineering, IT security, and infrastructure teams to drive best practices and ensure excellent service delivery.
  • Budget Management: Ensure cost-effective investments in technology while meeting the organization's strategic goals.

Required Skills & Experience:

  • Expertise in HPC Environments: Strong experience managing large-scale HPC systems, preferably with LSF or similar schedulers.
  • DevOps & Infrastructure as Code (IaC): Proficient in DevOps methodologies, CI/CD pipelines, and tools such as Terraform, Ansible, and GitLab.
  • Experience with Cloud Platforms: In-depth knowledge of cloud platforms (AWS, GCP, Azure), with AWS being the primary focus.
  • Leadership: Demonstrated ability to lead and inspire large, technically diverse teams (30-40 people) in a fast-paced environment.
  • Background in Product Engineering: Experience in software development, especially in Python, and a product ownership mindset.
  • Budget and Resource Management: Proven ability to manage budgets and resources effectively.

Preferred Backgrounds:

  • Candidates from semiconductor companies or those with experience in high-performance computing (HPC) environments, or Oil and Gas etc. are highly preferred.
  • Experience in large-scale infrastructure management, such as virtualized environments and containerization (Kubernetes).

The culture is collaborative and supportive, with high expectations for delivering results. You will be joining a team that values innovation, efficiency, and seamless collaboration across functions.

The client is looking to pay up to £165,000 per annum + benefits. This is a hybrid working role with a minimum of 3 days in the office per week in Cambridge.

For more information please send your CV to me on kamni.sharmalafosse

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.