Senior Site Reliability Engineer

Be among the first applicants.
ZipRecruiter
Chelmsford
GBP 50,000 - 90,000
Be among the first applicants.
5 days ago
Job description

Job Description

This is an amazing opportunity for an experienced Site Reliability Engineering (SRE) to take the next step in their career working for an exciting, growing health-tech company driving digital transformation in UK healthcare.

As a senior SRE team member of our Azure-based infrastructure team, you will be responsible for ensuring the reliability, availability, and performance of our clients' applications and services. You will work closely with engineering, operations, and product teams to implement best practices for cloud infrastructure management and operational excellence.

Responsibilities:

  1. Assist with the design, implementation, and management of scalable, resilient, and secure Azure infrastructure.
  2. Oversee the deployment, configuration, and maintenance of cloud resources, ensuring they meet performance and reliability standards.
  3. Lead incident response efforts, ensuring quick resolution and minimizing impact on service availability.
  4. Conduct post-incident reviews and implement corrective actions to prevent future occurrences.
  5. Drive automation initiatives to reduce manual intervention, improve efficiency, and enhance system reliability.
  6. Implement Infrastructure as Code (IaC) using tools like Terraform or Azure Bicep.
  7. Continuously assess and improve system performance, capacity, and reliability.
  8. Build new customer deployments and perform maintenance on existing installations.

Experience Required:

  1. Proven experience in a similar role, with a strong focus on Azure cloud infrastructure.
  2. Extensive knowledge of Azure services, including but not limited to Azure Virtual Machines, Azure Kubernetes Service (AKS), Azure App Services, and Azure Storage.
  3. Proficiency in scripting and automation tools (e.g., PowerShell, Python, Bash).
  4. Experience with monitoring and observability tools (e.g., Azure Monitor, Prometheus, Grafana).
  5. Strong understanding of networking, security, and compliance in cloud environments.
  6. Excellent problem-solving skills and the ability to handle high-pressure situations.
  7. Automation experience (such as Bash, Terraform, or Ansible).

Desirable:

  1. Azure certifications (e.g., Microsoft Certified: Azure Solutions Architect, Azure DevOps Engineer).
  2. Experience with DevOps practices and tools (e.g., CI/CD pipelines, Git, Azure DevOps).
  3. Experience with containerization and orchestration (e.g., Docker, Kubernetes).
  4. BSc/BA in Computer Science or a related degree.
  5. Office 365 administration / Azure skills.
  6. Database (NoSQL) skills.
Get a free, confidential resume review.
Select file or drag and drop it
Avatar
Free online coaching
Improve your chances of getting that interview invitation!
Be the first to explore new Senior Site Reliability Engineer jobs in Chelmsford