Senior Site Reliability Engineer (SSRE) – Remote (12-Month Contract)
We are looking for an experienced Senior Site Reliability Engineer (SSRE) to join a dynamic and innovative team. This is a fully remote contract role where you will be responsible for building and maintaining scalable, secure, and high-performance cloud infrastructure on Azure.
Role Overview
As an SSRE, you will play a critical role in designing, automating, and optimizing cloud infrastructure to ensure the availability, reliability, and security of a large-scale data platform. You will work closely with cross-functional teams to implement best practices, enhance observability, and improve deployment pipelines.
Key Responsibilities
- Design, deploy, and maintain Azure cloud infrastructure for high-availability and scalability.
- Automate infrastructure management using Terraform, Azure Bicep, and ARM templates.
- Develop and optimise CI/CD pipelines using Azure DevOps to streamline deployments.
- Enhance system monitoring, logging, and observability with Azure Monitor, Prometheus, and Grafana.
- Implement security best practices, conduct security assessments, and mitigate risks.
- Develop custom tools and libraries in Python/PySpark for Databricks environments.
- Optimise resource utilisation and cloud costs while ensuring system performance.
Qualifications & Experience
- 5+ years of experience in Site Reliability Engineering, DevOps, or Cloud Infrastructure roles.
- Strong expertise in Azure cloud services (VMs, Networking, Security, Storage, Databases, etc.).
- Proficiency in Infrastructure as Code (IaC) – Terraform, Azure Bicep, ARM Templates, PowerShell.
- Hands-on experience with CI/CD automation in Azure DevOps or similar tools.
- Strong programming skills in Python, PowerShell, or Bash for automation.
- Experience with monitoring and observability tools (Azure Monitor, Splunk, Prometheus, Grafana).
- Familiarity with security frameworks and cloud security best practices.
- Strong problem-solving and troubleshooting skills in complex cloud environments.
Nice to Have
- Experience with Kubernetes (AKS) and container orchestration.
- Knowledge of FinOps and cloud cost optimisation strategies.
- Exposure to MLOps workflows in Databricks.
Why Join?
100% Remote Work – Collaborate with a top-tier engineering team from anywhere.
Exciting Cloud & DevOps Projects – Work on cutting-edge Azure architectures.
Growth & Learning – Access to continuous training and professional development.
If you are not contacted within 10 days of your application, please consider your application unsuccessful.