Senior Site Reliability Engineer

Be among the first applicants.
WatersEdge Solutions
Gauteng
Remote
ZAR 300 000 - 400 000
Be among the first applicants.
2 days ago
Job description

Senior Site Reliability Engineer (SSRE) – Remote (12-Month Contract)

We are looking for an experienced Senior Site Reliability Engineer (SSRE) to join a dynamic and innovative team. This is a fully remote contract role where you will be responsible for building and maintaining scalable, secure, and high-performance cloud infrastructure on Azure.

Role Overview

As an SSRE, you will play a critical role in designing, automating, and optimizing cloud infrastructure to ensure the availability, reliability, and security of a large-scale data platform. You will work closely with cross-functional teams to implement best practices, enhance observability, and improve deployment pipelines.

Key Responsibilities

  • Design, deploy, and maintain Azure cloud infrastructure for high-availability and scalability.
  • Automate infrastructure management using Terraform, Azure Bicep, and ARM templates.
  • Develop and optimise CI/CD pipelines using Azure DevOps to streamline deployments.
  • Enhance system monitoring, logging, and observability with Azure Monitor, Prometheus, and Grafana.
  • Implement security best practices, conduct security assessments, and mitigate risks.
  • Develop custom tools and libraries in Python/PySpark for Databricks environments.
  • Optimise resource utilisation and cloud costs while ensuring system performance.

Qualifications & Experience

  • 5+ years of experience in Site Reliability Engineering, DevOps, or Cloud Infrastructure roles.
  • Strong expertise in Azure cloud services (VMs, Networking, Security, Storage, Databases, etc.).
  • Proficiency in Infrastructure as Code (IaC) – Terraform, Azure Bicep, ARM Templates, PowerShell.
  • Hands-on experience with CI/CD automation in Azure DevOps or similar tools.
  • Strong programming skills in Python, PowerShell, or Bash for automation.
  • Experience with monitoring and observability tools (Azure Monitor, Splunk, Prometheus, Grafana).
  • Familiarity with security frameworks and cloud security best practices.
  • Strong problem-solving and troubleshooting skills in complex cloud environments.

Nice to Have

  • Experience with Kubernetes (AKS) and container orchestration.
  • Knowledge of FinOps and cloud cost optimisation strategies.
  • Exposure to MLOps workflows in Databricks.

Why Join?

100% Remote Work – Collaborate with a top-tier engineering team from anywhere.
Exciting Cloud & DevOps Projects – Work on cutting-edge Azure architectures.
Growth & Learning – Access to continuous training and professional development.

If you are not contacted within 10 days of your application, please consider your application unsuccessful.

Get a free, confidential resume review.
Select file or drag and drop it
Avatar
Free online coaching
Improve your chances of getting that interview invitation!
Be the first to explore new Senior Site Reliability Engineer jobs in Gauteng