As a Senior DevOps Specialist, you will be responsible for building, monitoring, and operating a Modern Data Platform (MDP).
You are responsible for ensuring a seamless flow of code through development, testing, deployment, and production environments while maintaining system reliability and scalability.
This role requires a strong technical background in cloud architecture and engineering, CICD automation, data engineering, and big data technologies (such as Spark, Databricks, Azure Data Lake, and Azure Data Factory).
Senior DevOps Specialist will work closely with other teams such as data product development, DevOps, data management and governance, and application operations to ensure that the data platform is aligned with the overall goals and objectives of the company.
Key Responsibilities :
CI / CD Pipeline Management : Design, implement, and maintain CI / CD pipelines for automated software testing, deployment, and monitoring across various environments.
Proficient in Databricks : Proficient in working with Databricks (Catalogs, Databricks Notebooks, Databricks Workflows, Databricks Asset Bundles) & Databricks Administration
Automation & Scripting : Develop and maintain scripts and tools to automate manual tasks in infrastructure management, deployment, and system administration.
Infrastructure as Code (IaC) : Implement infrastructure using IaC tools such as Terraform, ensuring systems are scalable, secure, and easily replicable.
Monitoring & Logging : Set up, configure, and manage monitoring tools (e.g., Prometheus, Grafana, Datadog) to proactively detect and address system performance issues, application errors, and other operational concerns.
Cloud Platform Management : Manage cloud-based infrastructure (Azure) including provisioning, scaling, security, and cost optimization.
Collaboration : Work closely with development, QA, and operations teams to ensure smooth deployment and high availability of services, promoting best practices in DevOps and software development.
Incident Response & Troubleshooting : Quickly respond to and troubleshoot production issues, minimizing downtime, and improving overall system reliability.
Performance Optimization : Continuously improve the performance and scalability of the modern data platform and its underlying infrastructure by identifying bottlenecks and implementing solutions.
Version Control Management : Manage source control tools (e.g., Git, Azure DevOps) and implement branching strategies that promote collaboration and efficient release management.
Job Requirements : Must Haves
Proficient in Databricks administration and Microsoft Azure
Strong knowledge of Azure DevOps CI / CD and Git
Proficient in working with Databricks (Catalogs, Databricks Notebooks, Databricks Workflows, Databricks Asset Bundles)
Familiarity with infrastructure-as-code tool Terraform