Software Principal Engineer - SRE

Boomi Software
Kolkata, Bengaluru, Pune City
INR 12,00,000 - 24,00,000
Job description

As a Senior Site Reliability Engineer, you will be responsible for developing sophisticated systems and software based on the customer's business goals, needs, and general business environment. You will work with product management, other engineering teams, customer success, and support on developing cutting-edge new product features and enhancements across various areas of Boomi offerings.

You will:

  1. Participate actively in detecting, remediating, and reporting on Production incidents, ensuring the SLAs/SLOs are defined and met.
  2. Participate in on-call rotation to ensure coverage for planned/unplanned events.
  3. Engage with other Engineering organizations to implement processes, identify improvements, and drive consistent results.
  4. Work with your SRE and Engineering counterparts for driving DR exercises, Game days, training, and other response readiness efforts.
  5. Collaborate with Service Engineering organizations to build and automate tooling, implement best practices on Observability, and manage the Boomi services in production to consistently achieve our market-leading SLA.
  6. Improve the scalability and reliability of Boomi's systems in production.
  7. Automate the provisioning and maintenance of Boomi's infrastructure.
  8. Work independently with a minimal level of guidance from technical leadership.
  9. Mentor other Boomi engineers, including design collaboration and code reviews.

Take the first step towards your dream career with Boomi

Essential Requirements

  1. Expert in defining, measuring, and improving Reliability Metrics (SLO/SLI/Error budgets).
  2. Strong in implementing observability practices (Monitoring, Logging, Distributed Tracing etc.), preferably using Splunk and New Relic.
  3. Experience not limited to using dashboards, but creating them from scratch.
  4. Passionate about SRD Automation and infrastructure platforms. Expert in developing Ansible playbooks and automation for Infrastructure as code using Terraform and Cloud Formation Templates and Python.
  5. Experience in conducting and automating DR exercises in AWS cloud, thus validating RPOs and RTOs.
  6. Strong understanding and working experience with AWS components.
  7. Ability to design and implement APIs for use by internal teams.

Desirable Requirements

  1. 7+ years experience in the software engineering industry, with experience supporting large scale software systems in production.
  2. Experience actively in detecting, remediating, and reporting on Production incidents, ensuring the SLAs/SLOs are defined and met and participate in on-call rotation to ensure coverage for planned/unplanned events.
  3. Certified in Cloud (AWS/Azure/GCP/Oracle), experience in using services such as computers, containers, and databases.
  4. Experience in Observability, creating dashboards for SLA/SLI/SLO.
  5. Experience in Ansible/Terraform and Python.
  6. A grasp of Cloud Native concepts, containerization best practices, and security awareness in Cloud will be a strong plus.
Get a free, confidential resume review.
Select file or drag and drop it
Avatar
Free online coaching
Improve your chances of getting that interview invitation!
Be the first to explore new Software Principal Engineer - SRE jobs in Kolkata, Bengaluru, Pune City