Site Reliability Engineer

Presight AI Ltd
Abu Dhabi
AED 200,000 - 400,000
Job description

Presight, an ADX-listed public company limited by shares whose majority shareholder is Abu Dhabi company G42, is the region’s leading big data analytics company powered by Artificial Intelligence (“AI”). It combines big data, analytics, and AI expertise to serve every sector, of every scale, to create business and positive societal impact. With its world-class computer vision, AI and omni-analytics platform as its engine, Presight leverages all-source data to support insight-driven decision making that shapes policy and creates safer, healthier, happier, and more sustainable societies.

The Opportunity

Seeking a meticulous and expert Engineer - Site Reliability to build and support the Presight delivery model that empowers product & technology teams to develop & deliver high-quality products, improve platform infrastructure and strengthen the reliability of products and solutions.

Key Responsibilities:

As an Engineer - Site Reliability in our team, you will be responsible for working collaboratively with software engineering to deploy and operate our systems. You will help automate and streamline our operations and processes; building and maintaining tools for deployment, monitoring and operations.

Functional

  • Manage the infrastructure required to run our solutions deployed to public or private cloud (air-gapped).
  • Deploy application updates.
  • Ensure the health of the environment by monitoring technical and business metrics, setting up alerts for things going wrong, acting proactively to prevent disasters.
  • Ensure emergency events can be responded to, quickly and precisely.
  • Enable the engineering team to execute the roadmap addressing roadblocks as needed.
  • Identify, evaluate, and conduct proof-of-concepts for new technologies.
  • Contribute to the knowledge base.
  • Analyze service performance, identify bottlenecks, and provide measurable improvement plans.
  • Serve as primary point of contact for all matters concerning the customer and develop a trusted advisor relationship with executive sponsors by understanding their business needs and technical challenges.
  • Comply with QHSE (Quality Health Safety and Environment), Business Continuity, Information Security, Privacy, Risk, Compliance Management and Governance of Organizations policies, procedures, plans and related risk assessments.

Desired Candidate Profile

Requirements

  • Bachelor’s degree in business Analytics, Data Science, Computer Science, Engineering, or related field.
  • 3+ years of experience in managing Kubernetes clusters.
  • 3+ years experience in configuring/tuning observability platforms (preferably Prometheus; any of the following is also relevant: Datadog / Splunk / NewRelic / CloudWatch).
  • Very good Linux knowledge.
  • Good understanding of message queues (preferably RabbitMQ; any of the following is also relevant: ActiveMQ / Kafka / SNS / SQS).
  • Experience using Elasticsearch (ELK stack).
  • Good understanding of network concepts.
  • Basic knowledge of at least one programming language (preferably Python).

Ideally, you’ll also need

  • A highly detail-oriented and methodical approach to problem solving.
  • A passion for technology, troubleshooting and customer service.
  • A strongly analytical mind.
  • Great verbal and written communication skills.

What we look for

If you are a performance-driven, inquisitive mind with the agility to adapt to ambiguity, you will fit right in. You should be eager to explore opportunities to build meaningful collaborations with stakeholders and aspire to create unique customer-centric solutions. Bias for action and a passion to conquer new frontiers in the AI space is at the heart of the Presight community.

What working at Presight offers:

Culture: An open, diverse and inclusive environment with a global vision that encourages personal growth and focuses on ground-breaking, industry-first innovations.

Career: Outstanding learning, development & growth opportunities via structured training programs and innovative, high-tech projects.

Rewards: A competitive remuneration package with a host of perks including healthcare, education support, leave benefits and more.

Employment Type: Full Time

Company Industry: IT - Software Services

Department / Functional Area:

Keywords: Python, Elastic Search, SRE, Site Reliability Engineer, DevOps

Get a free, confidential resume review.
Select file or drag and drop it
Avatar
Free online coaching
Improve your chances of getting that interview invitation!
Be the first to explore new Site Reliability Engineer jobs in Abu Dhabi