Enable job alerts via email!

Site Reliability Engineer

TwinStream

Ledbury

Hybrid

GBP 100,000 - 125,000

28 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An established industry player is seeking a skilled Site Reliability Engineer to enhance the reliability of their cross-domain services used by high-profile government organizations. In this hybrid role, you will collaborate with software engineers and system administrators to improve system performance, automate processes, and ensure the availability of crucial services. This opportunity allows you to work with cutting-edge technologies while supporting a mission-critical environment, making a significant impact on the quality of services delivered to clients. Join a forward-thinking team committed to excellence and innovation in the tech landscape.

Qualifications

  • Strong experience with Terraform and Docker for cloud infrastructure.
  • Proficient in CI/CD tools and monitoring solutions for system reliability.

Responsibilities

  • Collaborate with teams to enhance system reliability and performance.
  • Automate processes and improve observability to prevent issues.

Skills

Configuration Management Tools (Ansible, Chef)

Terraform

Docker Containers

Container Orchestration (Kubernetes, OpenShift)

CI/CD Tools (Jenkins)

Monitoring Tools (InfluxDB, Prometheus, Grafana)

Event-driven Integration (RabbitMQ)

Relational Databases and SQL

Linux Command Line and Shell Scripting

Network Security Protocols

Cloud Hosting Services (AWS)

Job description

Who are we:

In 2019, our founders were working as engineers solving complex cross domain problems in defence and security organisations.

TwinStream was formed to consolidate their collective expertise and experience into one business, providing technical excellence and exceptional service to their clients. We have teams working both on-site with clients and remotely from home.

Day Rate: £500 - £600

Location: Hybrid working near Ledbury with possible 24/7 call out when on rota

Security Clearance: Eligible for DV Clearance

About the role:

Our cross-domain services are used in high profile government organisations. The demand for these services continues to grow in both scope and scale. We are seeking an experienced Site Reliability Engineer to help satisfy that demand. As an SRE you will be responsible for ensuring the availability, performance and cost effectiveness of these services. You will be working with multiple feature development teams and the BAU/Support team to define and evolve our cloud & on-prem infrastructure & delivery pipelines, improving system observability, demonstrating performance and capacity improvements and proactively identifying and mitigating reliability risks.

Key Responsibilities of the Site Reliability Engineer:

  1. Collaborate with Software Engineers to improve reliability and performance in their subsystems
  2. Partner with System Administrators in automating toil and eliminating alerts
  3. Evolve observability and monitoring capabilities to identify and solve problems before they impact the business
  4. Support development environments to help us achieve our delivery and quality goals
  5. Research and evaluate technologies, tools and services to influence buy-vs-build decisions
  6. Develop expertise in diverse technical and business domains
  7. Expand your knowledge of the technical stacks used

Skills & Experience Required:

  1. Experience using modern configuration management tools (such as Ansible, Chef or similar)
  2. Experience working with Terraform
  3. Experience working with docker containers & container orchestration tools (such as Kubernetes, OpenShift or Docker Swarm)
  4. Experience both using and maintaining CI / CD tools (such as Jenkins or similar)
  5. Experience with monitoring tools such as InfluxDB, Prometheus or Grafana.
  6. Experience of event-driven integration with MQ messaging (RabbitMQ or similar AMQP solution)
  7. Good understanding of relational databases and SQL
  8. Linux command line, administration and shell scripting
  9. Working knowledge of network security protocols
  10. Experience using, developing with and maintaining cloud hosting services (ideally AWS EC2, RDS, S3, Lambda)

Desirable Skills:

  1. Industry experience writing well-tested code in one of our platform languages (Java, Go, Python or similar)
  2. Knowledge of cross domain principles & technologies
  3. Experience of working in a service management environment
  4. Practical applications of using observability patterns in previous systems
  5. Creating and monitoring system availability metrics and using those to drive work that reduces downtime

Further Information:

To meet the security requirements of certain clients and industries we serve, any job offer will be contingent upon the successful completion of a security screening process.

At TwinStream, we take pride in being an equal opportunity employer. We celebrate diversity and are committed to fostering an inclusive environment where all individuals are valued and respected. We welcome applications from qualified candidates regardless of race, religion, disability, age, sexual orientation, or gender.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Site Reliability Engineer

Only for registered members

England

Remote

GBP 100.000 - 125.000

11 days ago

Site Reliability Engineer - Contract

Only for registered members

Crawley

Remote

GBP 100.000 - 125.000

14 days ago

Site Reliability Engineer IOE: Cardano

Only for registered members

Remote

GBP 100.000 - 125.000

7 days ago
Be an early applicant

Site Reliability Engineer

Only for registered members

Birmingham

On-site

GBP 100.000 - 125.000

12 days ago

Site Reliability Engineer

Only for registered members

Remote

GBP 100.000 - 125.000

30 days ago

Senior Site Reliability Engineer - EMEA

Only for registered members

Greater London

Remote

GBP 100.000 - 125.000

29 days ago

Remote Site Reliability Engineer

Only for registered members

Remote

GBP 100.000 - 125.000

30 days ago

Site Reliability Engineer, Compute

Only for registered members

Remote

GBP 100.000 - 125.000

30+ days ago

Site Reliability Engineer

Only for registered members

Remote

GBP 100.000 - 125.000

30+ days ago