Enable job alerts via email!

Site Reliability Engineer- Global Crypto Trading- New York/ London/ Singapore

Venture Search

Greater London

On-site

GBP 100,000 - 125,000

12 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An established industry player in high-frequency crypto trading is on the lookout for a Site Reliability Engineer to enhance their trading platform's stability and performance. In this pivotal role, you will design and build production tools, automate deployment processes, and improve system efficiency. Collaborating closely with trading and development teams, you will ensure the seamless operation of live trading systems across a robust AWS infrastructure. If you thrive in a dynamic environment and possess strong programming skills in Python, this opportunity is perfect for you to make a significant impact in a cutting-edge field.

Qualifications

  • Strong programming skills in Python and ability to read C/C++ code.
  • Deep understanding of Linux systems and AWS for deployments.

Responsibilities

  • Develop scalable production tools for deployment and monitoring.
  • Ensure system reliability, performance, and automation.

Skills

Python

C/C++

Linux systems

AWS

networking fundamentals

scripting languages (e.g., Python, Bash)

monitoring, logging, and alerting solutions

Tools

Terraform

Ansible

Job description

Site Reliability Engineer- Global Crypto Trading- New York/ London/ Singapore

Our client, a leading high-frequency crypto trading firm, is seeking a Site Reliability Engineer (SRE) to design and build production configuration and deployment tools for their high-frequency trading (HFT) platform. This role is critical in ensuring the stability, scalability, and automation of our infrastructure. The ideal candidate will have extensive experience creating complex, production-focused tools, with an emphasis on reliability and performance.

Key Responsibilities

  • Develop and maintain scalable production tools to automate deployment, monitoring, and infrastructure management.
  • Improve system reliability, performance, and efficiency through automation and tooling.
  • Work closely with trading and development teams to ensure seamless operation of our live trading systems.
  • Manage configuration and deployment processes across AWS-based infrastructure.
  • Implement observability tools to enhance system monitoring and debugging capabilities.
  • Ensure fault tolerance, redundancy, and high availability for critical trading systems.
  • Support and enhance infrastructure for both C++ and Rust-based trading systems, ensuring seamless integration.

Required Qualifications

  • Strong programming skills in Python, with the ability to read and understand C/C++ code.
  • Deep understanding of Linux systems.
  • Experience managing deployments and configuration management in AWS and/or on-premise clusters.
  • Proficiency in monitoring, logging, and alerting solutions to maintain high system uptime.
  • Strong background in networking fundamentals, including TCP/IP and system performance tuning.
  • Experience with scripting languages (e.g., Python, Bash) for automation.

Preferred Skills

  • Familiarity with IaC tools such as Terraform or Ansible for infrastructure automation.
  • Experience in low-latency or high-performance environments is a plus but not required.
  • Strong problem-solving skills and the ability to work in a highly collaborative team.

Location

  • In-office only – offices available in New York City, London, and Singapore.

Seniority Level: Mid-Senior level

Employment Type: Full-time

Job Function: Capital Markets and Financial Services

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.