Level 3 Support and SRE

ALLTECH CONSULTING SVC INC
Drummondville
CAD 80,000 - 100,000
Job description

Technology at our Company:

Technology is the key differentiator that ensures that we manage our global businesses and serve clients on a market-leading platform that is resilient, safe, efficient, smart, fast, and flexible. The Technology division partners with our business units and leading technology companies to redefine how we do business in ever more global and dynamic financial markets.

Our sizeable investment in technology results in leading-edge tools, software, and systems. Our insights, applications, and infrastructure give a competitive edge to clients’ businesses and to our own.

Position Description:

The Core Services L3 support team is part of the Enterprise Computing Data Services Organization in the Company. The team manages and supports a variety of applications developed in-house for purposes like application management and application coordination using Apache Zookeeper, API Proxy, Automation Platform using Ansible Automation Platform and Infrastructure as Code using Terraform. It serves as the highest level of escalation, and actively engages engineering teams who develop the products and tooling to maintain service stability.

This position is a Level 3 support and SRE role with global responsibility for managing and providing support for these middleware products with on-call coverage to handle production escalations.

The successful candidate will be involved in day-to-day management of the infrastructure environment, troubleshooting with users, handling of changes, incidents, escalations, and problem management. The person would also be routinely working with engineering teams who developed these products to resolve problems and proactively automate operational and user processes to reduce toil and time to market.

Required Skills:

  1. 8+ years of overall IT experience.
  2. Advanced Linux / Unix support experience required.
  3. Strong shell scripting and python programming skills for SRE related activities required.
  4. Experience on using Splunk OR Grafana/Prometheus/Loki stack required, preferably both.
  5. General understanding of Veritas Cluster Service, Load Balancers, and VMWare required.
  6. Knowledge of ITIL principles required.
  7. Effective oral and written communication skills, and interpersonal skills to work well in a team environment required.
  8. Strong organizational and coordination skills with the ability to manage multiple tasks and high-pressure situations for outage handling, management, or resolution.
  9. Be available for weekend work.

Desired Skills:

  1. Experience in application support, code release and liaison with development teams highly desired.
  2. Experience on automation with Ansible playbooks highly desired.
  3. Experience on Ansible Automation Platform administration highly desired.
  4. Experience on Terraform, especially Terraform Enterprise highly desired.
  5. Knowledge of Dockers, Kubernetes/OpenShift highly desired.
  6. Experience in development tool chain such as git, bitbucket and CI/CD tools preferred.
  7. Experience in Agile methodologies preferred.
  8. Good knowledge of JVMs and its garbage collection mechanisms preferred.
  9. Experience on relational databases and webservers / application servers preferred.
Get a free, confidential resume review.
Select file or drag and drop it
Avatar
Free online coaching
Improve your chances of getting that interview invitation!
Be the first to explore new Level 3 Support and SRE jobs in Drummondville