Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia
Site Reliability Engineers (Middle & Senior)
Job Description:
Job Ref: QV795WR6
People Profilers is hiring on behalf of a leading global consulting firm for Site Reliability Engineers (Consultant & Senior Consultant Level), based in KL, Malaysia.
Responsibilities:
You will perform primary and secondary research, conduct analyses and appropriate modeling tasks that feeds directly into the development of technology-enabled solutions for tackling our clients' complex business problems.
You will leverage your training in technology, utilize analytical abilities and communication skills to support the project teams in delivery of our digital solution architectures and in development of work products that address our clients' business needs and help achieve their strategic goals.
You will support the project teams in developing presentation materials and in coordinating communications with the client.
You will assist the project teams in delivery of business-driven, technology-enabled solutions to help our clients meet pressing challenges and seize opportunities in their respective markets.
You will work with diverse and talented project team members to solve problems, improve performance, and generate value for our clients across all industries.
You will uphold the firm's standards and ethos in working with fellow team members and in your interactions with the clients.
You will support business development efforts by contributing directly to the preparation and development of proposals, presentations, and publications.
Demonstrate a strong commitment to personal learning and development.
Understand how our daily work contributes to the priorities of the team and business.
Understand the set expectations and demonstrate accountability in keeping personal performance on track.
Actively focus on developing effective communications and relationship-building skills with stakeholders, clients, and team.
Demonstrate an appreciation for working with others.
Understand what is fundamental to Deloitte's success as a business.
Demonstrate integrity and an awareness of strengths, differences, and personal impact.
Develop your understanding of Deloitte and offer a fresh perspective.
Possess strong strategic and analytical thinking skills.
Ability to identify and mitigate risks to the product.
Ability to provide oral and written discussion of analytical findings using narrative and graphic forms.
Ability to explain complex, technical concepts in digestible ways.
Ability to prioritize tasks according to their importance.
Requirements:
You will be responsible for the management and delivery of a system(s) within a platform leveraging agile practices, by leveraging existing experience of working in an agile environment.
The right person will have at least 3 - 6 years of relevant experience in DevOps, SRE.
Should be well versed in the concepts of DevOps and have a full understanding of Site Reliability Engineering (SRE) principles.
Knowledge of the correlation between SLIs and SLOs when measuring service reliability.
Must be familiar with well-known system monitoring and system configuration & management tools such as ElasticSearch, Grafana, Prometheus, Ansible, and Saltstack.
Must be familiar with Linux system administration and Linux Shell Programming (Bash).
Possesses programming skills in one or more of these languages: Java, Python.
Experience addressing production issues with effective solutions, demonstrated strong ability in debugging/troubleshooting issues on application/infrastructure/operating system levels.
Experience in administration, deployment, configuration, management, and troubleshooting Kubernetes clusters and related applications (e.g., Istio, Consul).
Experience in automating the deployment, configuration, management, and troubleshooting of containerized, cloud-native applications running on Kubernetes.
Experience in coordinating with development teams to streamline code deployment with CICD and IAC pipelines, possessing the ability to build automated solutions through code.
Familiar with message queue systems (e.g., Kafka, RabbitMQ) and other distributed systems (e.g., Consul, Zookeeper, MongoDB, Redis, etc.)
Experience in conducting system tests for security, performance, availability, and reliability.
Demonstrated skills in communication (oral, written, presentation), analysis, problem solving, and short-term and long-term planning.
Demonstrated portfolio of work showcasing technical competence.
An appreciation of the consulting lifestyle and ability to travel (both locally and abroad) is a prerequisite to fit our short-term and long-term project assignments.