Senior Logging, Monitoring and Alerting Engineer (IT)
We are seeking a Monitoring Systems Engineer to design, implement, and maintain our logging, monitoring, and alerting infrastructure. In this role, you'll be the key player in ensuring our systems are running smoothly, with a strong focus on synthetic monitoring, application performance monitoring, and full-stack/infrastructure monitoring. Your work will provide real-time insights into system health, helping us detect and resolve issues before they affect our users. If you have a passion for building robust monitoring solutions and enjoy solving complex problems, we want you on our team!
In this Role, Your Responsibilities Will Be:
- Design, implement, and maintain a scalable logging infrastructure that collects and stores application and system logs.
- Set up and apply monitoring tools to track critical metrics and identify potential performance bottlenecks.
- Develop and implement alerting systems that advise the appropriate teams of critical events and potential issues.
- Analyse log data and identify trends and patterns to proactively identify and prevent problems.
- Collaborate with other teams/departments to define and implement monitoring and alerting requirements for new applications and features.
Who You Are:
- Responsible for crafting, implementing, and maintaining logging, monitoring, and alerting systems.
- Have a good understanding of synthetic monitoring, application performance monitoring, and full-stack/infrastructure monitoring.
- Use financial analysis to generate, evaluate, and act on strategic options and opportunities.
- Marshal resources (people, funding, material, support) to get things done.
- Readily action new challenges, without unnecessary planning.
For This Role, You Will Need:
- Bachelor's degree in Computer Science, Information Technology, or a related field (or equivalent experience).
- 5+ years experience with any logging tools such as the ELK Stack, Splunk, and Cribl.
- Experience with monitoring tools and technologies such as Nagios and Dynatrace.
- Proficient in using Infrastructure as Code tools like Terraform and Configuration as Code tools like Ansible.
- Experience in containerization technologies, particularly Docker.
- Familiarity with cloud platforms such as AWS, Azure, and GCP.
- Experience in managing Unix, Linux, and Windows operating systems.
- Good grasp of Active Directory and Single Sign-On (SSO) concepts.
- Excellent analytical and problem-solving skills.
- Strong communication and collaboration skills.
Preferred Qualifications that Set You Apart:
- Knowledge of Kubernetes is a plus.
Please Note:
Malaysian citizenship, Permanent Residency, or ability to secure the appropriate work permit is required to work in this position at NI Penang.
US Customs Law forbids the export of certain technologies to certain countries. Due to licensing requirements which National Instruments does not presently pursue for this position, this regulation effectively prevents National Instruments from hiring candidates for this job (as it would require access to such technology) whose current country of citizenship or permanent residence are Cuba, Sudan, North Korea, Iran, and Syria.