Enable job alerts via email!
Boost your interview chances
Create a job specific, tailored resume for higher success rate.
An established industry player is seeking a proactive Platform Engineer specializing in MLOps to enhance AI/ML operations. In this dynamic role, you will collaborate with engineers and researchers to design and implement robust CI/CD pipelines, ensuring the reliability and efficiency of extensive training environments. Your expertise in tools like Docker and Kubernetes will be crucial for managing large-scale infrastructure and optimizing system performance. If you thrive in fast-paced environments and are driven by innovation, this opportunity offers a chance to make a significant impact in the AI/ML landscape.
About this role
As a Platform engineer, MLOps, you will be critical to deploying and managing cutting-edge infrastructure crucial for AI/ML operations, and you will collaborate with AI/ML engineers and researchers to develop a robust CI/CD pipeline that supports safe and reproducible experiments. Your expertise will also extend to setting up and maintaining monitoring, logging, and alerting systems to oversee extensive training runs and client-facing APIs. You will ensure that training environments are optimally available and efficiently managed across multiple clusters, enhancing our containerization and orchestration systems with advanced tools like Docker and Kubernetes.
This role demands a proactive approach to maintaining large Kubernetes clusters, optimizing system performance, and providing operational support for our suite of software solutions. If you are driven by challenges and motivated by the continuous pursuit of innovation, this role offers the opportunity to make a significant impact in a dynamic, fast-paced environment.
????️ Your responsibilities:
️ Is this you?
Preferred skills and experience: