Enable job alerts via email!
Boost your interview chances
Create a job specific, tailored resume for higher success rate.
An established industry player is seeking a passionate engineer to join their dynamic team focused on high-performance machine learning workloads. This role involves rapid prototyping of optimized CPU kernels to enhance model performance and accuracy, directly influencing future CPU architecture development. With a commitment to fostering a diverse and inclusive workplace, this innovative company offers an attractive relocation package and encourages talent to thrive. If you have a strong background in CPU architecture and kernel implementation, this is an exciting opportunity to contribute to groundbreaking projects in a collaborative environment.
High-performance ML workloads on Arm CPUs requires the co-development of algorithms and highly optimized CPU kernels. In CT-ML (Central Technology, Machine Learning), rapid kernel prototyping is crucial for exploring algorithms and assessing trade-offs between model accuracy and performance. Successful prototypes are essential to drive future CPU architecture development and also deliverables to Central Engineering for final production.
This position is part of a dedicated team within the CT-ML group to focus on analyzing ML workload, rapid prototyping of highly optimized CPU kernels to drive model performance and accuracies.
Arm is committed to global talent acquisition, offering an attractive relocation package. With offices around the world, Arm is a diverse organization of dedicated, creative and highly talented engineers. By enabling a dynamic, inclusive, meritocratic, and open workplace, where all our people can grow and succeed, we encourage our people to share their unrivaled contributions to Arm's success in the global marketplace.