The Apple Services Engineering - Experimentation data engineering team is looking for a Data Pipeline Engineer to automate and build data pipelines for search and recommendations features.
Description: This team focuses on designing, executing, and building tools for online and offline experiments to enhance data-driven features. The role involves automating data delivery, collaborating with Data Scientists to understand critical metrics, and writing code for data ingestion, computation, and organization.
Minimum Qualifications:
Bachelor's in Computer Science/Engineering or related field
3-5+ years of experience with Big Data systems, ETL, data processing, and analytics tools
Proficiency in Scala, Spark or Python
Experience with Hadoop, Spark, and maintaining large software systems
Ability to write and maintain a test suite
Preferred Qualifications:
Proficiency in SQL, Hive, and SparkSQL
Understanding of entity-relationship modeling and normalization
Experience with sessionization of clickstream and time-series data
Familiarity with dimensional modeling
Knowledge of Continuous Integration, Version Control (e.g., git), and data visualization tools
Deep understanding of data structures and data transformation methods
Stay updated with technology trends
Collaborate with software engineering teams to enhance data collection
Process, cleanse, and validate data integrity
Engineer durable and reliable code
Optimize code performance as data scales
Generate automated reports for key insights presentation