Data Engineer 4 months - with extensions Remote Active SC clearance required £640 per day inside ir35
REQUIRED
Strong understanding of data concepts - data types, data structures, schemas (both JSON and Spark), schema management etc.
Strong understanding of complex JSON manipulation.
Experience working with Data Pipelines using a custom Python/PySpark frameworks.
Strong understanding of the 4 core Data categories (Reference, Master, Transactional, Freeform) and the implications of each, particularly managing/handling Reference Data.
Strong understanding of Data Security principles - data owners, access controls - row and column level, GDPR etc including experience of handling sensitive datasets.
Strong problem solving and analytical skills, particularly able to demonstrate these intuitively (able to work a problem out, not follow a work instruction to resolve).
Experience working in a support role would be beneficial, particularly able to demonstrate incident triage and handling skills/knowledge (SLAs etc).
Fundamental linux system administration knowledge - ssh keys and config etc, Bash CLI and scripting, Environment variables.
Experience using browser based IDEs (Jupyter Notebooks, RStudio etc).
Experience working in a dynamic Agile environment (SAFE, scrum, sprints, JIRA etc).
Python (as a programming language, not just being able to write basic scripts).
LANGUAGES / FRAMEWORKS
JSON
YAML
Python (as a programming language, not just able to write basic scripts)
Pydantic experience DESIRABLE
SQL
PySpark
Delta Lake
Bash (both CLI usage and scripting)
Git
Markdown
Scala DESIRABLE
Azure SQL Server as a HIVE Metastore DESIRABLE
TECHNOLOGIES
Azure Databricks
Apache Spark
Delta Tables
Data processing with Python
PowerBI (Integration / Data Ingestion)
JIRA
If you meet the above requirements, please apply for the vacancy to be contacted by an Experis Consultant. If you haven't been contacted within 2 weeks of application, please consider the vacancy closed.