Senior Data Engineer
Jobbeschreibung
Job Title: Senior Data Engineer
Job ID: 2022-11155
Job Location: Remote
Employment Type: W2
Candidate Constraints:
Duration: Long term
Work Eligibility: All Work Authorizations are Permitted – No Visa Transfers
Key Technology: Python, Scala, Pandas, Unix/Linux, Azure or AWS, RDBMS, SQL
Job Responsibilities:
- Applying new algorithms to enable fuzzy matching
- Scaling application of the algorithm using dataframes such as Pandas and GPUs
- Applying new algorithms to introduce new linkages from the data
- Segmentation using diverse intuitive and non-intuitive data points and potentially fuzzy matching data sources
- Working to design and build new approaches to measuring quality and improving algorithms and matching processes
- Working with the data platform to build solutions
- Prepare reports and present analytic results in non-technical language with an orientation toward answering questions and addressing issues
Skills and Experience Required:
- Experience building production data pipelines with distributed data processing technologies.
- Hands-on and deep experience with schema design and data modeling.
- Proficiency in Python and are passionate about writing clean, supportable code.
- Advocacy for data quality. You have a strong opinion on when data audits, unit tests, and documentation can be most effective.
- Technical thought leadership
- Proficient business skills to understand problems and build the algorithms necessary to find the right answers applicable to the business needs
- Experience and enjoy mentoring others.
Ideal Candidate:
- 5+ years experience as a Data Engineer/Developer
- Python
- Experience with Scala, Pandas
- Experience debugging complex data pipelines
- Experience with Unix/Linux
- Azure or AWS experience
- Experience with databases
- RDBMS/SQL
- Experience with data normalization
Desired Skills:
- Experience with Big Data fundamentals
- Machine learning libraries on Python (including unsupervised learning models)
- Hadoop, Spark, Parquet
- Experience with AWS
- Experience with APIs
- Experience with Java
- Experience with streaming technologies
- Kafka, Flink, Spark Streaming
- Experience with NoSQL databases
- Experience with monitoring tools
- Knowledge of K8s/Docker