(Senior) Data Engineer - Applied ML & Distributed Compute (m/f/d)
ECDB GmbH
Hamburg
Hybrid
EUR 60.000 - 80.000
Vollzeit
Gestern
Sei unter den ersten Bewerbenden
Zusammenfassung
A data-driven eCommerce company in Hamburg is seeking a skilled Data Engineer to own and optimize data processing pipelines. This role involves working with large-scale data and implementing machine learning models. Candidates should have over 4 years of experience in Python and distributed computing frameworks. The company offers attractive benefits including flexible working hours and opportunities for personal growth. The modern office in Hamburg’s historic Speicherstadt fosters a unique working atmosphere.
Leistungen
Attractive career opportunities
Flexible working hours
Continuous learning and development
Modern office ambiance
Qualifikationen
4+ years of relevant professional experience.
Proven track record in python-heavy data processing.
Experience with distributed compute frameworks on object-storage datasets.
Practical ML experience including training and deployment.
Able to handle messy, large-scale data.
Aufgaben
Own large-scale data processing pipelines and batch processing.
Design and optimize distributed compute workloads.
Train, deploy and monitor ML models at scale.
Productionize models with batch inference and retraining.
Implement AI-assisted pipelines for classification or extraction.
Kenntnisse
Python
Machine Learning
Data Processing
Distributed Computing
Data Analysis
Tools
Spark
Dask
Ray
Jobbeschreibung
About us
ECDB – Shaping the Future of eCommerce with Data! At ECDB, we firmly believe that data determines success in eCommerce. That’s why we provide leading companies like Amazon, Google, and PayPal with the most precise analyses and market insights. With billions of transactions as our foundation, we are developing one of the most comprehensive eCommerce data platforms worldwide. Our team of over 50 experts combines cutting-edge technology with deep industry knowledge – and this is where you come in! If you're eager to shape the future of eCommerce through data-driven insights, ECDB is the perfect place for you.
Tasks
Own large-scale data processing pipelines, including batch processing of raw, unstructured data
Design and optimize distributed compute workloads to transform large-scale web and natural language data into structured, production-ready datasets
Train, deploy and monitor ML-models at scale (e.g., NLP models, classifiers and enrichment use-cases)
* Der Gehaltsbenchmark wird auf Basis der Zielgehälter bei führenden Unternehmen in der jeweiligen Branche ermittelt und dient Premium-Nutzer:innen als Richtlinie zur Bewertung offener Positionen und als Orientierungshilfe bei Gehaltsverhandlungen. Der Gehaltsbenchmark wird nicht direkt vom Unternehmen angegeben. Er kann deutlich über bzw. unter diesem Wert liegen.