Senior ML Data Engineer

Dow Jones & Company, Inc.
Barcelona
EUR 30.000 - 50.000
Descripción del empleo

About Our Organization:

Dow Jones is a global provider of news and business information, delivering content to consumers and organizations around the world across multiple formats, including print, digital, mobile and live events. Dow Jones has produced unrivaled quality content for more than 130 years and today has one of the world’s largest news-gathering operations globally. It is home to leading publications and products including the flagship Wall Street Journal, America’s largest newspaper by paid circulation; Barron’s, MarketWatch, Mansion Global, Financial News, Investor’s Business Daily, Factiva, Dow Jones Risk & Compliance, Dow Jones Newswires, OPIS and Chemical Market Analytics. Dow Jones is a division of News Corp (Nasdaq: NWS, NWSA; ASX: NWS, NWSLV).

About the Team:

Our Technology team drives the evolution of our Technology, Engineering, Data, Product and User Experience functions. With a keen focus on delivering cutting-edge solutions, we shape the digital landscape for our customers, readers, and users. From revolutionizing visuals to optimizing tools and harnessing the power of data, mobile, video, and social platforms, our team is committed to providing a seamless and immersive experience across all touchpoints. Collaborating closely with our newsrooms and strategic partners, we spearhead the development of groundbreaking products and technologies.

About the Role:

Dow Jones is seeking an experienced Data Engineer to join our AI Engineering Team. You will be responsible for designing, developing, and maintaining robust data pipelines for data scraping, processing, extraction, transformation, loading, and storage. You will collaborate within our team to ensure the efficient and reliable retrieval of data, enabling seamless integration with downstream systems for analysis and decision-making.

You Will:

  1. Collaborate with data scientists and ML engineers to design, develop, and maintain end-to-end data pipelines for extraction, transformation, loading (ETL), and storage.
  2. Clean, transform, and structure data using industry-standard techniques, ensuring quality and consistency.
  3. Work with APIs to retrieve data from external sources or integrate with third-party services, adhering to best practices.
  4. Manage and optimize SQL and NoSQL database systems for data storage, ensuring integrity and performance.
  5. Automate data fetching, processing, and storage by implementing data pipelines for ML/AI use cases, leveraging ETL principles.
  6. Identify and troubleshoot issues related to data quality and pipeline performance, applying problem-solving skills.
  7. Communicate effectively with stakeholders and data providers to gather requirements and ensure project alignment.
  8. Stay updated with industry trends, emerging technologies, and best practices in data engineering for Machine Learning.

You Have:

  1. At least 3 years of industrial experience in a data engineering role.
  2. Experience with cloud-based infrastructure and services (AWS, GCP preferred).
  3. Experience in designing and implementing end-to-end data pipelines for ML/AI use cases. Preferably in Airflow or GCP Cloud Composer.
  4. Ability to work with APIs to retrieve data from external sources or integrate with third-party services.
  5. Familiarity with NLP and Machine Learning frameworks and libraries (e.g., PyTorch, HuggingFace, LangChain, spaCy, NLTK, scikit-learn, etc.).
  6. Experience with working on Jenkins and/or Docker.
  7. Familiarity with database systems, including SQL and NoSQL databases.
  8. Bachelor's Degree or higher in Computer Science, Computer Engineering, Data Science or related STEM field preferred.
  9. Strong communication and collaboration skills to work effectively with team members and stakeholders.
  10. Continuous learning mindset with a willingness to stay updated with industry trends and best practices.

Reasonable accommodation: Dow Jones, Making Careers Newsworthy - We are an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity or expression, pregnancy, age, national origin, disability status, genetic information, protected veteran status, or any other characteristic protected by law. EEO/AA/M/F/Disabled/Vets. We strongly encourage applications from all qualified individuals, including women, people with disabilities, and those from underrepresented groups.

Obtenga la revisión gratuita y confidencial de su currículum.
Selecciona un archivo o arrástralo y suéltalo
Avatar
Asesoramiento online gratuito
¡Mejora tus posibilidades de entrevistarte para ese puesto!
Adelántate y explora vacantes nuevas de Senior ML Data Engineer en