We are looking for a passionate, talented, and inventive Data Engineer to lead the development of industry-leading data solutions to support conversational technology with multimodal systems.
As a Data Engineer with the GENIE ML Engineering team, you will be responsible for leading the development of novel data solutions, data modeling techniques, and data pipelines to advance our data analysis and business reporting for Customer Telemetry and related model training/analysis for state of the art conversational multimodal systems. Your work will directly impact our customers in the form of products and services that make use of vision and language technology e.g., data pipelines that feed into solutions to deliver customer features. You will leverage Amazon’s heterogeneous data sources and large-scale computing resources to accelerate development with multimodal Large Language Models (LLMs) and Generative Artificial Intelligence (Gen AI). You will have significant influence on our overall strategy by helping define data, enrichment, model optimizations and evaluation. You will drive the system architecture, and spearhead the best practices that enable a quality infrastructure.
As a Data Engineer, you will be working in one of the world's largest and most complex data warehouse environments using the latest set of tools. You will help GENIE teams and senior leadership within Alexa Devices understand our customers by providing data and metrics that provide insight into user experience. Our team is responsible for AI model analysis and/or training as well as analytical reports and metrics that are viewed at the highest levels in the organization. We are also working on near-real-time analytics using the latest set of tools for data visualization and investing in Big Data technologies. You should have deep expertise in the design, creation, management, and business use of extremely large datasets, as well as data security and privacy standards and best practices.
- 3+ years of data engineering experience
- Experience with data modeling, warehousing and building ETL pipelines
- Experience in at least one modern scripting or programming language, such as Python, Java, Scala, or NodeJS
- Experience with AWS technologies like Redshift, S3, AWS Glue, EMR, Kinesis, FireHose, Lambda, and IAM roles and permissions
- Experience with non-relational databases / data stores (object storage, document or key-value stores, graph databases, column-family databases)
- Experience working on and delivering end to end projects independently
Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, disability, age, or other legally protected status.
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit this link for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.