Data Scientist - Vertix ai/Llama/Edge AI
Capgemini Engineering, a global leader in engineering services, brings together engineering, science and architecture teams to help the world’s most innovative companies unlock their potential and contribute to a better future. From self-driving cars to life-saving robots, our digital and software experts go beyond the conventional, providing unique R&D and engineering services across all business sectors.
Responsibilities:
- Collect and preprocess data from various sources to ensure it is clean, accurate, and ready for analysis.
- Explore and understand the characteristics of the data to inform modeling decisions.
- Apply statistical methods to analyze data trends and patterns.
- Conduct hypothesis testing to validate findings and draw meaningful conclusions.
- Develop, implement, and fine-tune machine learning models to solve specific business problems.
- Select appropriate algorithms based on the nature of the data and the problem at hand.
- Identify and create relevant features from raw data to enhance model performance.
- Consider domain knowledge to engineer features that capture meaningful information.
- Assess the performance of machine learning models using appropriate metrics.
- Implement cross-validation techniques to ensure model generalizability.
- Work closely with domain experts, engineers, and other stakeholders to understand business requirements and incorporate domain knowledge into data science solutions.
- Present findings and insights through compelling data visualizations.
- Communicate complex technical concepts to non-technical stakeholders effectively.
- Stay updated on the latest advancements in data science, machine learning, and relevant technologies.
- Experiment with new tools and techniques to improve model performance.
- Ensure the ethical use of data, considering issues related to bias, fairness, and privacy.
Qualifications:
- BS in Computer Science, Statistics, Mathematics, or a related discipline with 5+ years of experience.
- Experience in Llama AI/Vertex AI/Edge AI.
- Advanced degrees (Master's or Ph.D.) are preferable.
- Proficient in the programming language Python.
- Familiarity with libraries and frameworks like NumPy, pandas, scikit-learn, TensorFlow, or PyTorch.
- Strong understanding of statistical concepts and techniques, including regression, hypothesis testing, and probability.
- Experience with a variety of machine learning algorithms and techniques for classification, regression, clustering, and feature selection.
- Skills in data manipulation and analysis using tools like SQL and data manipulation libraries in Python or R.
- Familiarity with big data tools and technologies, such as Hadoop, Spark, and distributed computing.
- Proficiency in data visualization tools such as Matplotlib, Seaborn, Tableau, or Power BI.
- Depending on the industry, having domain-specific knowledge can be highly beneficial.
- Strong communication skills to convey complex findings to both technical and non-technical stakeholders.
- Critical thinking and problem-solving abilities to address complex business challenges using data-driven approaches.
ABOUT CAPGEMINI:
Capgemini is a global leader in transforming clients' businesses by harnessing the full power of technology. We are guided by a purpose to achieve an inclusive and sustainable future through technology and the energy of those who make it. We are a responsible and diverse company, a leading international IT and engineering services company with more than 360,000 professionals in over 50 countries. With a strong 55-year heritage and deep industry expertise, clients trust Capgemini to address their total business needs, from strategy and design to operations, driven by the fast-paced world of cloud, data, AI, connectivity, software, digital platforms and engineering.