Large hedge fund is a private institutional investment management complex consisting of an international team of researchers and technologists who constantly work toward ever-greater quantification and automation in the development of processes. We develop and deploy systematic financial strategies across a variety of asset classes in global markets, utilizing a proprietary research platform and risk management process. We are seeking an engineer to analyze, extract, transform and onboard vendor's data to Data Lake with the support of our data system.
Responsibilities
In this role, you will analyze complex datasets to derive insights that influence business strategies. You will collaborate with cross-functional teams to design and implement analytical solutions, ensuring data accuracy and integrity. Strong problem-solving skills and the ability to clearly communicate your findings to stakeholders are crucial for success in this role.
SKILLS
Must have
Proficient in Python and Logical Programming: Extensive experience with core Python libraries (requests, pandas, numpy, json, pyplot, etc.) and frameworks, coupled with a strong ability to apply logical reasoning and develop efficient algorithms to solve complex programming problems.
PySpark: Experience with distributed data processing using PySpark.
Strong Bash/Linux Skills: Comfortable navigating the command line, writing shell scripts, and managing files/permissions.
Experience working with APIs: Familiarity with RESTful APIs, API authentication, and data serialization formats (e.g., JSON, XML).
Proficient with Git: Experience with branching, merging, pull requests, and conflict resolution.
CI/CD Proficiency: Hands-on experience implementing and managing CI/CD pipelines. Experience with GitLab Runner is a plus.
Proactive Problem Solver: Demonstrates initiative and takes ownership of challenges.
Excellent Communication Skills: Communicates effectively both verbally and in writing.
Strong Technical Writing Skills: Emphasis on writing clear and concise documentation for both the code itself and the overall implementation process.
Research Mindset: Embraces working with unknowns and proactively explores solutions.
Nice to have
SQL and Database Experience: Familiarity with relational databases and writing SQL queries.
Mathematical/Statistical Skills: Ability to analyze data, identify trends, and draw meaningful conclusions.
Airflow: Experience with workflow orchestration and scheduling using Airflow.
AI/LLM for Automation: Interest in and experience with applying AI and Large Language Models to automate processes.