top of page

Senior Data Engineer (AI / ML)

Description

Our client (a Multinational Pharmaceutical) is looking for a Data Engineer.

 

Mission

Advanced Analytics and AI are high on the agenda at our client and they are looking to strengthen the internal team of AI experts with a particular focus on sales & marketing.  In this context the client is looking for an outstanding data engineer with strong Python & Spark skills to contribute to the development of analytics workflows focused on insights generation, prescriptive analytics and decision support apps.

Responsibilities

  • Develop and operate data pipelines processing large, complex datasets as input for analytics and machine learning;

  • Help to define the analytical scope and data for projects, including investigating data sources, designing new features and data integration flows;

  • Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc;

  • Create data tools for analytics and data scientist team members that assist them in building and optimizing their results;

  • Utilizing a diverse array of technologies and data science toolsets as needed, primarily Python, Spark and Pandas, but also Jupyter, Denodo, Azure ML, Azure DevOps, Docker, Databricks, GIT, SQL, ...;

  • Communicate ideas, approaches and results with peers and stakeholders.

​​

Skills / Requirements

​

  • Mastery of Python, Spark and Pandas to create ETL pipelines for data scientists to use; knowledge of one or more data pipelines frameworks is a plus;

  • At least 3 years of intensive hands-on experience as a full-stack Python data engineer: Python, Spark, Pandas, NumPy, SciPy, visualization (matplotlib), machine learning (scikit- learn), data pipeline orchestration (e.g. kedro);

  • Good knowledge and experience with versioning systems (GIT);

  • Good knowledge and experience with databases;

  • Advanced degree in a relevant discipline such as: Statistics, Applied Mathematics, Operations Research/Optimization, Computer Science, Computational/Theoretical Physics, Data Science/visualization, Machine Learning, Electrical/Computer Engineering or Health Sciences (e.g. Bioengineering / Bioinformatics) ;

  • Experience in extracting, cleaning, preparing and modeling data. Experience with command-line scripting, data structures, and algorithms;

  • Ability to work across structured, semi-structured, and unstructured data;

  • Strong presentation and communication skills towards peer data scientists and non-technical stakeholders;

  • Ability to work individually and in teams (agile);

  • Experience with the healthcare / pharmaceutical industry is a plus;

  • Experience with sales & marketing analytics is a plus.

 

Additional Information

  • Hours per week: Full time

  • Duration of the contract: 6 months (followed by extensions)

  • Start date: ASAP

  • Location: full remote

 

​​​

Do you want to apply for this job ? Let us know and send your CV to hello@akindra.ro

​

bottom of page