- Professional data scientist
- Focuses on three areas of the AI lifecycle: data engineering in the curated and cleaned data layer, modelling & business analysis.
- Acts as supporting role for the professionals in charge of model and infrastructure deployments.
- Is responsible to translate questions from the business into viable AI/ML solutions
- Works with different tool sets most of them requiring a code-first approach.
- Breaks the question into a process flow that always includes an understanding of the business problem, an understanding of the data required, and the types of AI/ML techniques that can solve the problem
Required Skills - Sound foundations in mathematics, data science, machine learning and they often have business acumen. They are pretty proficient at coding in Python and sometimes R.
- Citizen data scientist
- Focuses mainly on the light data engineering with curated data, modelling, and business analysis.
- Often prefers low to no-code options such as automated machine learning, GUI-based AI designers or pre-trained models available in APIs like Microsoft cognitive services.
- Solid on analytics but does not normally have deep algorithmic coding skills.
- Works heavily with data visualization tools like PowerBI, for instance.
- Often works in the business teams and normally is the point of contact for data science projects.
Required Skills - Solid business domain expertise, strong in analytics, good statistical knowledge and acquainted with hypothesis testing and validation.
ML engineer
Core responsibilities of the ML Engineer:
- Involved in three stages of the lifecycle: data development (pre-processing), model development and production.
- Mainly responsible for productionizing a model, with a strong focus on software development practices such as DevOps, CI/CD, monitoring and the right AI infrastructure for scaling the solution
- This role requires strong coding skills, background in software development, has to be able to communicate on multiple different levels in order for the right implementation in the three different environments (dev/test/prod) of pipeline execution.
- Focuses more on pipelines focused on ML, together with the data engineer who is responsible for the data pipelines.
Required Skills - Strong in mathematics, software engineering & machine learning. The ML engineer is very strong in coding specifically focused on object-oriented programs such as Python, Java, C++. Work with regular software development tooling such as Azure DevOps, Git, CLI, Visual Studio Code.
2021-2023, Microsoft Revision
5b03f29 Azure ML-Ops (Accelerator)
main