Challenge 02 - Standardizing on Silver

< Previous Challenge - Home - Next Challenge >

Introduction

In this challenge we will begin to transform the data we loaded “as is” to the Bronze layer into more “standardized” and “integrated” datasets that are then loaded into the Silver layer. The goal of this challenge is to merge the tables and bring in only a subset of the fields. We do not need this to be production level quality and determine every field that could be a match. Rather we want to bring over just enough fields to show an intelligent report. The definition of “intelligent” is at your discretion. Duplicate names may happen, but do not have duplicate IDs. We need each Delta table to have a unique key.

Description

Some of the things we try to achieve within the Silver layer are

The above steps can be achieved using Azure Databricks or Azure Synapse. A more visual guide on the process can be seen below.

picture alt

Business Case

Now that we know what we need to do, it’s important to understand why we are doing this.
From an organizational standpoint, the Silver layer serves three main purposes:

Success Criteria

To complete this challenge successfully, you should be able to:

Learning Resources

The following links may be useful to achieving the success criteria listed above.

Bonus Challenges