Azure HDInsight Readiness Resources
Azure HDInsight is a managed analytics service. With HDInsight, you can use open-source frameworks such as Apache Spark, Apache Hive, Apache Kafka, and Hadoop. HDInsight can be configured with different cluster types based on an enterprise’s analytical needs (streaming, analytics, warehousing, etc.); this includes a basic Hadoop cluster, Spark cluster, HBase cluster, and so on.
Keeping Up
See the latest updates on Azure HDInsight.
Fundamentals
The best place to start with HDInsight is with the HDInsight documentation page on Microsoft Learn. This page covers the basics including cluster types, how-to guides, and migration and security topics.
- Intro to HDInsight
- Interested in learning what HDInsight is, how it works, and when to use it? This MS Learn module walks through the basics.
- MS Learn HDInsight Modules
- There are several learning guides geared towards typical implementations; for example, streaming with Spark and Kafka to using Interactive Query and HBase.
Advanced
- Pluralsight HDInsight Course
- This content requires a Pluralsight subscription.
- Migrating Big Data Workloads to Azure HDInsight
- This comprehensive guide walks through many scenarios and considerations, and includes samples.
Certifications
- DP-900 Azure Data Fundamentals
- A broad exam that tests knowledge of core data concepts related to Microsoft Azure data services, including HDInsight.
- DP-203 Data Engineering on Microsoft Azure.
- Passing DP-203 is required for the Microsoft Certified: Azure Data Engineer Associate certification.