0% found this document useful (0 votes)
2 views9 pages

MS Azure+Azure Data Engineering-Syllabus

The document outlines a comprehensive course curriculum for MS Azure, SQL, and Azure Data Engineering, covering topics such as cloud computing, Microsoft Azure features, Azure Resource Manager, SQL database provisioning, and Azure Data Factory. It includes detailed modules on storage services, migration strategies, SQL queries, and data flow transformations using Azure Data Factory and Apache Spark. The curriculum emphasizes practical implementation and real-time scenarios to enhance learning outcomes.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2 views9 pages

MS Azure+Azure Data Engineering-Syllabus

The document outlines a comprehensive course curriculum for MS Azure, SQL, and Azure Data Engineering, covering topics such as cloud computing, Microsoft Azure features, Azure Resource Manager, SQL database provisioning, and Azure Data Factory. It includes detailed modules on storage services, migration strategies, SQL queries, and data flow transformations using Azure Data Factory and Apache Spark. The curriculum emphasizes practical implementation and real-time scenarios to enhance learning outcomes.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 9

1

Course Curriculum for MS Azure + SQL + Azure Data


Engineering

Introduction to Cloud Computing:

• Understanding different Cloud Models


• Advantages of Cloud Computing
• Different Cloud Services
• Different Cloud vendors in the market

Microsoft Azure Platform:

• Introduction to Azure
• Azure cloud computing features
• Azure Services
• Introduction of Azure Resources/Services with examples
• Azure management portal
• Advantage of Azure Cloud Computing
• Managing Azure resources with the Azure portal
• Overview of Azure Resource Manager
• Azure management services.

• What is Azure Resource Groups


• Configuration and management of Azure Resource groups for
hosting Azure services

Introduction to Azure Resource Manager & Cloud Storage Services

• Completed walkthrough of the Azure Portal with all the features.


• What is Resource Groups and why we need RG’s in Azure cloud
computing platform to host resources??
• Different types of Storage Accounts provisioning in Cloud
computing with different storage services
• (i)Container/Blob storage service,
• (ii)File share storage service,
• (iii)Table storage service &
• (iv)Queue storage service
• Details explanation & understanding of different Blob/container
storage services…
2

• (i)Page Blob.
• (ii)Append Blob &
• (iii)Block Blob
• What is Standard Storage Account and Premium Storage account
and which to use accordingly as per the real time scenarios.
• Detail explanation and implementation of Data Lake storage Gen2
Storage Account to store the unstructured data in cloud storage
services.
• All the features/properties(Overview, activity log, Tags, Access
control(IAM), Storage browser…etc) of Azure Storage Accounts.
• Maintenance and management of Storage keys and connection
string for Azure Storage services.
• Implementing different levels of access(Reader, contributor,
owners…etc) to the Azure Storage accounts

Migration of storage contents across Public & Private Clouds

• Moving the storage account with storage content across different


Resources Groups based on real time scenarios.
• Migrating the data from On-prem(Private cloud) to Azure Storage
account (Public cloud) using Az copy(forward migration).
• Migrating the data from public cloud to Private cloud(revers
migration).
• Implementing the Az copy commands to migrate the data.
• (i)On-prem to Azure cloud storage services
• (ii)cloud storage services to On-prem
• (iii)Cloud to Cloud
• Moving the SA & its content from one Resource Group to another.

Replication of Storage Accounts Authentication & Authorization of


Storage Accounts & Azure Storage Explorer

• Azure Storage explorer for creating, managing, and maintaining


the Azure storage services data.
• Installation of Azure Storage Explorer and what is the purpose of
this tool for Azure Storage accounts(its Purpose & benefits with
real time scenarios)
3

• Generate Shared Access Signature(SAS) in Azure Storage


Explorer(ASE) for security implementation of Storage account
content.
• Managing of Access keys & connection strings of SA with Azure
Storage Explorer
• Implementation of Azure Storage Account Replication from one
region(as primary) to another region(Secondary) in cases of
Disaster recovery(RPO & RTO)
• Configuration of Authentication and Authorization for Storage
Account via Azure Active Directory.

Hosting of File share Servers, Implementation of CDN for Blob


Storage Services:

• Hosting Fileshare Storage services to On prem servers or Cloud


Servers as shared drive for File share servers.
• Implementation and detail explanation of Content Delivery
Network(CDN)/Front door for Blob Storage Services

Provisioning of SQL DB’s in Private & Public cloud computing:


• Introduction to SQL DB’s
• Creation of new SQL DB’s & Sample SQL DB’s both in On-prem
and Cloud computing.
• Planning and deploying Azure SQL Database
• Implementing and managing Azure SQL Database
• Managing Azure SQL Database security
• Planning and deployment of SQL DB’s in Azure cloud computing
with real time scenarios.
• Different DB’s Deployment options.
• Databases purchasing models.(VCore & DTU’s)
• Visualization of cloud DB server, Database, and validation of data
from on-prem(private cloud)
• Implementation of Firewall security rules on Azure DB servers to
access and connect from on-prem SSMS.
• Creation of Database in on-premises and synch with azure cloud
4

SQL DB Migrations:
• Migrating SQL DB’s from On-premises to Azure cloud computing
using Microsoft Data migration assistant.
• Restoring SQL DB’s from On-prem to cloud computing.
• Migration of Specific DB objects from on-prem to cloud based
upon base upon project requirements.
• Implementation of RSV and scheduling the backups of SQL DB’s
and Azure Storage Account file share services on schedule, on
demand based upon real time scenarios.
Introduction to SQL Server & SQL Queries from basics to
Advance(till ADE Services):
• Introduction to SQL DB Queries
• Below SQL queries detail explanations, syntax & execution based
upon real time scenarios.
➢ Select queries.
➢ Distinct queries
➢ Where queries
➢ And or not queries.
➢ Order By queries
➢ Insert into queries.
➢ Null values queries
➢ Update queries
➢ Delete queries.
➢ Select Top queries.
➢ Min & Max queries
➢ Count, Avg, Sum queries.
➢ Like queries.
➢ Wildcards queries.
➢ In queries
➢ Between queries.
➢ Aliases queries.
➢ Joins(Inner join, Left join, Right join, Full join, Self-join…etc)
➢ Union queries.
➢ Group By queries.
➢ Having queries.
➢ Exists queries.
➢ Any All queries.
➢ Select into queries.
5

➢ Insert into select queries.


➢ Store procedures queries.
What is Azure Data Factory(ADF):
➢ Deep understanding and implementation of concepts/Components
of ADF
o Pipelines
o Activities
o Datasets
o Linked Services
➢ Building blocks of Azure Data Factory
o Triggers
o Integration runtime
o Dataflow
➢ Complete features and walk through of Azure Data factory studio.
➢ Different triggers and their implementation in ADF
o Scheduled trigger
o Tumbling window trigger
o Event trigger
➢ What is integration run time and different types of integration run
time in ADF.
o Azure
o Azure – SSIS
o Self-hosted
➢ When to use ADF.
➢ Why to use ADF.
➢ Different types of ADF pipelines
o Dynamic pipelines
o Parameterized pipelines
o Automated pipelines
➢ Pipelines in ADF
➢ Different types of Activities in ADF
o (i)Data movement activities
o (ii)Data transformation activities
o (iii)Data control activities.
➢ Datasets in Azure Data factory
➢ Linked services in ADF.
Controls/Activities of Azure Data Factory(ADF) for copying the
DATA across various sources to Azure IAAS & PAAS Services:
6

➢ Copying the data from Blb Storage account to ADL’s Gen2 Storage
account.
➢ Copying of zip files(.csv) from Blob SA to ADL’s Gen2 SA using ADF
➢ Implementation and explanation of Metadata control in ADF to find
the structure before copying the data.
➢ Implementation and explanation of Validation and If Condition
➢ Implementation of Get Metadata control, filter control & For Each
Control or activities in ADF.
➢ Implementation & execution to copy the data from GitHub platform to
Azure Storage services with variables and parameters.
➢ Implementation of Foreach control, copy data control and Set
variable to dynamically load the data from source to target using
ADF.
➢ Creating Dynamic pipelines with lookup activity to copy multiple .csv
files data picking form Json format data in Azure Storage services.
➢ Copying the files from GitHub Dynamically with the use of Dynamic
parameters allocation-AUTOMATION PROCESS:
➢ Copying the data from different files formats(.csv, .xlsx, .txt,
.Parquet, .Json, .SQL…etc) using suitable ADF controls/activities.
➢ Implementation and execution of Loading the data from Blb SA to
SQL DB single table & multiple tables using copy data activity,
ForEach activity,
➢ Executing multiple pipelines in parallel with Execute pipeline activity.
Scheduling Triggers for automation of Dataflow/Datacopy to
various sources and destinations in ADF:
➢ Implementation of Schedule based triggers for different ADF
pipeline containing different activities.
➢ Implementation of Event based triggers for different ADF pipeline
containing different activities.
➢ Implementation of Thumbling window-based triggers for different
ADF pipeline containing different activities.
➢ Implementation and execution of storage and Event based
triggers.

What is Azure Keyvault, purpose of using Keyvault, Storing the SA


keys, connection string in Azure KV with Access policies:
➢ Detail explanation & implementation of Azure Keyvaults,
7

➢ Making the SQL DB connection string to store in Keyvault to


enhance the security for SA content and SQL DB
➢ Generating the secrets inside the Azure keyvault and granting
access by implementing the access policies for different users.
Integrating Azure Data Factory with GitHub Portal:
➢ Detail walk through of GitHub portal
➢ Creating an account, repo’s, in GitHub portal
➢ Integrating Azure Data Factory with GitHub Portal as per project
requirements.
➢ Placing, maintaining and executing the source code via GitHub
portal for Azure Data Factory.
➢ Creating master branch, practice branches in GitHub portal to
merge the newly created code via Pull Requests.
➢ Setting up the Repo for ADF pipelines and converting to live mode
from GitHub portal covering with real time scenarios.
Data Flows Transformations in Azure Data Factory:
➢ Designing new Data flows
➢ Designing and implementing transformations like
➢ 1)Source transformation
➢ 2)Join transformations
➢ Inline Datasets in data flow source control
➢ Designing and implementing of Data flow with Source
transformations, Filter transformations & Sink transformations in
ADF with inline Datasets
➢ Implementation of Select transformations with Data flows for
various source controls.
➢ Implementation of Dataflows using Aggregate & Sink
transformation:
➢ Implementation of Dataflow with conditional split & Sink
transformation with copy data activity:
➢ Implementation of Dataflow with Exists & Sink transformation:
➢ Implementation of Azure Dataflows for Derived column
transformation with Source & Sink transformation:
➢ Implementation of Azure Dataflows to connect to SQL DB with
Source & Sink transformation:
➢ Union & Union flow transformation implementation with ADF Data
flows
8

➢ Implementation of Azure Dataflows to connect to SQL DB with


Source & Sink transformation.
➢ Implementation of windows functions…like Rank() function,
Dense_Rank() function, Row_Number() function…etc.
Azure Data Bricks & Apache Spark:
➢ What is Apache Spark, details explanation and implementation of
Apache Spark.
➢ Illustration and Elaboration of Apache Spark Architecture.
➢ Explanation of

o Resilient Distributed Dataset (RDD)


o Directed Acyclic Graph (DAG)

➢ Understanding of different Apache Spark components


o Spark Core
o Spark SQL
o Spark Streaming
o MLlib
o Graph-X

➢ What are worker nodes and slaves nodes in Azure Data Bricks
clusters
➢ Implementation of Azure Databricks cluster by considering different
worker nodes and slave nodes.
➢ Different features and properties of Azure Data Bricks clusters

o Single node
o Multi node
o Photon acceleration
o Auto turn off Azure Data bricks cluster after a defined time.
o Autoscaling of cluster
o Configuration provisioning of Azure Data Bricks clusters

Azure Data Bricks & Apache Spark clusters features:

o Creating single node and multi nodes clusters


9

o Creation of Pyspark notebooks in Databricks cluster to fulfil different


business requirements.

You might also like