How To Build Data Pipelines On AWS - Reference Workflow
How To Build Data Pipelines On AWS - Reference Workflow
@aTechGuide
How to build
Data Pipelines
in AWS
using Let’s look into
7 Simple
services
[2023 Edition]
atech.guide
Buckle Up,
to build end to end
Data Pipeline
atech.guide
First Question …
atech.guide
How to Load Data in
AWS?
atech.guide
1. AWS Database Migration Service
atech.guide
ff
We have a
mechanism to Pull
Data.
But …
atech.guide
Where to Save
data?
atech.guide
2. Amazon S3
2. It is an Distributed Object
store
atech.guide
Can we build Tables
on top of Raw Data?
atech.guide
A Schema on Read
will be helpful.
How to do that?
atech.guide
3. AWS Glue Data Catalog
atech.guide
fi
How to Query the
Tables?
atech.guide
4. Amazon Athena
1. Amazon Athena is an
Interactive Query service to
analyse data in S3
atech.guide
Now, we need to
3. Aggregate data
4. …
atech.guide
How to do
compute?
atech.guide
5. Amazon EMR
atech.guide
fi
We have all the
pieces but how to
run everything
Automatically and in
Sequence
atech.guide
How to Automate
the Work ow?
atech.guide
fl
6. Amazon MWAA
atech.guide
fl
fl
fl
fl
Also, we need to
build Graph on top
of aggregated
Metrics
And
Dashboard
containing those
Graphs
atech.guide
How to build
visualisations?
atech.guide
7. Amazon QuickSight
atech.guide
We have an End to
End Pipeline 🥳
atech.guide
Over to You,
If we are using
Tableau, How shall
our pipeline
change?
atech.guide
Want to be a part of
Community?
https://discord.gg/Pc9ed8krYK
atech.guide
Kamran Ali
@aTechGuide
That’s a Wrap
atech.guide
fi
fi