Intro to ETL
Intro to ETL
Intro to ETL
• Step 1: Extract
• The extraction process involves copying or
exporting raw data from multiple locations
called source locations and storing them in a
staging location for further processing.
• Source locations can consist of any type of
data, including SQL or NSQL servers, flat files,
emails, logs, web pages, CRM, ERP systems,
spreadsheets, logs, etc.
• Common data extraction methods are:
• Partial extraction with update notification
• Partial extraction without update notification
• Full extraction
Step 2: Transform
• Integrate.io
• Talend
• MarkLogic
• Oracle Data Integrator
• Amazon RedShift
• AWS Glue
• Matillion
• Azure Data Factory
• FlyData
ETL Challenges