Data Wrangling Python - Suwarti
Data Wrangling Python - Suwarti
DATA SCIENCE
#MulaiBelajarData
Mini Bootcamp Data Science
Profil Narasumber
SUWARTI, M.Si
Pendidikan:
Magister Matematika Aktuaria
Institut Teknologi Bandung (ITB)
Pekerjaan:
Data Scientist at Astra Graphia Information Technology (AGIT)
Contact Narasumber
LinkedIn: https://www.linkedin.com/in/suwarti/
#MulaiBelajarData
Mini Bootcamp Data Science
Learning Objective
In this course you will learn:
Understanding Data Cleansing Mechanism
Understanding Missing Values Checking and Handling Concepts
Understanding Anomaly and Outlier Detection Concepts
Understanding Data Type Checking and Correction Mechanism
#MulaiBelajarData
Mini Bootcamp Data Science
#MulaiBelajarData
Mini Bootcamp Data Science
Data Cleansing
Missing Values Checking and Handling
Duplicates Checking
Anomaly and Outlier Detection
Data Type Checking
Data type correction
Feature extraction
#MulaiBelajarData
Mini Bootcamp Data Science
Missing values
Why missing value exist?
Values are missed during data acquisition process
Values are deleted accidentally
Corrupt data
Mismatch between row and column position
The real value is not available
If we fill in missing values with the wrong data, you are adding bias.
#MulaiBelajarData
Mini Bootcamp Data Science
#MulaiBelajarData
Mini Bootcamp Data Science
Anomalies
#MulaiBelajarData
Mini Bootcamp Data Science
Outliers
Outliers are data point that differs significantly from other
observations, outliers are not form of error.
#MulaiBelajarData
Mini Bootcamp Data Science
#MulaiBelajarData
Mini Bootcamp Data Science
#MulaiBelajarData
Mini Bootcamp Data Science
TERIMA KASIH
#MulaiBelajarData