Data - Science - Methodology - and - Use - Case
Data - Science - Methodology - and - Use - Case
Internal
LEARNING GOALS
#RintisKarirImpian
Internal
CRISP-DM
• Breaks down the life cycle of a data mining project into 6 phases
• DM Vendors - SPSS, NCR, IBM, SAS, SGI, Data Distilleries, Syllogic, Magnify, ..
• System Suppliers / consultants - Cap Gemini, ICL Retail, Deloitte & Touche, …
#RintisKarirImpian
Internal
WHY CRISP – DM ?
• Guidelines
• Experience documentation
#RintisKarirImpian
Internal
CRISP–DM : 6 PHASES
Phases … Explanation …
#RintisKarirImpian
Internal
CRISP-DM : 6 PHASES AND TASKS
Business Data
Data
Understandin Understandin Modeling Evaluation Deployment
Preparation
g g
Determine Select
Collect Evaluate Plan
Business Select Data Modeling
Initial Data Results Deployment
Objectives Technique
Plan
Monitoring
Assess Describe Generate Review
Clean Data &
Situation Data Test Design Process
Maintenanc
e
Determine
Explore Construct Determine Produce
Data Mining Build Model
Data Data Next Steps Final Report
Goals
Format
Data
#RintisKarirImpian
Internal
BUSINESS UNDERSTANDING
Task … Description …
#RintisKarirImpian
Internal
BUSINESS UNDERSTANDING
• Difficult
non-experts
• No, performance will not be 100%
model
#RintisKarirImpian
• For tomorrow, it is impossible
Internal
BUSINESS UNDERSTANDING – PROJECT PLAN
Data
Understanding
Data
Preparation
Modelin
g
Evaluatio
n
Deployme
nt
10/15/202
0
#RintisKarirImpian
Internal
BUSINESS UNDERSTANDING – DO AND DON’TS
Task … Description …
• Initial data collection report
Collect initial data • Acquire the data (of access to the data) listed
in the project resources.
#RintisKarirImpian
Internal
DATA UNDERSTANDING – SPOT OUTLIER
#RintisKarirImpian
Internal
DATA UNDERSTANDING – DATA EXPLORATION
#RintisKarirImpian
Internal
DATA UNDERSTANDING – DO AND DON’TS
#RintisKarirImpian
Internal
DATA PREPARATION
Task … Description …
• Reconsider data selection criteria and
decide which dataset will be used
Select data
• Explain why certain data was included or
excluded
• Derived attributes.
Construct data • How can missing attributes be
constructed or imputed?
• Rearranging attributes
Format Data • Reordering records
• Reformatted within-value
#RintisKarirImpian
Internal
DATA PREPARATION – REMOVE OUTLIER
Data understanding and preparation will usually consume half of more of your
project time !
#RintisKarirImpian
Internal
DATA PREPARATION – DO AND DON’TS
#RintisKarirImpian
Internal
MODELING
Task … Description …
Generate test • Describe the intended plan for training, testing, and
design evaluating the models.
#RintisKarirImpian
Internal
MODELLING – TOOLING SELECTION
•
#RintisKarirImpian
Can I afford a prototype ?
Internal
MODELLING – DO AND DON’TS
•When possible, peak inside your model and consult it with domain expert
• assess feature importance
Task … Description …
#RintisKarirImpian
Internal
EVALUATION – PERFORMANCE OF MODEL
#RintisKarirImpian
Internal
EVALUATION – DO AND DON’TS
Task … Description …
#RintisKarirImpian
Internal
TOP 5 BUSINESS CASE IN MARKETING
Business Case What is …
Product • A filtering system that seeks to predict and show that items that
Recommendation
a user would like to purchase
• Help you learn more about who your customers are and how
Sentiment Analysis
you can better engage with them.
#RintisKarirImpian
Internal
CUSTOMER SEGMENTATION
• Segmentation based on :
production.
#RintisKarirImpian
Internal
PRODUCT RECOMMENDATION
• With the increasing demand for more personalized playlists, Spotify launched
• According to data, Discover Weekly listeners stream music on Spotify more than
#RintisKarirImpian
Internal
MARKET BASKET ANALYSIS
#RintisKarirImpian
Internal
SENTIMENT ANALYTICS
#RintisKarirImpian
Internal