Basic concepts of linear algebra
Various Distributions
Normal Uniform
2
Analytics Methodology CRISP DM: Phases and Tasks
Business Data Data
Modeling Evaluation Deployment
Understanding Understanding Preparation
Determine Business Collect Initial Select Select Modeling Evaluate Plan
Objectives Data Data Technique Results Deployment
Plan Monitoring
Assess Describe Clean Generate Review &
Situation Data Data Test Design Process Maintenance
Determine Produce
Explore Construct Determine
Data Mining Build Model Final
Goals Data Data Next Steps Report
Verify
Produce Integrate Review
Data Assess Model
Project Plan Quality Data Project
Format
Data
Cross Industry standard process
3
Type of Data/Variables
• Numeric data Nominal Ordinal Interval Ratio
• Continuous (measurements)
• Interval (difference between two Frequency Yes Yes Yes Yes
distribution
values is meaningful)
• Ratio (interval + clear definition of 0.0)
• Discrete (counts) Median and
percentiles No Yes Yes Yes
• Categorical
• Nominal Add or subtract No No Yes Yes
• Ordinal (Likert scale) mean, std No No Yes Yes
• Dichotomous
Ratio No No No Yes
• Independent Variables (experimental or
predictor)
• Dependent (outcome) 4