BA_CH01
BA_CH01
1-2
2
Why Data Analytics
1-3
¨ Interdisciplinary
¤ Statistics + Computer Science + Information Systems
1-6
Three types of Analytics
1-7
1-8
Descriptive Analytics
Descriptive Analytics
1-10
Predictive Analytics
1-11
Prescriptive Analytics
1-12
Data
Sample Data
1-14
Types of Data
¨ Cross-sectional data
¤ Collected by recording a characteristic of many subjects at the same
point in time
¤ Recording a characteristic of many subjects at the same point in time
¨ Time series data
¤ Collected over several time periods focusing on certain groups of
people, specific events, or objects
¤ Hourly, daily, weekly, monthly, quarterly, or annual observations
1-15
Cross-Sectional Data
1-16
Time Series Data
1-17
Structured Data
¨ Structured data
¤ Reside in a pre-defined, row-column format
¤ Spreadsheet or database applications
¨ Unstructured data
¤ Do not conform to a pre-defined, row-column format
¤ Textual
¤ Multimedia content
¤ Do not conform to database structures
¨ Human- or machine-generated
¤ Structured human: price, income, retail sales
¤ Structured machine: sensors, speed cameras, web server logs
¤ Unstructured human: email, text, social media, presentations
¤ Unstructured machine: satellite images, video data, camera images
1-19
Big Data
1-20
Vs of Big Data
1-23
3. Interval
¤ Numerical
¤ Categorize and rank, differences are meaningful
¤ Zero value is arbitrary and does not reflect absence of characteristic
¤ Ratios are not meaningful
¤ Example: temperature
4. Ratio (numerical)
¤ Numerical
¤ Most sophisticated
¤ A true zero point, reflects absence of characteristic
¤ Ratios are meaningful
¤ Example: profits
1-24
Types of Variable - Example
¨ Music: nominal
¨ Food quality: ordinal
¨ Closing time: interval
¨ Own money spent: ratio
1-25
Data Sources
¨ 90% of the data in the world today was created in the last
two years.
¨ Data sources for this book mostly come from Google.
¤ Bureau of Economic Analysis
¤ Bureau of Labor Statistics
¤ Federal Research Economic Data
¤ U.S. Census Bureau
¤ National Climatic Data Center
¤ Yahoo Finance
¤ Zillow
1-26
Data File Formats
1-27