CH01

Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 54

Statistics for Business and Economics

Ninth Edition, Global Edition

Chapter 1
Describing Data:
Graphical

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 1


Chapter Goals (1 of 3)
After completing this chapter, you should be able to:
• Explain how decisions are often based on limited/sample
information
• Explain key definitions:
– Population vs. Sample
– Parameter vs. Statistic
– Sampling vs. Nonsampling Error
– Descriptive vs. Inferential Statistics
• Describe random sampling & systematic sampling
• Explain the difference between Descriptive and Inferential
statistics

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 2


Chapter Goals (2 of 3)
After completing this chapter, you should be able to:
• Identify types of data and levels of measurement
• Create and interpret graphs to describe categorical
variables:
– frequency distribution, bar chart, pie chart, Pareto diagram
• Create a line chart to describe time-series data
• Create and interpret graphs to describe numerical
variables:
– frequency distribution, histogram, ogive

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 3


Chapter Goals (3 of 3)
After completing this chapter, you should be able to:

• Construct and interpret graphs to describe relationships


between variables:
– Scatter plot, cross table
• Describe appropriate and inappropriate ways to display
data graphically

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 4


Section 1.1 Decision Making in an
Uncertain Environment (1 of 2)
Everyday decisions are based on limited/sample
information
Examples:
• Will the job market be strong when I graduate?
• Will the price of Yahoo stock be higher in six months than it
is now?
• Will profit rates remain low for the rest of the year if the
federal budget deficit is as high as predicted?

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 5


Section 1.1 Decision Making in an
Uncertain Environment (2 of 2)
Data are used to assist decision making
• Statistics is a tool to help process, summarize, analyze,
and interpret data.
• Statistics is a subject which deals with data in order to
make a good/correct decision.

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 6


Key Definitions
• A population is the collection of all items of interest or
under investigation
– N represents the population size

• A sample is an observed subset of the population


– n represents the sample size

• A parameter is a specific characteristic of a population


• A statistic is a specific characteristic of a sample

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 7


Population vs. Sample
Population Sample

Values calculated using Values computed from


population data are sample data are called
called parameters statistics

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 8


Examples of Populations
• Names of all registered voters in the United States
• Incomes of all families living in Daytona Beach
• Annual returns of all stocks traded on the New
York Stock Exchange
• Grade point averages of all the students in your
university

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 9


Random Sampling
Simple random sampling is a procedure in which
• each member of the population is chosen strictly by
chance,
• each member of the population is equally likely to be
chosen,
• every possible sample of n objects is equally likely to be
chosen
The resulting sample is called a random sample

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 10


Descriptive and Inferential Statistics
Two branches of statistics:
• Descriptive statistics
– Graphical, tabular and numerical procedures to
summarize and process data

• Inferential statistics
– Using data to make predictions, forecasts, and
estimates to assist decision making

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 11


Descriptive Statistics
• Collect data
– e.g., Survey

• Present data
– e.g., Tables and graphs

• Summarize data
– e.g., Sample mean   X i

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 12


Inferential Statistics
• Estimation
– e.g., Estimate the population
mean weight using the
sample mean weight
• Hypothesis testing
– e.g., Test the claim that the
population mean weight is
140 pounds

Inference is the process of drawing conclusions or


making decisions about a population based on sample
results
Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 13
Section 1.2 Classification of Variables

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 14


Measurement Levels

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 15


Section 1.3-1.5 Graphical
Presentation of Data (1 of 2)
• Data in raw form are usually not easy to use for
decision making. Some type of organization is
needed
– Table
– Graph & Numerical
• The type of graph to use depends on the variable
being summarized

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 16


Section 1.3-1.5 Graphical
Presentation of Data (2 of 2)
• Techniques reviewed in this chapter:

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 17


Section 1.3 Tables and Graphs for
Categorical Variables

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 18


The Frequency Distribution Table
Summarize data by category
Example: Hospital Patients by Unit

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 19


Graph of Frequency Distribution
• Bar chart of patient data

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 20


Cross Tables
• Cross Tables (or contingency tables) list the
number of observations for every combination of
values for two categorical or ordinal variables
• If there are r categories for the first variable (rows)
and c categories for the second variable (columns),
the table is called an r  c cross table

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 21


Cross Table Example
• 3  3 Cross Table for Investment Choices by Investor
(values in $1000’s)

Investment Investor A Investor B Investor C Total


Category
Stocks 46 55 27 128
Bonds 32 44 19 95
Cash 15 20 33 68
Total 93 119 79 291

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 22


Graphing Multivariate Categorical
Data (1 of 2)
• Side by side horizontal bar chart

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 23


Graphing Multivariate Categorical
Data (2 of 2)
• Stacked bar chart

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 24


Vertical Side-by-Side Chart Example
• Sales by quarter for three sales territories:

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 25


Bar and Pie Charts
• Bar charts and Pie charts are often used for
qualitative (categorical) data
• Height of bar or size of pie slice shows the
frequency or percentage for each category

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 26


Bar Chart Example

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 27


Pie Chart Example

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 28


Pareto Diagram
• Used to portray categorical data
• A bar chart, where categories are shown in
descending order of frequency
• A cumulative polygon is often shown in the same
graph
• Used to separate the “vital few” from the “trivial
many”

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 29


Pareto Diagram Example (1 of 3)
Example: 400 defective items are examined for
cause of defect:
Source of
Manufacturing Error Number of defects
Bad Weld 34
Poor Alignment 223
Missing Part 25
Paint Flaw 78
Electrical Short 19
Cracked case 21
Total 400

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 30


Pareto Diagram Example (2 of 3)
Step 1: Sort by defect cause, in descending order
Step 2: Determine % in each category
Source of
Manufacturing Error Number of defects % of Total Defects
Poor Alignment 223 55.75
Paint Flaw 78 19.50
Bad Weld 34 8.50
Missing Part 25 6.25
Cracked case 21 5.25
Electrical Short 19 4.75
Total 400 100%

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 31


Pareto Diagram Example (3 of 3)
Step 3: Show results graphically

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 32


Section 1.4 Graphs to Describe Time-
Series Data
• A line chart (time-series plot) is used to show the
values of a variable over time
• Time is measured on the horizontal axis
• The variable of interest is measured on the
vertical axis

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 33


Line Chart Example

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 34


Section 1.5 Graphs to Describe
Numerical Variables

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 35


Frequency Distributions
What is a Frequency Distribution?
• A frequency distribution is a list or a table…
• containing class groupings (categories or ranges
within which the data fall)...
• and the corresponding frequencies with which
data fall within each class or category

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 36


Why Use Frequency Distributions?
• A frequency distribution is a way to summarize
data
• The distribution condenses the raw data into a
more useful form...
• and allows for a quick visual interpretation of the
data

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 37


Class Intervals and Class Boundaries
• Each class grouping has the same width
• Determine the width of each interval by
largest number  smallest number
w  interval width 
number of desired intervals
• Use at least 5 but no more than 15-20 intervals
• Intervals never overlap
• Round up the interval width to get desirable
interval endpoints

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 38


Frequency Distribution Example (1 of 3)
Example: A manufacturer of insulation randomly
selects 20 winter days and records the daily high
temperature
data:

24, 35, 17, 21, 24, 37, 26, 46, 58, 30,
32, 13, 12, 38, 41, 43, 44, 27, 53, 27

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 39


Frequency Distribution Example (2 of 3)
• Sort raw data in ascending order:
12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58
• Find range: 58  12 = 46
• Select number of classes: 5 (usually between 5 and 15)
• Compute interval width: 10  46 then round up 
 5 
• Determine interval boundaries: 10 but less than 20, 20 but
less than 30, , 60 but less than 70

• Count observations & assign to classes

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 40


Frequency Distribution Example (3 of 3)
Data in ordered array:
12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58

Interval Frequency Relative Percentage


Frequency
10 but less than 20 3 .15 15

20 but less than 30 6 .30 30

30 but less than 40 5 .25 25

40 but less than 50 4 .20 20

50 but less than 60 2 .10 10

Total 20 1.00 100

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 41


Histogram
• A graph of the data in a frequency distribution is
called a histogram
• The interval endpoints are shown on the
horizontal axis
• the vertical axis is either frequency, relative
frequency, or percentage
• Bars of the appropriate heights are used to
represent the number of observations within each
class
Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 42
Histogram Example

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 43


Histograms in Excel (1 of 2)

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 44


Histograms in Excel (2 of 2)

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 45


Questions for Grouping Data into
Intervals
• How wide should each interval be?
(How many classes should be used?)
• How should the endpoints of the intervals be
determined?
– Often answered by trial and error, subject to user
judgment
– The goal is to create a distribution that is neither too
"jagged" nor too "blocky”
– Goal is to appropriately show the pattern of variation in
the data

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 46


How Many Class Intervals?
• Many (Narrow class intervals) 3.5

– may yield a very jagged distribution 3


2.5

with gaps from empty classes

Frequency
2
1.5
– Can give a poor indication of how 1
0.5
frequency varies across classes 0

4
8
12
16
20
24
28
32
36
40
44
48
52
56
60
More
Temperature

• Few (Wide class intervals)


– may compress variation too much
and yield a blocky distribution
– can obscure important patterns of
variation.

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 47


The Cumulative Frequency
Distribution
Data in ordered array:
12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58

Cumulative Cumulative
Class Frequency Percentage
Frequency Percentage
10 but less than 20 3 15 3 15

20 but less than 30 6 30 9 45

30 but less than 40 5 25 14 70

40 but less than 50 4 20 18 90

50 but less than 60 2 10 20 100

Total 20 100 blank blank

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 48


The Ogive Graphing Cumulative
Frequencies

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 49


Scatter Diagrams
• Scatter Diagrams are used for paired
observations taken from two numerical
variables
• The Scatter Diagram:
– one variable is measured on the vertical axis
and the other variable is measured on the
horizontal axis

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 50


Scatter Diagram Example

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 51


Scatter Diagrams in Excel

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 52


Chapter Summary (1 of 2)
• Reviewed incomplete information in decision
making
• Introduced key definitions:
– Population vs. Sample
– Parameter vs. Statistic
– Descriptive vs. Inferential statistics
• Described random sampling
• Examined the decision making process

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 53


Chapter Summary (2 of 2)
• Reviewed types of data and measurement levels
• Data in raw form are usually not easy to use for decision
making -- Some type of organization is needed:
– Table
– Graph
• Techniques reviewed in this chapter:
– Frequency distribution – Line chart
– Cross tables – Frequency distribution
– Bar chart – Histogram and ogive
– Pie chart – Stem-and-leaf display
– Pareto diagram – Scatter plot

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. Slide - 54

You might also like