Module 20. Data Collection
Module 20. Data Collection
National Guard
Black Belt Training
Module 20
Data Collection
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
TOOLS
•Process Mapping
ACTIVITIES
• Map Current Process / Go & See •Process Cycle Efficiency/TOC
• Identify Key Input, Process, Output Metrics •Little’s Law
• Develop Operational Definitions •Operational Definitions
• Develop Data Collection Plan •Data Collection Plan
• Validate Measurement System •Statistical Sampling
• Collect Baseline Data •Measurement System Analysis
• Identify Performance Gaps •TPM
• Estimate Financial/Operational Benefits •Generic Pull
• Determine Process Stability/Capability •Setup Reduction
• Complete Measure Tollgate •Control Charts
•Histograms
•Constraint Identification
•Process Capability
Note: Activities and tools vary by project. Lists provided here are not necessarily all-inclusive. UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
Learning Objectives
Determine what to measure and why
Prepare plans to collect output, process and/or input
data
Apply sampling techniques, as needed
Construct forms and test data collection procedures
Refine data collection
Implement data collection plan
UNCLASSIFIED / FOUO 3
UNCLASSIFIED / FOUO
What Is a Measure?
A quantified evaluation of characteristics
and/or level of performance based on
observable data
Examples include:
Length of time (speed, age)
Size (length, height, weight)
Dollars (costs, sales revenue, profits)
Counts of characteristics or “attributes” (types of
customer, property size, gender)
Counts of defects (number of errors, late checkouts,
complaints)
UNCLASSIFIED / FOUO 4
UNCLASSIFIED / FOUO
Why Measure?
UNCLASSIFIED / FOUO 5
UNCLASSIFIED / FOUO
National Guard
Black Belt Training
Determine
What to Measure
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO 7
UNCLASSIFIED / FOUO
Process
Input Output
UNCLASSIFIED / FOUO 8
UNCLASSIFIED / FOUO
Loan Process Application Credit & Risk Review & Loan Service
Cycle Time = Data Entry + Collateral + Assessment +
Approval Time
+
Time
Time Check Time Time
Generally, you can influence some of the Xs but not all. CPI
projects will generally address those Xs which can be influenced
and which have the greatest impact on Y.
UNCLASSIFIED / FOUO 9
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO 11
UNCLASSIFIED / FOUO
Output Measures
Referred to as “Y” data. Output Metrics quantify the
overall performance of the process, including:
How well customer needs and requirements were
met (typically Quality & Speed requirements), and
How well business needs and requirements were met
(typically Cost & Speed requirements)
Output measures provide the best overall barometer of
process performance
Focus on one Primary Output (Y) metric at a time. Use
Secondary Y metrics to “keep you honest”
Example: If the Primary Y is to improve cycle time, the Secondary Y could
monitor defects to make sure they also improve or at least don’t get worse!
UNCLASSIFIED / FOUO 12
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO 13
UNCLASSIFIED / FOUO
X and Y Metrics
Suppliers Inputs Process Outputs Customers
• Billing Dept. staff Billing Process • Delivered
• Customer Invoice
database
• Shipping
information
• Order information
UNCLASSIFIED / FOUO 14
UNCLASSIFIED / FOUO
National Guard
Black Belt Training
Develop Data
Collection Plan
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
Types of Data
Continuous / Variable – Any variable measured on a continuum or
scale that can be infinitely divided into recognizable parts. Primary
types include time, dollars, size, weight, temperature, and speed. Any
metric that can be continuously divided by 2 and the metric still makes
sense is a continuous metric. Continuous Data is always
preferred over Discrete or Attribute Data.
Discrete / Attribute – A count, proportion, or percentage of a
characteristic or category. Service process data is often discrete.
Continuous/Variable Discrete/Attribute
• Cycle time • Late delivery
• Cost or price • Gender
• Length of call • Region/location
• Temperature of rooms • Room type
UNCLASSIFIED / FOUO 17
UNCLASSIFIED / FOUO
Developed
earlier 2 3 4 5 6 1
Examples: Examples:
Identification of Largest Contributors Pareto Chart
UNCLASSIFIED / FOUO 18
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO 19
UNCLASSIFIED / FOUO
Stratification Factors
Factors Examples
What Complaints, Defects
When Month, Day
Where Region, City
Department,
Who
Individual
UNCLASSIFIED / FOUO 20
UNCLASSIFIED / FOUO
Stratification Matrix
Key Steps
Fill in the Output measure Y
Fill in the key stratification questions you have about the process in
relationship to the Y
List out all the levels and ways you can look at the data in order to
determine specific areas of concern
Discuss with the team whether or not these measurements will help to
predict the output Y, if not, think of where to apply the measures so
that they will help you to predict Y
UNCLASSIFIED / FOUO 21
UNCLASSIFIED / FOUO
Stratification Matrix
2 3 4
Questions About Process Stratification factors Measurements
X Variables Does data exist
to support
these
measurements
?
(Y/N)
5
Will these
measurements
(Output Y) help to predict
Y? (Y/N)
1
6
UNCLASSIFIED / FOUO 22
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO 23
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO 24
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO 25
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO 26
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO 27
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO 29
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO 30
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO 31
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO 32
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO 33
UNCLASSIFIED / FOUO
items or occurrences
1
2 X X X X X X X
3 X X X X X
UNCLASSIFIED / FOUO 34
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO 35
UNCLASSIFIED / FOUO
Awardee: __________________________________________________
Recommender’s division:
□ G-1 □ G-2 □ G-3 □ G-4 □ Other __________
Time begun; Time
Process step Defects found
completed
Approve
recommendation
Schedule presentation
UNCLASSIFIED / FOUO 36
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO 37
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO 38
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO 39
UNCLASSIFIED / FOUO
Preparing Collectors
Be sure collectors:
Give input on the check sheet design
Understand operational definitions (!)
Understand how data will be tabulated
Helps them see the consequences of changing
Have been trained and allowed to practice
Have knowledge and are unbiased
UNCLASSIFIED / FOUO 41
UNCLASSIFIED / FOUO
Step 6. Sampling
Sampling is using a smaller group to represent the
whole population (the foundation of “statistics”)
Benefits:
Saves time and money
Allows for more meaningful data
Simplifies measurement over time
Can improve accuracy
UNCLASSIFIED / FOUO 42
UNCLASSIFIED / FOUO
Sampling Considerations
Time
Cost
Accuracy
UNCLASSIFIED / FOUO 43
UNCLASSIFIED / FOUO
Sampling Types
Population – Drawing from a fixed group with
definable boundaries. No time element.
Customers
Complaints
Items in Warehouse
UNCLASSIFIED / FOUO 44
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO 45
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO 46
UNCLASSIFIED / FOUO
Sampling Biases
Self-selection
Self-exclusion
Missing key representatives
Ignoring “non-conformances”
Grouping
UNCLASSIFIED / FOUO 47
UNCLASSIFIED / FOUO
Sampling Methods/Strategies
The big pitfall in sampling is “bias” – i.e., select a sample that does
NOT really represent the whole. The sampling plan needs to guard
against bias. Different methods of sampling have different advantages
and disadvantages in managing bias.
Judgment
As it sounds – selecting a sample based on someone’s knowledge of
the process, assuming that it will be “representative.” Judgment
guarantees a bias, and should be avoided.
Convenience
Also just like it sounds – sampling those items or at those times
when it is easier to gather the data. (For example, taking data
from people you know, or when you go for coffee.) This is another
common (but ill-advised) approach.
UNCLASSIFIED / FOUO 48
UNCLASSIFIED / FOUO
Sampling Strategies
Best Methods:
Random
Best approach for population situations. Use a random
number table or random function in Excel or other software,
or draw numbers from a hat.
Systematic
Most practical and unbiased in a process situation.
“Systematic” means that we select every nth unit, or take
samples at specific times of the day. The risk of bias comes
when the timing of the sample matches a pattern in the
process.
UNCLASSIFIED / FOUO 49
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO 50
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO 51
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO 52
UNCLASSIFIED / FOUO
Stratification factors
If you elect to conduct a stratified sample, you
need to know the size of each subset or stratum
UNCLASSIFIED / FOUO 53
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO 54
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO 55
UNCLASSIFIED / FOUO
Sampling
Some Final Tips ...
When you want to ensure representation from different
groups or strata, prepare a separate sampling plan for
each group
Be sure to maintain the time order of your
samples/subgroups so you can see changes over time
Common sense is a useful tool in sampling
Help is available if you need it!
UNCLASSIFIED / FOUO 56
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO 57
UNCLASSIFIED / FOUO
Step 5 Step 4
Speed
Cost
Required Deliverable
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
Include other unique terms that apply to your project that require clear operational definitions
for those collecting the data and for those interpreting the data.
Required
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
- Example -
2
Ability to update X – Tollgate In DEPMS By determining % of Name ASAP 40 None To determine
projects and template slides activity steps identified in consistency with
build tollgate that match POI “Introduction to _____” POI
reviews modules in POI that are
adequately addressed in
templates
3
Easy Access to X – Availability of In DEPMS By determining the Name ASAP 63 None To determine
LSS tools and LSS tools and percentage of tools, with availability of tools
references references their references, listed on and references
DMAIC Road Map slides that
can be found in PS
4
Easy Access to X – Steps In DEPMS By counting # steps Name ASAP 37 None To find VA, BNVA,
LSS tools and required to find required to find the tools NVA
references tools and and their references
references
Required Deliverable
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
Time = 30 Minutes
UNCLASSIFIED / FOUO 61
UNCLASSIFIED / FOUO
Takeaways
Know what to measure and why
Create a plan to collect output, process and/or input
data
Construct forms and test data collection procedures
using appropriate data sampling methods
Refine data collection
Collect the data
Analyze the data
UNCLASSIFIED / FOUO 62
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
National Guard
Black Belt Training
Appendix
Sample Size Calculations
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO
Risk
Variation Difference
UNCLASSIFIED / FOUO 66
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO 67
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO 68
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO 70
UNCLASSIFIED / FOUO
n 289
n finite = = = 118.2 or 119 - minimum sample size required
n 289
1+ 1+
N 200
UNCLASSIFIED / FOUO 71
UNCLASSIFIED / FOUO
1.96 4
2
n= = 62 contracts
1
UNCLASSIFIED / FOUO 72
UNCLASSIFIED / FOUO
Another Lean Six Sigma team determines the minimum sample size
required for the proportion of DPW, Department of Public Works,
service contracts that require rework at the approval meeting. From
interviews, the team has concluded that approximately 25% of the
contracts contain errors and require rework. They wish to determine
the % requiring rework within 5%.
2
1.96
=
n .25(1 .25)
.05
n =(1536.64)(.1875) = 289 contracts
UNCLASSIFIED / FOUO 73
UNCLASSIFIED / FOUO
Exercise:
Sample Size
Objective:
Determine the appropriate sample size
Instructions:
Use the pizza delivery example. The pizza is scheduled for
the time the customer requests delivery.
The customer requirement is +/- 10 minutes from the
scheduled delivery time
Estimated s = 7.16 minutes and D = 2 minutes
Estimated number of defects is 30% ( P = 0.30; D =5%)
Determine the minimum sample size for both continuous
and discrete data
UNCLASSIFIED / FOUO 74
UNCLASSIFIED / FOUO
Exercise:
Sample Size Answers
Continuous
2 2 2
1.96s 1.96 * 7.16 14.03
n= = = = 49.24 50
D 2 2
Discrete
2 2
1.96 1.96
n= P(1 P) = 0.30(0.70) = 39.2 * 0.21 = 322.69 323
2
D 0.05
UNCLASSIFIED / FOUO 75
UNCLASSIFIED / FOUO
Exercise:
Sample Size
Objective:
Determine the appropriate sample size
Instructions:
Select one output indicator for your process
Determine the type of data (continuous / discrete)
Continuous - estimate “s” and D
Discrete - estimate D and P
Determine the minimum sample size required
UNCLASSIFIED / FOUO 76
UNCLASSIFIED / FOUO
Exercise:
Sample Size Formula
Objective:
Determine the appropriate sample size formula to use
Instructions:
At your tables determine the right formula (proportion/discrete or continuous)
to use and calculate the sample size for each situation
1.Estimate the average cycle time within 2 hours. The estimated standard
deviation is 8 hours. What is the minimum number to sample?
2.A team collected 100 observations to determine the proportion defective.
They found 20% to be defective. How accurately can they estimate the
proportion defective?
3.You have a customer survey with 2 categorical questions and 8 interval
statements. You estimate that at least one option of a categorical
question will be answered by approximately 50% of the respondents and
you want to be able to detect a difference within ± 5%. For the
continuous statements you want to be able to detect a difference of at
least ½ a point. The highest estimated standard deviation for any of the
statements is 1.2. You expect the response rate to be 25%. How many
surveys do you have to send out and how many completed surveys do
you need returned?
UNCLASSIFIED / FOUO 77
UNCLASSIFIED / FOUO
UNCLASSIFIED / FOUO 78