Exploring Displaying
Exploring Displaying
Exploring Displaying
DISPLAYING, AND
EXAMINING DATA
1
Types of Data Analysis
• Exploratory data analysis
• the data guide the choice of analysis--or a
revision of the planned analysis
• Confirmatory data analysis
• closer to classical statistical inference in its
use of significance and confidence
• may use information from a closely related
data set or by validating findings through the
gathering and analyzing of new data
2
Techniques to Display
and Examine Distributions
Frequency Table
Visual Displays
• Histograms
• Stem-and-leaf display
• Box-plot
Crosstabulation of Variables
3
Techniques to Display
and Examine Distributions
Histograms
4
Techniques to Display
and Examine Distributions (cont.)
5
Techniques to Display
and Examine Distributions (cont.)
Transformation
6
Improvement & Control Analysis
Statistical process control
• Uses statistical tools to analyze, monitor, and
improve process performance
• Total Quality Management
• Control chart
• Displays sequential measurements of a process
together with a center line and control limits
• Upper control limit
• Lower control limit
7
Types of Control Charts
Variables data
(ratio or interval measurements)
• X-bar
• R-charts
• s-charts
• Pareto Diagrams
• Bar chart whose percentages sum to 100 percent
8
Geographic Information Systems
Systems of hardware, software, and
procedures that capture, store,
manipulate, integrate, and display
spatially-referenced data
9
Geographic Information Systems
Minimum four components
• Integrating information from various sources
• Capturing data
• Projection and restructuring
• Modeling
10
Crosstabulation
A technique for comparing two
classification variables
–Cells
–Marginals
–Contingency tables
11
Percentaging Errors
Averaging percentages without weighting
Using too-large percentages (>100%)
12
Other Table-based Analysis
Automatic Interaction Detection (AID)
• Sequential partitioning procedure that uses a
dependent variable and set of predictors
• Searches among up to 300 variables for the
best single division of data into subsets
according to each predictor variable,
• Chooses one division approach
• Splits the sample using chi-square tests to
create multi-way splits.
13