1) Common Univariate Summaries: I) I) Iii) I) Ii)

1. The document discusses methods for summarizing and visualizing univariate and bivariate data, including measures of center and dispersion, dot plots, boxplots, and scatterplots. 2. It also covers topics in multivariate analysis such as correlation, regression models, vectors, matrices, the normal distribution, transformations of multivariate normal distributions, and assessing normality. 3. Standardizing data, computing distances from the center, and plotting these distances are described as a three-step approach for assessing normality in multivariate data.

Uploaded by

wj228368867

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

134 views5 pages

1) Common Univariate Summaries: I) I) Iii) I) Ii)

Uploaded by

wj228368867

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

1) Common Univariate Summaries

I) Measures of the Centre i) Mean ii) Median iii) Mode

II) Measures of Dispersion () i) Standard Deviation ii) Range iii) Interquartile Range
1.1) Common Univariate Displays i)Dot Plots ii)Boxplots
2)Bivariate Summaries
i) Usual univariate summaries ii)Plus Correlation AND Regression Equation
2.1)Bivariate Displays
i)Scatterplot
Scatter plots show how much one variable is affected by another. The relationship between two
variables is called their correlation.
Possibly including
Regression line, smoothed curve, or...?
A central point (point of averages or medians)
Some indication of the dispersion?
ii)Bivariate boxplot? (used to check assumptions of bivariate normality)
An extension of univariate boxplots indicating i)A centre point (replacing the median) ii)An
elliptical region containing roughly the middle half of the data (replacing the hinges or box)
iii)An outer elliptical region containing almost all the data (replacing the fence or whiskers)
Boxplot is based on medians and quartiles
Bivariate version based on similarly robust estimates

3) The regression model Y
i
= + x
i
+
i

4) Vector
An ordered list of numbers, e.g. i)Heights ii)Team statistics
Can be used to generate a point in a graph or An arrow from the origin to that point(
The usual geometric interpretation of a vector.).

5) The dot Product
x y = (x
1
, x
2
)
T
(y
1
, y
2
)
T
= (r cos(), r sin() )
T
(s cos(), s sin() )
T

= ||x||||y||cos(- )
Hence, (x y) / (||x||||y||) = the cosine of the angel between x and y
x and y are orthogonal to each other iff x y = 0.

6) Orthogonal Matrices
i) Columns have unit length, and are orthogonal to each other.
ii) Corresponds to rotations and/or reflections of axes (or sets of vectors).
iii) U
T
U = I (Taking the transpose turns row into columns. The entries of U
T
U are therefore dot
product of rows of U with each other. These are all 0 except for the entities on.)

7) Identity Matrix
i) Has no impact on a vector ii) the matrix equivalent of the number 1

8) Transpose of a Product: (AB)
T
= B
T
A
T

9) Inverse Matrix: M
-1
M = M M
-1
= I (Through it may not always exist)
Inverse of an orthogonal matrix is its transpose.

10) Eigenvalues and Eigenvectors
If M = c for some scalar c then:
i) c is called an eigenvalue of the matrix M and
ii) is called an eigenvector of the matrix M.

11) The Bivariate Standard Normal Probability Density Function
Each component is standard normal AND they are independent
f(z
1
, z
2
) =( e
z
1
2
/2
/sqrt(2pi) ) ( e
z
2
2
/2
/sqrt(2pi) ) derive the value of the
normalization constant 1/(2pi).

12) The General Bivariate Normal Distribution: Formula
i) Depends on the means and standard derivations of the two variables and their
correlations - and nothing else.
Formula:

ii) Note that the distribution is more concentrated as either of the standard deviations
becomes smaller, or the correlation is closer to 1.

13) Matrix-Vector Formulation for Bivariate Normal Probability Density Function
Where
x
= E(x) ;

x
= the variance-covariance matrix for x. |
x
|= the determinant of the matrix
x

14) Conditional Distributions
i) Slices correspond to conditional distributions.
ii) The conditional distribution of any multivariate normal random vector given the value
of any linear combination of its elements is again multivariate normal.
15) Marginal Distributions
The marginal distribution of any subset of a multinomial random vector is again
multivariate normal.

16) Transformations
If x is multivariate normal, then so is Mx + b.
Expectation E{Mx + b} = M E{x} + b.
Variance-covariance matrix
Mx + b
= M
x
M
T

17) Central Limit Theorem: Intuitive Interpretation
Random variables whose values are largely determined by a large number of roughly
independent and comparably influential factors will likely be close to normally
distributed.
Common examples where the theorem is typically relevant:
i) Sums or averages of observations that are generated from a distribution that is
not severely skewed.
ii) Linear measurements on individuals sampled from a homogeneous collection of
biological organisms.
iii) Linear measurements on output from a carefully controlled industrial process.
Common examples where the theorem is typically not relevant and the
distribution tends to be positively skewed:
i) Sizes of items that can grow or shrink (like cities) where larger items have a
competitive advantage.
ii) Numbers of parasites, pathogens, etc., on hosts (e.g., numbers of mountain pine
beetles caught in pheromone traps near Prince George, BC.
iii) Measurements of mass, volume, or area when the Central Limit Theorem applies
to linear measurements.
Multivariate Central Limit Theorem: Intuitive Interpretation
Random vector whose values are largely determined by a large number of roughly
independent and comparably influential factors will likely be close to normally
distributed.
Prototypical Examples of Normally Distributed Variables
i) Averages of values obtained by random sampling from a population distribution that
is not severely skewed.
ii) Measurement errors.
iii) Sizes of linear measurements (length, girth, e.g., but not area, volume or weight, e.g.,
of Body parts in homogeneous groups of biological organisms, or Parts produced by a
well-controlled industrial process.
Prototypical Examples of Normally Distributed Vectors
i) Averages of multivariate sets of values obtained by random sampling from a
population for which the distribution of each component value is not severely skewed.
ii) Multivariate measurement errors.
iii) Sizes of multivariate sets of linear measurements (length, girth, e.g., but not area,
volume or weight, e.g., of body parts in homogeneous groups of biological organisms, or
Parts produced by a well-controlled industrial process.

18) Typical Problems to Watch for
i) Component variables that tend to be skewed (like incomes, heights of mountains
within 200 km of Vancouver, and daily rainfall amounts in the fall semester at the top of
Burnaby Mountain).
ii) Component variables that are related nonlinearly, e.g., baby lengths and weights.

19) Distribution Shape May Vary for Different Components
Examples:
i) Samples of persons: IQ scores, height, income, net wealth, RRSP savings.
ii) Samples of viral strains: Number of deleted nucleotides in a key segment, virulence,
prevalence (proportion of infected individuals, intensity (average numbers in infected
individuals), abundance (average numbers in entire host population).

20) Some Components May Not Even Be Quantitative
1. Examples: i) samples of persons: Sex, Religion. ii) Samples of viral strains: Names of
particular deletions present.
2. Often deleted, e.g., in a principal component analysis.
3. Need special consideration in, e.g., cluster analysis.

21) Standardizing a Normal Random Vector
Univariate Case: Z = (X )/ or more commonly, Z
j
= (X
j
Xbar) / s
Multivariate Case i) Option1 ii) Standardize each component Z
i
= (X
i
-
i
)/
i
or
Z
ij
= (X
ij
- xbar)/ S
i

Standardizing a Normal Random Vector
i) Option 1 does not make z standard multivariate normal unless the components of x
are independent. ii) To make z something like the following: z = x / sqrt().

Quantile-Quantile Plots
Step 1 i) Order the values ii) Raw Data: X
1
, X
2
....X
n
iii) Ordered values: X
(1)
< .....< X
(n)
Step 2 i) Compare these ordered values, X
(1)
< .....< X
(n)
to what you would "expect" to
observe if the data were generate by a normal distribution. ii) A simple way to do this,
starting with the sample median.
What might you expect the sample median to be close to in a random sample
from a normal distribution? The simplest guess is the median for the normal
distribution (which equals the mean).
What might you expect the lower quartile to be close to in a random sample
from a normal distribution? The simplest guess is the lower quartile for the normal
distribution.
e.g. n = 99 k = 50: median value with k = (1+99)/ 2 = (1+n)/2
k= 25 the lower quartile with k = (1+99)/4 = (1+n)/4
Step 3 i) Plot the observed values vs. the expected values.
ii) They should follow a straight line.
Step 4 i) You can add such a line to help you to assess the plot.
ii) The R function, qqline(x) draws such a line.
iii) Sample applications in R are provided separately.

Three-Step Solution for Assessing Normality
1. Standardize the data.
2. Measure the distance of each point from the centre. (This is just the length of
standardized vectors, Z)
3. Plot these distances in a way that highlights potential outliers.

Standardizing Multivariate Data:
Step 1 i) Subtract off the mean. W = X
x
. Then
w
= 0, but
w
=
x

Step 2 Find some orthogonal matrix U that makes the components of Y = UW independent.
Then
y
= 0 and y = U
w
U
T
= D, with D diagonal. (That is just the length of
the standardized vectors, Z)
Step 3 rescale Y to Z = D
-1/2
Y. Then
z
=
D^(-1/2)Y
= D
-1/2
D D
(-1/2)
= I

Underlying Assumptions
1. Formal Statistical inference: i) Observations generated independently ii) With the
same, multivariate normal distribution,
2. Exploratory Analyses: No former requirements, but watch out influential outliers.

Johnson, R. A., & Wichern, D. W. (2007) .Applied Multivariate Statistical Analysis, Prentice Hall PDF
No ratings yet
Johnson, R. A., & Wichern, D. W. (2007) .Applied Multivariate Statistical Analysis, Prentice Hall PDF
794 pages
Chapter1 MV
No ratings yet
Chapter1 MV
72 pages
Multivariate Normal Distribution
No ratings yet
Multivariate Normal Distribution
19 pages
Multivariate Statistical Analysis: Old School
No ratings yet
Multivariate Statistical Analysis: Old School
319 pages
Multivariate
0% (1)
Multivariate
319 pages
Unit 19
No ratings yet
Unit 19
16 pages
CS361 FA23 Lec2 Post
No ratings yet
CS361 FA23 Lec2 Post
67 pages
WST 311 - Part 1 2023
No ratings yet
WST 311 - Part 1 2023
59 pages
Sta 809 A
No ratings yet
Sta 809 A
58 pages
WST 311 - Part 1 2024
No ratings yet
WST 311 - Part 1 2024
59 pages
Basic Statistical Descriptions of Data
No ratings yet
Basic Statistical Descriptions of Data
26 pages
Multivariate Analysis Lecture Notes For Stat 5353: J. D. Tubbs Department of Mathematical Sciences Fall Semester 2002
No ratings yet
Multivariate Analysis Lecture Notes For Stat 5353: J. D. Tubbs Department of Mathematical Sciences Fall Semester 2002
234 pages
MVA Section1 2012
No ratings yet
MVA Section1 2012
14 pages
Data Visualizations: Histograms
No ratings yet
Data Visualizations: Histograms
27 pages
Normal Distribution For ML
No ratings yet
Normal Distribution For ML
17 pages
History Reporting
No ratings yet
History Reporting
61 pages
STA2005S Regression
No ratings yet
STA2005S Regression
92 pages
Notes For Multivariate Statistics With R
No ratings yet
Notes For Multivariate Statistics With R
189 pages
HASTS215 - HSTS215 NOTES Chapter4
No ratings yet
HASTS215 - HSTS215 NOTES Chapter4
7 pages
Multivariate Material
No ratings yet
Multivariate Material
58 pages
Normal Distribution
No ratings yet
Normal Distribution
10 pages
Symbiosis International (Deemed University) : Symbiosis School For Online and Digital Learning
No ratings yet
Symbiosis International (Deemed University) : Symbiosis School For Online and Digital Learning
84 pages
MATM111 Midterms REVIEWER
No ratings yet
MATM111 Midterms REVIEWER
3 pages
Ge 4 - Topic 2-Statistics
No ratings yet
Ge 4 - Topic 2-Statistics
8 pages
Module 4 - Chapter 2
No ratings yet
Module 4 - Chapter 2
14 pages
Presentation B 6 Sep 2021
No ratings yet
Presentation B 6 Sep 2021
68 pages
STAT456 Study Guide
No ratings yet
STAT456 Study Guide
31 pages
Gec004 - Module 4 - Normal Distribution and Regression
No ratings yet
Gec004 - Module 4 - Normal Distribution and Regression
84 pages
STAT3006: Tutorial 2
No ratings yet
STAT3006: Tutorial 2
3 pages
Multivariate Normal
No ratings yet
Multivariate Normal
24 pages
Dsbda Unit 2
No ratings yet
Dsbda Unit 2
155 pages
Multivariate Statistical Functions in R
No ratings yet
Multivariate Statistical Functions in R
138 pages
5 Random Var PDF
No ratings yet
5 Random Var PDF
74 pages
Introduction To The Practice of Basic Statistics (Textbook Outline)
100% (14)
Introduction To The Practice of Basic Statistics (Textbook Outline)
65 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
13 pages
STAT3006 Lecture Notes 2021 Aug8 2021
No ratings yet
STAT3006 Lecture Notes 2021 Aug8 2021
110 pages
HASTS215 - HSTS215 NOTES Chapter1 - 2
No ratings yet
HASTS215 - HSTS215 NOTES Chapter1 - 2
24 pages
Stat331-Multiple Linear Regression
No ratings yet
Stat331-Multiple Linear Regression
13 pages
The Multivariate Normal Distribution: f (x) = √ e −∞ 0. /σ
No ratings yet
The Multivariate Normal Distribution: f (x) = √ e −∞ 0. /σ
5 pages
Mathematics of The Linear Model and Linear Mixed Model: Brian Zhang February 2020
No ratings yet
Mathematics of The Linear Model and Linear Mixed Model: Brian Zhang February 2020
20 pages
BE184
No ratings yet
BE184
47 pages
Unit 1 Multivariate Analysis Lecture Notes
No ratings yet
Unit 1 Multivariate Analysis Lecture Notes
12 pages
Multivariate Statistical Analysis: The Multivariate Normal Distribution
No ratings yet
Multivariate Statistical Analysis: The Multivariate Normal Distribution
13 pages
Mathematics As A Tool (Descriptive Statistics) (Midterm Period) Overview: This Module Tackles Mathematics As Applied To Different Areas Such As Data
No ratings yet
Mathematics As A Tool (Descriptive Statistics) (Midterm Period) Overview: This Module Tackles Mathematics As Applied To Different Areas Such As Data
33 pages
OPMT 1005 - Week Three - Data and Statistics
No ratings yet
OPMT 1005 - Week Three - Data and Statistics
50 pages
Statistics Normality
No ratings yet
Statistics Normality
42 pages
Introductory Statistics With R
No ratings yet
Introductory Statistics With R
84 pages
Multivariate Statistics - An Introduction 8th Edition
100% (1)
Multivariate Statistics - An Introduction 8th Edition
202 pages
Multivariate Data Analysis in R PDF
No ratings yet
Multivariate Data Analysis in R PDF
400 pages
Jacobson Erik D 201108 Ma
No ratings yet
Jacobson Erik D 201108 Ma
121 pages
Nummerical Summaries
No ratings yet
Nummerical Summaries
11 pages
Tarea 1
No ratings yet
Tarea 1
6 pages
Mod2 Notes
No ratings yet
Mod2 Notes
72 pages
STAT3006: Tutorial 1: Sample Solutions
No ratings yet
STAT3006: Tutorial 1: Sample Solutions
10 pages
Lec 11 Chapter IV Descriptiv and Inferential Stat.
No ratings yet
Lec 11 Chapter IV Descriptiv and Inferential Stat.
26 pages
Univariate Statistics
No ratings yet
Univariate Statistics
7 pages
EXP-1 - Statistics and Plotting
No ratings yet
EXP-1 - Statistics and Plotting
23 pages
Statistics 101 Study Notes
No ratings yet
Statistics 101 Study Notes
33 pages
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
Digital Signal Processing (DSP) with Python Programming
From Everand
Digital Signal Processing (DSP) with Python Programming
Maurice Charbit
No ratings yet
Good Practices in Visual Inspection - Drury
No ratings yet
Good Practices in Visual Inspection - Drury
85 pages
Experiment No 2 Msds
No ratings yet
Experiment No 2 Msds
2 pages
SM 4500 Co2 PDF
No ratings yet
SM 4500 Co2 PDF
7 pages
Classical Mechanics MCQ Gamecampusco
No ratings yet
Classical Mechanics MCQ Gamecampusco
3 pages
CT Project Format (Ankit)
No ratings yet
CT Project Format (Ankit)
2 pages
Rock Anchor
No ratings yet
Rock Anchor
12 pages
Type - 1:multiple Choice Questions
No ratings yet
Type - 1:multiple Choice Questions
7 pages
Tire-Road Friction Coefficient
No ratings yet
Tire-Road Friction Coefficient
11 pages
The Effects of Corrugation and Wing Planform On The Aerodynamic Force Production of Sweeping Model Insect Wings
No ratings yet
The Effects of Corrugation and Wing Planform On The Aerodynamic Force Production of Sweeping Model Insect Wings
11 pages
Capephysics Labs2 v4 PDF
100% (2)
Capephysics Labs2 v4 PDF
26 pages
Chemical Engineering Science: Ardi Hartono, Muhammad Saeed, Arlinda F. Ciftja, Hallvard F. Svendsen
No ratings yet
Chemical Engineering Science: Ardi Hartono, Muhammad Saeed, Arlinda F. Ciftja, Hallvard F. Svendsen
11 pages
Limit 2012 PDF
100% (1)
Limit 2012 PDF
54 pages
Pasavento Instructions For Zup Pattern Indicator
No ratings yet
Pasavento Instructions For Zup Pattern Indicator
13 pages
TSA Using Matlab
No ratings yet
TSA Using Matlab
30 pages
A Comparison of The Information Seeking Patterns of Researchers Ellis PDF
No ratings yet
A Comparison of The Information Seeking Patterns of Researchers Ellis PDF
28 pages
c006b Q
No ratings yet
c006b Q
54 pages
Determination of PH of Common Solutions Using PH Paper and PH Meter
No ratings yet
Determination of PH of Common Solutions Using PH Paper and PH Meter
2 pages
Lecture 1428730889
100% (2)
Lecture 1428730889
53 pages
Predicting The Sun's Position: Figure 1 - Earth Rotates About Polar Axis
100% (1)
Predicting The Sun's Position: Figure 1 - Earth Rotates About Polar Axis
12 pages
Physics 2075
No ratings yet
Physics 2075
5 pages
l-XT650GK-ENGINE KINROAD EINZELTEILE Mit Bestell-Nummern
No ratings yet
l-XT650GK-ENGINE KINROAD EINZELTEILE Mit Bestell-Nummern
41 pages
No. 96 - June 1999
No ratings yet
No. 96 - June 1999
32 pages
The Hill Reaction in Isolated Chloroplasts Postlab
No ratings yet
The Hill Reaction in Isolated Chloroplasts Postlab
8 pages
BS en 1926:2006
No ratings yet
BS en 1926:2006
20 pages
MSZ en Fodem Terhek Engl
No ratings yet
MSZ en Fodem Terhek Engl
23 pages
Basics of Steel Making
No ratings yet
Basics of Steel Making
26 pages
Ang KWENTO NG PAGONG AT MATSING
No ratings yet
Ang KWENTO NG PAGONG AT MATSING
10 pages
Fredlund1978 PDF
No ratings yet
Fredlund1978 PDF
9 pages
LABEX3
No ratings yet
LABEX3
28 pages
Dowex Optipore V503
No ratings yet
Dowex Optipore V503
2 pages

1) Common Univariate Summaries: I) I) Iii) I) Ii)

Uploaded by

1) Common Univariate Summaries: I) I) Iii) I) Ii)

Uploaded by

1) Common Univariate Summaries

I) Measures of the Centre i) Mean ii) Median iii) Mode

You might also like