IL2-Describing Variation in Data

This document defines and describes various measurement variables and methods for presenting continuous data. It discusses variables that can be measured along a numerical continuum, such as height, weight, and blood pressure. It also covers topics like presenting continuous data through histograms and descriptive statistics, measures of central tendency like mean, median and mode, measures of spread such as range, quartiles and interquartile range, and the normal distribution. Graphic illustrations of concepts like box plots and distributions are also mentioned.

Uploaded by

Vanessa Hermione

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views

IL2-Describing Variation in Data

Uploaded by

Vanessa Hermione

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Measurement variables

Describing (continuous variables)

Variation in Data
z Variables with an infinite number of values
A/P Koh Woon Puay that are equally spaced
MBBS, PhD z Can be measured along a numerical continuum
Email: ephkwp@nus.edu.sg
EPH office,, MD3,, Level 3 z Eg: height, weight, temperature, blood pressure
Tel: 6516 4975

Long ordinal data Presenting continuous data

z Ordinal data which are graded on a long scale, z Graphically: histogram
especially if numerically represented, may
sometimes be treated as continuous data z Descriptive:
z Eg: depression or anxiety on a scale of 1 to 10 z Summarize data with a single value
z But not true continuous data because (Measure central tendency)
z They have a finite number of distinct values
z There are gaps in the continuum z Measure absolute spread (dispersion)
z Spacing between categories is not numerically equivalent
(this may limit the interpretation of the results in analysis)

1
Distribution of age among diabetic Measures of central tendency
patients in the polyclinic
z Summarizes the set with a single value
z mean median,
mean, median and mode
z The mean is the average value of all the data
in the set.
z The median is the value that has exactly half
the data above it and half below itit.
z The mode is the value that occurs most
frequently in the set (rarely used)

Advantages and
Example Disadvantages
Systolic blood pressure
z Mean
130 145
130, 145, 150
150, 160
160, 165
z Widely used, easy to understand, measures
Mean: (130+145+150+160+165)/5 central location
Median: 150
z Overly sensitive to extreme values
130, 145, 150, 160, 165, 170

Mean: (130+145+150+160+165+170)/6
z Median
Median: (150+160)/2 z Insensitive to very large or very small values
z Determined by the middle points and less
sensitive to the actual numerical values of the
other data points

2
Mean = median A normal distribution
z Bell-curve or bell-shaped
histogram.
histogram

Mean > median

z Most of the values
accumulate around the
Median Mean middle. The mean,
median and mode are all
equal, and the scores at
Mean < median
either end of the
distribution occur less
Mean Median often

Skewness Measures of spread of continuous

or measurement data
z Skewed to right: if the
scores tend to cluster •Skewed to left: most of
toward the lower end of the scores tend to occur z Range
toward the upper end of the
the scale
scale while increasingly
Sex-partners fewer scores occur toward z Quartiles
the lower end.
z Variance and standard deviation

Sex-partners

3
Range Median and quartiles
z Range = difference between highest and z The median divides the data into two equal
lowest observed values sets (Q2).
z Greatly influenced by the presence of just z The lower quartile (Q1) is where 25% of the
one unusually large or small value (outlier). values are smaller than Q1 and 75% are
larger.
z Can be expressed as an interval such as 3-8,
or as an interval width, as a range of 5. z The upper quartile (Q3) is the value where
75% of the values are smaller than Q3 and
25% are larger

Example 1 – Upper and lower quartiles Interquartile Range

z Data:
z Interquartile range =difference between
z 8 49
8, 49, 51
51, 17
17, 45
45, 43
43, 9
9, 41
41, 45
45, 43
43, 38 upper quartile (Q3) and lower quartile (Q1)

z Ordered data z Interquartile range spans 50% of a data set,

and eliminates the influence of outliers
z 8, 9, 17, 38, 41, 43, 43, 45, 45, 49, 51
z Lower quartile: 17
z Median: 43;
z Upper quartile: 45;

4
Graphic illustrations Percentile rank
z Box-plots z Divide all values into 100 parts (percentile)
z Error-bars
z The proportion of values in a distribution that
Upper quartile a specific score is greater than or equal to.
z Eg. if you received a score of 75 on a math
Lower quartile
test and this score was greater than or equal
t the
to th scores off 85% off the
th students
t d t taking
t ki
the test, then your percentile rank would be
85 (85th percentile)

Advantages and disadvantages Variance

z Variance combines all the values in a data set to
z Range produce a measure of spread.
z Very easy to compute
z Very sensitive to extreme observations z The variance (symbolized by s2) is the sum of the
z Poor indication of distribution of points in between squared deviations from the mean, divided by the
number of observations minus 1 (degree of
z Quartiles freedom))
z Less sensitive to outliers
z Some of the observations are not used

n-1

5
Standard Deviation
Degree of freedom z Standard deviation (s) = square root of the variance
(give back the original scale)
zThe number of variables whose values can z Properties
p of standard deviation
be altered without affecting the mean, once it z measure spread or dispersion around the mean of a
is known. data set.
z never negative.
z sensitive to outliers.
zEg. 80, 85, 90, 105, X z for data with approximately the same mean, the
If mean is 95
95, X=115
X 115. Hence only 4 out of 5 values greater the spread,
g p , the greater
g the standard deviation.
can be changed to get back mean = 95.

n-1

More about a normal

Normal distribution distribution
z Many kinds of physiological z If the mean and standard deviation of a normal
data are approximated well by distribution are known, it is relatively easy to figure
the normal distribution.
out the percentile rank.
z Many statistical tests assume
a normal distribution.
z In a normal distribution, about 68% of the scores
z Most of these tests work well
even if the distribution is only are within one standard deviation of the mean,
approximately normal and in about 95% of the scores are within two standard
many cases as long as it does deviations, and about 99% of the scores are
not deviate greatly from within three standard deviations
normality.

6
zEg. 47,000 babies born in a

hospital
z 1,000 babies sampled, 1,000
weights obtained
z M
Mean = 3.25
3 25 kkg, Counts
SD=0.3 kg
95% of all the 1,000 babies
lie within 3.25 +/- (2x0.3) kg.
95% of all the 1,000 babies
lie within 2
2.65
65 and 3
3.85
85 kg
kg.
2.5% weigh less than 2.65 kg
and 2.5 % weigh more than 2.0 2.5 3.0 3.5 4.0
3.85 kg.

ENT Ear Examination Script
No ratings yet
ENT Ear Examination Script
2 pages
PSYCH Autism OSCE Script
No ratings yet
PSYCH Autism OSCE Script
1 page
Positive Psychology and Science of Happiness
100% (1)
Positive Psychology and Science of Happiness
4 pages
Introduction To Biostatistics
No ratings yet
Introduction To Biostatistics
53 pages
Topic 1 Describing Data II
No ratings yet
Topic 1 Describing Data II
68 pages
Descriptive Statistics 1
No ratings yet
Descriptive Statistics 1
63 pages
Lecture 3 - Stat HO
No ratings yet
Lecture 3 - Stat HO
21 pages
Variability Final
No ratings yet
Variability Final
53 pages
Measures of Central Tendency and Spread: Chapter 1, Section 2
No ratings yet
Measures of Central Tendency and Spread: Chapter 1, Section 2
36 pages
Dsbda Unit 2
No ratings yet
Dsbda Unit 2
155 pages
Lec5&6 02sep2016
No ratings yet
Lec5&6 02sep2016
32 pages
Stat Chapter 5-9
No ratings yet
Stat Chapter 5-9
32 pages
Stat 1101 4 7
No ratings yet
Stat 1101 4 7
18 pages
Quantitative Data Analysis
No ratings yet
Quantitative Data Analysis
31 pages
Introduction To Descriptive Statistics 2014
67% (3)
Introduction To Descriptive Statistics 2014
72 pages
Statistics Part 1 and 2
No ratings yet
Statistics Part 1 and 2
53 pages
Measures of Central Tendency and Disperssion (1)
No ratings yet
Measures of Central Tendency and Disperssion (1)
33 pages
03 - BIOE 211 - Basic Demog and Health Indicator Formula
No ratings yet
03 - BIOE 211 - Basic Demog and Health Indicator Formula
29 pages
Lesson-3.2-Measures-of-Central-Tendency-Position-and-Variation
No ratings yet
Lesson-3.2-Measures-of-Central-Tendency-Position-and-Variation
62 pages
03 Numerical Description
No ratings yet
03 Numerical Description
52 pages
slides_week2
No ratings yet
slides_week2
43 pages
Lec3&4 02sep2016
No ratings yet
Lec3&4 02sep2016
43 pages
المحاضرة رقم 3
No ratings yet
المحاضرة رقم 3
44 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
59 pages
Bio Statistics 3
No ratings yet
Bio Statistics 3
13 pages
ch03 Ver3
No ratings yet
ch03 Ver3
25 pages
Bus. Statt. Chapter-Lecture 2+3
No ratings yet
Bus. Statt. Chapter-Lecture 2+3
43 pages
FDSA unit 2
No ratings yet
FDSA unit 2
44 pages
AGA 3842-2022-2023. Descriptive Statistics
No ratings yet
AGA 3842-2022-2023. Descriptive Statistics
101 pages
Measusres of Locations
No ratings yet
Measusres of Locations
52 pages
Chapter 2
No ratings yet
Chapter 2
19 pages
Introductory of Statistics - Chapter 3
No ratings yet
Introductory of Statistics - Chapter 3
7 pages
Lecture 3
No ratings yet
Lecture 3
10 pages
Spring Semester, 2020-2021
No ratings yet
Spring Semester, 2020-2021
40 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
38 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
41 pages
Measures-of-Centrality-and-Variability
No ratings yet
Measures-of-Centrality-and-Variability
42 pages
Group-1 Module-1 PPT
No ratings yet
Group-1 Module-1 PPT
100 pages
Biostatistics (Descriptive Statistics)
No ratings yet
Biostatistics (Descriptive Statistics)
30 pages
02 Measures of Central Tendency
No ratings yet
02 Measures of Central Tendency
41 pages
Measures
No ratings yet
Measures
8 pages
Statistical Organization of Scores
No ratings yet
Statistical Organization of Scores
109 pages
Measures of Central Tendency Position and Dispersion 1.Pptx 20241015 145631 0000
No ratings yet
Measures of Central Tendency Position and Dispersion 1.Pptx 20241015 145631 0000
44 pages
Click To Add Text Dr. Cemre Erciyes
No ratings yet
Click To Add Text Dr. Cemre Erciyes
69 pages
01 Data
No ratings yet
01 Data
100 pages
Measure of Variation
No ratings yet
Measure of Variation
50 pages
Lecture 1
No ratings yet
Lecture 1
89 pages
Measures of Central Tendency to Z Score
No ratings yet
Measures of Central Tendency to Z Score
33 pages
Lesson 4: Statistics/Data Management Unit 1 - Measures of Central Tendency
No ratings yet
Lesson 4: Statistics/Data Management Unit 1 - Measures of Central Tendency
26 pages
Gtu 302 Biostatistics: Descriptive Statistics
100% (1)
Gtu 302 Biostatistics: Descriptive Statistics
57 pages
1.2 Mathematical Presentation of Data
No ratings yet
1.2 Mathematical Presentation of Data
28 pages
mathematics mean and mode
No ratings yet
mathematics mean and mode
37 pages
Chap 4
No ratings yet
Chap 4
126 pages
3-Measures of Central Tendency
No ratings yet
3-Measures of Central Tendency
59 pages
2nd Unit - Statistics
No ratings yet
2nd Unit - Statistics
15 pages
Dtatistical Measures
No ratings yet
Dtatistical Measures
54 pages
2.data Description
No ratings yet
2.data Description
57 pages
01_Scales of mesurement_Sumarising numeric data
No ratings yet
01_Scales of mesurement_Sumarising numeric data
26 pages
GE MODMAT Unit 4 Statistics 1
No ratings yet
GE MODMAT Unit 4 Statistics 1
14 pages
Descreptive Statistics 1
No ratings yet
Descreptive Statistics 1
74 pages
Descriptive Statistics (31-1) Biostatistics
No ratings yet
Descriptive Statistics (31-1) Biostatistics
40 pages
Unit-3 DS Students
No ratings yet
Unit-3 DS Students
35 pages
Statistical Foundations for Psychology
From Everand
Statistical Foundations for Psychology
James C. Ware
No ratings yet
Cancer Registry Annual Report 2015 - Web
No ratings yet
Cancer Registry Annual Report 2015 - Web
43 pages
Disease of Mesenteric Arteries and Veins 2017
No ratings yet
Disease of Mesenteric Arteries and Veins 2017
51 pages
IL1-Measures of Disease Frequency
No ratings yet
IL1-Measures of Disease Frequency
5 pages
Managemen of Vascular Graft and Endograft Infections Mar 2020
No ratings yet
Managemen of Vascular Graft and Endograft Infections Mar 2020
46 pages
ENT Approach To Thyroid Masses
No ratings yet
ENT Approach To Thyroid Masses
2 pages
ACLS Simulation Scenarios
No ratings yet
ACLS Simulation Scenarios
14 pages
Anatomy of The Mediastinum and Imaging Modalities
No ratings yet
Anatomy of The Mediastinum and Imaging Modalities
11 pages
PSYCH Conduct Disorder OSCE Script
0% (1)
PSYCH Conduct Disorder OSCE Script
1 page
Not Consistent With ACS ? Acs Very Consistent With ACS (Based On Clinical Features and ECG)
No ratings yet
Not Consistent With ACS ? Acs Very Consistent With ACS (Based On Clinical Features and ECG)
1 page
Bronchospasm During Anaesthesia Update 2011
No ratings yet
Bronchospasm During Anaesthesia Update 2011
5 pages
Protocol For Ordering Treadmill Before Cardiology Consult
No ratings yet
Protocol For Ordering Treadmill Before Cardiology Consult
2 pages
Protocol 13: Chest Pain: A&E Doctor
No ratings yet
Protocol 13: Chest Pain: A&E Doctor
4 pages
DuarteEtAl NailMelanomaInSitu
No ratings yet
DuarteEtAl NailMelanomaInSitu
10 pages
What Is ECMO?: Patient Education
No ratings yet
What Is ECMO?: Patient Education
2 pages
Examination of The CNS in Children: Professor Low Poh Sim Department of Paediatrics Ucmi
No ratings yet
Examination of The CNS in Children: Professor Low Poh Sim Department of Paediatrics Ucmi
40 pages
Paul and Maiti (2008)
No ratings yet
Paul and Maiti (2008)
32 pages
Sustainability 10 02579
No ratings yet
Sustainability 10 02579
17 pages
Probabilistic Methods in Engineering: Lecture 3: Counting/Conditional
No ratings yet
Probabilistic Methods in Engineering: Lecture 3: Counting/Conditional
44 pages
Developing Critical and Creative Thinking Through Chess
No ratings yet
Developing Critical and Creative Thinking Through Chess
7 pages
Practicalresearch1 q4 Mod4 Collectingdatathroughobservationsandinterviews Final
No ratings yet
Practicalresearch1 q4 Mod4 Collectingdatathroughobservationsandinterviews Final
20 pages
Body Ink Tattooing Among Young Adults: Relationship With Self-Esteem, Need For Uniqueness and Social Physique Anxiety
No ratings yet
Body Ink Tattooing Among Young Adults: Relationship With Self-Esteem, Need For Uniqueness and Social Physique Anxiety
7 pages
Organization and Management
No ratings yet
Organization and Management
8 pages
Tor Consultancy Parcellary Edited
No ratings yet
Tor Consultancy Parcellary Edited
7 pages
Handouts 3is PDF
No ratings yet
Handouts 3is PDF
4 pages
11.2 - Notation and Analysis: Assessment Statement Notes 11.2.1
No ratings yet
11.2 - Notation and Analysis: Assessment Statement Notes 11.2.1
5 pages
Research Gaps in Adolescent Sexual and Reproductive Health
No ratings yet
Research Gaps in Adolescent Sexual and Reproductive Health
7 pages
Chapter 11: Quantitative Data Analysis
No ratings yet
Chapter 11: Quantitative Data Analysis
4 pages
Script CFA Model
No ratings yet
Script CFA Model
3 pages
Research Proposal Assignment 08
No ratings yet
Research Proposal Assignment 08
3 pages
Socio Economic Impact of Financial Inclusion: Summer Internship Report On
No ratings yet
Socio Economic Impact of Financial Inclusion: Summer Internship Report On
45 pages
Full Chapter Categorical and Nonparametric Data Analysis E Michael Nussbaum PDF
100% (8)
Full Chapter Categorical and Nonparametric Data Analysis E Michael Nussbaum PDF
53 pages
Lopez 2021
No ratings yet
Lopez 2021
9 pages
Blessing Project
No ratings yet
Blessing Project
44 pages
Discriminant Analysis: 5.1 The Maximum Likelihood (ML) Rule
No ratings yet
Discriminant Analysis: 5.1 The Maximum Likelihood (ML) Rule
6 pages
Population Living in Different Types of Houses in Bhutan (Urban)
No ratings yet
Population Living in Different Types of Houses in Bhutan (Urban)
6 pages
System Analysis CH 5
100% (1)
System Analysis CH 5
13 pages
The Impact of User-Generated Content Social Intera
No ratings yet
The Impact of User-Generated Content Social Intera
11 pages
AEFL Quarterly Volume 18 Issue 1 March 2016
No ratings yet
AEFL Quarterly Volume 18 Issue 1 March 2016
171 pages
Thesis Qualitative Data Analysis
100% (3)
Thesis Qualitative Data Analysis
8 pages
CHAPTER 7 Academic Writing and Referencing
No ratings yet
CHAPTER 7 Academic Writing and Referencing
32 pages
Quants Intern - JD
No ratings yet
Quants Intern - JD
3 pages
Problem and Background of The Study
No ratings yet
Problem and Background of The Study
23 pages
Information Use, User, User Needs and Seeking Behaviour: A Review
No ratings yet
Information Use, User, User Needs and Seeking Behaviour: A Review
6 pages
Mobile Phone Use
100% (1)
Mobile Phone Use
6 pages