MBA - Quantitative Methods
MBA - Quantitative Methods
1
How to Use Self-Learning Material?
The pedagogy used to design this course is to enable the student to assimilate the concepts
with ease. The course is divided into modules. Each module is categorically divided into units or
chapters. Each unit has the following elements:
Table of Contents: Each unit has a well-defined table of contents. For example: “1.1.1.
(a)” should be read as “Module 1. Unit 1. Topic 1. (Sub-topic a)” and 1.2.3. (iii) should
be read as “Module 1. Unit 2. Topic 3. (Sub-topic iii).
Aim: It refers to the overall goal that can be achieved by going through the unit.
Learning Outcomes: These are demonstrations of the learner’s skills and experience
sequences in learning, and refer to what you will be able to accomplish after going
through the unit.
Did You Know?: You will learn some interesting facts about a topic that will help you
improve your knowledge. A unit can also contain Quiz, Case Study, Critical Learning
Exercises, etc., as metacognitive scaffold for learning.
Summary: This includes brief statements or restatements of the main points of unit and
summing up of the knowledge chunks in the unit.
Activity: It actively involves you through various assignments related to direct application
of the knowledge gained from the unit. Activities can be both online and offline.
e-References: This is a list of online resources, including academic e-Books and journal
articles that provide reliable and accurate information on any topic.
Video Links: It has links to online videos that help you understand concepts from a
variety of online resources.
Quantitative Methods
LEADERSHIP KLEF
Quantitative Methods
CREDITS
Author
Dr. J. Venkata Ramana
Director CDOE
C. Shanath Kumar
Instructional Designer
Nabina Das
Content Editor
M. Mounika Supriya
Project Manager
K. D. N. Lakshmi
Graphic Designer
B. V. Satyanarayana
Quantitative Methods
First Edition, 2023.
Quantitative Methods
Author Profile
He did M.Sc, M.M.M, Ph.D in the area of Management. He specialised in Marketing Management
and Business Analytics with 16 years of academic experience. He published 4 Scopus-Indexed
and 10 UGC Care-Indexed research papers in reputed Journals.
He is a fellow member of the World Economics Association (WEA), Green ThinkerZ, Indian
Academicians and Researchers Association (IARA) and Institute of Supply Management
(ISM). He is a reviewer for various international journals and guides research scholars in the
Management domain. He authored two books in the areas of Management
Quantitative Methods
Quantitative Methods
Course Description
Quantitative Methods is all about the decision-making process. This course begins with
fundamental principles and progresses to diverse decision-making processes. Quantitative
methods are defined as procedures that give the decision maker a systematic, and powerful
means of analysis, and assistance in developing policies for reaching pre-determined goals
based on measurable facts.
By the end of the course, students will be able to describe how to use various statistical methods
to solve problems and make decisions. Students will also examine uncertainty scenarios and
various forms of data, including uni-variate, bi-variate, and multi-variate data.
This course is designed to serve as a steppingstone for the students to analyse the managerial
decisions. The Quantitative Methods course contains Four Modules.
Quantitative Methods
1
MODULE 1
INTRODUCTION TO PROBABILITY
Concept of Probability: Definitions and rules for probability, Conditional probability, Independence
of events, Bayes’ theorem. Probability distributions: Random variables, Binomial Distribution,
Poisson Distribution, Normal Distribution. Introduction to R Programming: Evolution and features
of R Programming, Operators in R Programming, Data Structures in R Programming.
MODULE 2
OVERVIEW OF SAMPLING
Basic Concepts: Types of Sampling, Sampling distributions, Sampling distribution of mean
and proportion, Application of Central Limit Theorem, Determining the sample size. Estimation
and Testing of Hypothesis: Introduction to Estimation, Point Estimation, Interval Estimation,
Introduction to Hypothesis, one simple and two sample tests for means and proportions of large
samples (z-test), one sample and two sample tests for means of small samples (t-test), F-test
for two sample standard deviations, ANOVA (ANALYSIS OF VARIANCE) one- and two-way, Chi-
square tests for independence of attributes and goodness of fit, Sign test and Rank Test.
MODULE 3
CORRELATION AND REGRESSION
Introduction to Correlation: Meaning, measurement, graphic and algebraic, Scatter Plot, Pearson
Correlation Coefficient, Spearman’s Rank Correlation, Testing the significance of regression
coefficients, Regression: Meaning, Types, Estimating the regression coefficients.
MODULE 4
INDEX NUMBERS AND TIME SERIES ANALYSIS
Time series analysis: Meaning and Components of Time Series, Variations in time series,
Smoothing Methods, trend analysis, cyclical variations, Seasonal variations, and irregular
variations. Index Numbers: Unweight Index numbers, Weighted Index numbers.
Quantitative Methods
2
Table of Contents
Module 1
INTRODUCTION TO PROBABILITY
Unit 1.1 Concept of Probability
Unit 1.2 Probability Distributions
Unit 1.3 Introduction to R Programming
Module 2
OVERVIEW OF SAMPLING
Unit 2.1 Introduction to Sampling
Unit 2.2 Estimation and Testing of Hypothesis
Unit 2.3 ANOVA & Non-parametric Testing of Hypothesis
Module 3
CORRELATION AND REGRESSION
Unit 3.1 Introduction to Correlation
Unit 3.2 Regression
Module 4
INDEX NUMBERS AND TIME SERIES ANALYSIS
Unit 4.1 Time Series Analysis
Unit 4.2 Index Numbers
Quantitative Methods
3
QUANTITATIVE METHODS
Module - 1
INTRODUCTION TO PROBABILITY
Quantitative Methods
4
Module Description
We use the words ‘probability’ and ‘chance’ frequently in our daily conversations, yet most
people have only a hazy understanding of what they represent. For example, we might hear
in a weather report, “There’s a chance of heavy rain tomorrow”, or “There’s a chance that both
teams A and B win tomorrow’s match”, or “Probably you’re right”, or “It’s likely that I won’t be able
to come to your house for the get-together”. It has been observed that keywords like likelihood,
chances, possible, likely, and others appear in the above sentences and communicate the same
meaning, i.e., the event is not certain to occur, or, in other words, there is uncertainty in the
event’s occurrence. In plain language, the term “probability” denotes that there is a degree of
uncertainty regarding the event’s outcome.
Quantitative Methods
5
QUANTITATIVE METHODS
Module - 1
Unit - 1
CONCEPT OF PROBABILITY
Quantitative Methods
6
Unit Table of Contents
Unit 1.1 Concept of Probability
Aim -------------------------------------------------------------------------------------------------------------- 08
Instructional Objectives ------------------------------------------------------------------------------------ 08
Learning Outcomes ----------------------------------------------------------------------------------------- 08
Introduction ---------------------------------------------------------------------------------------------------- 09
1.1.1 Definitions and Rules of Probability ----------------------------------------------------------- 09
Self-Assessment Questions --------------------------------------------------------------------- 26
1.1.2 Conditional Probability and Independence of Events ------------------------------------- 27
Self-Assessment Questions --------------------------------------------------------------------- 30
1.1.3 Bayes’ Theorem ------------------------------------------------------------------------------------ 31
Self-Assessment Questions --------------------------------------------------------------------- 34
Summary ------------------------------------------------------------------------------------------------------- 35
Terminal Questions ------------------------------------------------------------------------------------------ 35
Answer Keys -------------------------------------------------------------------------------------------------- 36
Glossary -------------------------------------------------------------------------------------------------------- 37
Bibliography --------------------------------------------------------------------------------------------------- 37
External Resources ----------------------------------------------------------------------------------------- 37
e-References ------------------------------------------------------------------------------------------------- 37
Image Credits ------------------------------------------------------------------------------------------------- 38
Video Links ---------------------------------------------------------------------------------------------------- 38
Keywords ------------------------------------------------------------------------------------------------------ 38
Quantitative Methods
7
Aim
This unit aims to explain the basic concepts of probability and its applications in the
field of management.
Instructional Objectives
This unit intends to:
● Explain the basic concepts of probability
● Discuss the concepts of probability in various management applications
Learning Outcomes
At the end of this unit, you are expected to:
● Demonstrate conditional probabilities and various theorems of probability
● Apply the concepts of probability in marketing, HRM, finance, and other
functional areas
Quantitative Methods
8
Introduction
We use the words ‘probability’ and ‘chance’ frequently in our daily conversations, yet most people
have only a hazy understanding of what they represent. For example, we might hear in a weather
report, “There is a chance of heavy rain tomorrow”. “There are chances that both teams and
win tomorrow’s match” and “It is likely that I will not be able to come to your house for the get-
together,” are also some other examples. It has been observed that keywords like likelihood,
chances, possible, likely, and others appear in the above sentences and communicate the same
meaning, i.e., the event is not certain to occur, or, in other words, there is uncertainty in the event’s
occurrence. In plain language, the term “probability” denotes that there is a degree of uncertainty
regarding the event’s outcome. However, in mathematics and statistics, we attempt to describe
conditions under which we can make meaningful numerical assertions about uncertainty and use
specific methods for computing numerical probabilities and expectations. The term probability is
thus defined in the statistical sense and is unrelated to beliefs or any type of dreaming.
The concept of probability deals with uncertainty and randomness. The literal meaning of probability
is ‘Chance’. Galileo – an Italian Mathematician made inventions in quantitative probability.
Quantitative Methods
9
1.1.1.1 Basic Terminology in Probability
Random Experiment
Sample space
● The set of all possible outcomes of random experiment is called sample space and is
denoted by S
● Any performance of a random experiment is called Trial and its outcomes are called
Events.
E.g.: Tossing a coin for one time is called a trial and its outcomes { H , T } are events.
Exhausting Situations
● In a random experiment, ‘exhaustive events’ refers to the entire number of all conceivable
elementary outcomes. In other words, when there are no alternative options, a set is said
to be exhaustive.
Favourable Events
● Favourable events are the basic results that imply or favour the occurrence of an event,
i.e., the outcomes that aid in the occurrence of that event.
● An event is ‘mutually exclusive’ if it completely prevents all other events in a trial from
occurring. In other words, two occurrences and cannot happen at the same time.
● If there is no reason to expect one event over another, the outcome is said to be ‘equally
likely’, i.e., each of the exhaustive outcomes has an equal chance of occurring.
Complementary Events
● Let E denote the event’s occurrence. The absence of event E is denoted by the complement
of E , E ' s complement is indicated by the symbol.
Quantitative Methods
10
P ( Not1) =
1 − p ( 2,3, 4,5, 6, ) =
1− 5 / 6 =
1/ 6
P (1) = 1/ 6 P ( Not1) =
1 − p ( 2,3, 4,5, 6, ) =
1− 5 / 6 =
1/ 6
Independent Events
● In a sequence of trials, two or more events are ‘independent’ if the outcome of one does
not affect the outcome of the other or vice versa.
There are four approaches to construct a measure of probability of occurrence of an event. They
are:
● Mathematical or Classical Approach
● Statistical or Empirical Approach
● Axiomatic Approach.
● Subjective Approach
Classical or Mathematical Approach
● If a trial results in ‘n’ exhaustive, mutually exclusive, equally likely and independent
outcomes, and if ‘m’(m<=n) of them is favourable for the happening of the event E, then
the probability ‘P’ of occurrence of the event ‘E’ is given by
Example: P ( H ) = ½
Then P ( E ) = lim M / N
N →∞
Quantitative Methods
11
Example: P ( One head = ) m= / n 2= / 4 1/ 2or 0.5 A die is rolled 100 times. The
number 3 is rolled 12 times. The relative frequency of rolling a 3 is 12/100.
Axiomatic Approach
P ( S ) = 1 . (Axiom of Certainty)
P ( AUB
= ) P ( A) + P ( B ) , when event A and B are mutually exclusive. (Axiom of ad-
ditively)
● The probability that determined based on the human tendencies like experiences, belief,
etc., is called subjective probability.
● These are values (between 0 and 1 or 0 and 100%) assigned by individuals based on
how likely they think events are to occur.
● Example: The probability that a student is gains 90 marks in an exam is 60%
P ( A) ≤ 1
0 ≤ P ( A) ≤ 1
Quantitative Methods
12
Self-Assessment Questions
A) Event
B) Chance
C) Relative
D) Case
A) Galileo
B) Fisher
C) Pearson
D) Bowley
A) Sample
B) Sample Space
C) Space
D) Event
Quantitative Methods
13
1.1.1.4 Solved examples
Solution:
=m {=
TH , HT , TT } 3 S = { HH , HT , TH , TT } n = 4
head m
a) Favourable cases of getting one= {=
HT , TH } 2
=
b) Favourable cases of two heads m {=
HH } 1
P (Two heads
= ) m=
/n ¼
=
c) Favourable cases of no heads m {=
TT } 1
P ( No heads ) = ¼
=
d) Favourable cases of at least one tail m {=
TH , HT , TT } 3
a) an odd face;
b) a prime face;
c) a face multiple of 3;
d) a face of 7;
e) any number between 1 and 6 (both inclusive).
Solution:
=
Exhaustive cases n {1,=
2,3, 4,5, 6} 6
=
a) Favourable cases of odd face= m {1,3,5
= } 3
p ( odd face
= ) m=
/ n 3=
/ 6 1/ 2
Quantitative Methods
14
=
b) Favourable cases of Prime face= m {=
2,3,5} 3
p ( prime
= ) 3=
/ 6 1/ 2
=m
c) Favourable cases of multiple of 3= {=
3, 6} 2
p ( multiple of 3) = 2 / 6
=
e) Favourable cases of getting 1to 6= m {1,=
2,3, 4,5, 6} 6
p (1 − 6=
) 6 / 6= 1
1. P ( a ) ≥ 0 (Axiom of Non-negativity)
2. P ( s ) = 1 (Axiom of Certainty)
3. P ( a ∪ b=
) P ( a ) + P ( b ) (a, b Mutually exclusive) (Axiom of additivity)
Example:
If a coin is tossed, then define the probability of getting head using axiomatic definition of
probability.
1) P ( a
= ) ½ ( > 0)
2) P ( S ) = P ( H , T ) = ½ + ½ = 1
3) P ( HUT
= ) P ( H ) + P (T )
P ( S=
) ½ +½
1=1
Quantitative Methods
15
3. If a coin is tossed for 3 times, then find the probability for
a) two heads;
b) Alternative tails;
c) At least two tails
d) No tails.
Solution:
=
a) the favourable cases of getting two heads m {HHT , HTH , THH }
= 3
(E)
P= m
= /n 3/8
=
b) the favourable cases of getting alternative tails m {=
THT } 1
(E)
P= m
= /n 1/ 8
P (=
E ) m=
/n 4/8
tails m
d) the favourable cases of getting no= {=
HHH } 1
(E)
P= m
= /n 1/ 8
4. If a die thrown for two times then find the probability of a) The total on both the dice is
9;b) The total on both the dice is more than 10; c)First die shows face 3; d) second die
shows face 5; e) both the dice have the same face; f ) first die an even and second die
is an odd; g) the total on both the dice is 13; h) the total on both the dice is any number
between 2 to 12 (Both Inclusive).
Solution:
If a die thrown for two times, then the exhaustive cases are 62 = 36.
S = {(1,1) , (1, 2 )(1,3) , (1, 4 ) , (1,5 ) , (1, 6 ) , ( 2,1) , ( 2, 2 )( 2,3) , ( 2, 4 ) , ( 2,5 ) , ( 2, 6 ) , ( 3,1) , ( 3, 2 )( 3,3) , ( 3, 4 ) , ( 3,5 ) , ( 3, 6 ) ,
m {( 3, 6 ) , ( 6,3) , ( 4,5) , ( 5, 4 )}
= 4 P (=
E ) m=
/ n 4 / 36
Quantitative Methods
16
b) The favourable cases of getting total on both the dice is more than 10. (i.e., 11 or 12)
m {( 5, 6 ) , ( 6,5) , ( 6, 6 )} 3 P ( E ) = 3 / 36
=
e) The favourable cases of getting both the dice have the same face.
f) The favourable cases of getting first die an even and second die is an odd.
P (=
E ) m=
/n 9 / 36
m
g) The favourable cases of getting the total on both the dice is 13,= {}
= 0
P (=
E ) m=
/n 0 /=
36 0 the total on both the dice is 13 is called Impossible event.
h) The favourable cases of getting total on both the dice is any number between 2 to 12
(Both Inclusive)
m = {(1,1) , (1, 2 )(1,3) , (1, 4 ) , (1,5 ) , (1, 6 ) , ( 2,1) , ( 2, 2 )( 2,3) , ( 2, 4 ) , ( 2,5 ) , ( 2, 6 ) , ( 3,1) , ( 3, 2 ) , ( 3,3) ,
Quantitative Methods
17
Permutation:
npr = n ! / ( n − r ) ! ; where=
n! n ( n − 1) ( n − 2 ) ……3 2 1
3 p=
2 3! / ( 3 − 2 )=
! 3.2.1/1= 6
Combination:
ncr = n ! / ( n − r )! r ! ; where=
n! n ( n − 1) ( n − 2 ) ……3 2 1
5. If 2 cards are selected from a pack of 52 cards, then find the probability for a) a spade and
a heart card; b) a number and a face card; c) both the cards are black.
Solution:
Selection of 2 cards from a pack of 52 cards can be done in n = 52c2
( e ) m=
p= /n 13c1 x 13c1 / 52c2
Quantitative Methods
18
b) favourable cases of getting a number and a face card m = 36c1 x16c1
( e ) m=
p= /n 36c1 x16c1 / 52c2
( e ) m=
p= /n 26c2 / 52c2
6. If 4 cards are selected from a pack of 52 cards, then find the probability for a) one card
from each suit;
b) two of the selected are diamonds; c) at least one card is a spade card; d) all are num-
ber cards.
Solution:
a) favourable cases of getting one card from each suit , m = 13c1 x13c1 x13c1 x13c1
( e ) m=
p= /n 13c1 x13c1 x13c1 x13c1 / 52c4
b) favourable cases of getting two of the selected are diamonds, m = 13c2 x39c2
( e ) m=
p= / n 13c2 x39c2 / 52c4
( e ) m=
p= / n 36c4 / 52c4
Quantitative Methods
19
7. a basket contains 3 white, 4 blue and 5 red balls. if 3 balls are selected at random then
find the probability for
Solution:
a) the favourable cases for one ball from each colour, m = 3c1.4c1.5c1
(e)
p= m
= /n 3c1.4c1.5c1 /12c3
b) the favourable cases for two of the drawn balls are white, m = 3c 2.9c1
(e)
p= m
= /n 3c2 .9c1 /12c3
(e)
p= m
= /n 5c3 /12c3
d) the favourable cases for at least one is blue ball, m = 4c1.8c2 + 4c2 .8c1 + 4c3
(e)
p= m
= /n 9c3 /12c3
Quantitative Methods
20
8. Find the probability of getting 53 Sundays in a randomly selected leap year?
Solution:
It required to find the probability for 53rd Sunday in the remaining 366-364 = 2 days
n {=
( sun, mon ) , ( mon, tue ) , ( tue, wed ) , ( wed , thu ) , ( thu, fri ) , ( fri, sat ) , ( sat , sun )} 7
of a Sunday will be m
The favourable cases = {=
( sun, mon ) , ( sat , sun )} 2
(e)
p= m
= /n 2/7
● For Two Events: If A and B are any two events then the probability of happening of at
P ( A ∪ B ∪ C=
) P ( A) + P ( B ) + P ( C ) − P ( A ∩ B ) − P ( B ∩ C ) − P ( C ∩ A) + P ( A ∩ B ∩ C )
1. A card is drawn at random from a pack of 52 cards. Find the probability that the drawn
card is either a spade or a king.
Solution:
Let A : Event of drawing a card of spade and B : Event of drawing a king card
Quantitative Methods
21
13
The probability of drawing a card of spade P ( A) =
52
4
The probability of drawing a king card P ( B ) =
52
Because one of the kings is a spade card also therefore, these events are not mutually
1
exclusive. The probability of drawing a king of spade is P ( A ∩ B ) =
52
So, the probability of the drawing a spade or king card is:
13 4 1 16 4
P ( A ∪ B ) = P( A) + P ( B ) − P ( A ∩ B ) = + − = =
52 52 52 52 13
2. A herd contains 30 cows numbered from 1 to 30. One cow is selected at random. Find
the probability that number of the selected cow is a multiple of 5 or 8.
Solution:
Let A be the event of number being a multiple of 5 within 30 and B be the event of number
being a multiple of 8 within 30.
Favourable cases for event A are {5, 10, 15, 20, 25, 30} = 6
6
The probability of the number being a multiple of 5 within 30 is P ( A ) =
30
3
The probability of the number being a multiple of 8 within 30 is P ( B ) =
30
Since A and B are mutually exclusive, the probability that number of the cow is a multiple of
6 3 9 3
5 or 8 is: P ( A ∪ B ) = P ( A) + P ( B ) = + = =
30 30 30 10
3. A card is drawn at random from a pack of 52 cards. Find the probability that the drawn
card is either a club or an ace of diamond.
Quantitative Methods
22
Solution:
Let A : Event of drawing a card of club; and B : Event of drawing an ace of diamond
13
The probability of drawing a card of club P ( A ) =
52
1
The probability of drawing an ace of diamond P ( B ) =
52
Since the events are mutually exclusive, the probability of the drawn card being a club, or an ace
13 1 14 7
of diamond is: P ( A ∪ B ) = P ( A) + P ( B ) = + = =
52 52 52 26
4. A herd contains 30 cows numbered from 1 to 30. One cow is selected at random. Find the
probability that the number of the selected cow is a multiple of 5 or 6.
Solution:
Let A be the event of number being a multiple of 5 within 30 and B be the event of number being
a multiple of 6 within 30.
Favourable cases for event A are {5, 10, 15, 20, 25, 30} = 6
Similarly favourable cases for event B are {6, 12, 18, 24, 30} = 5
6
The probability of the number being a multiple of 5 within 30 is P ( A ) =
30
5
The probability of the number being a multiple of 6within 30 is P ( B ) =
30
Since 30 is a multiple of 5 as well as 6, therefore the events are not mutually exclusive
1
( A ∩ B ) P ( A=
P= and B )
30
The probability that the number of the selected cow is a multiple of 5 or 6 is:
6 5 1 10 1
P ( A ∪ B ) = P( A) + P ( B ) − P ( A ∩ B ) = + − = =
30 30 30 30 10
Quantitative Methods
23
5. A herd contains 30 cows numbered from 1 to 30. One cow is selected at random. Find the
probability that the number of the selected cow is a multiple of 5 or 6 or 8.
Solution:
C - Multiple of 8, (8,16,24) = 3
P ( A ) = 6 / 30
P ( B ) = 5 / 30
P ( C ) = 3 / 30
P( A ∩ B) =
1/ 30
P( B ∩ C ) =
1/ 30
P( A ∩ C ) = 0 / 30= 0
P ( A ∩ B ∩ C=
) 0 / 30
= 0
P ( AUBUC ) = P ( A) + P ( B ) + P ( C ) − P ( A ∩ B ) − P ( B ∩ C ) − P ( C ∩ A) + P ( A ∩ B ∩ C )
= 6 / 30 + 5 / 30 + 3 / 30 − 1/ 30 − 1/ 30 − 0 + 0
= 12
= / 30 2 / 5
6. If a card is selected at random from a pack of 52 cards, then find the probability that the
selected card is a Red or a Heart or a Jack card.
Solution:
Quantitative Methods
24
Let C - Jack card, P ( C ) = 4 / 52
P ( A ∪ B ∪ C=
) P ( A) + P ( B ) + P ( C ) − P ( A ∩ B ) − P ( B ∩ C ) − P ( C ∩ A) + P ( A ∩ B ∩ C )
= 26 / 52 + 13 / 52 + 4 / 52 − 13 / 52 − 1/ 52 − 2 / 52 + 1/ 52
= 28 / 52
Quantitative Methods
25
Self-Assessment Questions
A) 3
B) 2
C) 4
D) 5
A) And
B) Or
C) Not
D) No
A) No
B) Or
C) Not
D) And
Quantitative Methods
26
1.1.2 Conditional Probability and Independence of Events
If A and B are any two events in the sample space S . Then the probability of happening of
P ( A / B) =
P( A ∩ B) / P ( B ) where P ( B ) > 0
Similarly, P ( B / A ) =
P( A ∩ B) / P ( A ) where P ( A ) > 0
If A and B are any two events in the sample space S . Then the probability of happening of
P ( A ) P ( B / A ) ;where P ( A ) > 0
event A and B is given by P ( A ∩ B ) =
= P ( B ) P ( A / B ) ;where P ( B ) > 0
If A , B and C are any three events in the sample space S . Then the probability of happening
of event A , and B , and C is given by
P( A ∩ B ∩ C )
= P ( A) P ( B / A) P ( C / A ∩ B )
Independence of events:
P ( A) P ( B )
Two events A and B are said to be independent if P ( A ∩ B ) =
1. Box A contains 5 red and 3 white balls and Box B Contains 2 red and 6 white balls. If
a ball is drawn from each box, what is the probability that they are both of same colour?
Quantitative Methods
27
Solution:
P ( E1 ) 1/=
Let E1 = The ball drawn from box A and a red ball, = 2.5 / 8 5 /16
P ( E2 ) 1/=
Let E2 = The ball drawn from box B and a red ball, = 2.2 / 8 2 /1 6
= 5 /16.2
= /16 10
= / 256 5 /128 .
P ( E3 ) 1/=
Let E3 = The ball drawn from box A and a WHITE ball, = 2.3 / 8 3 /16
P ( E2 ) 1/=
Let E4 = The ball drawn from box B and a WHITE ball, = 2.6 / 8 6 /1 6
= 3 /16.6
= /16 18
= / 256 9 /128 .
2. The probabilities of 3 students solve a problem in statistics are 1/2,1/3 &1/4 respectively.
Then find the probability that that the problem will be solved.
Solution:
P ( A ∪ B ∪ C=
) P ( A) + P ( B ) + P ( C ) − P ( A ∩ B ) − ( B ∩ C ) − ( C ∩ A) + P( A∩ B ∩C)
= P ( A) + P ( B ) + P ( C ) − P ( A) . P ( B ) − P ( B ) . P ( C ) − P ( C ) . P ( A) + P ( A) . P ( B ) . P ( C )
Quantitative Methods
28
= 1/2 + 1/ 3 + 1/ 4 − 1/ 2.1/ 3 − 1/ 3.1/ 4 − 1.4.1/ 2 + 1/ 2.1/ 3.1/ 4
= 3/ 4
3. If the probability that a communication system has high fidelity is 0.81 and the probability
that it has high fidelity and selectivity is 0.18. What is the probability that the system with
high selectivity given that it has high fidelity?
Solution:
Probability that the system with high selectivity given that it has high fidelity.
P ( B / A=
) P( A ∩ B) / P ( A)
= ( 0.18) / ( 0.81)
= 2/9
Quantitative Methods
29
Self-Assessment Questions
A) A or B
B) A and B
C) A by B
D) A given B
A) Intersection
B) Union
C) Given
D) Not
A) Intersection
B) Union
C) Given
D) Not
A) 1
B) 2
C) 0
D) 3
A) 1
B) 2
C) 0
D) 0<p<1
Quantitative Methods
30
1.1.3 Baye’s Theorem
The concept of Baye’s theorem is applied to convert the Prior probabilities into Posterior
(Conditional) probabilities.
Let A1 , A2… An are n mutually exclusive events in the sample space S such that P ( Ai ) > 0
and B be another event in the sample space S such that P ( B ) > 0 and B €UAi . Then
the conditional probability of Aj given B is
P ( A j ) .P ( B / A j ) .
P ( Aj / B ) = n
∑ P( A ).P ( B / A ) .
j =1
j j
1. In a bolt factory, machines A , B and C produces 20%, 30% and 50% of the total out-
put and 6%, 3% and 2% are defectives produced by A , B and C . If a bolt is drawn at
random and found to be defective, then find the probability that it is manufactured by i)
Machine A ii) Machine B iii) Machine C iv) Machines A or B .
Solution:
P ( A1) 20
Let A1 – Production by Machine A , = = /100 0.2
P ( B=
/ A1) 6=
/100 0.06
P ( B=
/ A2 ) 3=
/100 0.03
P ( B=
/ A3) 2 /100
= 0.02
Quantitative Methods
31
= ( 0.2.0.06 ) / ( 0.2.0.06 ) + ( 0.3.0.03) + ( 0.5.0.02 )
= 0.012 / 0.031
= 0.38
= 38%
ii) The probability for the defective both produced by machine B
P ( ( A1 ∪ A2 ) / B ) = P ( A1/ B ) + P ( A2 / B )
= 0.38 + 0.29
= 0.67
= 67%
v) The probability for the defective both produced by machine A or B or C
P ( ( A1 ∪ A2 ∪ A3) =
/ B ) P ( A1/ B ) + P ( A2 / B ) + P ( A3 / B )
=0.38 + 0.29 + 0.32 =0.99 =1 =100%
2. Of all staff in a university, 40% are women and 60% are men, among which 3% and 5%
are smoke cigarettes. If an employee is selected at random and found to be a smoker,
then find the probability that a) a man who is a smoker; b) a woman who is a smoker; c)
a smoker.
Solution:
P ( A1)
= 40 /100
= 0.4
P ( A2 )
= 60
= /100 0.6
Quantitative Methods
32
Then we have
P ( B=
/ A1) 3=
/100 0.03
P ( B /=
A2 ) 5=
/100 0.05
P ( A2 / B ) P ( A2 ) P ( B / A2 ) / P ( A1) P ( B / A1) + P ( A2 ) P ( B / A2 )
= 0.012 / 0.042
= 0.28
= 28%
P ( A1 ∪ A2 ) / B =
P ( A1/ B ) + P ( A2 / B )
Quantitative Methods
33
Self-Assessment Questions
A) Posterior
B) Union
C) Intersection
D) Cost
A) Intersection
B) Prior
C) Given
D) Posterior
A) Intersection
B) Prior
C) Given
D) Posterior
A) 1
B) 2
C) 0
D) 3
A) Even chance
B) Impossible
C) Sure
D) Certain
Quantitative Methods
34
Summary
Terminal Questions
Quantitative Methods
35
Answer Keys
Self-Assessment Questions
Question No Answers
1 C
2 A
3 B
4 C
5 B
6 D
7 D
8 B
9 A
10 C
11 D
12 A
13 B
14 D
15 A
16 B
Quantitative Methods
36
Glossary
• Conditional probability: Defined as the likelihood of an event or outcome
occurring, based on the occurrence of a previous event or outcome.
• Probability: A probability is a number that reflects the chance or likelihood that
a particular event will occur.
• Event: The outcomes of an experiment.
BIBLIOGRAPHY
1. Chung, K. L. (2000). A Course in Probability Theory (3rd ed.). San Diego, CA:
Academic Press.
2. Feller, W. (1968). An introduction to Probability Theory and its applications: Vol-
ume I. John Wiley & Sons.
3. Gupta, S. C., & Kapoor, V. K. Fundamentals of Mathematical Statistics. S. Chand
Publications.
External Resources
1. Rukmangadachari, E. Probability and Statistics. Person Education.
2. Rohatgi, V. K., & Ehsanes Saleh, A. K. (2015). An introduction to probability and
statistics (3rd ed.). doi:10.1002/9781118799635
e-References
• Bayes Theorem: https://www.cuemath.com/data/bayes-theorem/
Quantitative Methods
37
Image Credits
Representation of Chance
Fig. 1 https://www.advancedhighermaths.
co.uk/probability-2/
Video Links
Topic Link
Probability https://www.youtube.com/
watch?v=lZSL7Tm5ViA
Introduction to Probability, Basic Overview https://www.youtube.com/
- Sample Space, & Tree Diagrams watch?v=SkidyDQuupA
Probability Equation Questions https://www.youtube.com/
watch?v=76-cMKLmWJY
Keywords
• Random experiment
• Sample space
• Equally likely events
• Bayes theorem
Quantitative Methods
38
QUANTITATIVE METHODS
Module - 1
Unit - 2
PROBABILITY DISTRIBUTIONS
Quantitative Methods
39
Unit Table of Contents
Unit 1.2 Probability Distributions
Aim -------------------------------------------------------------------------------------------------------------- 41
Instructional Objectives ------------------------------------------------------------------------------------ 41
Learning Outcomes ----------------------------------------------------------------------------------------- 41
Introduction ---------------------------------------------------------------------------------------------------- 42
1.2.1 Random Variable ----------------------------------------------------------------------------------- 42
Self-Assessment Questions --------------------------------------------------------------------- 44
1.2.2 Binomial Distribution ------------------------------------------------------------------------------ 45
Self-Assessment Questions --------------------------------------------------------------------- 50
1.2.3 Poisson Distribution ------------------------------------------------------------------------------- 51
Self-Assessment Questions --------------------------------------------------------------------- 55
1.2.4 Normal Distribution -------------------------------------------------------------------------------- 56
Self-Assessment Questions --------------------------------------------------------------------- 58
Summary ------------------------------------------------------------------------------------------------------- 59
Terminal Questions ------------------------------------------------------------------------------------------ 59
Answer Keys -------------------------------------------------------------------------------------------------- 60
Glossary -------------------------------------------------------------------------------------------------------- 61
Bibliography --------------------------------------------------------------------------------------------------- 61
External Resources ----------------------------------------------------------------------------------------- 61
e-References ------------------------------------------------------------------------------------------------- 61
Image Credits ------------------------------------------------------------------------------------------------- 62
Video Links ---------------------------------------------------------------------------------------------------- 62
Keywords ------------------------------------------------------------------------------------------------------ 62
Quantitative Methods
40
Aim
To explain basic concepts of probability distributions, random variables, and their
types.
Instructional Objectives
This unit intends to:
● Discuss the basic concepts of probability distributions
● Distinguish between continuous and discrete random variables
● Describe the properties of binomial distribution, poisson distribution, and
nominal distribution
Learning Outcomes
At the end of this unit, you are expected to:
● Distinguish between various types of random variables
● Calculate probabilities of random variables
● Identify continuous random variable and discrete random variable, and their
properties
● Apply probability distributions in marketing, HRM, finance, and other
functional areas
Quantitative Methods
41
Introduction
Random variable concept is an extension to the concept of probability. It describes the probability
distribution between different points of a random variable. Binomial, Poisson, and Normal Distri-
butions have a number of applications in management. The probability distribution of a random
variable describes how the probability is distributed over the values of the random variable.
X ( S ) : SàR
For example, we flipped a fair coin three times and recorded whether it showed heads or tails.
The result or sample space is S = { HHH , HHT , HTH , THH , TTT , TTH , THT , HTT } . There are
eight possible outcomes, and each outcome is equal. Now, we flip a fair coin four times. How
many possible outcomes are there? There are 24 =16. How about 8 times? 256 possible out-
comes! Instead of considering all possible outcomes, we can consider assigning the variable
X to be, say, the number of heads in n flips of a fair coin. If we flipped the coin n = 3 times (as
above), then X can take on possible values of 0,1,2, or 3. By defining the variable X , as we
have, we created a random variable.
E.g.: A coin is tossed twice and getting the head is a random variable.
S = {HH , HT , TH , TT }
R = {2,1, 0}
Random variables are classified as discrete and continuous variables. The main difference be-
tween them is the type of possible values each variable can take. Let us understand these two
variables in detail below.
Eg :=
X 6,=
Y 1, 2,3, 4,5.
Quantitative Methods
42
b. Continuous Random variable
It is a random variable which assumes an infinite number of values between certain
intervals. Simply put, a random variable is called continuous if its possible values
have an entire interval of numbers.
Eg : X € [ 2,3] , Y € ( 5, 6 )
It is the distribution of the probability among the various sample points in the sample space.
E.g.: A coin is tossed two times and getting a head is a random variable.
S = {HH , HT , TH , TT }
R = {2,1, 0}
1. P ( Xi ) ≥ 0
1. Binomial Distribution
2. Poisson Distribution
1. Normal Distribution
Quantitative Methods
43
Self-Assessment Questions
A). Nominal
B). Real
C). Ordinal
D). Case
A). 2
B). 3
C). 4
D). 1
A). 2
B). 3
C). 4
D). 1
DID
YOU Binomial Distribution (BD) was discovered by James Bernoulli in 1700
KNOW as an extension to Bernoulli Distribution.
Quantitative Methods
44
1.2.2 Binomial Distribution
An experiment is said to follow the BD under the following conditions.
1. The experiment should be repeated a finite number of times. Say ' n ' times.
3. p ( s ) = p , p ( f ) 1
= q ; where p + q =
If a trail repeated for n times and the probability of getting x successes can be represented by
the following Probability Mass Function:
P (= )
X x= ncx. p x q ( n − x ) ;=
x 0,1, 2,3……. n
= 0 ; otherwise
Where n , p are called parameters of B.D
1. A fair coin is tossed 6 times then, find the probability of getting 4 heads.
Answer:
Quantitative Methods
45
P (= )
X x= ncx. p x q ( n − x ) ; =
x 0,1, 2,3……. n
P ( X= x=
) 6cx (1/ 2 ) (1/ 2 )
x 6− x
P (= ) 6cx (1/ 2 )
6
X x=
P (= ) 6C4 (1/ 2 )
6
X 4=
= 0.234
2. A die is rolled 10 times. What is the chance of getting exactly 2 times the face 1?
Answer:
n = 10
Success? face 1 p = 1/ 6
( x 2=
) 10c 2 (1/ 6 ) ( 5 / 6 )
2 (10 − 2 )
p=
= 0.27
3. A discrete variable X has the mean 6 and variance 2. If it is assumed that the distribution is
Answer:
Given that x ~ B ( 6, 2 )
Substitute 1 in 2
− > 6q = 2− > q = 2 / 6 = 1/ 3
Quantitative Methods
46
− > P = 1 − q = 1 − 1/ 3 = 2/3
=n 9,
= p 2/=
3, q 1/ 3
p (= ) 9cx ( 2 / 3) (1/ 3)
x 9− x
X x= is the required PMF of B.D
P (5 ≤ x ≤ 7) =P ( X =+
5 ) P ( X =+
6) P ( X =
7)
4. If four coins are tossed 160 times and the number of times x heads occur is given below.
x 0 1 2 3 4
No. of times 8 34 69 43 6
Answer:
p = 0.5
q =1 − 0.5 =0.5
p (= )
X x= 4cx 0.5 x 0.5( 4− x )
p (= ) 4cx 0.54
X x=
Quantitative Methods
47
The Expected frequencies
X = x P ( X = x) E(X )
= N
= P ( X x)
0 p ( X= 0=
) 4c0. 0.5=4 0.0625
= E ( 0 ) 160
= ( 0.0625) 10
1 p ( X= 1=
) 4c1. 0.54= 0.25 =
E (1) 160
= ( 0.25) 40
2 p ( X= 2=
) 4c2. 0.5=4 0.375=
E ( 2 ) 160
= ( 0.375) 60
3 p ( X= 3=
) 4c3. 0.5=4 0.25 =
E ( 3) 160
= ( 0.25) 40
4 p ( X= 4=
) 4c4. 0.5=4 0.0625
= E ( 4 ) 160
= ( 0.0625) 10
x 0 1 2 3 4
8 34 69 43 6
Oi
10 40 60 40 10
Ei
5. Fit a binomial distribution and obtain the expected frequencies for the following data.
x 0 1 2 3 4 5 6
6 28 56 60 36 12 2
f
Answer:
We don’t know the experiment, so that calculate the mean of given data and equate to the
mean of B.D
x 0 1 2 3 4 5 6
6 28 56 60 36 12 2
f
0 28 112 180 144 60 12
x* f
Quantitative Methods
48
Sum ( f ) = 200
Sum ( x * f ) = 536
= 536
= / 200 2.68
q = 1 − p = 1 − 0.44 = 0.56
p (= )
X x= ncx p x q n − x
C X ( 0.44 ) X ( 0.56 )
6− X
= 6
= X 0,1, 2,3, 4,5, 6
X =x P ( X = x) E(X )
= N
= P ( X x)
0 p (= )
X 0= 6C0 ( 0.45 ) ( 0.55 ) =
0 6−0
=
0.03 E ( 0) 200 ( 0.03)
= 6
1 p(= )
X 1= 6C1( 0.44 ) ( 0.56 ) =
1 6 −1
=
0.145 E (1) 200 ( 0.145 )
= 28
2 p (= )
X 2= 6C 2 ( 0.44 ) ( 0.56 ) =
2 6− 2
=
0.28 E ( 2) 200 ( 0.28 )
= 56
3 p (= )
X 3= 6C 3 ( 0.44 ) ( 0.56 ) =
3 6 −3
E ( 3)
0.3 = 200 ( 0.3)
= 60
4 p (= )
X 4= 6C 4 ( 0.44 ) ( 0.56 ) =
4 6− 4
=
0.17 E ( 4) 200 ( 0.17 )
= 34
5 p (= )
X 5= 6C 5 ( 0.44 ) ( 0.56 ) =
5 6 −5
=
0.05 E ( 5) 200 ( 0.05 )
= 10
6 p (= )
X 6= 6C 6 ( 0.44 ) ( 0.56 ) =
6 6−6
=
0.007 E ( 6) 200 ( 0.007 )
= 2
Quantitative Methods
49
Self-Assessment Questions
A). p
B). np
C). npq
D). pq
A). p
B). np
C). npq
D). pq
DID
YOU The Poisson Distribution was formulated by Siemon D Poisson as a
KNOW limiting case of Binomial Distribution.
Quantitative Methods
50
1.2.3 Poisson Distribution
The Binomial distribution will be converted as a poisson distribution under the following cases:
e−λ λ x
P (=
X = x) = ; x 0,1, 2,......., ∞
x!
=0 ; otherwise
1). The probability that an individual suffers from a bad reaction of certain drug is 0.001, deter-
mine the probability that out of 2000 individuals
(i) exactly 3
(ii) more than 2 suffers from the bad reaction.
Answer:
p = 0.001
e−λ λ x
P (=
X = x) = ; x 0,1, 2,......., ∞
x!
Quantitative Methods
51
λ
where = np
= 2000 ( 0.001
=) 2
i). The probability that exactly 3 members suffering from the bad reaction is
( x 3=
p= ) e −2 23 / 3!
= ( 0.135) 8 / 6
= 0.18
ii). The probability that more than 2 members suffering from the bad reactions
p ( x > 2=
) p (=
x 3) + p (=
x 4 ) + ………… + p (=
x 2000 )
= 1 − p ( x ≤ 2 )
=1 − p ( x =0) + p ( x =
1) + p ( x =2 )
= 1 − e −2 (1 + 2 + 2 ) p ( x ≥ 2 )
= 1 − 0.675
= 0.325
= 32.5%
2). 2% of the products produced by a machine are defective. Then what is the probability that out
of 100 products (i) 3 defectives; (ii) defectives between 3 to 5?
Answer:
that n
Given = 100,
= p 0.02
λ
= = 100 ( 0.02=
np ) 2
( x 3=
p= ) e −2 23 / 3!
Quantitative Methods
52
= ( 0.135) 8 / 6
= 0.18
ii). p ( 3 ≤ x ≤ 5 ) =p ( x =3) + p ( x =4 ) + p ( x =5 )
= e −2 23 / 3! + e −2 24 / 4! + e −2 25 / 5!
= ( 0.135) ( 2.256 )
= 0.304
= 30.4%
3). Fit a Poisson distribution for the following data and also calculate the expected frequencies.
x 0 1 2 3 4
f 109 65 22 3 1
Answer:
Since the experiment is unknown, calculate the mean of the given data and itself is the mean of
Poisson Distribution.
x 0 1 2 3 4
f 109 65 22 3 1 200
x* f 0 65 44 9 4 122
X bar
= 122
= / 200 0.61
Therefore, λ = 0.61
The PMF of PD
e−λ λ x
P (=
X = x) = ; x 0,1, 2,......., ∞
x!
e −0.61 0.61x
P ( X = x) =
= ; x 0,1, 2,......., ∞ is the required PMF of PD
x!
Quantitative Methods
53
The expected frequencies
X =x P ( X = x) E(X )
= N
= P ( X x)
0
= −.061
e= 0.610 / 0! 0.543 E
= ( 0) 200 ( 0.543
= ) 108.6
= 109
=
1
−.061
e= 0.611 /1! 0.543
= ( 0.61) 0.331 (1)
E= 200 ( 0.331
= ) 66.2
= 66
=
2
−.061
e= 0.612 / 2! 0.543
= ( 0.018) 0.10
= E ( 2) 200 ( 0.10 )
= 20
3
= −.061
e= 0.613 / 3! 0.020 E ( 3)
= 200 ( 0.020 )
= 4
4
= −.061
e= 0.614 / 4! 0.0029 E (=
4) 200 ( 0.0029
= ) 0.5
= 1
e −0.61 0.61x
P ( X = x) =
= ; x 0,1, 2,......., ∞
x!
x 0 1 2 3 4
Observed Frequencies 109 65 22 3 1
Quantitative Methods
54
Self-Assessment Questions
A). p
B). np
C). npq
D). λ
A). λ
B). np
C). npq
D). pq
Quantitative Methods
55
1.2.4 Normal Distribution
The Normal Distribution is a continuous probability distribution Developed by Gauss. Therefore,
this distribution is also known as Gaussian Distribution. For increased trails, all other distributions
tend to Normal Distribution.
1 ( x − µ )2
f ( x; µ , σ )
= exp{− }
2πσ 2σ 2
Quantitative Methods
56
1.2.4.3 Examples of Normal Distribution
1). Let X is a normal variate with mean 30 and S.D 5, then find the probabilities that
i). p ( 26 ≤ x ≤ 40 ) ii ) p ( x ≥ 45 ) .
Answer:
Given that mean, µ = 30
Standard Deviation, σ = 5 and x is normal variate
i). p ( 26 ≤ x ≤ 40 ) =p (( 26 − µ ) / σ ≤ ( x − µ) / σ ≤ ( ( 40 − µ ) / σ )
= p (( 26 − 30 ) / 5 ≤ z ≤ ( ( 40 − 30 ) / 5 )
= p ( −0.8 ≤ z ≤ 2 )
= p ( 0 ≤ z ≤ 0.8 ) + p ( 0 ≤ z ≤ 2 )
= 0.28814 + 0.47725 = 0.756
ii). p ( x ≥ 45 ) = p ( ( x − µ ) / σ ≥ ( 45 − µ ) / σ )
= p(z ≥ ( 45 − 30 ) / 5) = p ( z ≥ 3) = 0.5 − P ( 0 ≤ Z ≤ 3)
=0.5 − 0.4987 =0.0013
2). For a Normally distributed variate with mean 1 and Standard Deviation 3, then find
i) p ( 3.43 ≤ x ≤ 6.19
= ) p ( ( 3.43 − µ ) / σ ≤ ( x − µ ) / σ ≤ ( 6.19 − µ ) / σ )
= p ( ( 3.43 − 1) / 3 ≤ z ≤ ( 6.19 − 1) / 3)
= p ( 0.81 ≤ z ≤ 1.73)
= p ( 0 ≤ z ≤ 1.73) − p ( ≤ z ≤ 0.81)
= 0.45818 − 0.29130
= 0.1671
Quantitative Methods
57
Self-Assessment Questions
A) Binomial
B) Gaussian
C) Poisson
D) Geometric
A) p
B) np
C) µ
D) λ
A) σ2
B) np
C) µ
D) λ
Quantitative Methods
58
Summary
Terminal Questions
x 0 1 2 3 4
No. of times 8 34 69 43 6
5. Fit a Poisson distribution and also find out the expected frequencies.
6. If four coins are tossed 160 times and the number of times x heads occurs is given
below.
x 0 1 2 3 4
No. of times 8 34 69 43 6
7. Fit a binomial distribution and also find out the expected frequencies.
8. Fit a Binomial distribution for the following data and also calculate the expected
frequencies.
x 0 1 2 3 4
f 109 65 22 3 1
9. If four coins are tossed 160 times and the number of times x heads occurs is given
below.
x 0 1 2 3 4
No. of times 8 34 69 43 6
10. For a Normally distributed variate with mean 1 and Standard Deviation 3, then find
Quantitative Methods
59
Answer Keys
Self-Assessment Questions
Question No Answers
1 B
2 A
3 D
4 A
5 B
6 C
7 C
8 D
9 A
10 B
11 C
12 A
Quantitative Methods
60
Glossary
• Random Variable: A random variable is a real valued function, which assigns
every element in the sample space to a real number.
• Discrete and continuous random variables: These are different types of
random variables.
• Binomial Distribution: The B.D was discovered by James Bernoulli in 1700 as
an extension to Bernoulli Distribution. An experiment is said to follow the B.D
under the following condition that the experiment should be repeated for a finite
number of times (say n times).
• Poisson Distribution: The Poisson Distribution was developed by Siemon D
Poisson as a limiting case of Binomial Distribution.
• Normal Distribution: A continuous probability distribution is called Normal
Distribution.
BIBLIOGRAPHY
1. Chung, K. L. (2000). A Course in Probability Theory (3rd ed.). San Diego, CA:
Academic Press.
2. Feller, W. (1968). An introduction to Probability Theory and its applications: Vol-
ume I. John Wiley & Sons.
3. Gupta, S. C., & Kapoor, V. K. Fundamentals of Mathematical Statistics. S. Chand
Publications.
External Resources
1. Rukmangadachari, E. Probability and Statistics. Person Education.
2. Rohatgi, V. K., & Ehsanes Saleh, A. K. (2015). An introduction to probability and
statistics (3rd ed.). doi:10.1002/9781118799635
e-References
• Normal Distribution: What It Is, Properties, Uses, and Formula: https://www.investo-
pedia.com/terms/n/normaldistribution.asp
Quantitative Methods
61
Image Credits
Video Links
Topic Link
Binomial distribution https://www.youtube.com/
watch?v=WWv0RUxDfbs
An Introduction to the Poisson Distribution https://www.youtube.com/watch?v=-
jmqZG6roVqU
Normal Distribution, Clearly Explained!!! https://www.youtube.com/
watch?v=rzFX5NWojp0
Keywords
• Binomial distribution
• Poisson distribution
• Normal distribution
• Discrete random variable
• Gaussian distribution
Quantitative Methods
62
QUANTITATIVE METHODS
Module - 1
Unit - 3
INTRODUCTION TO
R PROGRAMMING
Quantitative Methods
63
Unit Table of Contents
Unit 1.3 Introduction to R Programming
Aim -------------------------------------------------------------------------------------------------------------- 65
Instructional Objectives ------------------------------------------------------------------------------------ 65
Learning Outcomes ----------------------------------------------------------------------------------------- 65
Introduction ---------------------------------------------------------------------------------------------------- 66
1.3.1 Evolution and Features of R Programming -------------------------------------------------- 66
Self-Assessment Questions --------------------------------------------------------------------- 68
1.3.2 Operators in R Programming ------------------------------------------------------------------- 69
Self-Assessment Questions --------------------------------------------------------------------- 73
1.3.3 Data Structures in R Programming ------------------------------------------------------------ 74
Self-Assessment Questions --------------------------------------------------------------------- 79
Summary ------------------------------------------------------------------------------------------------------- 80
Terminal Questions ------------------------------------------------------------------------------------------ 80
Answer Keys -------------------------------------------------------------------------------------------------- 81
Glossary -------------------------------------------------------------------------------------------------------- 82
Bibliography --------------------------------------------------------------------------------------------------- 82
External Resources ----------------------------------------------------------------------------------------- 82
e-References ------------------------------------------------------------------------------------------------- 82
Video Links ---------------------------------------------------------------------------------------------------- 83
Keywords ------------------------------------------------------------------------------------------------------ 83
Quantitative Methods
64
Aim
This unit aims to explain the basic concepts of R Programming and its applications
in Management.
Instructional Objectives
This unit intends to:
● Explain the basic concepts of R Programming
● Analyse the application of R Programming in management
Learning Outcomes
At the end of this unit, you are expected to:
● Describe the basic concepts of R Programming
● Apply the concepts of R Programming in Marketing, HRM, Finance and other
functional areas
Quantitative Methods
65
Introduction
R Programming is a concept that illustrates the necessity of utilising programming approaches to
apply statistical tools. R is a widely used statistical software and data analysis tool that is written
in an open-source programming language. R comes with a command-line interface by default. R
is available on a variety of platforms, including Windows, Linux, and Mac OS X. In addition, the
R programming language is the most up-to-date tool.
• R is an open source (freely downloaded and updatable) programming language that was
created in 1992 by Ross Ihaka and Robert Gentleman at Auckland University in New
Zealand.
• It was formerly known as the R&R Language and is now known as the R Programming
Language.
• The Comprehensive R Archive Network created R, a programming language and free
software environment for statistical computing and graphics (CRAN).
Quantitative Methods
66
R-Console is a command-line interface for the R-Console (R-Gui)
• R-Script is a programming language that allows you to create your own scripts (Input)
• R-Controller (Output)
• R-Script is a programming language that allows you to create your own scripts (Input)
• R-Controller (Output)
• R-Environment is an acronym for “Responsible Environment” (Saving Variables)
• R-Files & Folders (R-Files & Folders) (R-Files & Folders (Packages)
1.3.1.3 Variables
• E.g.: x = 15
Quantitative Methods
67
Self-Assessment Questions
A) Yes
B) No
C) May be
D) Can’t say
A) 2
B) 3
C) 4
D) 1
A) 5
B) 3
C) 2
D) 1
A) CLASS
B) CALM
C) COMB
D) CRAN
Quantitative Methods
68
1.3.2 Operators in R Programming
An operator is a symbol that tells the compiler to perform specific mathematical or logical manip-
ulations. R language is rich in built-in operators and provides following types of operators:
Assignment Operator
Arithmetic Operator
Combining Operator
Sequence or Colon Operator
Relational Operator
Logical Operator
+ - Addition
– - Subtraction
* - Multiplication
/ - Division
^ - Exponent
%% - Modulus (Remainder from division)
% / % - Integer Division
Quantitative Methods
69
E.g.: > x < − 5
> y < − 16
> x + y [1] 21
> x − y [1] − 11
> x * y [1] 80
> y % / % x [1] 3
> y %% x [1] 1
> y x [1]1048576
: - Sequence Operator
E.g.: ü
Output: 1 2 3 4 5
Quantitative Methods
70
> x > y [1] FALSE
>y =
=16 [1] TRUE
5 [1] FALSE
> x !=
! - Logical NOT
& - Element-wise logical AND
&& - Logical AND
| - Element-wise logical OR
|| - Logical OR
• Note: Operators & and | perform element-wise operation producing result
having length of the longer operand.
But && and || examines only the first element of the operands resulting into a
single length logical vector.
Zero is considered FALSE and non-zero numbers are taken as TRUE. An exam-
ple follows.
Everything in R is an object.
Quantitative Methods
71
R has 5 basic data types or Objects
• Character
" a " , " swc "
• Numeric (real or decimal)
2 , 15.5
• Integer
2L (the L tells R to store this as an integer)
• Logical
TRUE , FALSE
• Complex
1 + 4i (Complex numbers with real and imaginary parts)
Note: there are three functions in R to check the data type- typeof ( ) , class ( ) , mode ( )
R provides many functions to examine features of vectors and other objects, for example
class ( ) - what kind of object is it (high-level)?
x < − " ramana "
> class ( x )
> typeof ( y )
[1] 1 2 3 4 5
> length ( y )
[1] 5
Quantitative Methods
72
Self-Assessment Questions
A) Percentage
B) Modulus
C) Addition
D) Subtraction
A) 2
B) 3
C) 4
D) 6
A) 5
B) 3
C) 2
D) 5
A) f
B) a
C) c
D) d
Quantitative Methods
73
1.3.3 Data Structures in R Programming
R has many data structures. These include:
Vector
Matrix
Array
List
Data frame
Factors
1.3.3.1 Vector
length ( x ) 5
Length ( y ) 3
x [ 4] 4
1.3.3.2 Matrix
[,1] [, 2] [,3]
[1,] 1 4 7
[ 2,] 2 5 8
[3,] 3 6 9
Quantitative Methods
74
> m < −matrix ( a, nrow
= 3,= = T)
ncol 3, byrow
> m
[,1] [, 2] [,3]
[1,] 1 2 3
[ 2,] 4 5 6
[3,] 7 8 9
1.3.3.3 List
[,1] [, 2]
[1,] 1 4
[ 2,] 5 6
> list ( a, f , m )
[1]
[1] 1 2 3 4 5 6 7 8 9
[ 2]
[1] 1 2
[3]
[,1] [, 2]
[1,] 1 4
[ 2,] 5 6
Quantitative Methods
75
1.3.3.4 Array
[1] 1 2 3 4 5 6
> ar < −array (ab, dim =c ( 2,3,1) , dimnames = list ( c ( ' m ', ' f ') , c ( ' a ', ' b ', ' c ') ))
> ar
abc
m 1 3 5
f 246
Output: > df
Quantitative Methods
76
Note: The function cbind( ) is used to add a column and rbind( ) is used to add a row to
the existing data frame.
1 Raja 12 Vij
2 Ramu 12 Nell
3 Krish 14 Ong
4 Hema 11 Hyd
5 Ravi 15 Kdp
> df
S .no name marks region
1 Raja 12 Vij
2 Ramu 12 Nell
3 Krish 14 Ong
4 Hema 11 Hyd
5 Ravi 15 Kdp
6 Seeta 15 Gun
7 Geeta 14 Gun
Quantitative Methods
77
1.3.3.6 Factors
[1] 1 1 1 1 2 2 2 2 2 1 1
> gd < −as. factor ( gd )
> gd
[1] 1 1 1 1 2 2 2 2 2 1 1
Levels : 1 2
Quantitative Methods
78
Self-Assessment Questions
A) Array
B) Modulus
C) Factor
D) Subtraction
A) 2
B) 3
C) 4
D) 6
A) List
B) Factor
C) Array
D) Matrix
A) List
B) Factor
C) Array
D) Data frame
Quantitative Methods
79
Summary
Terminal Questions
Quantitative Methods
80
Answer Keys
Self-Assessment Questions
Question No Answers
1 A
2 C
3 C
4 D
5 B
6 D
7 D
8 C
9 C
10 D
11 A
12 D
Quantitative Methods
81
Glossary
• Data frame: Is a data structure that organises data into a 2-dimensional table
of rows and columns, much like a spreadsheet.
• List: An ordered data structure with elements separated by a comma and en-
closed within square brackets.
• Array: A data structure consisting of a collection of elements (values or vari-
ables), of same memory size, each identified by at least one array index or key.
• Modulus: A modulus function is a function which gives the absolute value of a
number or variable.
BIBLIOGRAPHY
1. Chung, K. L. (2000). A Course in Probability Theory (3rd ed.). San Diego, CA:
Academic Press.
2. Feller, W. (1968). An introduction to Probability Theory and its applications:
Volume I. John Wiley & Sons.
3. Gupta, S. C., & Kapoor, V. K. Fundamentals of Mathematical Statistics. S. Chand
Publications.
External Resources
1. Rukmangadachari, E. Probability and Statistics. Person Education.
2. Rohatgi, V. K., & Ehsanes Saleh, A. K. (2015). An introduction to probability and
statistics (3rd ed.). doi:10.1002/9781118799635
e-References
• Operators in R programming: https://intellipaat.com/blog/tutorial/r-programming/
operators/
Quantitative Methods
82
Video Links
Topic Link
R Programming Tutorial - Learn the Basics https://www.youtube.com/watch?v=_
of Statistical Computing V8eKsto3Ug
R Programming for Beginners https://www.youtube.com/
watch?v=BvKETZ6kr9Q
Introduction to R Programming for Excel https://www.youtube.com/
Users watch?v=Ekp2mfxQSzw
Keywords
• CRAN
• Console
• Studio
• Big data
• Matrix
Quantitative Methods
83
QUANTITATIVE METHODS
Module - 2
OVERVIEW OF SAMPLING
Quantitative Methods
84
Module Description
The method of sampling depends on the type of analysis being performed, although it may
include simple random sampling or systematic sampling. Sampling is a process in statistical
analysis when researchers take a specific number of observations from a larger population.
In most biomedical studies, researchers form hypotheses on the connections between different
variables, gather data to verify those connections, and then attempt to infer connections from
the data gathered. Investigators frequently compare the average amount of a factor between two
groups or between one group and a standard reference to examine relationships.
A method for determining how well one can extrapolate observed results in a study sample to
the larger population from which the sample was drawn, hypothesis testing is a process used
to assess the strength of the evidence from the sample and provides a framework for making
decisions related to the population.
These presumptions can scarcely be met, nevertheless. Non-Parametric Tests are either
distribution-free or have a given distribution but do not specify the distribution’s parameters. They
also have significantly more loose assumptions.
Quantitative Methods
85
QUANTITATIVE METHODS
Module - 2
Unit - 1
INTRODUCTION TO SAMPLING
Quantitative Methods
86
Unit Table of Contents
Unit 2.1 Concept of Probability
Aim -------------------------------------------------------------------------------------------------------------- 88
Instructional Objectives ------------------------------------------------------------------------------------ 88
Learning Outcomes ----------------------------------------------------------------------------------------- 88
Introduction ---------------------------------------------------------------------------------------------------- 89
2.1.1 Basic Concept of Sampling ---------------------------------------------------------------------- 89
Self-Assessment Questions --------------------------------------------------------------------- 91
2.1.2 Types of Sampling --------------------------------------------------------------------------------- 92
Self-Assessment Questions --------------------------------------------------------------------- 95
2.1.3 Sampling distributions ---------------------------------------------------------------------------- 96
Self-Assessment Questions --------------------------------------------------------------------- 97
2.1.4 Application of Central Limit Theorem --------------------------------------------------------- 98
Self-Assessment Questions --------------------------------------------------------------------- 99
Summary ------------------------------------------------------------------------------------------------------- 100
Terminal Questions ------------------------------------------------------------------------------------------ 100
Answer Keys -------------------------------------------------------------------------------------------------- 101
Glossary -------------------------------------------------------------------------------------------------------- 102
Bibliography --------------------------------------------------------------------------------------------------- 102
External Resources ----------------------------------------------------------------------------------------- 102
e-References ------------------------------------------------------------------------------------------------- 102
Video Links ---------------------------------------------------------------------------------------------------- 103
Keywords ------------------------------------------------------------------------------------------------------ 103
Quantitative Methods
87
Aim
This unit aims to explain the basic concepts of probability sampling and its
applications in Management.
Instructional Objectives
This unit intends to:
● Explain the concepts of Sampling and sampling procedures
● Apply the sampling methods in various applications of management
Learning Outcomes
At the end of this unit, you are expected to:
● Utilise the concepts of Sampling in various management applications
● Compare and contrast different sampling methods to select sample from the
population
Quantitative Methods
88
Introduction
The process of selecting a group of people from a population to research and characterise
the population is known as sampling. All members of a given group, as well as all conceivable
outcomes or measurements, are included in the population. The specific population will be
determined by the study’s scope.
When conducting research on a group of people, it’s uncommon that you’ll be able to collect data
from every single one of them. Rather, you choose a sample. The sample is the group of people
that will take part in the study. You must carefully consider how you will select a sample that is
representative of the entire group to make accurate conclusions from your findings.
Population:
A population is the total number of animates or in-animates included in a study. The number of
units in a population is referred to as Population Size, and it is represented by the letter N.
Sample:
Sample is a term that refers to any finite subset of a population. Sample Size is the number of
units in a sample and is indicated by n.
E.g.: A sample is, for example, the number of students in a college class.
Parameter:
Quantitative Methods
89
For example, Population Mean (), Sample Variance (2) and Population Proportion (P) are all
examples of population statistics.
Statistic:
For instance, Sample Mean (), Sample Variance (s2), Population Proportion (p), and so on.
Sampling:
Sampling is the process of picking a representative sample from a larger population. That is, the
sample should include all of the population’s characteristics.
Quantitative Methods
90
Self-Assessment Questions
A) Population
B) Sample
C) Parameter
D) Statistic
A) Population
B) Sample
C) Parameter
D) Statistic
A) Population
B) Sample
C) Parameter
D) Statistic
A) Population
B) Sample
C) Parameter
D) Statistic
Quantitative Methods
91
2.1.2 Types of Sampling
Different types of sampling samples are used depending on the population. There are two types
of approaches for picking a representative sample from a population:
The method in which each population unit have the same chance to be appear in the Sample,
which can be classified as:
The method in which each population unit not have the same chance to be appear in the Sample,
which can be classified as:
● Purposive Sampling
● Judgement Sampling
● Convenience sampling
● Quota Sampling
● Panel Sampling
● Snowball Sampling
The sample units are chosen at random in random sampling. Following the definition of the
‘parent population,’ each item in that population has an equal chance of being included in any
sample. This method necessitates extreme caution to guarantee that samples are chosen at
random. A completely random option may not always be possible. However, the investigator
should strive to achieve as close to the ideal of random selection as possible.
A stratified sampling strategy can be used when the population is heterogeneous in terms of the
variables under investigation and can be separated into reasonably homogeneous groups and
subgroups. When there are large groups of known size within the ‘parent population,’ this sort of
sampling is used to ensure that each subgroup is fairly represented within the entire sample. For
example, imagine a village has a population of 10,000 people and is separated into ten groups
Quantitative Methods
92
based on economic characteristics. A random sample representing the income of each subgroup
will be collected.
The fundamental benefit of stratified sampling is that it is simple to administer, and each stratum
is represented in the sample (which may not be the case with random and purposive selection),
allowing for distinct estimates for stratum means if needed. In agricultural, industrial, and applied
geographical research, stratified random sampling is commonly utilised.
Systematic Sampling
Instead of selecting everyone individually, this method uses a regular pattern of selection. This
technique is also referred to as quasi-random. For example, if a crop combination study is to be
conducted in 2,000 villages in an aerial unit and 20 sample villages are to be chosen, the villages
should be numbered from 1 to 2,000.
After serially arranging the villages, the hundredth village on the list is selected. The sample
villages that are required will be reached fast. When utilised correctly, systematic sampling can
be more convenient and effective than genuine random sampling. This method, while useful for
quick and effective sampling, has the drawback of subjectivity because not every village in the
area has an equal probability of being included in the sample.
Cluster sampling
Cluster sampling is a probability sampling approach that divides the population into different
groups (clusters) for research purposes. For data collection and analysis, researchers use a
basic random or systematic random sampling technique to pick random groups.
Area Sampling
When there isn’t a complete frame of reference available, area sampling is used. The entire area
under inquiry is divided into small sub-areas that are sampled at random or according to a set of
rules (stratification of sampling).
Multistage sampling
Multistage sampling is a sample strategy for doing research that splits the population into groups
(or clusters). Significant clusters of the selected people are broken into sub-groups at various
points during this sampling process to make primary data collecting easier.
Purposive sampling
Judgment sampling
Quantitative Methods
93
Convenience sampling
Quota sampling
Panel sampling
Panel sampling is selecting a set of people at random to be part of a panel that participates in a
study multiple times over a period of time. In a longitudinal survey, for example, the same group
of people may be polled periodically throughout time.
Snowball sampling
Snowball sampling (also known as chain sampling, chain-referral sampling, or referral sampling)
is a non-probability sampling approach used in sociology and statistics research, in which
current study subjects recruit prospective study subjects from among their friends. As a result,
the sample group is described as growing like a snowball.
Quantitative Methods
94
Self-Assessment Questions
A) Population
B) Random
C) Parameter
D) Statistic
A) 3
B) 4
C) 2
D) 5
7. Relatively homogeneous groups and subgroups are the base for ____ Sampling.
A) Sample random
B) Quota
C) Space
D) Stratified
A) Random
B) Non-random
C) Space
D) Event
Quantitative Methods
95
2.1.3 Sampling distributions
A sampling distribution is a probability distribution of a statistic derived from a large number
of samples gathered from a particular population. The sampling distribution of a population is
the frequency distribution of a range of alternative outcomes that could occur for a population
statistic.
Academicians, statisticians, researchers, marketers, analysts, and others draw and employ a lot
of samples, not populations. A subset of a population is referred to as a sample. For example,
a medical researcher who wanted to compare the average weight of all babies born in North
America from 1995 to 2005 to those born in South America during the same time period couldn’t
draw the data for the entire population of over a million childbirths over the ten-year period in a
reasonable amount of time. Instead, he’ll base his judgement on the weight of, say, 100 babies
on each continent. The sample consists of 200 new-borns, with the average weight determined
as the sample mean.
What most laypeople call an ‘average’ and statisticians call the arithmetic mean is the most
popular and widely used metric of describing the total data by one value. Its value is calculated by
adding all of the things together and dividing the sum by the number of items. Simple arithmetic
mean or Weighted arithmetic mean are two types of arithmetic mean.
It should be emphasised that statisticians dislike the term “average” because it connotes a too
ambiguous meaning. It has a variety of connotations. For instance, an average person, average
pay, average height, and so on. It can refer to any average, including mean, median, mode,
geometric mean, harmonic mean, and so on. In fact, the arithmetic mean is so widely used that
the term ‘mean or average’ is often used without qualifier to refer to this sort of average. That is,
unless otherwise mentioned, it is reasonable to presume that when someone says “the mean” or
“the average” of a set of observations, they are referring to the arithmetic mean.
Quantitative Methods
96
Self-Assessment Questions
A) Population
B) Random
C) Parameter
D) Statistic
A) North America
B) India
C) UK
D) Germany
A) Population
B) Random
C) Average
D) Statistic
A) 3
B) 2
C) 4
D) 5
Quantitative Methods
97
2.1.4 Application of Central Limit Theorem
The central limit theorem asserts that if you collect sufficiently enough random samples with
replacement from a population with a mean and standard deviation, the distribution of the
sample means will be nearly normally distributed. This holds true whether the source population
is normal or skewed, as long as the sample size is large enough (often 30). The theory is
valid even for samples less than 30 if the population is normal. This is true even if the population
is binomial, as long as min (np, n(1,p))>5 where n is the sample size and p is the population’s
probability of success. This means that when generating conclusions about a population mean
based on the sample mean, we can utilise the normal probability model to measure uncertainty.
Consequential research necessitates a grasp of the statistics that underlie the various sample
size decisions that must be made. A simple equation will allow you to set the migraine tablets
down and confidently sample, knowing that your survey will be statistically accurate with the
proper sample size.
Before you can calculate a sample size, you must first figure out a few details about the target
demographic and the sample size you require:
1. Size of the Population – How many people fall into your demography as a whole? If you
want to know about moms in the United States, for example, your population size would be
the total number of mothers in the United States. Not every population needs to be this big.
Even if your population is small, you should be aware of who falls within your demographics.
If you’re unsure about the actual amount, don’t worry. The population is frequently unknown
or approximated using two educated guesses.
2. Margin of Error (Confidence Interval) – Because no sample is perfect, you must choose
how much error to allow. The confidence interval specifies how much higher or lower you
are ready to allow your sample mean to deviate from the population mean. You’ve probably
seen a confidence interval in a political survey on the news. It will look something like this, for
example: “With a margin of error of +/- 5%, 68 percent of voters responded yes to Proposition
Z.”
3. Level of Confidence — How confident do you want to be that the actual mean falls inside
your confidence interval? 90 percent confident, 95 percent confident, and 99 percent
confident are the most common confidence intervals.
Quantitative Methods
98
Self-Assessment Questions
A) .
B)
C)
D)
A) 30
B) 25
C) 42
D) 51
A) 2
B) 3
C) 4
D) 1
A) 5%
B) 10%
C) 1%
D) 20%
A) Error
B) Statistic
C) Parameter
D) Population
Quantitative Methods
99
Summary
• The unit introduction to sampling helps in understanding the basic concepts of sam-
pling such as population, sample, parameter and statistic.
• Various sampling techniques such as simple random, stratified, systematic, cluster
sampling, etc., will provide the representative samples from the population.
• Along with random sampling techniques, the non-random sampling methods also
provide representative samples from the population.
Terminal Questions
Quantitative Methods
100
Answer Keys
Self-Assessment Questions
Question No Answers
1 A
2 B
3 C
4 D
5 B
6 C
7 D
8 B
9 D
10 A
11 C
12 B
13 D
14 A
15 C
16 A
17 D
Quantitative Methods
101
Glossary
• Sampling distribution: A probability distribution of a statistic obtained from a
larger number of samples drawn from a specific population.
• Snowball sampling: A recruitment technique in which research participants
are asked to assist researchers in identifying other potential subjects.
• Quota sampling: A non-probability sampling method that relies on the non-ran-
dom selection of a predetermined number or proportion of units.
• Cluster sampling: A probability sampling technique where researchers divide
the population into multiple groups (clusters) for research.
BIBLIOGRAPHY
1. Chung, K. L. (2000). A Course in Probability Theory (3rd ed.). San Diego, CA:
Academic Press.
2. Feller, W. (1968). An introduction to Probability Theory and its applications:
Volume I. John Wiley & Sons.
3. Gupta, S. C., & Kapoor, V. K. Fundamentals of Mathematical Statistics. S. Chand
Publications.
External Resources
1. Rukmangadachari, E. Probability and Statistics. Person Education.
2. Rohatgi, V. K., & Ehsanes Saleh, A. K. (2015). An introduction to probability and
statistics (3rd ed.). doi:10.1002/9781118799635
e-References
• Sample Distribution: Definition, How It’s Used, With an Example:
https://www.investopedia.com/terms/s/sampling-distribution.asp
• Types of Sampling:
https://www.qualitygurus.com/types-of-sampling/
Quantitative Methods
102
Video Links
Topic Link
https://www.youtube.com/watch?v=l-
Sampling Distribution-I
9rfMOZXk0Y
Introduction to sampling distributions | https://www.youtube.com/
Sampling distributions watch?v=z0Ry_3_qhDw
https://www.youtube.com/
Sampling distribution example problem
watch?v=0ZstEh_8bYc
Keywords
• Population
• Sample
• Parameter
• Stratified sampling
Quantitative Methods
103
QUANTITATIVE METHODS
Module - 2
Unit - 2
Quantitative Methods
104
Unit Table of Contents
Unit 2.2 Estimation and Testing of Hypothesis
Quantitative Methods
105
Aim
This unit aims to explain the basic concepts of Estimation, Testing of Hypothesis,
and its applications in Management.
Instructional Objectives
This unit intends to:
● Explain the concepts of estimation and methods of estimation
● Discuss the concepts of testing of hypothesis and its process
● Apply estimation, testing of hypothesis in various applications of management
Learning Outcomes
At the end of this unit, you are expected to:
● Demonstrate the concepts of estimation and its methods
● Examine the concepts of hypothesis testing
● Utilise the concepts of estimation, testing of hypothesis in Marketing, HRM,
Finance, etc.
Quantitative Methods
106
Introduction
Theory of estimation and testing of hypothesis are the statistical tools to analyse various
population parameters with their sample statistics. Both are two different procedures to analyse
the population. Theory of estimation and testing of hypothesis both together called Statistical
Inference.
Estimation theory is a branch of statistics that deals with estimating the values of parameters
based on measured empirical data that has a random component. The parameters describe an
underlying physical setting in such a way that their value affects the distribution of the measured
data.
Objective Function
θ
Z Estimator θ0
Known
Model /
Information set Convergence
Constraints
region
Point estimation
Point Estimation is the process of estimating any population parameter with its sample statistic.
In point estimation, any sample statistic is considered as its point estimator. The sample mean
is the point estimator for the population mean, the sample variance is the point estimator for the
Quantitative Methods
107
population variance etc. The sample mean ( xbar ) is the point estimator for the popula-
( )
tion mean ( µ ) , the sample variance s is the point estimator for the population variance
2
α = 5% or 1% or 10% , etc.
Interval estimation
Point estimation in which the point estimator may not coincide with the true value of the popu-
lation parameter in general. Hence, it is suggested to estimate any population parameter in an
interval. The process of estimating any population parameter in an interval is called Interval
estimation.
The Formula to generate the confidence interval of a population parameter is “point estimate
± ((critical value) (standard error))”
=
E.g.: Confidence interval for population mean µ xbar ± zα σ / sqrt ( n )
Quantitative Methods
108
Self-Assessment Questions
A). Hypothesis
B). Estimation
C). Parameter
D). Statistic
A). 5
B). 4
C). 3
D). 2
A). Population
B). Sample
C). Point
D). Interval
A). Population
B). Sample
C). Point
D). Interval
Quantitative Methods
109
2.2.2 Introduction to Hypothesis
The concept of testing of Hypothesis is used to analyse the population parameter by using the
sample statistic. In hypothesis testing, an assumption on population parameter is taken and
tested.
Hypothesis Testing
HO Rejected
HO Accepted
Rejected Region
Hypothesis:
Types of Hypothesis:
Null Hypothesis:
E.g.:
H 0 : µ = 15
H 0 : σ 2 = 20
H 0 : P = 0.5
Quantitative Methods
110
Alternative Hypothesis:
The Hypothesis against to the Null Hypothesis is called Alternative Hypothesis and is denoted
by H1 .
E.g.: H1 : µ > 15
H1 : σ 2 < 20
H1 : P0.5
Statements of Hypothesis:
Among the above statements, 1 and 2 are True Statements, whereas 3 and 4 are Error State-
ments.
Type I Error:
The statement of “Rejecting The Null Hypothesis H 0 , when H 0 is True” is called Type I Error.
Type II Error:
The statement of “Accepting The Null Hypothesis H 0 , when H 0 is False” is called Type II Error.
Level of Significance:
The probability of committing Type I Error is called Level of Significance and is denoted by α .
i.e., P (Type I Error ) P
= ( Rejecting The Null Hypothesis H 0 , when H 0is True ) α .
Power of the Test:
The probability of committing Type II Error is called β and (1 − β ) is called Power of the Test. i.e.,
1 − P (Type II Error ) =
1 − P ( Accepting The Null Hypothesis H 0 , when H 0is False ) =
1− β
Critical Region:
The region of rejection of Null Hypothesis H 0 is called Critical Region. i.e., the region formed by
the sample points where Null Hypothesis H 0 is rejected is called Critical Region and is denoted
by W .
Quantitative Methods
111
Acceptance Region:
The region of Acceptance of Null Hypothesis H 0 is called Acceptance Region. i.e., the region
formed by the sample points where Null Hypothesis H 0 is Accepted is called Acceptance Region
and is denoted by W .
Two-tailed Test:
If the alternative Hypothesis H1 : θθ 0 , then such a test is called Two-tailed test. For a Two-tailed
test, the critical Value of Zα is focus on both the tails of Normal Probability Curve.
Left-tailed Test:
If the alternative Hypothesis H1 : θ < θ 0 , then such a test is called Left-tailed test. For a Left-
tailed test, the critical Value of Zα is focus on the left tail of Normal Probability Curve.
Right-tailed Test:
If the alternative Hypothesis H1 : θ > θ 0 , then such a test is called Right-tailed test. For a
Right-tailed test, the critical Value of Zα is focus on the right tail of Normal Probability Curve.
Testing of Hypothesis can be classified into Large Sample testing of Hypothesis ( n > 30 ) and
Small Sample Testing of Hypothesis ( n ≤ 30 ) . Can be represented by the following flow chart
Hypothesis
Quantitative Methods
112
Procedure for Testing of Hypothesis
H1 : There is a significant difference between the Population Parameter θ and Sample statistic t.
i.e., H1 : θ ≠ θ 0
Quantitative Methods
113
Self-Assessment Questions
5. Assumption on the population parameter known as_____.
A). Hypothesis
B). Estimation
C). Parameter
D). Statistic
A). 5
B). 4
C). 3
D). 2
A). H1
B). H1
C). H 2
x−y
z=
D). σ 12 σ 22
+
n1 n2
A). 4
B). 5
C). 3
D). 2
Quantitative Methods
114
2.2.3 One sample and two sample tests for means and proportions of large
samples (z-test)
The Z-Test is used to test the hypothesis on population parameters using their sample statis-
tics under large sample case. i.e., n > 30 . Various such tests are
2.2.3.1 Large Sample test for Population Mean or Z-test for Population Mean
H 0 : µ = µ0
H1 : µ ≠ µ0
Example
A sample of 64 students have a mean weight of 70 kgs and with a standard deviation of 25 kgs.
Then test whether the populations mean weight is 56 kgs or not test at 5% level of significance.
Solution
Quantitative Methods
115
Choose an appropriate level of significance α = 5%
Now,=z 4.48
= 4.48
Since 4.48 > 1.96, Reject the Null Hypothesis H 0 . That is accept the alternative hypothesis H1
2.2.3.2 Large Sample test for Population Proportion or Z-test for Population
Proportion
H 0 : P = P0
Quantitative Methods
116
Example
In a sample of 1000 people in Maharashtra, 540 are found to be wheat eaters and rest are rice
eaters. Then test whether both rice and wheat eaters are equally proportional in the state of
Maharashtra, test at 5% level of significance.
Solution
Given that
=
Sample Size, n 1000 ( > 30 )
=
Sample Proportion, p 540
= /1000 0.54
Then the test procedure for population mean consists of the following steps
Null Hypothesis, H 0 : Both wheat and rice eaters are equally proportional in the state of
Maharashtra.
i.e., H 0 : P = 0.5
Alternative Hypothesis H1 : Both wheat and rice eaters are not equally proportional in the state
of Maharashtra.
i.e., H 0 : P ≠ 0.5
p−P
z=
PQ
n
0.54 − 0.5
z=
(0.5)(0.5)
1000
z = 2.53
=
Now, z 2.53
= 2.53
zα at 5% Level of Significance is 1.96
Since 2.53 > 1.96, Reject the Null Hypothesis H 0 . That is Accept the Alternative Hypothesis H1 .
Hence, both wheat and rice eaters are not equally proportional in the state of Maharashtra.
Quantitative Methods
117
2.2.3.3 Large Sample test for Equality of Two Population Means or Z-test for
Equality of Two Population Means
H1 : µ1 ≠ µ2
If Z > Zα , Reject the Null Hypothesis H 0 . i.e., accept the Alternative Hypothesis H1 .
Example
Two horses A and B are tested according to the different time periods to run on a particular track
with the following results
Horse A 28 30 32 33 33 29 34
Horse B 29 30 30 24 27 29
Test whether the average running capacities of both horses are same or not
Solution
Given that
=
Sample Size, n1 7,=
n2 6
Null Hypothesis, H 0 : The average running capacities of both horses are the same.
i.e., H 0 : µ 1 = µ 2
Quantitative Methods
118
Alternative Hypothesis H1: The average running capacities of both horses are not the same
i.e., H 0 : µ 1 ≠µ 2
=
Now, t 2.45
= 2.45
Since 2.45 > 2.20, Reject the Null Hypothesis H 0 . That is Accept the Alternative Hypothesis H1 .
Hence, the average running capacities of both horses are not the same
2.2.3.4 Large Sample test for Equality of Two Population Proportions or Z-test for
Equality of Two Population Proportions
x1 + x 2
where P = and Q = 1− P
n1 + n2
Step V: If Z ≤ Zα , Accept the Null Hypothesis H 0
If Z > Zα , Reject the Null Hypothesis H 0 . i.e., accept the Alternative Hypothesis H1
Quantitative Methods
119
Example
A sample of 600 students at a certain college, 400 are found to boys and 200 are girls. In another
college, in a sample of 900 students, 450 are boys and 450 are girls. Then test whether there is
any significant difference among the proportion of boys are same in both the colleges test at 5%
level of significance.
Answer
=
Then the sample proportions 1 / n1
p1 x= 400 / =
600 0.66
= 2 / n2
P2 x= 200 /=
400 0.5
H 0 : P1 = P 2
H1 : The proportions of boys are not the same in both the colleges
H1 : P1 P 2
Z = 6.11
Zα = 1.96
Hence, the proportions of boys are not same in both the colleges.
Quantitative Methods
120
Self-Assessment Questions
10. ____ test is used to test the large sample test for means.
A). Z
B). t
C). F
D). Q
A). Mean
B). Ratio
C). Variance
D). S.D
A). σ
B). µ
C). Σ
D). χ
A). σ 2
B). µ
C). P
D). χ 2
Quantitative Methods
121
2.2.4 One sample and two sample tests for means of small samples
(t-lest)
The t -Test is used to test the hypothesis on population parameters using their sample statis-
tics under small sample case. i.e., n ≤ 30 . Various such tests are:
t – Table values
Quantitative Methods
122
2.2.4.1 Small Sample test for Population Mean or t-test for Population Mean
If t > tα , Reject the Null Hypothesis H 0 . i.e., accept the Alternative Hypothesis
Problem 1:
A random sample of 10 students had the I .Q’s 70, 120, 110, 101, 88, 83, 95, 98, 107 and 100.
Then test whether the average I .Q levels of students is 100 or not. Test at 5% level of signifi-
cance.
Solution
Given that
n 10 ( < 30 )
Sample Size,=
1
Sample Mean, x =
n
∑ xi
1
= ( 70 + 120 + ........ + 107=) 97.2
10
Quantitative Methods
123
1
=s2
Standard Deviation, n −1
∑ ( xi − x ) 2
s 2 = 203.73
=s 203.73 14.27
=
Then the test procedure for Population Mean consists of the following steps
i.e., H 0 : µ = 100
Alternative Hypothesis H1: The Population Mean I.Q is not100.
i.e., H1 : µ ≠ 100
Choose an appropriate level of Significance α = 5%
2.2.4.2 Small Sample test for Equality of Two Population Means or t-test for Equality of
Two Population Means
Quantitative Methods
124
Step III: Choose an appropriate level of Significance α = 5% or 1% or 10%
1 1
=s1
Where n −1
∑ 2
( xi − x )= s2
n −1
∑ ( yi − y ) 2
If t > tα , Reject the Null Hypothesis H 0 . i.e., accept the Alternative Hypothesis H1
.
Quantitative Methods
125
Self-Assessment Questions
14. ____ test is used to test the small sample test for means.
A). Z
B). t
C). F
D). Q
A). σ
B). µ
C). Σ
D). s
A). n = 30
B). n ≠ 30
C). n > 30
D). n ≤ 30
Quantitative Methods
126
2.2.5 F-test
The F-test is used to test the equality of variances under small sample case.
H 0 : σ 12 = σ 2 2
H1 : σ 12 ≠ σ 2 2
Step III: Choose an appropriate level of Significance α = 5% or 1% or 10%
s12
F= ˜F
s 22
1 1
Where s1
=
n −1
∑ 2
( xi − x )= s2
n −1
∑ ( yi − y ) 2
If F > Fα , Reject the Null Hypothesis H 0 . i.e., accept the Alternative Hypothesis H1
.
F Table Values
0 fa
Quantitative Methods
127
Quantitative Methods
128
Self-Assessment Questions
17. ____ test is used to test the variances.
A). Z
B). t
C). F
D). Q
A). σ 2
B). µ
C). Σ
D). χ2
Quantitative Methods
129
Summary
Terminal Questions
Quantitative Methods
130
Answer Keys
Self-Assessment Questions
Question No Answers
1 B
2 D
3 C
4 D
5 A
6 D
7 A
8 B
9 C
10 A
11 B
12 B
13 C
14 B
15 D
16 D
17 C
18 A
Quantitative Methods
131
Glossary
• Mean: Mean is the average of the given numbers and is calculated by dividing
the sum of given numbers by the total number of numbers.
• Variance: The mean squared difference between each data point and the cen-
tre of the distribution measured by the mean.
• Proportion: Is a mathematical comparison between two numbers
• Z-test: A statistical test to determine whether two population means are differ-
ent when the variances are known and the sample size is large.
BIBLIOGRAPHY
1. Chung, K. L. (2000). A Course in Probability Theory (3rd ed.). San Diego, CA:
Academic Press.
2. Feller, W. (1968). An introduction to Probability Theory and its applications:
Volume I. John Wiley & Sons.
3. Gupta, S. C., & Kapoor, V. K. Fundamentals of Mathematical Statistics. S. Chand
Publications.
External Resources
1. Rukmangadachari, E. Probability and Statistics. Person Education.
2. Rohatgi, V. K., & Ehsanes Saleh, A. K. (2015). An introduction to probability and
statistics (3rd ed.). doi:10.1002/9781118799635
e-References
• The F-Distribution: http://surl.li/gngiy
Quantitative Methods
132
Image Credits
Video Links
Topic Link
https://www.youtube.com/
Introduction To Statistical Inference
watch?v=n3p4PK8kwOU
Introduction Hypothesis Testing in https://www.youtube.com/
Statistics watch?v=VK-rnA3-41c
https://www.youtube.com/
Z- test
watch?v=bB-J6_wcGgE
https://www.youtube.com/
T- test introduction
watch?v=fKZA5waOJ0U
https://www.youtube.com/watch?v=-
F- test
FlIiYdHHpwU
Keywords
● Estimation
● Hypothesis
● Level of Significance
● T-test
● F-test
Quantitative Methods
133
QUANTITATIVE METHODS
Module - 2
Unit - 3
Quantitative Methods
134
Unit Table of Contents
Unit 2.3 ANOVA & Non-Parametric Testing of Hypothesis
Quantitative Methods
135
Aim
This unit aims to explain the basic concepts of Analysis of Variance
(ANOVA), Non-parametric testing of hypothesis and its applications in
Management.
Instructional Objectives
This unit intends to:
• Explain the concepts of Analysis of Variance (ANOVA) and its methods
• Discuss the concepts of non-parametric testing of hypothesis
• Describe ANOVA, Non-parametric testing of hypothesis in management
applications
Learning Outcomes
Quantitative Methods
136
INTRODUCTION
Analysis of variance (ANOVA) is a statistical technique that is used to check if the means of two
or more groups are significantly different from each other. ANOVA checks the impact of one or
more factors by comparing the means of different samples. ANOVA is a statistical method that
analysis variances to determine if the means from more than two populations are the same. In
other words, we have a quantitative response variable and a categorical explanatory variable
with more than two levels. In ANOVA, the categorical explanatory is typically referred to as the
factor.
Non-parametric tests are used if the assumptions for the parametric tests are not met and are
commonly called distribution free tests. The advantage of non-parametric tests is that we do not
assume that the data come from any distribution.
Since non-parametric tests do not estimate population parameters, in general, there are
● no estimates of variance/variability
● no confidence intervals
● fewer measures of effect size
Also, non-parametric tests are generally not as powerful as parametric alternatives when the
assumptions of the parametric tests are met.
Assumptions of ANOVA
● Randomisation
● Replication
● Local Control
ANOVA is classified as
● ANOVA One-way Classification
● ANOVA Two-way Classification
Quantitative Methods
137
2.3.1.1 ANOVA ONE-WAY CLASSIFICATION
Testing the homogeneity of more than two populations means with respect to one Classification
(Treatments). The procedure for ANOVA One way classification consists of the following steps:
i.e. Tk 2 Ho : µ1 = µ 2 = ……… = µ k
th
in i row Error Sum of Squares (S.S.E) H1(tr )
ANOVA TABLE
Source of Sum of Mean sum of
Degree of Freedom F-Ratio
Variation Squares squares
2 S2tr s 2t
Treatments St 2
K −1 s =
t F = 2 ~Fk −1,N − k
k −1 se
Se2
Error Se 2 N −K
2
s =
e
N-k
Total ST2 N −1
Quantitative Methods
138
Example:
Suppose 3 drying formulas for curing a glue are studied and the following times are observed.
Formula A 13 10 8 11 8
Formula B 13 11 14 14
Formula C 4 1 3 4 2 4
Solution
H 0 : There is homogeneity among the means of A, B & C i.e. µ
= A µ=
B µC
H1 : There is no homogeneity among the means of A, B & C i.e. µ A ≠ µ B ≠ µC
Appropriate level of significance is 5% (given)
Ti Ti 2 / ni
Formula A 13 10 8 11 8 --- 50 2500/5 = 500
Formula B 13 11 14 14 --- --- 52 2704/4=676
Formula C 4 1 3 4 2 4 18 324/6 = 54
G =120 Σ Ti 2 / ni = 1230
G 2 1202
Correction factor (C.F) = = = 960
N 15
2
Sum of Squares due to Total (S.S.T) = ST = R.S.S - C.F = 1262 - 960 = 302
Ti 2
Sum of Suares due to Treatments (S.S.S.tr) = Str2 = ∑ n - C.F = 1230 - 960 = 270
i
ANOVA Table
Quantitative Methods
139
Inference
{Fα =
, k −1, N − k F=
0.05,2,12 3.89}
Fcal > Ftab , We reject H 0 . Hence, we conclude that µ A ≠ µ B ≠ µC
ANOVA Two Classification is used to test the homogeneity of more than two population
Means w.r.t Two Classification (Treatments and Blocks). The procedure for ANOVA Two-way
classification consists of the following steps.
● Null Hypothesis
● Alternative Hypothesis
BLOCKS
1 2 . . . h Ti Ti 2
1 X 11 X 12 . . . X 1h T1 T12
2 X 21 X 22 . . . X2h T2 T2 2
Treatments . . . . . . . . .
. . . . . . . . .
k X k1 Xk2 . . . X kh Tk Tk 2
Bj B1 B2 . . . Bh G ΣTi 2
Bj2 B12 B2 2 . . . Bh 2 ΣB j 2
Quantitative Methods
140
∑X
2
Row sum of squares(R.S.S) = ij
, Where X i is the ith observation
G2
Correction factor (C.F) = where G is the Grand total,
N ,
N is no. of observations in expt.
2
Total Sum of Squares (T.S.S) = ST SR.S.S-C.F
1 k 2
2
Total Sum of Squares (t.S.S) = St = ∑ Ti - C.F
h i =1
1 h 2
2
Blocks Sum of Squares (b.S.S) = sb ∑ B j - C.F
k j1
2 2 2 2
Error Sum of Squares (E.S.S) = se S.S.T-S.S.tr-S.S.b ST S r Sb
ANOVA TABLE
S2 s 2t
treatments St 2
k −1 s = t2
F1 = ~Fk −1,(k-1)(h-1)
t
k −1 s e2
S2B s 2b
Blocks Sb 2 h −1 s = 2
F2 = ~Fh −1,(k-1)(h-1)
b
h −1 s e2
Se2
Error Se 2 ( k − 1) ( h − 1) s e2 =
(k − 1)(h-1)
Total ST 2 kh − 1
Example
Blocks
B1 B2 B3 B4
Treatment 1 13 7 9 3
Treatment 2 6 6 3 1
Treatment 3 11 5 15 5
Solution
Null Hypothesis
µ1
H 0(tr ) : There is homogeneity among the treatments i.e. = µ=
2 µ3
Quantitative Methods
141
H 0(b ) : There is homogeneity among the blocks µ1
i.e. = µ=
2 µ=
3 µ4
Alternative Hypothesis
Blocks Ti Ti 2
Treatment 1 13 7 9 3 32 1024
Treatment 2 6 6 3 1 16 256
Treatment 3 11 5 15 5 36 1296
G 2 842
Correction factor (C.F) = = = 588
N 12
2
Sum of Squares due to Total (S.S.T) = ST = R.S.S - C.F = 786 - 588 = 198
1 k 2 1
Sum of Squares due to Treatments (S.S.tr) = S=
t
2
∑
h i =1
Ti − C.F
=
4
2576 - 588 = 56
1 h 2 1
2
Sum of Sqaures due to Blocks (S.S.b) = S=
b ∑
k j =1
B j − C .F
=
3
2034 - 588 = 90
2 2 2
Sum oue to Error (S.S.E) = S.S.T - S.S.tr - S.S.b = ST − Str − Sb = 198 - 56 - 90 = 52
ANOVA Table
Treatments 56 2 56/2 = 28
Ft = 28/8.67 = 3.23
Blocks 90 3 90/3 = 30
Fb = 30/8.67 = 3.46
Error 52 6 52/6 = 8.67
Total 198 11
Quantitative Methods
142
Inference
● {F = F= }
5.14 , Ft < Ftab , we accept H 0(tr )
α , k −1,( k −1) ( h −1)
0.05,2,6
● {F = F=
α , h −1, ,( k −1) ( h −1)
0.05,3,6 4.76 } , Fb < Ftab , we accept H 0(b )
Quantitative Methods
143
Self-Assessment Questions
A). t
B). F
C). Z
D). χ 2
A). ANOVA
B). COVA
C). ANCOVA
D). ANO
A). 3
B). 4
C). 2
D). 5
4. ANOVA is used to test the homogeneity of several population means. What do you
say?
A). No
B). Can’t say
C). May be
D). Yes
Quantitative Methods
144
2.3.2 CHI-SQUARE TEST
The square of a standard normal variate is called Chi-square variate and is denoted by χ 2 .
The Chi-square test is used to test the homogeneity of the given data and can be classified into
two types.
The Chi-square test for goodness of fit is used to test the homogeneity of the given data by
comparing the observed and expected frequencies. The procedure of Chi-square test for good-
ness of fit consists of the following steps:
(Oi − ei ) 2
χ =∑
2
ei
If χ 2 > χ 2α , Reject the Null Hypothesis H 0 . i.e., accept the Alternative Hypothesis H1 .
Example
The number of road accidents per day in a week on a highway are distributed as follows
No. of Accidents 12 8 20 5 14 10 15
Quantitative Methods
145
Use chi-square test and test whether the accidents are uniformly distributed throughout the week.
Test at 5% level of significance
Solution
1
The expected frequency, E =
n
∑ Oi
Ei
= 1/ 7 (12 + 8 + 20 + 5 + 14 + 10 + 15 )
Ei
= 84
= /7 12
Null Hypothesis ( Ho ) : the accidents are uniformly distributed throughout the week.
Ho : Oi = Ei
Alternative Hypothesis ( H 1) : the accidents are not uniformly distributed throughout the week.
H 1: Oi ≠ Ei
The level of significance, α = 5%
(Oi − ei ) 2
The test statistic χ 2 = ∑ ei
To determine χ 2 , the following table is used
( Oi − Ei ) ( Oi − Ei )
2 2
Oi Ei / Ei
12 12 (12-12)2 = 0 0/12 = 0
χ2 12.15
Therefore χ 2 = 12.15
Quantitative Methods
146
χ 2α ( n − 1) df = χ 25% ( 7 − 1) df = χ 25% ( 6 ) df = 12.592
The Chi-square test for the Independence of Attributes is used to test the independence of the
attributes (Factors) with observed and expected frequencies. The procedure of the Chi-square
test for independence of attributes consists of the following steps:
(Oi − ei ) 2
χ2 = ∑
ei
If χ 2 > χ 2α , Reject the Null Hypothesis H 0 . i.e., accept the Alternative Hypothesis H1 .
To test whether the Two Factors ( F1and F2 ) affecting the given data are independent or not.
( r x c Contingency Table)
F2
1 2 . J . c
1 O11 O12 . O1J . O1c
2 O 21 O 22 . O2 j . O 2c
. .
F1 . .
i Oi1 Oi 2 . ü . Oic
. .
r Or1 Or 2 . ü . Orc
Procedure
H 0 : Oij = Eij
Step 2: Alternative Hypothesis ( H1 ) : The two factors F1and F2 are not independent
H 0 : Oij ≠ Eij
Quantitative Methods
147
( Oij − eij )
2
eij
Where χ 2
=
(i th
row total x j th column total )
Grand total ( G )
Example
The following table gives the classification of 100 workers according to their Gender and Nature
of work.
Nature of work
Skilled Unskilled
Gender Male 40 20
Female 10 30
Use Chi-Square test and test whether the Gender is an independent of Nature of work or not.
Test at 5% Level of significance.
Solution
Given That
(
Expected frequency, Eij = i th row total x j th column total ) / Grand total ( G )
Quantitative Methods
148
Null Hypothesis ( H 0 ) : the Gender is an independent of Nature of work
H 0 : Oij = Eij
H 0 : Oij ≠ Eij
The level of Significance, α = 5%
( Oij − eij )
2
Therefore, χ 2 = 16.66
Now χ 2α ( r − 1)( c − =
1) χ 25% ( 2 − 1)( 2 − =
1) χ 25%1 df
= 3.841
Quantitative Methods
149
Self-Assessment Questions
A). t
B). F
C). Z
D). χ 2
A). Eij
B). Bij
C). Oij
D). Aij
7. χ 2 Test is of __Types.
A). 3
B). 4
C). 2
D). 5
A). Eij
B). Bij
C). Oij
D). Aij
{F = F=
α , h −1, ,( k −1) ( h −1)
0.01,2,6 }
10.92
Quantitative Methods
150
2.3.3 Sign test
The sign test is a statistical method to test for consistent differences between pairs of obser-
vations, such as the weight of subjects before and after treatment. Given pairs of observations
(such as weight pre- and post-treatment) for each subject, the sign test determines if one mem-
ber of the pair (such as pre-treatment) tends to be greater than (or less than) the other member
of the pair.
Quantitative Methods
151
Example
Two archaeologists, X and Y worked at an ancient building for 25 days and found the following
artifacts. Use sign test to test the null hypothesis that the two archaeologists are equally good at
finding artifacts against the alternative hypothesis that X is better.
X 1 0 2 3 1 0 2 2 2 3 0 1 1 1 4 1 2 1 3 5 2 1 3 2 2
Y 0 0 1 0 2 0 0 1 1 2 0 1 2 1 1 0 2 2 6 0 2 3 0 2 1
Solution
Given that
X 1 0 2 3 1 0 2 2 2 3 0 1 1 1 4 1 2 1 3 5 2 1 3 2 2
Y 0 0 1 0 2 0 0 1 1 2 0 1 2 1 1 0 2 2 6 0 2 3 0 2 1
Xi − Yi + 0 + + - 0 + + + + 0 0 - 0 + + 0 - - + 0 - + 0 +
Null Hypothesis H 0 : the two archaeologists are equally good at finding artifacts.
H0 : X = Y
H1 : X > Y
That is Accept H1
The archaeologist X is better than Y in finding artifacts.
Quantitative Methods
152
Self-Assessment Questions
9. The Sign test is based on the deviations of observations. What do you say?
A). Yes
B). No
C). May be
D). Can’t say
A). t
B). F
C).
D). χ 2
A). s
B). p
C). r
D). n
Quantitative Methods
153
2.3.4 Wilcoxon Signed Rank
The Wilcoxon signed-rank test is a non-parametric statistical hypothesis test used either to test
the location of a set of samples or to compare the locations of two populations using a set of
matched samples.
● Rank test is a test which is based on ranks
● The Wilcoxon signed-rank test is the non-parametric test equivalent to the
parametric t-test.
● As the Wilcoxon signed-ranks test does not assume normality in the data, it can be
used when this assumption has been violated and the use of the t-test is inappropriate.
● The Wilcoxon signed-rank test is used when comparing two related samples, matched
samples, or repeated measurements on a single sample to assess whether their
population mean ranks differ.
● Find the difference between each pair of values and then take absolute values of the
differences.
● Assign ranks to the absolute value of the differences.
● Re-attached to each rank the positive or negative sign that was removed earlier.
● If ranks are repeated take the average of ranks.
● Calculate the sum of negative ranks and sum of positive ranks.
● Calculate the statistic W , which is equal to the smaller of the two sums
Quantitative Methods
154
Example
The weekly sales revenues of a product in 14 randomly selected retail stores before and after a
new competing product is released in the market is given below. Test whether there is a sig-
nificance of difference in sales of the product after the release of the competing product at 5%
level.
Retail Store 1 2 3 4 5 6 7 8 9 10 11 12 13 14
Before 8 9 6 9 4 8 5 9 2 8 9 10 12 17
After 7 8 8 7 3 5 4 8 4 7 8 7 9 9
Solution
H 0 : There is no significance of difference in sales of the product after the release of the
competing product.
H1 : There is a significance of difference in sales of the product after the release of the compet-
ing product.
The level of Significance α = 5%
To prepare the test statistic W , the given data can be tabulated as follows
1 8 7 1 1 4 +4
2 9 8 1 1 4 +4
3 6 8 -2 2 9.5 -9.5
4 9 7 2 2 9.5 +9.5
5 4 3 1 1 4 +4
6 8 5 3 3 13 +13
7 5 4 1 1 4 +4
8 9 8 1 1 4 +4
9 2 4 -2 2 9.5 -9.5
10 8 7 1 1 4 +4
11 9 8 1 1 4 +4
12 10 7 3 3 13 +13
13 12 9 3 3 13 +13
14 7 9 -2 2 9.5 -9.5
Quantitative Methods
155
Sum of ( + Ranks ) = 4 + 4 + 9.5 + 4 + 13 + 4 + 4 + 4 + 4 + 13 + 13 = 76.5
Hence Accept H1
There is a significance of difference in sales of the product after the release of the competing
product.
Quantitative Methods
156
Self-Assessment Questions
12. Signed Ranks are the base for Wilcoxon Signed Rank test. What do you say?
A). Yes
B). No
C). May be
D). Can’t say
13. The test statistic used in Wilcoxon Signed Rank test is___________.
A). W
B). F
C). X
D). χ 2
A). Parametric
B). Mean
C). Non-parametric
D). variance
Summary
● The unit ANOVA & Non-parametric tests help in understanding the non-
parametric tests.
● ANOVA with two classifications used to analyse the means of more than two
populations with treatments, treatments & blocks.
● The sign test and Wilcoxon Signed Rank test are the significant non-parametric
tests based on the signs and ranks of sample observations.
Quantitative Methods
157
Terminal Questions
Position 1 90 82 79 98 83 91
Position 2 105 89 93 104 89 95 86
Position 3 83 89 80 94
Perform an ANOVA to test at 0.05 L.O.S whether the difference among the
sample means at the 3 positions are significant.
Fα = F= 3.74
Tab: , k −1, N − k 0.05,2,14
2. Explain the computational procedure for Chi-square test for goodness of fit.
3. Look on the detergents as treatments and engines as blocks, obtain the appropriate
t ANOVA table and test at 0.01 level of significance whether there are differences in
the detergents or in the engines.
{F = F=
α , k −1,( k −1) ( h −1)
0.01,3,6 } {
9.78 , Fα ,h −1, ,( k=
−1) ( h −1)
F=
0.01,2,6 }
10.92
4. From a telephone directory, the sample of the100 digits are distributed as follows
Digit 0 1 2 3 4 5 6 7 8 9
Frequency 11 8 12 6 5 13 14 7 8 16
Use Chi-square test and test whether the digits are uniformly distributed or not.
5. Explain the computational procedure for Chi-square test for independence of attri-
butes.
6. Discuss the procedure for Sign Test with an example.
7. Discuss the procedure for Wilcoxon Signed Rank test with an example.
Quantitative Methods
158
Answer Keys
Self-Assessment Questions
Question No Answers
1 B
2 A
3 C
4 D
5 D
6 C
7 C
8 A
9 A
10 C
11 C
12 A
13 A
14 C
Quantitative Methods
159
Glossary
● Absolute value: The actual distance of the integer from zero, in a number line.
● Null hypothesis: A type of statistical hypothesis that proposes that no statisti-
cal significance exists in a set of given observations.
● Homogeneity: The quality of being similar or comparable in kind or nature.
BIBLIOGRAPHY
1. Chung, K. L. (2000). A Course in Probability Theory (3rd ed.). San Diego, CA:
Academic Press.
2. Feller, W. (1968). An introduction to Probability Theory and its applications:
Volume I. John Wiley & Sons.
3. Gupta, S. C., & Kapoor, V. K. Fundamentals of Mathematical Statistics. S. Chand
Publications.
External Resources
1. Rukmangadachari, E. Probability and Statistics. Person Education.
2. Rohatgi, V. K., & Ehsanes Saleh, A. K. (2015). An introduction to probability and
statistics (3rd ed.). doi:10.1002/9781118799635
e-References
● Two-Way ANOVA:
https://www.investopedia.com/terms/t/two-way-anova.asp
Quantitative Methods
160
Video Links
Topic Link
https://www.youtube.com/
One way ANOVA Steps Involved in ANOVA
watch?v=7Nt-PeITLbY
Two Way ANOVA Steps Involved in Analy- https://www.youtube.com/
sis of Variance ANOVA watch?v=xMtmhctKyOU
Pearson's chi square test (goodness of fit) https://www.youtube.com/
Probability and Statistics watch?v=2QeDRsxSF9M
https://www.youtube.com/
Sign Test Concept and Example
watch?v=GV5y5IzhyEU&t=258s
Wilcoxon-Test (Wilcoxon Signed Rank https://www.youtube.com/
Test) watch?v=NZsL2eDQiDQ
Keywords
Quantitative Methods
161
QUANTITATIVE METHODS
Module - 3
Quantitative Methods
162
Module Description
Any statistical association between two random variables or bivariate data, whether causal or
not, is referred to in statistics as correlation or dependency. Although “correlation” can mean
any kind of association in the broadest sense, in statistics it typically refers to the strength of a
pair of variables’ linear relationships. Examples of common dependent phenomena include the
relationship between parent and child height and the relationship between a good’s price and the
number of units buyers are prepared to buy, as shown in the so-called demand curve.
In the common language, the word correlation refers to an association of some kind. We could
claim to have observed a connection between wheezy episodes and foggy days. However,
correlation is the statistical term used to describe the relationship between two quantitative
variables. We also assume that the relationship is linear, meaning that for every unit rise or
reduction in one variable, the other increases or decreases by a constant amount. Regression,
which entails estimating the best straight line to summarise the correlation, is the other method
that is frequently employed in similar situations.
A statistical method called regression links a dependent variable to one or more independent
(explanatory) variables. A regression model can demonstrate whether changes in one or more of
the explanatory variables are related to changes in the dependent variable. This is accomplished
by essentially fitting a best-fit line and observing the distribution of the data around this line.
Financial analysts and economists can use regression to make predictions and value assets,
amongst other things. To interpret regression findings correctly, several presumptions regarding
the data and the model itself must be true.
Quantitative Methods
163
QUANTITATIVE METHODS
Module - 3
Unit - 1
INTRODUCTION TO
CORRELATION
Quantitative Methods
164
Unit Table of Contents
Unit 3.1 Introduction to Correlation
Quantitative Methods
165
Aim
This unit aims to explain the basic concepts of Correlation and its applications in
Management.
Instructional Objectives
This unit intends to:
● Explain the concepts of correlation and types of correlation
● Apply the methods of correlation in various applications of management
Learning Outcomes
At the end of this unit, you are expected to:
● Demonstrate the basic concepts of correlation and different methods
correlation to analyse the data
● Interpret the concepts of correlation in management applications such as
marketing, HRM, finance, etc.
Quantitative Methods
166
INTRODUCTION
Correlation and regression are the Bi-variate Statistical tools used to analyse the Bi-variate data
( Xi, Yi )=i 1, 2……. n . The preamble for correlation is ‘Relation’, which means dependency. Let
Y= a + bX be the form of straight line, in which Y is the dependent variable and is depending
on the variable X is called Independent Variable. Ex: Among the variables Price & Demand,
the Demand (Y ) is the dependent variable and is depending on Price ( X ) is the Independent
Variable. Therefore, the variables X and Y are related.
Correlation: If the change in one variable leads to change in another variable, then the two
variables are said to be Correlated. If the effect of Change of one variable leads to the effect of
change of another variable, the two variables are said to be correlated.
• Positive Correlation
o If the two variables deviate in the same direction, i.e., increase in one variable
leads to increase in another variable then the two variables are said to have
Positive Correlation
o Ex: Income and Expenditure of a group of employees
• Negative Correlation
o If the two variables deviate in the opposite direction, i.e., increase in one variable
leads to decrease in another variable then the two variables are said to have
Negative Correlation
o E.g.: Price and Demand of a product
• Perfectly Positive Correlation
o If the two variables deviate in the same direction with equal ratio and forms a
straight line, then the two variables are said to have Perfectly positive Correlation.
o E.g.: Income and Expenditure of a group of employees with equal ratio
Quantitative Methods
167
• Perfectly Negative Correlation.
o If the two variables deviate in the opposite direction with equal ratio and forms a
straight line, then the two variables are said to have Perfectly negative Correlation.
o E.g.: Price and Demand of a product with equal ratio
• No or Nonsense Correlation
o There is a correlation between two variables, but which are not related then such
a correlation is called Nonsense Correlation
o E.g.: The correlation between hair colours of people in the US, and rainfall in India
is a nonsense correlation
Quantitative Methods
168
Self-Assessment Questions
A). Correlation
B). Relation
C). Parameter
D). Statistic
A). 4
B). 5
C). 3
D). 2
A). Zero
B). Negative
C). Positive
D). non-sense
A). Zero
B). Negative
C). Positive
D). Nonsense
Quantitative Methods
169
3.1.2 MEASUREMENT OF CORRELATION
Measurement of correlation means to find the nature of correlation and the amount of correlation
in Bi-variate data ( Xi, Yi ) =
i 1, 2 …. n . Various methods and techniques are used to measuring
the correlation in Bi-variate data. Following methods are used to determine the nature and the
amount of correlation in Bi-variate data ( Xi, Yi ) =
i 1, 2 …. n .
● Scatter Diagram
● Karl Pearson’s Coefficient of Correlation (K P C C)
● Spearman’s Rank Coefficient of Correlation (S R C C)
Quantitative Methods
170
Self-Assessment Questions
5. Measurement of correlation means to find the nature of correlation and the amount
of correlation in Bi-variate data. What do you say?
A). Yes
B). No
C). May be
D). Can’t say
A). 4
B). 5
C). 3
D). 2
Quantitative Methods
171
3.1.3 SCATTER PLOT
● Scatter Plot or Scatter Diagram is the Diagrammatic or Graphical representation of a
Bi-variate data ( Xi, Yi )=i 1, 2, ….n in a Two-dimensional plane.
● Scatter Diagram explains the Nature of Correlation in a Bi-variate data ( Xi, Yi )=i 1, 2, ….n
● If all the ordered pairs in the Bi-variate data are very close to each other, then a good
amount of correlation is expected and if the points are widely scattered, then a poor
amount of correlation is expected.
Following scatter diagrams explains the nature of correlation among the Bi-variate data
Positive Correlation
Negative Correlation
Quantitative Methods
172
Perfectly Negative Correlation
No Correlation
3.1.3.1 Examples:
1. Draw the Scatter Diagram and comment on the nature of correlation for the following
Bi-variate Data.
Quantitative Methods
173
Solution:
Since the variable X and Y are increasing from left to right. Hence the variables X & Y are
positively correlated.
2. Draw the Scatter Diagram and comment on the nature of correlation for the following Bi-vari-
ate Data.
Solution:
Since the variable X and Y are decreasing from left to right. Hence the variables X & Y are
negatively correlated.
Quantitative Methods
174
3. Draw the Scatter Diagram and comment on the nature of correlation for the following Bi-vari-
ate Data.
X 1 2 3 4 5 6
Y 2 4 6 8 10 12
Solution:
Since the variable X and Y are increasing from left to right with equal ration. Hence the variables
X & Y are perfectly positively correlated.
4. Draw the Scatter Diagram and comment on the nature of correlation for the following Bi-vari-
ate Data.
X 1 2 3 4 5 6
Y 12 10 8 6 4 2
Solution:
Quantitative Methods
175
Since the variable X and Y are decreasing from left to right with equal ration. Hence the vari-
ables X & Y are perfectly negatively correlated.
5. Draw the Scatter Diagram and comment on the nature of correlation for the following Bi-vari-
ate Data.
X 1 4 6 8 10 12
Y 2 7 4 6 10 2
Solution:
Since the variable X and Y neither increasing nor decreasing. Hence the variables X & Y
have no correlation.
Quantitative Methods
176
Self-Assessment Questions
A). Yes
B). No
C). Maybe
D). Can’t say
A). Yes
B). No
C). Maybe
D). Can’t say
9. If all the ordered pairs in the Bi-variate data are very close to each other, then ____
amount of correlation is expected.
A). Worst
B). No
C). Poor
D). Good
10. If all the ordered pairs in the Bi-variate data are widely scattered, then ____amount
of correlation is expected.
A). Worst
B). No
C). Poor
D). Good
Quantitative Methods
177
3.1.4 Pearson Correlation Coefficient
● Scatter Diagram provides only the Nature of Correlation (Positive, Negative, perfectly
Positive etc…) but not the amount of correlation between the two variables in the Bi-vari-
ate data.
● The K P C C is used to determine not only the nature but also the amount of Correlation
in a Bi-variate Data developed by Karl Pearson.
● The K P C C is a measure of Linear Relation between two Variables X & Y in the
Bi-variate Data
● The K P C C is represented by ρ xy or Rxy
cov( x, y )
Rxy =
σ xσ y
1
n
∑ xy − xy
Rxy =
1 1
∑ x 2 − ( x )2 ∑ y 2 − ( y )2
n n
1 1
x=
n
∑ xi y = ∑ yi
n
,
n - Number of pair of observations
Quantitative Methods
178
3.1.4.2 Examples
Solution
Given that
∑ Y = 15 , ∑ X = 15 , ∑ XY =
44 , ∑ X 2 =
49 , ∑ Y 2 =
49 and n = 5 .
cov( x, y )
The KPCC Rxy =
σ xσ y
1
n
∑ xy − xy
Rxy =
1 1
∑ x 2 − ( x )2 ∑ y 2 − ( y )2
n n
1 1
x=
n
∑ xi y = ∑ yi
n
,
= 5 (15 )
X bar 1/= 3
= 5 (15 )
Y bar 1/= 3
2. In a marketing survey, the price of milk and coffee in a town based on quality was found as
shown below. Could you find any relation between milk and coffee price?
Price of Milk 89 90 95 70 60 75 50
Price of Coffee 120 134 150 115 110 140 100
Solution
Given that, the number of pairs of observations, n =
The observations
Quantitative Methods
179
Xi : 89 90 95 70 60 75 50
Yi : 120 134 150 115 110 140 100
To determine the relation between the price of Milk and the price of coffee, we have to use the
Karl Pearson’s Coefficient of Correlation (KPCC)
cov( x, y )
Rxy =
σ xσ y
1
n
∑ xy − xy
Rxy =
1 1
∑ x 2 − ( x )2 ∑ y 2 − ( y )2
n n
The Given data can be tabulated as follows
Xi Yi X2 Y2 XY
89 120 7921 14400 10680
90 134 8100 17956 12060
95 150 9025 22500 14250
70 115 4900 13225 8050
60 110 3600 12100 6600
75 140 5625 19600 10500
50 100 2500 10000 5000
ρ xy =
{1/ 7 ( 67140 ) – ( 75.57 )(124.14 ) }
{Sqrt (1/ 7 ( 41671) − 5710.82}{Sqrt (1/ 7 (109781) − 15410.73}
9591.42 – 9381.25
=
(15.56 ) (16.50 )
210.17
=
256.74
= 0.8186
Since ü = 0.8186
Therefore, the price of milk and the price of Coffee are strongly positively correlated.
Quantitative Methods
180
3. Calculate the coefficient of Correlation between the age of cars and annual maintenance
cost and comment
Solution
Given that
Xi : 2 4 6 7 8 10 12
Yi : 1600 1500 1800 1900 1700 2100 2000
Quantitative Methods
181
13128.57 – 12600
=
Sqrt (10 ) Sqrt ( 40000 )
528.57
=
632
= 0.836
Since Rxy = 0.836 , there exists a strong positive correlation between the Age of cars and its
maintenance of cost.
Quantitative Methods
182
Self-Assessment Questions
A). Zero
B). No
C). Positively
D). Negatively
A). Zero
B). No
C). Positively
D). Negatively
Quantitative Methods
183
3.1.5 Spearman’s Rank Correlation Coefficient (SRCC)
● The S R C C is used to calculate the Correlation for the Qualitative Bi-variate Data.
● Calculate the ranks of Xi and Yi observations
6∑ di2
● The SRCC ρ xy = 1 − , Where di = Rx-Ry (Deviations of Ranks)
n(n 2 − 1)
n = Number of Pairs of Observations
3.1.5.1 Properties of Rank Correlation
Examples
1. Following are the marks obtained by 10 students in a class in two tests. Calculate the Rank
Correlation
Test – I 70 68 67 55 60 60 75 63 60 72
Test – II 65 65 80 60 68 58 75 63 60 70
Solution:
Given that
X 70 68 67 55 60 60 75 63 60 72
Y 65 65 80 60 68 58 75 63 60 70
n = No. of pairs of observations =10
6∑ di2
The rank correlation ρ xy = 1 −
n(n 2 − 1)
di
= Rx − Ry , n = number of pairs of observations
the given data can be tabulated as follows
Quantitative Methods
184
X Y Rx Ry di
= Rx − Ry di 2
70 65 3 5.5 3-5.5=-2.5 6.25
68 65 4 5.5 -1.5 2.25
67 80 5 1 4 16
55 60 10 8.5 1.5 2.25
60 68 8 4 4 16
60 58 8 10 -2 4
75 75 1 2 -1 1
63 63 6 7 -1 1
60 60 8 8.5 -0.5 0.25
72 70 2 3 -1 1
Σdi 2 50
6∑ di2
ρ xy = 1 −
n(n 2 − 1)
= 1 − ( 6*50 ) /10 ( 99 )
= 1 − ( 300 / 990 )
= 1 − 0.303
= 0.696
Therefore, the test I and Test II Marks are positively correlated.
2. Ten competitors in a musical test were ranked by 3 judges in the following order.
Ranks by A 1 6 5 10 3 2 4 9 7 8
Ranks by B 3 5 8 4 7 10 2 1 6 9
Ranks by C 6 4 9 8 1 2 3 10 5 7
Use Rank Correlation and determine which pair of judges have the nearest approach in their
judgment.
Solution
RA 1 6 5 10 3 2 4 9 7 8
RB 3 5 8 4 7 10 2 1 6 9
RC 6 4 9 8 1 2 3 10 5 7
Quantitative Methods
185
To determine the pair of judges having the nearest approach in their judgment, we have to find
=
the rank correlations between ( sno, BC( 6,and
c= 7), AC c (‘Seeta’,’Geeta
namerespectively. = ’) , marks c (15,14
= ) , region c (‘gun’,’
The Spearmans’ Rank Coefficient of Correlation (SRCC)
6∑ di2
ρ xy = 1 −
n(n 2 − 1)
RA RB RC d1
= RA − RB d12 d2
= RB − RC d 22 d3
= RC − RA d32
1 3 6 -2 4 -3 9 5 25
6 5 4 1 1 1 1 -2 4
5 8 9 -3 9 -1 1 4 16
10 4 8 6 36 -4 16 -2 4
3 7 1 -4 16 6 36 -2 4
2 10 2 -8 64 8 64 0 0
4 2 3 5 25 -1 1 -1 1
9 1 10 8 64 -9 81 1 1
7 6 5 -1 1 1 1 -2 4
8 9 7 -1 1 2 4 -1 1
ρ AB = −0.21
ρ BC = −0.29
ρ AC = 0.63
0.63 > −0.21 > −0.29
ρ AC > ρ BC > ρ AB
Therefore, judges A and C are having the nearest approach in their judgment.
Quantitative Methods
186
Self-Assessment Questions
15. Rank Correlation is used to calculate the correlation between ____ Bi-variate data.
A). Quantitative
B). Numeric
C). Qualitative
D). Alphabet
A). Some
B). Not
C). Positively
D). Negatively
A). Zero
B). Moderate
C). c) Low
D). d) Highly
Quantitative Methods
187
Summary
• The unit aims to introduce the concept of correlation and their applications in the
field of management.
• Basically, correlation is of two types of positive correlation and negative correlation.
• Scatter plot, Karl Pearson Correlation Coefficient and Spearman’s Rank Correlation
Coefficient (SRCC) are the methods to determine the nature and amount of correla-
tion of a bi-variate data.
Terminal Questions
Test- 1 70 68 67 55 60 60 75 63 60 72
Test-2 65 65 80 60 68 58 75 63 60 70
4. Following are the ranks given by two judges for 12 Contestants in a singing
competition. Find out whether judges agree or not.
X 1 9 2 10 3 11 8 4 12 7 5 6
Y 2 9 1 7 4 10 8 3 12 6 5 10
Quantitative Methods
188
Answer Keys
Self-Assessment Questions
Question No Answers
1 A
2 B
3 C
4 B
5 A
6 C
7 A
8 A
9 D
10 C
11 A
12 C
13 D
14 A
15 C
16 B
17 D
18 B
Quantitative Methods
189
Glossary
• Correlation: A statistical term describing the degree to which two variables
move in coordination with one another.
• Positive Correlation: A relationship between two variables that tend to move
in the same direction.
• Bi-variate data: Data on each of two variables, where each value of one of the
variables is paired with a value of the other variable.
BIBLIOGRAPHY
1. Chung, K. L. (2000). A Course in Probability Theory (3rd ed.). San Diego, CA:
Academic Press.
2. Feller, W. (1968). An introduction to Probability Theory and its applications:
Volume I. John Wiley & Sons.
3. Gupta, S. C., & Kapoor, V. K. Fundamentals of Mathematical Statistics. S. Chand
Publications.
External Resources
1. Rukmangadachari, E. Probability and Statistics. Person Education.
2. Rohatgi, V. K., & Ehsanes Saleh, A. K. (2015). An introduction to probability and
statistics (3rd ed.). doi:10.1002/9781118799635
e-References
• MEASUREMENT OF CORRELATION: https://theintactone.com/2019/02/10/qt-u2-top-
ic-3-measurement-of-correlation-karl-pearsons-method-spearman-rank-correlation/
Quantitative Methods
190
Image Credits
https://www.embibe.com/exams/correla-
Curves are drawn from:
tion/
Video Links
Topic Link
Types of correlation & what is correlation https://www.youtube.com/
coefficient: Correlation and Regression watch?v=8dPkvu4gAvc&t=85s
Correlation and Regression Analysis: Sim- https://www.youtube.com/
plest Way to Learn with Examples watch?v=xTpHD5WLuoA
Introduction to Correlation & Regression, https://www.youtube.com/
Part 1 watch?v=z7kMeJQWr4Y
Keywords
• Negative Correlation
• Rank Correlation
• No Correlation
• Rank Coefficient
• Perfectly Positive Correlation
Quantitative Methods
191
QUANTITATIVE METHODS
Module - 3
Unit - 2
REGRESSION
Quantitative Methods
192
Unit Table of Contents
Unit 3.2 Regression
Quantitative Methods
193
Aim
This unit aims to explain the basic concepts of Regression and its applications in
Management.
Instructional Objectives
This unit intends to:
● Explain the concepts of Regression and types of correlation
● Apply the methods of Regression in various applications of management
Learning Outcomes
At the end of this unit, you are expected to:
● Elaborate upon the different types of Regression used to analyse data
● Optimise using Regression in Marketing, HRM, finance, etc.
Quantitative Methods
194
INTRODUCTION
After understanding the relationship between the Bi-variate data (Xi, Yi) i=1,2……. n, we may
be interested in estimating or predicting the value of one variable given the value of other. The
variable predicted based on another variable is called the ‘Dependent’ or the ‘Explained’ variable
and the other the ‘Independent’ or ‘Predicting’ Variable.
The Prediction is based on average relationship derived statistically by Regression Analysis. The
Equation, Linear or otherwise, is called the regression equation or explaining equation.
3.2.1 MEANING
● The literal meaning of Regression is ‘stepping back towards average value’. That means,
Regression is used to determine the average relationship between two variables.
● Correlation is used to determine the rate of change in two variables whereas Regression
is used to determine the amount of dependency between the Two Variables.
● Let Y = a + b. X be the form of straight line
Where α =- Dependent
5% or 1% or 10%
Variable
X – Independent Variable
a – Intercept
b – Slope
Quantitative Methods
195
Self-Assessment Questions
A). Correlation
B). Relation
C). Parameter
D). Regression
A). Intercept
B). Slope
C). Correlation
D). Regression
A). Intercept
B). Slope
C). Correlation
D). Regression
4. In Y = a + bX , Y is _____Variable.
A). Intercept
B). Slope
C). Dependent
D). Independent
5. In Y = a + bX , X is _____Variable.
A). Intercept
B). Slope
C). Dependent
D). Independent
Quantitative Methods
196
3.2.2 TYPES OF REGRESSION
Regression is classified into
● Liner Regression
● Multiple Regression
● Non-Linear Regression
The linear relationships are based on straight line trend, the equation of which has no-power higher
than one. A linear relationship can be both simple and multiple. Normally a linear relationship is
taken into account because besides its simplicity, it has a better predictive value. A linear trend
can be easily projected in the future.
LINEAR REGRESSION
DATA
POINT
95% CONFIDENCE BAND
Quantitative Methods
197
● Then the Regression line of Y on X is represented as
y byx( x − x )
y −=
Where byx is called the Regression coefficient of Y on X and is given by
σy
byx = ρ xy
σx
cov( x, y )
byx =
σ x2
• Regression Line of ‘X on Y’
y a0 + a1 x1 + a2 x2 +………………+ an xn
=
Tail length
Mouse weight
Quantitative Methods
198
3.2.2.3 Non-Linear Regression
● (
Exponential Curve y = ae x )
● (
Power Curve y = ax b )
Quantitative Methods
199
Self-Assessment Questions
A). Yes
B). No
C). May be
D). Never
A). 4
B). 5
C). 3
D). 2
A). 2
B). 1
C). 3
D). 4
Quantitative Methods
200
3.2.3 ESTIMATING THE REGRESSION COEFFICIENTS
Regression coefficients can be estimated through the direct method and principle of least squares
in which minimising the sum of squares of deviations of actual values from its estimated values.
Example 1
15
Estimate the regression equation of Y on X and X on Y , given the following data: ∑ Y =
15 , ∑ XY =
, ∑ X = 44 , ∑ X 2 =
49 , ∑ Y 2 =
49 and n §= .
Solution
15 , ∑ X =
Given data ∑ Y = 15 , ∑ XY =
44 , ∑ X 2 =
49 , ∑ Y 2 =
49 and n = 5
.
The regression line of X on Y
x bxy ( y − y )
x −=
cov( x, y )
bxy =
σ y2
σx
bxy = ρ xy
σy
bxy =
1/ nΣxy – ( xbar * ybar )
– ( ybar )
2 2
1/ nΣy
1/ 5 ( 44 ) – ( 3*3)
bxy =
1/ 5 ( 49 ) − 32
8.8 − 9
bxy =
9.8 − 9
−0.2
bxy =
0.8
bxy = − 0.25
x bxy ( y − y )
x −=
Quantitative Methods
201
X − 3 = − 0.25 (Y − 3)
X − 3 = − 0.25Y + 0.75
X + 0.25Y − 3.75 =
0
X + 0.25Y − 3.75 =
0
cov( x, y )
Where byx =
σ x2
byx =
1/ nΣxy – ( xbar * ybar )
1/ nΣx 2 – ( xbar )
2
1/ 5 ( 44 ) – ( 3*3)
byx =
1/ 5 ( 49 ) − 32
8.8 − 9
byx =
9.8 − 9
byx = −0.25
y byx( x − x )
y −=
Y − 3 = − 0.25 ( X − 3)
Y − 3 = − 0.25 X + 0.75
Example 2:
From the following data, obtain the two regression lines
Solution:
Let X be the sales and Y be the Purchases
Quantitative Methods
202
The regression line of Y on X is given by
y byx( x − x )
y −=
cov( x, y )
byx =
σ x2
byx =
1/ nΣxy – ( xbar * ybar )
1/ nΣx 2 – ( xbar )
2
X Y X2 Y2 F
91 71 8281 5041 6461
97 75 9409 5625 7275
108 69 11664 4761 7452
121 97 14641 9409 11737
67 70 4489 4900 4690
124 91 15376 8281 11284
51 39 2601 1521 1989
73 61 5329 3721 4453
111 80 12321 6400 8880
57 47 3249 2209 2679
ΣX = 900 ΣY= 700 2
ΣX = 87360 2
ΣY = 51868 Σ XY = 66900
byx =
1/ nΣxy – ( xbar * ybar )
– ( xbar )
2 2
1/ nΣx
byx = 0.6132
bxy =
1/ nΣxy – ( xbar * ybar )
– ( ybar )
2 2
1/ nΣy
Quantitative Methods
203
1/10 ( 66900 ) – ( 90*70 )
byx =
1/10 ( 51868 ) – ( 70 )
2
bxy = 1.361
x bxy ( y − y )
x −=
90 1.361 (Y − 70 )
X −=
=X 1.361Y − 5.27
Example 3:
The following data on advertisement expenditure (in crores) and sales (in tons) of a detergent.
Fit a regression equation and estimate the likely sales when advertisement expenditure is 100.
Ads ( X ) 20 43 63 26 53 31 58 46 58 70
Sales (Y ) 120 128 141 126 134 128 136 132 140 144
Solution:
The regression line of Sales (Y ) on Ads ( X ) is given by
y byx( x − x )
y −=
cov( x, y )
byx =
σ x2
byx =
1/ nΣxy – ( xbar * ybar )
1/ nΣx 2 – ( xbar )
2
Quantitative Methods
204
Now the given data can tabulate as follows
(Y ) X Y X2 Y2 XY
20 120 400 14400 2400
43 128 1849 16384 5504
63 141 3969 19881 8883
26 126 676 15876 3276
53 134 2809 17956 7102
31 128 961 16384 3968
58 136 3364 18496 7888
46 132 2116 17424 6072
58 140 3364 19600 8120
70 144 4900 20736 10080
ΣX =468 1329
ΣY = Σ X2 = 63293
177137 ΣXY =
24408 Σ Y 2 =
byx =
1/ nΣxy – ( xbar * ybar )
1/ nΣx 2 – ( xbar )
2
6329.3 – 6219.72
byx =
2440.8 – 2190.24
byx = 0.437
y byx( x − x )
y −=
Quantitative Methods
205
Y − 132.9 =
0.437 X – 20.452
43.7 + 112.44 =
Y = 156.14
Y = 156.14
Hence if the ad budget is Rs 100 crores then estimated sales is 156.14 tons.
Quantitative Methods
206
Self-Assessment Questions
A). byx
B). bxy
C). bxx
D). byy
A). byx
B). bxy
C). bxx
D). byy
A). σ x
B). y
C). σ x
2
D). σ y 2
A). σ x
B). σ y
C). σ x 2
D). σ y2
A). Subtraction
B). Division
C). Product
D). Sum
Quantitative Methods
207
Summary
• The unit aims to introduce the concept of Regression and their applications in
the field of management.
• Regression is classified in Linear, Multiple and Non-linear Regression
• Linear regression is of two types i) Regression line of Y on X and b) Regres-
sion line of X on Y
Terminal Questions
Performance 1 2 3 4 5 6 7
Marks by P 46 42 44 40 43 41 45
Marks by Q 40 38 36 35 39 37 41
For the 8th performance, Judge Q is not attended. If Judge P awarded 37 marks to
8th performance. Then estimate the score of Judge Q for 8th performance.
Quantitative Methods
208
Answer Keys
Self-Assessment Questions
Question No Answers
1 D
2 A
3 B
4 C
5 D
6 A
7 D
8 B
9 C
10 B
11 A
12 B
13 A
14 C
15 D
Quantitative Methods
209
Glossary
• Regression: A statistical technique that relates a dependent variable to one or
more independent (explanatory) variables.
• Liner Regression: Used to predict the value of a variable based on the value
of another variable.
• Multiple Regression: Is used to estimate the relationship between two or
more independent variables and one dependent variable.
BIBLIOGRAPHY
1. Chung, K. L. (2000). A Course in Probability Theory (3rd ed.). San Diego, CA:
Academic Press.
2. Feller, W. (1968). An introduction to Probability Theory and its applications:
Volume I. John Wiley & Sons.
3. Gupta, S. C., & Kapoor, V. K. Fundamentals of Mathematical Statistics. S. Chand
Publications.
External Resources
1. Rukmangadachari, E. Probability and Statistics. Person Education.
2. Rohatgi, V. K., & Ehsanes Saleh, A. K. (2015). An introduction to probability and
statistics (3rd ed.). doi:10.1002/9781118799635
e-References
• Regression Coefficients: https://www.cuemath.com/data/regression-coefficients/
Quantitative Methods
210
Image Credits
Linear regression:
Fig. 1 https://towardsdatascience.com/linear-regression-
explained-1b36f97b7572
Multiple regression:
Fig. 2
https://www.youtube.com/watch?v=zITIFTsivN8
Video Links
Topic Link
Regression Analysis, Regression Coeffi- https://www.youtube.com/
cient, Linear Regression watch?v=QAEZOhE13Wg
Regression Analysis, Regression Coeffi- https://www.youtube.com/
cient, Linear Regression Part-II watch?v=ddYNq1TxtM0
Regression Analysis, Angle Between Two https://www.youtube.com/
Regression Lines, Proof watch?v=YciBHHeswBM
Keywords
● Regression line
● Non-Linear Regression
● Intercept
● slope
Quantitative Methods
211
QUANTITATIVE METHODS
Module - 4
Quantitative Methods
212
Module Description
An index number is a statistic used to illustrate changes in variables or a group of related variables
across time, across regions, or in relation to other aspects of the variable being studied. It is
known as a change measure, a change measurement tool, or a change representation series.
The variations in economic activity are indicated using index numbers as a barometer. They also
offer a framework for making decisions and projecting the future. Three different sorts of index
numbers are typically employed. Price index, quantity index, and value index are the three.
A company’s past performance can be compared to the statistics of the present to assess its
performance. Time Series Analysis is the process of comparing data from the past and the
present. Instead of being restricted to a limited period, time series are prolonged throughout
a span of time. Time series analysis is significant because it can aid in future prediction. Time
series can forecast the future based on present and past trends.
Financial planning benefits from time series analysis because it provides insight into future data
based on previous and present performance data. By comparing the data from the present and
the past, one can estimate the data for an anticipated period.
Quantitative Methods
213
QUANTITATIVE METHODS
Module - 4
Unit - 1
Quantitative Methods
214
Unit Table of Contents
Unit 4.1 Time Series Analysis
Quantitative Methods
215
Aim
This unit aims to explain the concept of Time Series and its applications in
Management.
Instructional Objectives
This unit intends to:
● Explain the concepts and components of Time Series
● Describe Time Series in various applications of management
Learning Outcomes
At the end of this unit, you are expected to:
● Elaborate upon the concepts and components of Time Series
● Demonstrate the concepts of Time Series in Marketing, HRM, Finance
Quantitative Methods
216
INTRODUCTION
A time series is a chronologically ordered collection of statistical data. That is in accordance with
its occurrence time. It reflects the dynamic speed with which a phenomenon moves through time.
Most time series in economics, business, and commerce, such as prices, production, consump-
tion, agricultural and industrial production, national income, foreign reserves, investment, sales
and profits of business units, bank deposits and clearings, stock exchange shares, and so on,
are all time series spread over a long period of time. In business and economics, time series are
crucial.
Time series plays a vital role in Business Management for future planning
● E.g.: money in circulation for a decade, bank deposits and clearings for a financial
year, sales and profits of a departmental store in a quarter, agricultural and industrial
production of a calendar year, etc.
A time series is a set of observations taken at specified times, usually at equal intervals.
“A time series may be defined as a collection of reading belonging to different time periods of
some economic or composite variables” - “ Ya-Lun-Chau “
The variable “Time” which is independent variable & and the second is “Data” which is the
dependent variable.
Mathematically, a time series is defined by the functional relationship ü t = ( )
Where U t is the value of the phenomenon (Variable - Dependent)
ΣX 2 = 506 f ( t ) is the function of time (Independent Variable)
● E.g.: population f ( t ) of India in different Years ( t )
The sales (U t ) of a departmental store in different months ( t )
The production (U t ) of a manufacturing unit in different hours ( t )
Quantitative Methods
217
Examples
Table 1
Day No. of Packets of milk sold
Monday 90
Tuesday 88
Wednesday 85
Thursday 75
Friday 72
Saturday 90
Sunday 102
Table 2
Year Population (in Millions)
1921 251
1931 279
1941 319
1951 361
1961 439
1971 548
1981 685
• From Table 1, the sale of milk packets decreases from Monday to Friday then again it
starts to increase.
• Same thing in Table 2, the population is continuously increasing.
Quantitative Methods
218
Self-Assessment Questions
A). Statistic
B). Relation
C). Correlation
D). Time Series
A). Yes
B). No
C). May be
D). Can’t say
A). Ut ≠ f ( t )
B). Ut ≤ f (t )
C). Ut = f (t )
D). Ut ≥ f (t )
A). Yes
B). No
C). Maybe
D). Can’t say
Quantitative Methods
219
4.1.2 Components of Time Series
The changes which are being observed in the time series are affected by economic, social,
natural, industrial & political reasons. These reasons are called components of time series and
are classified into 4 types:
• Secular trend or Long-term Movement
• Periodic Changes or Short-term Movement (which are of 3 kinds)
Seasonal Variation
Resulting from the natural forces
Resulting from Man-made conventions
• Cyclic Variations
• Random or Irregular variations
• The changes in Time series over period of within one year are called Periodic Changes
or Short-term Movement.
E.g.: Sales of a departmental store in every month of the year.
• Periodic Changes are broadly classified as
Seasonal Variations
i. These variations in Time Series are due to the rhythmic forces which
operate in a regular and periodic manner over a span of less than a year.
ii. Seasonal variations can be classified as
1. Resulting from the natural forces
2. Resulting from Man-made conventions
Cyclic Variations
Cyclical variations are recurrent upward or downward movements in a time series,
but the period of cycle is greater than a year. Also, these variations are not regular as
seasonal variation.
Quantitative Methods
220
E.g.: Business Cycle
• Irregular variations are fluctuations in time series that are short in duration due to
Floods, Earthquakes, Wars, etc.
● Additive Model
According to Additive Model, the Time Series can be Expressed as
U t = Tt + St + Ct + R t
Where U t - time series value at time t
Tt – Trend Value
St – Seasonal Variations
Ct – Cyclic Variations
Rt – Random or Irregular Variations
4.1.2.5 Multiplicative Model
Quantitative Methods
221
Self-Assessment Questions
A). Trend
B). Cyclic
C). Seasonal
D). Random
A). 4
B). 5
C). 3
D). 2
A). Trend
B). Cyclic
C). Seasonal
D). Random
A). Trend
B). Cyclic
C). Seasonal
D). Random
A). Trend
B). Cyclic
C). Seasonal
D). Random
Quantitative Methods
222
4.1.3 TREND ANALYSIS
Trend is a long-term moment in time series. Data varied due to long-term changes is denoted
with Trend. Trend can be measured and analysed in different ways.
Various methods used to the measure the secular Trend or long-term moment are:
In this method, the Time Series data is represented in two-dimensional plane (Graph) and draw
a smooth hand curve to understand the tendency of the data. We take “Time” on ‘x’ axis and
“Data” on the ‘y’ axis.
Example
Draw a free-hand curve on the basis of the following data also draw the trend line and estimate
the profit for 1997.
Solution
Plot the years on X axis and Profits on Y axis then the free-hand smooth curve for the given
data is as follows
Quantitative Methods
223
From the above free-hand smooth curve, it was concluded that the profits are in increasing
order from 1989 to 1996. And the Trend line can be represented with a dotted line.
● This method is also similar to the free-hand curve method in which the semi averages are
calculated and represented in the graph.
● In this method the given data are divided into two parts, preferably with the equal number
of years.
● If the data contains even periods, then divide them into two equal parts and find the semi
averages.
● If the data contains odd periods, then ignore the middle period and divide them into two
equal parts and find the semi averages.
● Draw the free-hand curve with the data and trend line using semi averages
Example
Draw the trend line from the following data by Semi-Average Method
Solution
There are total 8 periods in the data which can be distributed in equal parts.
Now we calculated Average mean for every part.
Quantitative Methods
224
From the above graph, it was concluded that production is in increasing order from 1989
to 1996. And the Trend line can be represented with a dotted line.
• It is one of the most popular methods for calculating Long Term Trend. This method is
also used for ‘Seasonal fluctuation’, ‘cyclical fluctuation’ & ‘irregular fluctuation’. In this
method we calculate the ‘Moving Average for certain years.
• For example: If we calculate ‘Three year’s Moving Average’ then according to this method
Example
Year 1982 1983 1984 1985 1986 1987 1988 1989 1990 1991 1992 1993 1994 1995 1996
Price 20 25 33 33 27 35 40 43 35 32 37 48 50 37 45
Solution
Quantitative Methods
225
Example
Solution
Method of Least Squares is an important method to determine the trend in the time series data
by fitting Linear of Non-Linear curves.
Linear Trend
Non-linear Trend
(
Second degree Parabola y = a + bx + cx 2 )
(
y a0 + a1 x + a2 x 2 +………………+ an x n
nth degree Polynomial = )
(
Exponential function y = ae x )
(
Power function y = ax b )
Quantitative Methods
226
4.1.3.5 Fitting of a straight line ( Y= a + bx )
Example
Solution
Quantitative Methods
227
Now we calculate the value of two constant ‘a’ and ‘b’ with the help of two equation:-
∑ Y NabX ∑
=
∑ XYaXbX ∑ + ∑ 2
5.2 + 1.6(6) =
Y1996 = 14.8
Example
Given below, seasonal demand for electricity in Ahmedabad (2001-2011). Forecast the demand
for 2012 by fitting a trend line to the data.
Time 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011
period
Quantity 11 14 16 13 17 20 23 25 27 31 35
Solution
ΣY = na + bΣX
Quantitative Methods
228
ΣXY = aΣX + bΣX 2
n=11
ΣY = na + bΣX
- 110b = -251
→ b =-251/-110
→ b = 2.28
Quantitative Methods
229
→ 11a =81.52
→ a = 81.52 / 11
→ a =7.41
Y2012 =7.41+2.28(12)
Y2012 =34.7
Hence the estimated demand for the year 2012 is 34.7 tons
Quantitative Methods
230
Self-Assessment Questions
A). No
B). Yes
C). Maybe
D). Can’t say
A). 4
B). 5
C). 3
D). 2
14. In which method, Time series is divided into Two equal parts?
A). Y = bx
B). Y=a
C). Y= a + bx
D). Y= a + b
Quantitative Methods
231
4.1.4 SEASONAL, CYCLICAL, AND IRREGULAR VARIATIONS
Apart from trend or long-term variations, the time series data fluctuated due to Seasonal, Cyclic
and Irregular variations.
A seasonal variation in a time series refers to changes caused by forces that work on a regular,
periodic basis over a period of less than a year. A business or sales manager’s understanding
of such changes, which are prevalent in most economic and business time series, is critical for
planning future production and scheduling purchases, inventory control, personal requirements,
and selling and advertising programmes. The following are the aims for investigating seasonal
patterns in a time series:
• To distinguish between seasonal fluctuations. This entails determining the impact of sea-
sonal swings on the value of a specific phenomenon and
• Removing them. That is, if there were no seasonal ups and downs in the series, the value
of the phenomena would be determined. This is referred to as de-seasonalising the data,
and it is required for the analysis of cyclic variations.
Obviously, time series data for sections of a year, such as monthly, quarterly, weekly, and daily of
the year, must be provided for the study of seasonal fluctuations. Seasonal variations are stud-
ied under the assumption that the seasonal pattern is superimposed on the values of a series
independently. The following are the several seasonal variation approaches -- Method of Simple
Averages:
The residual approach, which consists of first estimating trend (T) and seasonal (S) components
and then reducing their effects on the provided time series, is an imprecise or primitive method
of assessing cyclic fluctuations. These components (T and S) are eliminated when the given time
series values are divided by TxS , assuming a multiplicative model of the time series.
(Y / TxS )
= TSCI
= / TS CI
Because of the nature of movements, no formula can be proposed for estimating the irregular
component in a time series, no matter how close it is. In practice, the three components of a
time series, namely Trend (T), Seasonal (S), and Cyclic (C), are obtained, and the irregular
component, which is unaccounted for by these components after eliminating them from the given
Quantitative Methods
232
series, is produced as a residual. The random or irregular component of a time series is calculat-
ed using the multiplicative model.
(Y / TSC )
= TSCI
= / TSC I
Quantitative Methods
233
Self-Assessment Questions
A). Trend
B). Cyclic
C). Seasonal
D). Random
A). Trend
B). Cyclic
C). Seasonal
D). Random
A). Trend
B). Cyclic
C). Seasonal
D). Random
A). Trend
B). Cyclic
C). Seasonal
D). Random
A). Trend
B). Cyclic
C). Seasonal
D). Random
Quantitative Methods
234
Summary
● The unit aims to introduce the concept of Time Series and their applications in
the field of management.
● Any Time series data fluctuate due to Trend, Seasonal, Cyclic and Irregular
variations
● Various methods used to determine the Trend are Free-Hand Curve method,
Semi-Averages method, Moving Averages method and Least Squares meth-
od. Among which Least Squares method is the best method to determine the
Trend.
● Seasonal, Cyclic, and Irregular variations also fluctuate the Time Series apart
from Trend.
Quantitative Methods
235
Terminal Questions
Time
2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011
period
Quantity 11 14 16 13 17 20 23 25 27 31 35
Quantitative Methods
236
Answer Keys
Self-Assessment Questions
Question No Answers
1 A
2 D
3 A
4 C
5 A
6 C
7 A
8 B
9 A
10 D
11 B
12 A
13 D
14 B
15 A
16 A
17 B
18 C
19 D
20 C
Quantitative Methods
237
Glossary
• Time series: A data set that tracks a sample over time.
• Least square method: The process of finding the best-fitting curve or line of
best fit for a set of data points by reducing the sum of the squares of the offsets
(residual part) of the points from the curve.
• Cyclic variations: A type of variation that occurs in a cyclical pattern over a
period of time.
BIBLIOGRAPHY
1. Chung, K. L. (2000). A Course in Probability Theory (3rd ed.). San Diego, CA:
Academic Press.
2. Feller, W. (1968). An introduction to Probability Theory and its applications:
Volume I. John Wiley & Sons.
3. Gupta, S. C., & Kapoor, V. K. Fundamentals of Mathematical Statistics. S. Chand
Publications.
External Resources
1. Rukmangadachari, E. Probability and Statistics. Person Education.
2. Rohatgi, V. K., & Ehsanes Saleh, A. K. (2015). An introduction to probability and
statistics (3rd ed.). doi:10.1002/9781118799635
e-References
• Understanding Trend Analysis and Trend Trading Strategies:
https://www.investopedia.com/terms/t/trendanalysis.asp
Quantitative Methods
238
Image Credits
Business Cycle
Fig. 1: https://corporatefinanceinstitute.com/
resources/economics/business-cycle/
Video Links
Topic Link
https://www.youtube.com/
Time Series Analysis
watch?v=BBoUJYT0jxY
“Semi Averages Method” in Time Series https://www.youtube.com/
form Statistics Subject watch?v=VmOZ7_Fjn-s
“Time Series” Chapter Introduction in Sta- https://www.youtube.com/
tistics watch?v=RxhmWTxrTs0
Introducing Time Series Analysis and fore- https://www.youtube.com/
casting watch?v=GUq_tO2BjaU
“Freehand Smooth Curve” in Time Series https://www.youtube.com/
Chapter from Statistics watch?v=CgebqU_I9tE
Keywords
• Seasonal variations
• Irregular variations
• Moving average
• Semi average
Quantitative Methods
239
QUANTITATIVE METHODS
Module - 4
Unit - 2
INDEX NUMBERS
Quantitative Methods
240
Unit Table of Contents
Unit 4.2 Index Numbers
Quantitative Methods
241
Aim
This unit aims to explain the concept of Index Numbers and their applications in
Management.
Instructional Objectives
This unit intends to:
● Explore the concepts of Index Numbers and types of Index Numbers
● Describe Index Numbers in various applications of management
Learning Outcomes
At the end of this unit, you are expected to:
● Analyse the concepts of Index Numbers
● Compare the different types of Index Numbers
● Apply the concepts of Index Numbers in Marketing, HRM, Finance, etc.
Quantitative Methods
242
INTRODUCTION
Index numbers are indicators that show how the level of a phenomenon has changed over time
in any given period (or over a specified period of time) in comparison to its values in a fixed pe-
riod (called the base period for comparison).
• The price of a particular commodity, such as steel, gold, or leather, or the price of a set
of commodities, such as consumer goods, cereals, milk and dairy products, cosmetics,
and so on.
• Volume of trade, factory output, industrial or agricultural output, imports and exports,
stocks and shares, sales and profits of businesses, and so on.
• A country’s national income, the wage structure of workers in various industries, bank
deposits, foreign exchange reserves, people’s cost of living in a specific community,
class, or profession, and so on.
4.2.1.1 Definition
• Index numbers are statistical devices designed to measure the relative change in the
level of a phenomenon (variable or group of variables) with respect to time, geographical
location or other characteristics such as income, profession, etc.
• “Index Number is a statistical device for indicating the relative movement of the data where
measurement of actual movements is difficult or incapable of being made” – Wheldon.
• “Index Number shows by its variations the changes in a magnitude which is not susceptible
either of accurate measurement in itself or of direct valuation in practice” – F. Y. Edgeworth.
Quantitative Methods
243
4.2.1.3 Classification of Index Numbers
Index Numbers are classified into various types, which can be classified as:
• Price Index
• Quantity Index Number
• Value Index Number
• Composite Index Number
• A price index (PI) is a measurement of how prices vary over time, or in other words, a
means to track inflation and deflation.
• A rise in the price level indicates that a certain economy’s currency is losing buying power
(i.e., less can be bought with the same amount of money)
• A volume index, also known as a quantity index, is a numerical time series measure used
to compare how the output of a certain class of goods and/or services differs over time or
between geographic regions.
• A value index is a statistic (ratio) that shows how a nominal value has evolved over time in
comparison to the base year’s value. For each point in time, the index point figure shows
what percentage of its relevant value at the base point in time a given value is at that time.
• A composite index is a statistical tool that combines a number of different stocks, assets,
or indices to indicate overall market or sector performance. Investment analysis, economic
trends, and market forecasting are all done via composite indices.
Quantitative Methods
244
Simple Aggregative
Unweighted
Simple Average of
Price Relative
Index Numbers
Weighted
Weighted
Weighted
Average of
Price Relatives
Quantitative Methods
245
Self-Assessment Questions
A). current
B). base
C). real
D). nominal
A). current
B). base
C). real
D). nominal
A). 3
B). 2
C). 4
D). 5
A). 3
B). 5
C). 4
D). 2
Quantitative Methods
246
4.2.2 UN-WEIGHTED INDEX NUMBERS
The percentage change in price of a single item or a group of goods between two periods of
time is measured by an unweighted price index Number. In unweighted index numbers, all of the
values studied have equal weight. Unweighted index numbers can be calculated in a variety of
ways.
It is calculated by expressing the current year’s aggregate price of all commodities as a percent-
age of the base year’s aggregate price
P01 =
∑p X 100
∑p 0
Where P01= Index number of the current year.
P1= Total of the current year’s price of all commodities.
P0= Total of the base year’s price of all commodities.
Example
Create the Index number for the year 2008 in Rajasthan based on the data provided.
Solution
Quantitative Methods
247
ΣP0 = 3236 ΣP1 = 4351
=P01
∑=
p 1
x100
4351
= x100 134.45
∑p 0
3236
Hence, the price in 2008 was 34.45% higher than the previous year.
The current year’s price is calculated as a percentage of the previous year’s price. The index
number is calculated by averaging these price relatives. Arithmetic mean, geometric mean, or
even median could be employed as the average.
P1
∑ P x100
P01 = 0
N
Where N is the Number of Items
Example
From the data given below construct the index number for the year 2008 taking 2007 as by using
arithmetic mean.
Solution
Index number using arithmetic mean and the given data can be tabulated as follows
Quantitative Methods
248
P1
∑ P x100 = 868.66
0
P1
∑ P x100 868.66
=P01 0
= = 137.34
N 5
Hence there is an increment of 37.34% in the entire commodity price compared with 2007 and
2008.
Quantitative Methods
249
Self-Assessment Questions
A). Yes
B). No
C). Maybe
D). Can’t say
A). current
B). base
C). real
D). nominal
A). current
B). base
C). real
D). nominal
A). N
B). P
C). Q
D). R
A). 3
B). 5
C). 4
D). 2
Quantitative Methods
250
4.2.3 WEIGHTED INDEX NUMBERS
When all commodities are not of equal worth, we give each one a weight based on their relative
importance, and the result is a weighted index number. This index is also known as the base
year weighted index because the base year quantities are used as weights. There are numerous
methods for calculating weighted index numbers, which can be divided into the following
categories:
• Laspeyres’s method
• Paasche’s method
• Dorbish and Bowley’s method
• Fisher’s ideal method
Laspeyres invented this approach in 1871. The weights in this method are determined by the
base’s amounts
P01 =
∑ PQ
1 0
x100
∑PQ
0 0
Hermann Paasche, a German statistician, created this method in 1874. In order to calculate the
Paasche’s Index number, the current year’s weights are used as the base year.
P01 =
∑ PQ
1 1
x100
∑PQ
0 1
This method combines Laspeyres’s and Paasche’s approaches. The index provided by Dorbish
& Bowley is obtained by taking the arithmetic average of Laspeyres’s and Paasche’s indexes.
∑ PQ
+∑ 1 1
1 1 PQ
P01 = ∑ P0Q1 ∑ P0Q1
x100
2
Quantitative Methods
251
4.2.3.4 Fisher’s Ideal Index
The geometric mean of Laspeyres’s and Paasche’s deal index numbers is Fisher’s deal index
number.
∑ PQ
1 1 ∑ PQ
1 1
=P01 SQRT + x100
∑ P Q ∑ P Q
0 1 0 1
Example
The price quantity data is listed below, with prices in Rs. per kilogram and production in quintals.
Find the following indexes: (1) Laspeyres’s Index (2) Paasche’s Index (3) Fisher’s Ideal Index.
2002 2007
ITEMS PRICE PRODUCTION PRICE PRODUCTION
BEEF 15 500 20 600
MUTTON 18 590 23 640
CHICKEN 22 450 24 500
Solution
To determine the Laspeyres’s Index, Paasche’s Index and Fisher’s Ideal Index, the given data
can be tabulated as follows
2002 2007
Price Production Price Production
ITEMS ( p1q0 ) ( p0q0 ) ( p1q1 ) ( p0q1 )
( p0 ) (q0 ) ( p1 ) ( q1 )
BEEF 15 500 20 600 10000 7500 12000 9000
MUTTON 18 590 23 640 13570 10620 14720 11520
CHICKEN 22 450 24 500 10800 9900 12000 11000
TOTAL 34370 28020 38720 31520
1. Laspeyresindex:
=P01
∑=
pq 1 0
x100
34370
= x100 122.66
∑pq 0 0 28020
2. Paasche’s Index:
=P01
∑=
pq 1 1
x100
38720
= x100 122.84
∑pq 0 1 31520
Quantitative Methods
252
3. Fisher Idealindex:
P01
∑=
pq ∑pq
1 0
x x100 1 0 34370 38720
= x x100 122.69
∑pq ∑pq
0 0 0 1 28020 31520
From the analysis, it was concluded that as per Laspeyres’s Index 22.66% increment, Paasche’s
Index 22.84% increment and Fisher’s Ideal Index 122.69% increment in the data.
Fixed Based Index Numbers in which prices of the subsequent years are expressed as relatives
of the price of the base year.
Find index numbers for the following data taking 1980 as the base year using Fixed Base Index
Numbers by taking 1980 as base year.
Solution
Index nos
Year Price 1980 as bases
Pon=Pn/Po×100
1980 40 40/40×100=100
1981 50 50/40×100=125
1982 60 60/40×100=150
1983 70 70/40×100=175
1984 80 80/40×100=200
1985 100 100/40×100=250
1986 90 90/40×100=225
1987 110 110/40×100=275
Quantitative Methods
253
4.2.3.5 Chain Based Index Numbers
Chain Based Index Numbers in which prices of the subsequent years are expressed as
relatives of the price of the previous year as the base year.
Example
Find index numbers for the following data taking 1980 as the base year using Chain Base Index
Numbers by taking 1980 as base year.
Solution
Since CBI = (Price in the Current Year/Price in the preceding Year) ×100
Pn-1,n=(Pn/Pn-1)*100
Index nos
Year Price 1980 as base
Pn-1,n=Pn/Pn-1×100
1980 40 40/40×100=100
1981 50 50/40×100=125
1982 60 60/50×100=120
1983 70 70/60×100=167
1984 80 80/70×100=114
1985 100 100/80×100=125
1986 90 90/100×100=90
1987 110 110/90×100=122.2
Quantitative Methods
254
Self-Assessment Questions
A). 3
B). 5
C). 4
D). 2
A). Laspeyres’s
B). Paasche’s
C). Dorbish and Bowley’s
D). Fisher’s
A). Laspeyres’s
B). Paasche’s
C). Dorbish and Bowley’s
D). Fisher’s
14. Geometric mean of the Laspeyres’s and Paasche’s index numbers is _____index
number.
A). Laspeyres’s
B). Paasche’s
C). Dorbish and Bowley’s
D). Fisher’s
15. The Laspeyres index number was discovered in the year ________.
A). 1820
B). 1920
C). 1881
D). 1871
Quantitative Methods
255
Summary
● The unit aims to introduce the concept of Index Numbers and their applications
in the field of management.
● Index numbers are indicators that represent the relative changes in the level of
a phenomenon through time in comparison to its values in a previous era.
● Price, Quantity, Value, and Composite Index Numbers are the four types of
index numbers.
● There are two types of index number methods: weighted and unweighted index
numbers.
Terminal Questions
Quantitative Methods
256
Answer Keys
Self-Assessment Questions
Question No Answers
1 B
2 A
3 B
4 C
5 D
6 A
7 B
8 A
9 A
10 D
11 C
12 D
13 C
14 D
15 D
Quantitative Methods
257
Glossary
• The Laspeyres Index: It is a price index used to measure the economy’s gen-
eral price level and cost of living, and to calculate inflation.
• Paasche’s Method: Is a composite index number of price arrived at by the
weighted sum method.
• Index Number: The measurement of any change in a variable or variables
across a determined period.
BIBLIOGRAPHY
1. Chung, K. L. (2000). A Course in Probability Theory (3rd ed.). San Diego, CA:
Academic Press.
2. Feller, W. (1968). An introduction to Probability Theory and its applications:
Volume I. John Wiley & Sons.
3. Gupta, S. C., & Kapoor, V. K. Fundamentals of Mathematical Statistics. S. Chand
Publications.
External Resources
1. Rukmangadachari, E. Probability and Statistics. Person Education.
2. Rohatgi, V. K., & Ehsanes Saleh, A. K. (2015). An introduction to probability and
statistics (3rd ed.). doi:10.1002/9781118799635
e-References
● What Is a Price-Weighted Index, and How Does It Work? :
https://www.investopedia.com/terms/p/priceweightedindex.asp
Quantitative Methods
258
Image Credits
Video Links
Topic Link
https://www.youtube.com/watch?v=-
Index Numbers
dUe3U0BTb4k
https://www.youtube.com/
Electrostatics L5 | Electric Field
watch?v=2_glZ6bMI9w
https://www.youtube.com/
"Index Numbers" Introduction in Statistics
watch?v=cv5iPhRmSAg
STATISTICS | Index Numbers – Introduc- https://www.youtube.com/
tion watch?v=3P62OdhegsI
Keywords
● Price Index
● Quantity Index
● Weighted Index
● Un-weighted Index
● Fisher’s Ideal Index
● Dorbish and Bowley’s Index
Quantitative Methods
259
Quantitative Methods
260