Module 1

Download as pdf or txt
Download as pdf or txt
You are on page 1of 122

bschool.cms.ac.

in
Modules
Module 1: Data collection & visualisation
Module 2: Descriptive Statistics
Module 3: Measures of Association & Sampling techniques
Module 4: Hypothesis testing & Case analysis

bschool.cms.ac.in
Text Books

• Anderson, Sweeny & Williams (2018) Statistics for Business &


Economics, Revised 13th edition, Cengage Learning

• Black Ken, Business Statistics for Contemporary Decision


making, 9th edition, Wiley

bschool.cms.ac.in
Reference Books

• Levin & Rubin (1998), Statistics for Management, 7th edition,


Pearson

• Kothari C R (2019), Research methodology: Methods &


techniques, New Age International

bschool.cms.ac.in
Marks distribution

• Class participation – 4 marks


• LinkedIn Learning – 6 marks
• Assignment 1 – 10 marks
• Assignment 2 – 10 marks
• Assignment 3 – 10 marks
• Block End Test – 10 marks
• Final Examination - 50 marks

bschool.cms.ac.in
MODULE 1

DATA COLLECTION & VISUALIZATION

bschool.cms.ac.in
• The word STATISTICS was derived from…….?

• Latin word “STATUS”


• Italian word “STATISTA”

• Meaning “Political state” or “Government”

bschool.cms.ac.in
INTRODUCTION
• Data visualization is the graphical representation of data.
• It involves producing images that communicate to the viewers.
• This communication is achieved through the use of a systematic
mapping between graphic marks and data values in the creation of the
visualization.
• This mapping establishes how data values will be represented visually.

bschool.cms.ac.in
INTRODUCTION

• Data visualization is both an art and a science.


• Difference between data and information
• Condensing the data
• Articulate the art of presenting the data through a tabular form,
colorful graphs and charts.
• It is all about Story telling with Data.

bschool.cms.ac.in
Some Examples

bschool.cms.ac.in
bschool.cms.ac.in
bschool.cms.ac.in
bschool.cms.ac.in
bschool.cms.ac.in
Some Examples

bschool.cms.ac.in
NEED FOR STATISTICS IN MANAGEMENT AND RESEARCH

a bout
Decision rmat io n Internal &
Info
External
Making in
Environment
a Business And information
comes from DATA
Hence, Business
statistics plays an
important role in the Collect, Analyze,
ongoing saga of Summarize and Business
decision making Present Statistics
within the dynamic
world of business.
bschool.cms.ac.in
SCOPE OF STATISTICS
Accounting
Economics
Auditing Management
Regional, National
Cost Estimation & International HR –
Compensation
Micro & Macro
Finance Quality Management

Investment MIS Production


Portfolio Gather & GANTT Charts
Management Marketing Disseminate
Information Regression Analysis
Analytics
Consumer Research International Business
Market & Demographic
Analysis

bschool.cms.ac.in
FORMAL DEFINITION OF STATISTICS
• The science that deals with the collection, classification, analysis, and
interpretation of numerical facts or data in large quantities and that, by
use of mathematical theories of probability, imposes order and
regularity on aggregates of more or less disparate elements for the
purpose of inferring proportions in a whole from those in a
representative sample is called as Statistics.

bschool.cms.ac.in
FORMAL DEFINITION OF STATISTICS
“Statistics is the aggregate of facts,
affected to a marked extent by
multiplicity of causes, numerically
expressed, enumerated or estimated
according to a reasonable standard of
accuracy, collected in a systematic
manner for the pre – determined
Prof. Horace Secrist purpose and placed in relation to
bschool.cms.ac.in
CHARACTERISTICS OF STATISTICS
• Statistics are:
• Aggregates of facts.
• Numerically expressed.
• Affected to a marked extent by multiplicity of causes.
• Enumerated or estimated according to a reasonable standard of accuracy.

bschool.cms.ac.in
LIMITATIONS OF STATISTICS
• Statistics is unable to explain individual item.
• Statistics is unable to explain qualitative characteristics
• Statistics deals with average
• The results of Statistics are not accurately correct
• Statistics is the only one of the methods of dealing a problem
• Statistics can be misused and mislead

bschool.cms.ac.in
MISUSE OF STATISTICS

• Potential misuse of
statistics in business
dealings

bschool.cms.ac.in
ETHICAL CONSIDERATION FOR STATISTICS
• Data is abundant and proliferation of statistical data is at large

bschool.cms.ac.in
MISUSE OF STATISTICS

• Unethical businesspeople might use


only selective data from studies for
their benefit.
• The results of statistical studies can
be mis-stated or overstated to gain
favor.
bschool.cms.ac.in
PRIMARY & SECONDARY DATA

Nominal
DATA CLASSIFICATION Ordinal
Scales of Measuring
Interval
Primary Data
Ratio

QUANTITATIVE & QUALITATIVE DATA

UNGROUPED DATA DISCRETE DATA GROUPED DATA

COVID-19 cases in Bengaluru in


the past one week – 2821, 2993,
2721, 3357, 3284, 2294, 1918.

Source: bschool.cms.ac.in
Source: https://city.imd.gov.in/citywx/city_weather.php?id=43295 https://www.pwc.in/assets/pdfs/citizens-perception-on-democratic-capital.pdf
Data Sources
Primary Secondary
Data Collection Data Compilation

Print or Electronic
Observation Survey

Experimentation

bschool.cms.ac.in
Types of Data

bschool.cms.ac.in
SOURCES OF DATA : Primary and Secondary
PRIMARY DATA: Primary data means original data that has been collected specially for the
purpose in mind. It means someone collected the data from the original source first hand.
Data collected this way is called primary data.

Primary data has not been published yet and is more reliable, authentic and objective.
Primary data has not been changed or altered by human beings; therefore its validity is
greater than secondary data.

bschool.cms.ac.in
Survey: Survey is most commonly used method in social sciences, management,
marketing and psychology to some extent. Surveys can be conducted in different
methods.
Questionnaire: Questionnaire is the most commonly used method in survey.
Questionnaires are a list of questions either an open-ended or close - ended for which the
respondent give answers. Questionnaire can be conducted via telephone, mail, live in a
public area, or in an institute, through electronic mail or through fax and other methods.
Interview: Interview is a face-to-face conversation with the respondent. It is slow,
expensive, and they take people away from their regular jobs, but they allow in-depth
questioning and follow-up questions.
Observations: Observations can be done while letting the observing person know that he
is being observed or without letting him know. Observations can also be made in natural
settings as well as in artificially created environment.

bschool.cms.ac.in
DISADVANTAGES OF PRIMARY
DATA
• High Cost
• Time Consuming
• Inaccurate Feed-backs
• More number of resources is
required

bschool.cms.ac.in
SECONDARY DATA

• Secondary data is the data that has been already collected by and readily
available from other sources.
• Such data are more quickly obtainable than the primary data. These
secondary data may be obtained from many sources, including literature,
industry surveys, compilations from computerized databases and
information systems, and computerized or mathematical models of
environmental processes.

bschool.cms.ac.in
Published Printed Sources: There are varieties of published printed sources. Their credibility
depends on many factors. For example, on the writer, publishing company and time and date
when published. New sources are preferred and old sources should be avoided as new
technology and researches bring new facts into light.
Books: Books are available today on any topic that you want to research. The uses of books
start before even you have selected the topic. After selection of topics books provide insight
on how much work has already been done on the same topic and you can prepare your
literature review. Books are secondary source but most authentic one in secondary sources.
Journals/periodicals: Journals and periodicals are becoming more important as far as data
collection is concerned. The reason is that journals provide up-to-date information which at
times books cannot and secondly, journals can give information on the very specific topic on
which you are researching rather talking about more general topics.
Magazines/Newspapers: Magazines are also effective but not very reliable. Newspaper on
the other hand is more reliable and in some cases the information can only be obtained from
newspapers as in the case of some political studies.
bschool.cms.ac.in
ADVANTAGES OF SECONDARY DATA
• Inexpensive
• Easily accessible
• Immediately available Will provide essential background and help to clarify or refine
research problem – essential for literature review
• Secondary data sources will provide research method alternatives.
• Will also alert the researcher to any potential difficulties.
DISADVANTAGES OF SECONDARY DATA
• Expensive
• Not immediately available – takes time to define problem, sampling frame, method and
analysis.
• Not as readily accessible
• Incomplete Information
bschool.cms.ac.in
Classification of data
The process of arranging data into homogenous groups
or classes according to some common characteristics
present in the data is called classification.

bschool.cms.ac.in
Basis of classification
• Geographical classification
• Chronological classification
• Qualitative classification
• Quantitative classification

bschool.cms.ac.in
Geographical classification
The classification is termed as geographical classification, if the data are
classified by geographical regions or location, like states, provinces, cities,
countries, etc.

bschool.cms.ac.in
Example for Geographical classification

The population of five important metro cities of India as


per 2019 survey is as follows:

CITIES POPULATION
MUMBAI 1,26,91,836
DELHI 1,09,27,986
BENGALURU 51,04,047
KOLKOTA 46,31,392
CHENNAI 43,28, 063

bschool.cms.ac.in
Chronological classification

The classification is termed as Chronological


classification, if the data are classified or arranged over
a period of time of occurrence, such as years, months,
weeks, days, etc.

bschool.cms.ac.in
Example for Chronological classification

The population of UK in last five years is as follows:

YEAR POPULATION (in millions)


2014 64.35
2015 64.88
2016 65.38
2017 65.81
2018 66.19

bschool.cms.ac.in
Qualitative classification
The classification is termed as qualitative classification,
if the data are classified according to a quality or
attribute such as sex, religion, literacy, intelligence,
graduation etc.,

bschool.cms.ac.in
Example for Qualitative classification

The Graduation base of MBA students in CMSBS in


2019 is as follows:

GRADUATION NUMBER OF STUDENTS


B.COM 325
BBM / BBA 236
B.E / B.TECH 125
B.Sc / BCA 22
BA / BHM 12

bschool.cms.ac.in
Quantitative classification
The classification is termed as quantitative classification, if
the data are classified by quantitative characteristics like
height, weight, age, income, etc.,

bschool.cms.ac.in
Example for Quantitative classification

The monthly income of the employees in CBS company


is as follows:

MONTHLY INCOME (in rupees) NUMBER OF EMPLOYEES


< 50 K 4
50 K – 80 K 11
80 K – 110 K 22
110 K – 140 K 10
> 140 K 9

bschool.cms.ac.in
Data series
• Individual series
• Discrete series
• Continuous series

bschool.cms.ac.in
Individual series

The total number of hours a person worked for 30


consecutive weeks in a company is given below.

94 89 88 89 90 94 92 88 87 85

88 93 94 93 94 93 92 88 94 90

93 84 93 84 91 93 85 91 89 95

bschool.cms.ac.in
Discrete series

The number of defective components in each of the


boxes are as follows:

bschool.cms.ac.in
Continuous series

The monthly income of the employees in CBS company


is as follows:

MONTHLY INCOME (in rupees) NUMBER OF EMPLOYEES


< 50 K 4
50 K – 80 K 11
80 K – 110 K 22
110 K – 140 K 10
> 140 K 9

bschool.cms.ac.in
Data series

Ungrouped data
• Individual series

Grouped data
• Discrete series
• Continuous series

bschool.cms.ac.in
Frequency Distribution Table
• Thirty shortlisted policemen reported in the national capital for
one-month rigorous training. The heights of them were checked and
tabled as given below.
Height (cms) Tally Marks Frequency
150 – 153 3
153 – 156 4
156 – 159 5
159 – 162 8
162 – 165 6
165 – 168 2
168 – 171 2
TOTAL 30 bschool.cms.ac.in
Frequency Distribution Table
• Class Interval – Range of each group of data
• Tally Marks - also called hash marks, are a unary numeral system. They are a form
of numeral used for counting. They are most useful in counting or tallying ongoing
results, such as the score in a game or sport, as no intermediate results need to be
erased or discarded. These tally marks are very useful in converting the individual
series to either discrete series or continuous series.
• Frequency - is how often something occurs.
• Frequency Distribution - a list, table or graph that displays the frequency of
various outcomes in a sample. Each entry in the table contains the frequency or
bschool.cms.ac.in
Frequency Distribution Table

• A relative frequency distribution is the fractional value assigned to each class


proportional to the total data.
• The cumulative frequency of a set of data or class intervals of a frequency table is
the sum of the frequencies of the data up to a required level. It can be used to
determine the number of items that have values below a particular level.

bschool.cms.ac.in
Frequency Distribution Table
• Table below represents the total number of overtime hours worked for
30 consecutive weeks by machinists in a machine shop. Prepare a
frequency distribution table
94 89 88 89 90 94 92 88 87 85

88 93 94 93 94 93 92 88 94 90

93 84 93 84 91 93 85 91 89 95

bschool.cms.ac.in
Frequency Distribution Table
• To determine the number of class intervals, the following procedure
have to be followed.
• Determine the data range of the data set
• Decide the width of the class intervals
• Divide the range by the chosen width of the class

• There is no hard and fast rule to decide the number of class intervals.
It purely depends on the researcher based on the research objective.
However, the number of class intervals is desirable to be between 5 to
10. Efforts have been made to have a rule to determine the number of
bschool.cms.ac.in
Frequency Distribution Table

bschool.cms.ac.in
Frequency Distribution Table

bschool.cms.ac.in
Frequency Distribution Table
Class Interval Tally Frequency

84 – 86

86 – 88

88 – 90

90 – 92

92 – 94

94 – 96

bschool.cms.ac.in
Frequency Distribution Table
Class Interval Tally Frequency

84 – 86 4

86 – 88 1

88 – 90 7

90 – 92 4

92 – 94 8

94 – 96 6

30
bschool.cms.ac.in
Frequency Distribution Table
Cumulative
Class Interval Tally Frequency Relative Frequency
Frequency
84 – 86 4 =4

86 – 88 1 =4+1=5

88 – 90 7 = 5 + 7 = 12

90 – 92 4 = 12 + 4 = 16

92 – 94 8 = 16 + 8 = 24

94 – 96 6 = 24 + 6 = 30

30
bschool.cms.ac.in
Frequency Distribution Table
• A Computer company received a rush order for as many home
computers as could be shipped during a 6-week period. Company
records provide the following daily shipments:
22 65 65 67 55 50 65
77 73 30 62 54 48 65
79 60 63 45 51 68 79
83 33 41 49 28 55 61
65 75 55 75 39 87 45
50 66 65 59 25 35 53
bschool.cms.ac.in
Frequency Distribution Table
• Following are the number of two wheelers sold by a dealer during 8
weeks of 6 working days each.
i. Group these figures into a table having the classes 10-12, 13-15, 16-18….and
28-30.
ii. Convert the distribution of (i) into a corresponding percentage frequency
distribution and also a percentage cumulative frequency distribution

bschool.cms.ac.in
Frequency Distribution Table
13 19 22 14 13 16 19 21

23 11 27 25 17 17 13 20

23 17 26 20 24 15 20 21

23 17 29 17 19 14 20 20

10 22 18 25 16 23 19 20

21 17 18 24 21 20 19 26

bschool.cms.ac.in
PRACTICE
• Calculate the number of class intervals to be taken, If the number of
data under consideration is 25 using conventional method.
• Calculate the number of class intervals to be taken, If the number of
data under consideration is 40 using Sturge’s rule.
• Calculate the number of class intervals to be taken, If the number of
data under consideration is 65 using conventional method.
• Calculate the number of class intervals to be taken, If the number of
data under consideration is 65 using Sturge’s rule.
bschool.cms.ac.in
PRACTICE
• The following data represent the afternoon high temperatures for 50
construction days during a year in St. Louis. (Source: TB2 – P50)
42 70 64 47 66 69 73 38 48 25 55 85 10 24 45 31 62 47 63 84 16 40
81 15 35 17 40 36 44 17 38 79 35 36 23 64 75 53 31 60 31 38 52 16 81
12 61 43 30 33
a. Construct a frequency distribution for the data using five class intervals.
b. Construct a frequency distribution for the data using 10 class intervals.
c. Examine the results of (a) and (b) and comment on the usefulness of the
bschool.cms.ac.in
frequency distribution in terms of temperature summarization capability.
PRACTICE
• Find the number of Indian visitors to various countries in 2017 under
Foreign Travel Statistics using
https://en.wikipedia.org/wiki/Indian_passport#Foreign_travel_statist
ics. Convert the individual series in it to the continuous series. Choose
the number of class intervals as per Sturge’s rule.
• http://www.worldstopexports.com/rice-exports-country/ reveals the
value of rice by each of the rice exporting countries in 2019. Convert
those data given in individual series to the continuous series.
bschool.cms.ac.in
BIVARIATE FREQUENCY DISTRIBUTION

• The frequency distribution of a single variable is called as univariate


distribution.
• The Business problems with two variables is called as Bivariate data
and its distribution as Bivariate Frequency distribution.

bschool.cms.ac.in
BIVARIATE FREQUENCY DISTRIBUTION
Family Finances Throughput
Income Admission

Expenditure Graduates Vehicular Movements


Inflow
Population Business
Outflow
Birth Rate Ice cream Sales
Death Rate Temperature

bschool.cms.ac.in
BIVARIATE FREQUENCY DISTRIBUTION
• In general, these data are depicted as (X, Y) and called as coordinates.
• The range in X and Y is bound to vary and hence the number of class
intervals in each of them need not be the same.
• To find the number of class intervals, we use the same logic as used in
univariate cases. If the number of class intervals in X is m and in Y is n
then there will be m x n cells in the two-way table.

bschool.cms.ac.in
BIVARIATE FREQUENCY DISTRIBUTION : An Example
• Body Mass Index (BMI) is a person's weight in kilograms divided by
the square of height in meters. A high BMI can be an indicator of high
body fatness. BMI can be used to screen for weight categories that may
lead to health problems but it is not diagnostic of the body fatness or
health of an individual. BMI is indicated in the form of Bivariate
Frequency table. Height on one side and the weight on the other. The
table below indicates the BMI and obesity of an individual.

bschool.cms.ac.in
BIVARIATE FREQUENCY DISTRIBUTION: An Example

bschool.cms.ac.in
BIVARIATE FREQUENCY DISTRIBUTION
• Two dice are thrown simultaneously 25 times. The following pairs of
observations are obtained. (2,3) (2,5) (5,5) (4,5) (6,4) (3,2) (5,2) (4,1)
(2,5) (6,1) (3,1) (3,3) (4,3) (4,5) (2,5) (3,4) (2,5) (3,4) (2,5) (4,3) (5,2)
(4,5) (4,3) (2,3) (4,1). Prepare a bivariate frequency distribution table
for the given data.

bschool.cms.ac.in
BIVARIATE FREQUENCY DISTRIBUTION
X
1 2 3 4 5 6
Y
1
2
3
4
5
6
bschool.cms.ac.in
BIVARIATE FREQUENCY DISTRIBUTION
X
1 2 3 4 5 6
Y
1

6 bschool.cms.ac.in
BIVARIATE FREQUENCY DISTRIBUTION
X
1 2 3 4 5 6 Total
Y
1 0
2 (2) (5) 7
3 (1) (1) (1) (2) 5
4 (2) (3) (3) 8

5 (2) (1) 3

6 (1) (1) 2
Total 4 3 6 3 9 0 25
bschool.cms.ac.in
BIVARIATE FREQUENCY DISTRIBUTION
• The scores of Mathematics and Statistics scored by 20 students are as
follows:
• Mathematics: 93, 87, 56, 63, 76, 55, 90, 82, 74, 61, 69, 70, 57, 79, 87,
91, 96, 59, 62, 75
• Statistics: 89, 90, 61, 80, 60, 85, 85, 80, 58, 62, 72, 75, 63, 74, 90, 86,
95, 60, 70, 63
• Construct a Bivariate frequency distribution table using proper class
intervals. Find the marginal distribution for both Mathematics and
bschool.cms.ac.in
Statistics and the conditional distribution of Statistics scores for which
BIVARIATE FREQUENCY DISTRIBUTION
M
50-60 60-70 70-80 80-90 90-100 Total
S
50-60 (1) 1
60-70 (3) (1) (2) 6
70-80 (2) (2) 4
80-90 (1) (1) (1) (3) 6
90-100 (2) (1) 3
Total 4 4 5 3 4 20
bschool.cms.ac.in
BIVARIATE FREQUENCY DISTRIBUTION
Marginal Distribution of Mathematics

C.I 50-60 60-70 70-80 80-90 90-100 Total

F 4 4 5 3 4 20

Marginal Distribution of Statistics

C.I 50-60 60-70 70-80 80-90 90-100 Total

F 1 6 4 6 3 20
bschool.cms.ac.in
BIVARIATE FREQUENCY DISTRIBUTION
M
60-70 70-80 Total
S
Conditional
50-60 (1) 1 Distribution of
60-70 (1) (2) 3
Statistics when
Mathematics scores
70-80 (2) (2) 4 are between 60 & 80.
80-90 (1) 1
90-100 0
Total 4 5 9
bschool.cms.ac.in
BIVARIATE FREQUENCY DISTRIBUTION

• Prepare a table of your class with all your heights in cm and weight in
Kgs and convert them to a Bivariate frequency table. Using the above
table, comment on the average health condition of your class (refer
the BMI table).
• Collect the data of Family income of 25 families and their savings per
month. Construct a bivariate frequency table of Income vs
Expenditure. Impose the conditional frequency and comment on your
bschool.cms.ac.in
observation.
BIVARIATE FREQUENCY DISTRIBUTION
• The all-round capabilities of a cricketer are measured in different
ways. A hundred and five dismissals in an innings is one of the ways.
Construct a bivariate frequency table by taking suitable captions and
stubs for the following data.

bschool.cms.ac.in
BIVARIATE FREQUENCY DISTRIBUTION
Player Bat Field Team Opposition Ground Match Date Scorecard
DT Lindsay 182 6ct/0st South Africa v Australia Johannesburg 23 Dec 1966 Test # 611

IDS Smith 113* 4ct/1st New Zealand v England Auckland 10 Feb 1984 Test # 977
SAR Silva 111 5ct/0st Sri Lanka v India Colombo (PSS) 6 Sep 1985 Test # 1024

AC Gilchrist 133 4ct/1st Australia v England Sydney 2 Jan 2003 Test # 1636
MJ Prior 118 5ct/0st England v Australia Sydney 3 Jan 2011 Test # 1989
AB de Villiers 103* 6ct/0st, South Africa v Pakistan Johannesburg 1 Feb 2013 Test # 2072
5ct/0st
MJ Prior 110* 5ct/0st England v New Zealand Auckland 22 Mar 2013 Test # 2084

BJ Watling 124 5ct/0st New Zealand v India Wellington 14 Feb 2014 Test # 2120
BJ Watling 142* 4ct/1st New Zealand v Sri Lanka Wellington 3 Jan 2015 Test # 2155
JM Bairstow 140 5ct/0st England v Sri Lanka Leeds 19 May 2016 Test # 2203
JM Bairstow 101 5ct/0st England v New Zealand Christchurch 30 Mar 2018 Test # 2301

BJ Watling 105* 5ct/0st New Zealand v Sri Lanka Colombo (PSS) 22 Aug 2019 Test # 2356
bschool.cms.ac.in
Source: https://stats.espncricinfo.com/ci/content/records/284278.html
MULTIVARIATE FREQUENCY DISTRIBUTION
• When the data involves three or more variables, it is categorized under
multivariate.
• It is similar to bivariate but contains more than one dependent
variable.

bschool.cms.ac.in
MULTIVARIATE FREQUENCY DISTRIBUTION
• As per the latest report on the aviation industry, following data is
provided to you. The aviation sector is composed of aircraft
movement, passenger traffic and freight traffic. It is observed that the
% shares of each of them in the month of January 2020 is given in
terms of domestic and international as (83.03, 16.97), (80.08, 19.92),
(40.03, 59.97). Tabulate the given data for better understanding.
Data source: https://www.ibef.org/industry/indian-aviation/

bschool.cms.ac.in
MULTIVARIATE FREQUENCY DISTRIBUTION
Route Domestic International
Sector (% Share) (% Share)

Aircraft Movement 83.03 16.97

Passenger Traffic 80.08 19.92

Freight Traffic 40.03 59.97


bschool.cms.ac.in
Parts of a table
• Table number
• Title of the table
• Caption and stubs
• Body
• Prefatory or head note
• Footnotes

bschool.cms.ac.in
Numericals
120 candidates were interviewed in a company for
employment. 45 were male, out of which 35 were
experienced. There were totally 70 inexperienced
candidates for the interview.
Tabulate the above data.

bschool.cms.ac.in
Interview details
Male Female Total
Experienced
Inexperienced
Total

bschool.cms.ac.in
Interview details
Male Female Total
Experienced
Inexperienced
Total 120

bschool.cms.ac.in
Interview details
Male Female Total
Experienced
Inexperienced
Total 45 120

bschool.cms.ac.in
Interview details
Male Female Total
Experienced 35
Inexperienced
Total 45 120

bschool.cms.ac.in
Interview details
Male Female Total
Experienced 35
Inexperienced 70
Total 45 120

bschool.cms.ac.in
Interview details
Male Female Total
Experienced 35 15 50
Inexperienced 10 60 70
Total 45 75 120

bschool.cms.ac.in
Numericals

There are 1235 employees in a trade union. 914


were male, out of which 319 are at least five years
experienced. There were totally 804 employees with
less than five years experience.
Tabulate the above data.

bschool.cms.ac.in
Numericals
In the year 2015, the total strength of three colleges X, Y and Z in a city
were in the ratio 4:2:5. The strength of college Y was 2000. The
proportion of girls and boys in all colleges was in the ratio 2:3. The
faculty wise distribution of boys and girls in the faculties of Arts,
Science and Commerce was in the ratio 1:2:2 in all the three colleges.
Suitably tabulate the above data. Show all the relevant calculations in
detail.

bschool.cms.ac.in
Numericals
The Population of three cities X, Y and Z is 4.5 lakhs. They were
distributed in the ratio 4:3:2 The proportion of male and female in all
these three cities were in the ratio 2:3. In an average, each of these
cities have the population from lower middle class, middle class and
rich class in the ratio 1:4:3.
Suitably tabulate the above data. Show all the relevant calculations in
detail.

bschool.cms.ac.in
Numericals

In a particular year of MBA batch, the total strength of three campuses


Global, Knowledge and Elite were in the ratio 5:2:3. The strength of the
Knowledge campus was 2500. The proportion of boys and girls in all
colleges were in the ratio 3:2. The specialization wise distribution of
boys and girls in HR, Finance and Marketing was in the ratio 1:2:2 in all
the three campuses. Suitably tabulate the above data by showing the
relevant calculations.

bschool.cms.ac.in
MULTIVARIATE FREQUENCY DISTRIBUTION
• Growth of index of industrial production (in %) for three industries,
Mining, Manufacturing and Electricity, for the financial years 2018,
2019 and 2020 (till Jan 2020) is given as (2.30, 4.5, 5.4), (2.90, 3.50,
5.20), (1.00, 0.30, 0.80). Tabulate the given data.
Data source: https://www.ibef.org/industry/manufacturing-sector-india/

bschool.cms.ac.in
MULTIVARIATE FREQUENCY DISTRIBUTION
Year
2018 2019 2020 (till Jan)
Industry

Mining 2.3 4.5 5.4

Manufacturing 2.9 3.5 5.2

Electricity 1.0 0.3 0.8

Growth of index of industrial production (in %)

bschool.cms.ac.in
MULTIVARIATE FREQUENCY DISTRIBUTION
• Sector-wise breakup of growth in export revenue (USD Billion) of
three major IT sectors - Software Products & engineering services,
Business Process Management and IT Services - for the financial years
2016, 2017, 2018 and 2019 is as below:
(22.4, 24.4, 61.0), (25.0, 26.0, 66.0), (28.0, 28, 70), (31.0, 31.0, 74.0).
Tabulate the given data.
Data source: https://www.ibef.org/industry/information-technology-india/

bschool.cms.ac.in
MULTIVARIATE FREQUENCY DISTRIBUTION
Year
2016 2017 2018 2019
IT Sectors

SP & ES 22.4 25.0 28.0 31.0

BPM 24.4 26.0 28.0 31.0

ITS 61.0 66.0 70.0 74.0

Growth in export revenue (USD Billion) of three major IT sectors


bschool.cms.ac.in
MULTIVARIATE FREQUENCY DISTRIBUTION
• About 70 percent of berths and seats added in the trains across the
country in the last five years are in the air-conditioned classes, reveals
‘Express’ through RTI. The Indian Railways has increased the total
number of seats and berths from 49.02 crore to 54.37 crore between
April 2014 and March 2019. However, a whopping 3.73 crore berths
(70 %) were added in various air-conditioned classes. Only 1.61 crore
(30%) is in the non-AC category.
Tabulate the following growth trends of berths/seats in Indian trains
bschool.cms.ac.in
(2015-2019) data
MULTIVARIATE FREQUENCY DISTRIBUTION
1. An analysis shows that the number of berths in three tier AC class
increased from 7.64 crore to 10.39 crore between 2014-15 and
2018-19 registering 26 percent increase. The successive years
2015-16 to 2017-18 data are 8.08, 8.86, and 9.46.
2. However, sleeper berths rose from 26.68 crore to 27.51 crore for the
same period showing just 3 percent rise.
3. The number of berths in two tier AC class increased from 2.63 crore to
3.03 crore between 2014-15 and 2018-19. The successive years
bschool.cms.ac.in
2015-16 to 2017-18 data are 2.7, 2.86, and 2.95.
MULTIVARIATE FREQUENCY DISTRIBUTION
Year 2014-15 2015-16 2016-17 2017-18 2018-19 % Growth
Coach (in Crores) (in Crores) (in Crores) (in Crores) (in Crores)

2AC 2.63 2.7 2.86 2.9 3.03 13.2

3AC 7.64 8.08 8.86 9.46 10.39 26

Sleeper 26.68 - - - 27.51 3

Growth trends of berths/seats in Indian trains (2015-2019)


bschool.cms.ac.in
MULTIVARIATE FREQUENCY DISTRIBUTION
• The trading holidays for share market is found in the following links.
Prepare a single table representing the trading holidays of all the
segments.
Trading Holidays for 2020 - Equity Segment, Equity Derivative
Segment and SLB Segment.
Trading Holidays for 2020 - Currency Derivatives segment, Interest
Rate Derivatives segment
Trading Holidays for 2020 - New Debt Segment.
bschool.cms.ac.in
Trading Holidays for 2020 - Commodity Derivatives Segment.
MULTIVARIATE FREQUENCY DISTRIBUTION

• Mental health and substance use challenges can take many forms.
There’s depression, anxiety, schizophrenia, addiction and the list go
on. Some of these challenges are more visible and you might recognize
them immediately. Others can be harder to see when you’re not
looking for them. But they are still there.
These statistics provide a look at how many people face a mental
health or substance use challenge, whether we see it or not
bschool.cms.ac.in
MULTIVARIATE FREQUENCY DISTRIBUTION
• In the united states, almost half of adults (46.4%) will experience a mental illness
during their lifetime.
• 5 percent of adults (18 or older) experience a mental illness in any one year,
equivalent to 43.8 million people.
• Of adults in the United States with any mental disorder in a one-year period, 14.4
percent have one disorder, 5.8 percent have two disorders and 6 percent have
three or more.
• Half of all mental disorders begin by age 14 and three-quarters by age 24.
• In the United States, only 41 percent of the people who had a mental disorder in
the past year received professional health care or other services.

Tabulate the above data and process it to information.


Source: https://www.mentalhealthfirstaid.org/2019/02/5-surprising-mental-health-statistics/
bschool.cms.ac.in
GRAPHICAL PRESENTATION OF DATA
Statistical Graphics
Effective and Clear
Communication
Plots
through Data
Visualization uses
Information Graphics

Encoded using.. To visually


Numerical Data Dots, Lines, Bars, etc. Quantitative Message
communicate

bschool.cms.ac.in
GRAPHICAL PRESENTATION OF DATA

• Stem & Leaf Display


Tables
• Line Chart
Types of Visualization
• Bar Graph
• Sub-divided Bar Graph
Charts
• Multiple Bar Graph
• Histogram
• Pie Chart
bschool.cms.ac.in
TYPES OF TABLES

bschool.cms.ac.in
GRAPHICAL PRESENTATION OF DATA
STEM & LEAF DISPLAY
A Stem and Leaf Plot is a special table where
each data value is split into a "stem" (the first
digit or digits) and a "leaf" (usually the last
digit).
The "stem" values are listed down, and the
"leaf" values go right (or left) from the stem
values.
The "stem" is used to group the scores and
each "leaf" shows the individual scores within
each group.
bschool.cms.ac.in
GRAPHICAL PRESENTATION OF DATA
A psychologist developed a new test of adult intelligence. The test was administered to 20 individuals,
and the following data were obtained.
114, 99, 131, 124, 117, 102, 106, 127, 119, 115, 98, 104, 144, 151, 132, 106, 125, 122, 118,118
Construct a stem-and-leaf display for the data.

bschool.cms.ac.in
GRAPHICAL PRESENTATION OF DATA

Stem Leaf
9 8, 9
10 2, 4, 6, 6
11 4, 5, 7, 8, 8, 9
12 2, 4, 5, 7
13 1, 2
14 4
15 1

bschool.cms.ac.in
GRAPHICAL PRESENTATION OF DATA
Consider the data result from a 150-question aptitude test given to 50 individuals interviewed for a
position at Haskens Manufacturing.
112, 72, 69, 97, 107, 73, 92, 76, 86, 73, 126, 128, 118, 127, 124, 82, 104, 132, 134, 83, 92, 108, 96, 100,
92, 115, 76, 91, 102, 81, 95, 141, 81, 80, 106, 84, 119, 113, 98, 75, 68, 98, 115, 106, 95, 100, 85, 94, 106,
119
Construct a stem-and-leaf display for the data.

bschool.cms.ac.in
GRAPHICAL PRESENTATION OF DATA
Stem Leaf
6 8, 9
7 2, 3, 3, 5, 6, 6
8 0, 1, 1, 2, 3, 4, 5, 6
9 1, 2, 2, 2, 4, 5, 5, 6, 7, 8,
8
10 0, 0, 2, 4, 6, 6, 6, 7, 8
11 2, 3, 5, 5, 8, 9, 9
12 4, 6, 7, 8
13 2, 4
14 1 bschool.cms.ac.in
LINE CHARTS GRAPHICAL PRESENTATION OF DATA
A line chart or line plot or line graph or
curve chart is a type of chart which displays
information as a series of data points called
'markers' connected by straight line
segments.
Finance department uses Line graphs to plot
the change in the amount of cash the company
has on hand over time. The typical line chart is
depicted below for reference.

bschool.cms.ac.in
GRAPHICAL PRESENTATION OF DATA
BAR GRAPH

A bar graph, or bar chart, is a graphical device for


depicting qualitative data summarized in a
frequency, relative frequency, or percent
frequency distribution.
For qualitative data, the bar should be separated to
emphasize the fact that each class is separate. The
typical bar chart for reference drawn below.

bschool.cms.ac.in
GRAPHICAL PRESENTATION OF DATA
SUB-DIVIDED BAR GRAPH

Sub-divided Bar Diagram is a way of


representation of data in which the total length of
the bar is divided into different parts/components
in particular ratios depending upon the
contributions of various components.
This is also known as stacked bar diagram.

bschool.cms.ac.in
GRAPHICAL PRESENTATION OF DATA

MULTIPLE BAR GRAPH

A multiple bar graph shows the


relationship between different values of
data.
Each data value is represented by a column
in the graph.

bschool.cms.ac.in
GRAPHICAL PRESENTATION OF DATA
PIE CHART
The pie chart provides another graphical device for
presenting relative frequency and percent frequency
distributions for qualitative data.
To construct a pie chart, we first draw a circle to represent
all of the data.
Then we use the relative frequencies to subdivide the
circle into sectors, or parts, that correspond to the relative
frequency for each class.
The pie chart is also known as circular graph or sector
graph.
bschool.cms.ac.in
GRAPHICAL PRESENTATION OF DATA
The top six pharmaceutical companies in the United States and their sales figures ($ millions) for a
recent year is given below. Use this information to construct a pie chart and a bar graph to represent
these six companies and their sales. Pharmaceutical Company Sales Pfizer 52,921 Johnson & Johnson
47,348; Merck 22,939; Bristol-Myers Squibb 21,886; Abbott Laboratories 20,473; Wyeth 17,358.

bschool.cms.ac.in
Graphs and Charts

bschool.cms.ac.in
Graphs and Charts
The number of people confirmed with coronavirus in India reaches 5820 as on April 8th, 2020. State
wise data is as follows. Draw a suitable diagram.
Kerala: 345 Punjab: 106 Delhi: 669 Jammu and Kashmir: 158 Ladakh: 14 Rajasthan: 383 Uttar Pradesh:
361 Maharashtra: 1135 Karnataka: 181 Tamil Nadu: 738 Telangana: 364 Haryana: 147 Andhra Pradesh:
329 Himachal Pradesh: 28 Gujarat: 186 Uttarakhand: 33 ...
Read more at:
https://www.deccanherald.com/national/coronavirus-india-update-state-wise-total-number-of-confir
med-cases-deaths-on-april-8-822699.html

bschool.cms.ac.in
Graphs and Charts
Discuss the graph given below on the front sheet of the daily newspaper on 10th April 2020 in ‘The
Hindu’

bschool.cms.ac.in

You might also like