Computer and statistics
Computer and statistics
Website: www.jptsonline.org
TABLE OF CONTENTS
CHAPTER ONE: INTRODUCTION TO COMPUTER
CHAPTER TWO: CLASSIFICATION OF COMPUTER
CHAPTER THREE: APPLICATION OF COMPUTER
CHAPTER FOUR: ADVANTAGES OF COMPUTER
CHAPTER FIVE: LMITATIONS OF COMPUTER
CHAPTER SIX: COMPONENTS OF A COMPUTER SYSTEM
CHAPTER SEVEN: UNITS OF MEASUREMENT
2
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
CHAPTER ONE
INTRODUCTION TO COMPUTER
Today, almost all of us in the world make use of computers in one way or the
other. It finds applications in various fields of engineering, medicine, commercial,
research and others. Not only in these sophisticated areas, but also in our daily
lives, computers have become indispensable. They are present everywhere, in all
the devices that we use daily like cars, games, washing machines, microwaves etc.
and in day to day computations like banking, reservations, electronic mails,
internet and many more.
The word computer is derived from the word compute. Compute means to
calculate. The computer was originally defined as a superfast calculator. It had the
capacity to solve complex arithmetic and scientific problems at very high speed.
But nowadays, in addition to handling complex arithmetic computations,
computers perform many other tasks like accepting, sorting, selecting, moving,
comparing various types of information. They also perform arithmetic and logical
operations on alphabetic, numeric and other types of information. This
information provided by the user to the computer is data. The information in one
form which is presented to the computer is the input information or input data.
Therefore, a computer can now be defined as a fast and accurate data processing
system that accepts data, performs various operations on the data, has the
capability to store the data and produce the results on the basis of detailed step
by step instructions given to it.
The terms hardware and software are almost always used in connection with the
computer.
3
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
• The Hardware:
The hardware is the machinery itself. It is made up of the physical parts or devices
of the computer system like the electronic Integrated Circuits (ICs), magnetic
storage media and other mechanical devices like input devices, output devices
etc. All these various hardware are linked together to form an effective functional
unit. The various types of hardware used in the computers, has evolved from
vacuum tubes of the first generation to Ultra Large Scale Integrated Circuits of the
present generation.
• The Software:
The computer hardware itself is not capable of doing anything on its own; it has
to be given explicit instructions to perform the specific task. The computer
program is the one which controls the processing activities of the computer. The
computer thus functions according to the instructions written in the program.
4
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
Software Types
5
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
6
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
The computers of the first generation were very bulky and emitted large amount
of heat which required air conditioning. They were large in size and cumbersome
to handle. They had to be manually assembled and had limited commercial use.
The concept of operating systems was not known at that time. Each computer
had a different binary coded program called a machine language that told it how
to operate.
The Abacus, which emerged about 5000 years ago in Asia Minor and is still in use
today, allows users to make computations using a system of sliding beads
arranged on a rack. Early merchants used Abacus to keep trading transactions.
7
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
electronic relay computer. Electromagnetic signals were used for the movement
of mechanical parts. Mark I could perform the basic arithmetic and complex
equations. Although this machine was extremely reliable, it was very slow (it took
about 3-5 seconds per calculation) and was complex in design and large in size.
EDVAC –
In the mid 1940’s Dr. John von Neumann designed the Electronic Discrete
Variable Automatic Computer with a memory to store both program and data.
This was the first machine which used the stored program concept. It had five
distinct units - arithmetic, central control, memory, input and output. The key
element was the central control. All the functions of the computer were co-
ordinate through this single source, the central control. The programming of the
computers was done in machine language
UNIVAC –
Remington Rand designed this computer specifically for business data processing
applications. The Universal Automatic Computer was the first general purpose
commercially available computer.
8
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
IBM 1401 was universally accepted throughout the industry and most large
businesses routinely processed financial information using second generation
computers. The machine language was replaced by assembly language. Thus the
long and difficult binary code was replaced with abbreviated programming code
which was relatively easy to understand.
The stored program concept and programming languages gave the computers
flexi bility to finally be cost effective and productive for business use. The stored
program concept implied that the instructions to run a computer for a specific
task were held inside the computer’s memory and could quickly be modified or
replaced by a different set of instructions for a different function. High level
languages like COBOL, FORTRAN and AL- GOL were dev eloped. Computers
started finding vast and varied applications. The entire software industry began
with the second generation computers.
9
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
The third generation computers made use of ‘Integrated Circuits that had 10-20
components on each chip, this was Small Scale Integration (SSI).
The Fourth Generation realized Large Scale Integration (LSI) which could fit
hundreds of components on one chip and Very Large Scale integration (VLSI)
which squeezed thousands of components on one chip. The Intel 4004 chip,
located all the components of a computer (central processing unit, memory, input
and output controls) on a single chip and microcomputers were introduced.
Higher capacity storage media like magnetic disks were developed. Fourth
generation languages emerged and applications softwares started becoming
popular.
10
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
As the computers started becoming more and more powerful, they could be
linked together or networked to share not only data but also memory space and
software. The networks could reach enormous proportions with local area
networks. A global web of computer circuitry, the Internet, links the computers
worldwide into a single network of information.
Defining the fifth generation computers is somewhat difficult because the field is
still in its infancy. The computers of tomorrow would be characterized by Artificial
Intelligence (AI). An example of Al is Expert Systems. Computers could be
developed which could think and reason in much the same way as humans.
Computers would be able to accept spoken words as input (voice recognition).
Many advances in the science of computer design and technology are coming
together to enable the creation of fifth generation computers. Two such advances
are parallel processing where many CPUs work as one and advance in
superconductor technology which allows the flow of electricity with little or no
resistance, greatly improving the speed of information flow.
11
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
CHAPTER TWO
CLASSIFICATION OF COMPUTERS
Computers are broadly classified into two categories depending upon the logic
used in their design as:
Analog computers:
In analog computers, data is recognized as a continuous measurement of a
physical property like voltage, speed, pressure etc. Readings on a dial or graphs
are obtained as the output, ex. Voltage, temperature; pressure can be measured
in this way.
Digital Computers:
These are high speed electronic devices. These devices are programmable. They
process data by way of mathematical calculations, comparison, sorting etc. They
accept input and produce output as discrete signals representing high (on) or low
(off) voltage state of electricity. Numbers, alphabets, symbols are all represented
as a series of 1s and Os.
Digital Computers are further classified as General Purpose Digital Computers and
Special Purpose Digital Computers. General Purpose computers can be used for
any applications like accounts, payroll, data processing etc. Special purpose
computers are used for a specific job like those used in automobiles, microwaves
etc.
Another classification of digital computers is done on the basis of their capacity to
access memory and size like:
• Small Computers:
I) Microcomputers: Microcomputers are generally referred to as Personal
Computers (PCs). They have smallest memory and less power. They are
widely used in day to day applications like office automation, and
professional applications, ex. PCAT, Pentium etc.
II) Note Book and Laptop Computers: These are portable in nature and are
battery operated. Storage devices like CDs, floppies etc. and output devices
like printers can be connected to these computers. Notebook computers
12
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
are smaller in physical size than laptop computers. However, both have
powerful processors, support graphics, and can accept mouse driven input.
III) Hand Held Computers
These types of computers are mainly used in applications like collection of
field data. They are even smaller than the note book computers.
13
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
CHAPTER THREE
APPLICATIONS OF COMPUTERS
Today computers find widespread applications in all activities of the modern
world. Some of the major application areas include:
Business:
Record keeping, budgets, reports, inventory, payroll, invoicing, accounts are all
the areas of business and industry where computers are used to a great extent.
Database management is one of the major area where computers are used on a
large scale. The areas of application here include banking, airline reservations,
etc. where large amounts of data need to be updated, edited, sorted, searched
from large databases.
Medicine:
Computerized systems are now in widespread use in monitoring patient data like
pulse rate, blood pressure etc. resulting in faster and accurate diagnosis. Modern
day medical equipment are highly computerized today. Computers are also widely
used in medical research.
Information:
This is the age of information. Television, Satellite communication, Internet,
networks are all based on computers.
Education:
The use of computers in education is increasing day by day. The students develop
the habit of thinking more logically and are able to formulate problem solving
techniques. CDs on a variety of subjects are available to impart education. On line
training programs for students are also becoming popular day by day. All the
major encyclopedias, dictionaries and books are now available in the digital form
14
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
and therefore are easily accessible to the student of today. Creativity in drawing,
painting, designing, decoration, music etc. can be well developed with computers.
15
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
CHAPTER FOUR
ADVANTAGES OF COMPUTERS
Speed:
Accuracy:
Computers are very accurate. They are capable of executing hundreds of
instructions without any errors. They do not make mistakes in their computations.
They perform each and every calculation with the same accuracy.
Efficiency
The efficiency of computers does not decrease with age. The computers can
perform repeated tasks with the same efficiency any number of times without
exhausting there selves. Even if they are instructed to execute millions of
instructions, they are capable of executing them all with the same speed and
efficiency without exhaustion.
Storage Capability
Computers are capable of storing large amounts of data in their storage devices.
These dev ices occupy very less space and can store millions of characters in
condensed forms. These storage devices typically include floppy disks, tapes, hard
disks, CDs etc, the data stored on these devices can be retrieved and reused
whenever it is required in future
Versatility
Computers are very versatile. They are capable not only of performing complex
mathematical tasks of science and engineering, but also other non-numerical
operations fielding air-line reservation, electricity bills, data base management
etc.
16
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
CAHPTER FIVE
LIMITATIONS OF COMPUTERS
Although the computers of today are highly intelligent and sophisticated they
have their own limitations. The computer cannot think on its own, since it does
not have its own brain. It can only do what is has been programmed to do. It can
execute only those jobs that can be expressed as a finite set of instructions to
achieve a specific goal. Each of the steps has to be clearly defined. The computers
do not learn from previous experience nor can they arrive at a conclusion without
going through all the intermediate steps. However the impact of computers on
today’s society in phenomenal and they are today an important part of the
society.
A COMPUTER SYSTEM
Any system is defined as a group of integrated parts which are designed to
achieve a common objective. Thus, a system is made up of more than one
element or part, where each element performs a specific function and where all
the elements (parts) are logically related and are controlled in such a way that the
goal (purpose) of the system is achieved.
Each of these units performs a specific task. However, none of them can function
independently on their own. They are logically related and controlled to achieve a
specific goal. When they are thus integrated they form a fully-fledged computer
system.
17
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
CHAPTER SIX
COMPONENTS OF A COMPUTER SYSTEM
The basic parts of computer system are:
Input Unit
The Central Processing Unit
Output Unit
18
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
19
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
Output Unit:
The output devices give the results of the process and computations to the
outside world. The output units accept the results produced by the computer,
convert them into a human readable form and supply them to the users. The
more common output devices are printers, plotters, display screens, magnetic
tape drives etc.
20
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
CHAPTER SEVEN
UNITS OF MEASUREMENT
Unit of Measurements
Storage Measurements: The basic unit used in computer data storage is called a
bit (binary digit). Computers use these little bits, which are composed of ones and
zeros, to do things and talk to other computers. All your files, for instance, are
kept in the computer as binary files and translated into words and pictures by the
software (which is also ones and zeros). This ‘two-number’ system is called a
“binary number system” since it has only two numbers in it. The decimal number
system in contrast has ten unique digits, zero through nine.
Bit BIT 0 or 1
Kilobyte KB 1024 Byte
Megabyte MB 1024 Kilobyte
Gigabyte GB 1024 Megabyte
Terabyte TB 1024 Gigabyte
Size example
• 1 bit - answer to an yes/no question
• 1 byte - a number from 0 to 255.
• 90 bytes: enough to store a typical line of text from a book.
• 4 KB: about one page of text.
• 120 KB: the text of a typical pocket book.
• 3 MB - a three minute song (128k bitrate)
• 650-900 MB - an CD-ROM
• 1 GB -114 minutes of uncompressed CD-quality audio at 1.4 Mbit/s
• 8-16 GB - size of a normal flash drive
21
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
22
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
STATISTICS
23
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
TABLE OF CONTENTS
CHAPTER ONE: DEFINITION, SCOPE AND LIMITATIONS
CHAPTER TWO: INTRODUCTION TO SAMPLING METHODS
CHAPTER THREE: COLLECTION OF DATA: CLASSIFICATION AND TABULATION
CHAPTER FOUR: FREQUENCY DISTRIBUTION
CHAPTER FIVE: DIAGRAMMATIC AND GRAPHICAL REPRESENTATION
CHAPTER SIX: MEASURE OF CENTRAL TENDENCY
CHAPTER SEVEN: MEASURE OF DISPERSION: SKEWNESS AND KURTOSIS
CHAPTER EIGHT: CORRELATION
CHAPTER NINE: REGRESSION
CHAPTER TEN: INDEX NUMBERS
24
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
CHAPTER ONE
DEFINITIONS, SCOPE AND LIMITATIONS
1.1 Introduction
In the modern world of computers and information technology, the importance of
statistics is very well-recognized by all the disciplines. Statistics has originated as a
science of statehood and found applications slowly and steadily in Agriculture,
Economics, Commerce, Biology, Medicine, Industry, planning, education and so
on. As of date, there is no other human walk of life, where statistics cannot be
applied.
Meaning of Statistics
Statistics is concerned with scientific methods for collecting, organizing,
summarizing, presenting and analyzing data as well as deriving valid conclusions
and making reasonable decisions on the basis of this analysis. Statistics is
concerned with the systematic collection of numerical data and its interpretation.
Definitions
Statistics is defined differently by different authors over a period of time. In the
olden days statistics was confined to only state affairs but in modern days it
embraces almost every sphere of human activity. Therefore, a number of old
definitions, which was confined to narrow field of enquiry were replaced by more
25
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
Statistics may be called the science of counting in one of the departments due to
Bowley, obviously this is an incomplete definition as it takes into account only the
aspect of collection and ignores other aspects such as analysis, presentation and
interpretation.
Bowley gives another definition for statistics, which states ‘statistics may be
rightly called the scheme of averages’. This definition is also incomplete, as
averages play an important role in understanding and comparing data and
statistics provide more measures.
1. Collection of Data: It is the first step and this is the foundation upon which
the entire data set. Careful planning is essential before collecting the data. There
26
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
Functions of Statistics
There are many functions of statistics. Let us consider the following five
important functions.
Condensation:
Generally speaking, by the word ‘to condense’, we mean to reduce or to lessen.
Condensation is mainly applied at embracing the understanding of a huge mass of
data by providing only few observations. If in a particular class in Chennai School,
only marks in an examination are given, no purpose will be served. Instead if we
are given the average mark in that particular examination, definitely it serves the
better purpose. Similarly, the range of marks is also another measure of the data.
Thus, Statistical measures help to reduce the complexity of the data and
consequently to understand any huge mass of data.
27
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
Comparison
Classification and tabulation are the two methods that are used to condense the
data. They help us to compare data collected from different sources. Grand totals,
measures of central tendency measures of dispersion, graphs and diagrams,
coefficient of correlation etc. provide ample scope for comparison.
If we have one group of data, we can compare within itself. If the rice production
(in Tonnes) in Tanjore district is known, then we can compare one region with
another region within the district. Or if the rice production (in Tonnes) of two
different districts within Tamilnadu is known, then also a comparative study can
be made. As statistics is an aggregate of facts and figures, comparison is always
possible and in fact comparison helps us to understand the data in a better way.
Forecasting:
By the word forecasting, we mean to predict or to estimate beforehand. Given
the data of the last ten years connected to rainfall of a particular district in
Tamilnadu, it is possible to predict or forecast the rainfall for the near future. In
business also forecasting plays a dominant role in connection with production,
sales, profits etc. The analysis of time series and regression analysis plays an
important role in forecasting.
Estimation:
One of the main objectives of statistics is drawn inference about a population
from the analysis for the sample drawn from that population. The four major
branches of statistical inference are
1. Estimation theory
2. Tests of Hypothesis
3. Non-Parametric tests
4. Sequential analysis
28
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
Tests of Hypothesis:
A statistical hypothesis is some statement about the probability distribution,
characterizing a population on the basis of the information available from the
sample observations. In the formulation and testing of hypothesis, statistical
methods are extremely useful. Whether crop yield has increased because of the
use of new fertilizer or whether the new medicine is effective in eliminating a
particular disease are some examples of statements of hypothesis and these are
tested by proper statistical tools.
Scope of Statistics:
Statistics is not a mere device for collecting numerical data, but as a means of
developing sound techniques for their handling, analysing and drawing valid
inferences from them. Statistics is applied in every sphere of human activity –
social as well as physical – like Biology, Commerce, Education, Planning, Business
Management, Information Technology, etc. It is almost impossible to find a single
department of human activity where statistics cannot be applied. We now discuss
briefly the applications of statistics in other disciplines.
29
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
As so many multinational companies have invaded into our Indian economy, the
size and volume of business is increasing. On one side the stiff competition is
increasing whereas on the other side the tastes are changing and new fashions
are emerging. In this connection, market survey plays an important role to exhibit
the present conditions and to forecast the likely changes in future.
For an example, five fertilizers are applied to five plots each of wheat and the
yields of wheat on each of the plots are given. In such a situation, we are
interested in finding out whether the effect of these fertilizers on the yield is
significantly different or not. In other words, whether the samples are drawn from
the same normal population or not. The answer to this problem is provided by the
technique of ANOVA and it is used to test the homogeneity of several population
means.
Alfred Marshall said, “Statistics are the straw only which I like every other
economist have to make the bricks”. It may also be noted that statistical data and
techniques of statistical tools are immensely useful in solving many economic
problems such as wages, prices, production, distribution of income and wealth
and so on. Statistical tools like Index numbers, time series Analysis, Estimation
theory, Testing Statistical Hypothesis are extensively used in economics.
30
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
In order to achieve the above goals, the statistical data relating to production,
consumption, demand, supply, prices, investments, income expenditure etc and
various advanced statistical techniques for processing, analysing and interpreting
such complex data are of importance. In India statistics, play an important role in
planning, commissioning both at the central and state government levels.
SYSTAT, a software package offers mere scientific and technical graphing options
than any other desktop statistics package. SYSTAT supports all types of scientific
and technical research in various diversified fields as follows
31
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
Limitations of statistics:
Statistics with all its wide application in every sphere of human activity has its
own limitations. Some of them are given below.
32
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
33
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
CHAPTER TWO
INTRODUCTION TO SAMPLING METHODS
Introduction
Sampling is very often used in our daily life. For example, while purchasing food
grains from a shop we usually examine a handful from the bag to assess the
quality of the commodity. A doctor examines a few drops of blood as sample and
draws conclusion about the blood constitution of the whole body. Thus, most of
our investigations are based on samples. In this chapter, let us see the importance
of sampling and the various methods of sample selections from the population.
Population
In a statistical enquiry, all the items, which fall within the purview of enquiry, are
known as Population or Universe. In other words, the population is a complete
set of all possible observations of the type which is to be investigated. Total
numbers of students studying in a school or college, total number of books in a
library, total number of houses in a village or town are some examples of
population.
34
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
Census Method
Information on population can be collected in two ways – census method and
sample method. In census method, every element of the population is included in
the investigation. For example, if we study the average annual income of the
families of a particular village or area, and if there are 1000 families in that area,
we must study the income of all 1000 families. In this method, no family is left
out, as each family is a unit.
Limitations
1. It requires a large number of enumerators and it is a costly method
2. It requires more money, labour, time energy etc.
3. It is not possible in some circumstances where the universe is infinite.
Sampling
The theory of sampling has been developed recently but this is not new. In our
everyday life we have been using sampling theory as we have discussed in
introduction. In all those cases we believe that the samples give a correct idea
about the population. Most of our decisions are based on the examination of a
few items that is sample studies.
35
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
Sample
Statisticians use the word sample to describe a portion chosen from the
population. A finite subset of statistical individuals defined in a population is
called a sample. The number of units in a sample is called the sample size.
Sampling unit
The constituents of a population which are individuals to be sampled from the
population and cannot be further subdivided for the purpose of the sampling at a
time are called sampling units. For example, to know the average income per
family, the head of the family is a sampling unit. To know the average yield of rice,
each farm owner’s yield of rice is a sampling unit.
Sampling frame
For adopting any sampling procedure it is essential to have a list identifying each
sampling unit by a number. Such a list or map is called sampling frame. A list of
voters, a list of house holders, a list of villages in a district, a list of farmers etc. are
a few examples of sampling frame.
Reasons for selecting a sample
Sampling is inevitable in the following situations:
36
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
In general, we use Greek or capital letters for population parameters and lower
case Roman letters to denote sample statistics. [N, µ, σ, are the standard symbols
for the size, mean, S.D, of population. n , x , s, are the standard symbol for the
size, mean, s.d of sample respectively].
Principles of Sampling
Samples have to provide good estimates. The following principle tell us that the
sample methods provide such good estimates
Other things being equal, as the sample size increases, the results tend to be
more accurate and reliable.
3. Principle of Validity
This states that the sampling methods provide valid estimates about the
population units (parameters).
4. Principle of Optimization
This principle takes into account the desirability of obtaining a sampling design
which gives optimum results. This minimizes the risk or loss of the sampling
design.
37
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
1. Sampling errors
Although a sample is a part of population, it cannot be expected generally to
supply full information about population. So, there may be in most cases
difference between statistics and parameters. The discrepancy between a
parameter and its estimate due to sampling process is known as sampling error.
2. Non-sampling errors:
In all surveys, some errors may occur during collection of actual information.
These errors are called Non-sampling errors.
38
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
1. Probability sampling.
2. Non-probability sampling.
3. Mixed sampling.
Probability sampling (Random Sampling)
A probability sample is one where the selection of units from the population is
made according to known probabilities. (eg.) Simple random sample, probability
proportional to sample size etc.
Non-Probability Sampling
It is the one where discretion is used to select ‘representative’ units from the
population (or) to infer that a sample is ‘representative’ of the population. This
method is called judgement or purposive sampling. This method is mainly used
for opinion surveys; A common type of judgement sample used in surveys is
quota sample. This method is not used in general because of prejudice and bias of
the enumerator. However, if the enumerator is experienced and expert, this
method may yield valuable results. For example, in the market research survey of
the performance of their new car, the sample was all new car purchasers.
Mixed Sampling
Here samples are selected partly according to some probability and partly
according to a fixed sampling rule; they are termed as mixed samples and the
technique of selecting such samples is known as mixed sampling.
Methods of selection of samples
Here we shall consider the following three methods:
1. Simple random sampling.
2. Stratified random sampling.
3. Systematic random sampling.
1. Simple random sampling
A simple random sample from finite population is a sample selected such that
each possible sample combination has equal probability of being chosen. It is also
called unrestricted random sampling.
2. Simple random sampling without replacement
In this method, the population elements can enter the sample only once (i.e.) the
unit once selected is not returned to the population before the next draw.
39
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
a) Lottery Method:
This is the most popular and simplest method. In this method, all the items of the
population are numbered on separate slips of paper of same size, shape and
colour. They are folded and mixed up in a container. The required numbers of
slips are selected at random for the desire sample size. For example, if we want to
select 5 students, out of 50 students, then we must write their names or their roll
numbers of all the 50 students on slips and mix them. Then we make a random
selection of 5 students.
This method is mostly used in lottery draws. If the universe is infinite this method
is inapplicable.
1. Tippett’s table
2. Fisher and Yates’ table
3. Kendall and Smith’s table are the three tables among them.
A random number table is so constructed that all digits 0 to 9 appear independent
of each other with equal frequency. If we have to select a sample from population
of size N = 100, then the numbers can be combined three by three to give the
numbers from 001 to 100.
40
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
digit number 000,001,002, ….. 999 are assigned. We may start at any place and
may go on in any direction such as column wise or row- wise in a random number
table. But consecutive numbers are to be used.
On the basis of the size of the population and the random number table available
with us, we proceed according to our convenience. If any random number is
greater than the population size N, then N can be subtracted from the random
number drawn.
203 023 277 353 100 294 109 179 272 284 450 141 148
408 280
41
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
42
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
The population size is denoted by N and the sample size is denoted by ‘n’ the
sample size is allocated to each stratum in such a way that the sample fractions is
a constant for each stratum. That is given by n/N = c. So in this method each
stratum is represented according to its size.
n1 = n × N1 = 50 × 200 = 20
N 500
n2 = n × N2 = 50 × 300 = 30
N 500
The sample sizes are 20 from A and 30 from B. Then the units from each
institution are to be selected by simple random sampling.
43
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
Limitations
1. To divide the population into homogeneous strata, it requires more
money, time and statistical experience which is a difficult one.
2. Improper stratification leads to bias, if the different strata overlap such a
sample will not be a representative one.
Systematic Sampling:
This method is widely employed because of its ease and convenience. A
frequently used method of sampling when a complete list of the population is
available is systematic sampling.
It is also called Quasi-random sampling.
Selection Procedure
The whole sample selection is based on just a random start. The first unit is
selected with the help of random numbers and the rest get selected automatically
according to some pre designed pattern is known as systematic sampling. With
systematic random sampling every Kth element in the frame is selected for the
sample, with the starting point among the first K elements determined at random.
44
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o
Limitations
1. Systematic sampling may not represent the whole population.
45
JOINT PROFESSIONALS TRAINING AND SUPPORT www.jptsonline.o