Chapter11 Sampling Systematic Sampling

Chapter 11
Systematic Sampling
The systematic sampling technique is operationally more convenient than simple random sampling. It
also ensures, at the same time that each unit has an equal probability of inclusion in the sample. In this
method of sampling, the first unit is selected with the help of random numbers, and the remaining units
are selected automatically according to a predetermined pattern. This method is known as systematic
sampling.
Suppose the N units in the population are numbered 1 to N in some order. Suppose further that N is
expressible as a product of two integers n and k , so that N  nk .
To draw a sample of size n ,

- select a random number between 1 and k .
- Suppose it is i .
- Select the first unit, whose serial number is i .
- Select every k th unit after i th unit.
- The sample will contain i, i  k ,1  2k ,..., i  (n  1)k serial number units.
So the first unit is selected at random and other units are selected systematically. This systematic sample
is called kth systematic sample and k is termed as a sampling interval. This is also known as linear
systematic sampling.
The observations in the systematic sampling are arranged as in the following table:
Systematic sample 1 2 3  i  k
number
Sample 1 y1 y2 y3  yi  yk
composition
2 yk 1 yk  2 yk  3  yk  i  y2k
      
n
y( n 1) k 1 y( n 1) k  2 y( n 1) k 3 y( n 1) k i ynk
 
Probability 1 1 1  1  1
k k k k k
Sample mean y1 y2 y3  yi  yk
Sampling Theory| Chapter 11 | Systematic Sampling | Shalabh, IIT Kanpur

Page 1
Example: Let N  50 and n  5. So k  10. Suppose first selected number between 1 and 10 is 3. Then
systematic sample consists of units with following serial number 3, 13, 23, 33, 43.
Systematic sampling in two dimensions:
Assume that the units in a population are arranged in the form of m rows, and each row contains nk
units. A sample of size mn is required. Then
- select a pair of random numbers (i, j ) such that i   and j  k .
- Select the (i, j )th unit, i.e., j th unit in i th row as the first unit.
- Then the rows to be selected are
i, i  , i  2,..., i  (m  1)
and columns to be selected are
j , j  k , j  2k ,..., j  (n  1)k .
- The points at which the m selected rows and n selected columns intersect determine the position
of mn selected units in the sample.
Such a sample is called an aligned sample.
An alternative approach to select the sample is

- independently select n random integers i1 , i2 ,..., in such that each of them is less than or equal to
.
- Independently select m random integers j1 , j2 ,..., jm such that each of them is less than or equal
to k .
- The units selected in the sample will have the following coordinates:
(i1  r , jr 1 ), (i2  r , jr 1  k ), (i3  r , jr 1  2k ),..., (in  r , jr 1  (n  1)k ) .
Such a sample is called an unaligned sample.
Under certain conditions, an unaligned sample is often superior to an aligned sample as well as a
stratified random sample.
Advantages of systematic sampling:

1. It is easier to draw a sample and often easier to execute it without mistakes. This is more
advantageous when the drawing is done in fields and offices as there may be substantial saving in
time.
2. The cost is low, and the selection of units is simple. Much less training is needed for surveyors to
collect units through systematic sampling.
3. The systematic sample is spread more evenly over the population. So no large part will fail to be
represented in the sample. The sample is evenly spread and cross-section is better. Systematic
sampling fails in case of too many blanks.
Page 2
Relation to the cluster sampling
The systematic sample can be viewed from the cluster sampling point of view. With n  nk , there are k
possible systematic samples. The same population can be viewed as if divided into k large sampling
units, each of which contains n of the original units. The operation of choosing a systematic sample is
equivalent to choosing one of the large sampling units at random, which constitutes the whole sample. A
systematic sample is thus a simple random sample of one cluster unit from a population of k cluster
units.
Estimation of population mean : When N = nk:

Let
yij : observation on the unit bearing the serial number i  ( j  1)k in the population,
i  1, 2,..., k , j  1, 2,..., n.
Suppose the drawn random number is i  k .
Sample consists of i th column (in the earlier table).
Consider the sample mean given by
1 n
ysy  yi   yij
n j 1
as an estimator of the population mean given by
1 k n
Y   yij
nk i 1 j 1
1 k
  yi .
k i 1
1
Probability of selecting i th column as systematic sample  .
k
So
1 k
E ( ysy )   yi  Y .
k i 1
Thus ysy is an unbiased estimator of Y .
Further,
1 k
Var ( ysy )   ( yi  Y )2 .
k i 1

Page 3
Consider
k n
( N  1) S 2   ( yij  Y ) 2
i 1 j 1
k n 2
   ( yij  yi )  ( yi  Y ) 
i 1 j 1
k n k
  ( yij  yi ) 2  n ( yi  Y ) 2
i 1 j 1 i 1
k
 k (n  1) S wsy
2
 n ( yi  Y ) 2
i 1
where
1 k n
S 2
wsy  
k (n  1) i 1 j 1
( yij  yi ) 2
is the variation among the units that lies within the same systematic sample . Thus
N  1 2 k (n  1) 2
Var ( ysy )  S  S wsy
N N
N  1 2 (n  1) 2
 S  S wsy
N n
 
Variation Pooled within
as a variation of the
whole k systematic sample
with N  nk . This expression indicates that when the within variation is large, then Var ( yi ) becomes
smaller. Thus higher heterogeneity makes the estimator more efficient and higher heterogeneity is well
expected in a systematic sample.
Alternative form of variance:

1 k
Var ( ysy )   ( yi  Y )2
k i 1
2
1 k 1 n 
    yij  Y 
k i 1  n j 1 
1 k  n 

kn 2
  ( y ij  Y )
 j 1
i 1 
1 k  n n n

 2    ( yij  Y ) 2    ( yij  Y )( yi  Y ) 
kn i 1  j 1 j (   ) 1  1 
1  k n n 
 2 (nk  1) S 2     ( yij  Y )( yi  Y )  .
kn  i 1 j (   ) 1  1 

Page 4
The intraclass correlation between the pairs of units that are in the same systematic sample is
E ( yij  Y )( yi  Y ) 1
w  ;   w  1
E ( yij  Y ) 2
nk  1
1 k n n
   ( yij  Y )( yi  Y )
nk (n  1) i 1 j (   ) 1  1
 .
 nk  1  2
 S
 nk 
So substituting
k n n
  (y
i 1 j (   ) 1  1
ij  Y )( yi  Y )  (n  1)(nk  1)  w S 2
in Var ( yi ) gives
nk  1 S 2
Var ( ysy )  1   w (n  1)
nk n
N 1 S 2
 1   w (n  1).
N n
Comparison with SRSWOR:

For a SRSWOR sample of size n ,
N n 2
Var ( ySRS )  S
Nn
nk  n 2
 S
Nn
k 1 2
 S .
N
Since
N 1 2 n 1 2
Var ( ysy )  S  S wsy
N n
N  nk
 k 1 N 1  2 n 1 2
Var ( ySRS )  Var ( ysy )    S  S wsy
 N N  n
n 1 2
 ( S wsy  S 2 ).
n
Thus ysy is
- 2
more efficient than ySRS when S wsy  S2 .
- 2
less efficient than ySRS when S wsy S2.
- 2
equally efficient as ySRS when S wsy  S 2.

Page 5
Also, the relative efficiency of ysy relative to ySRS is
Var ( ySRS )
RE 
Var ( ysy )
N n 2
S
 Nn
N 1 2
S 1   w (n  1) 
Nn
N n  1 
  
N  1 1   w (n  1) 
n(k  1)  1  1
  ;     1.
(nk  1) 1   w (n  1)  nk  1
Thus ysy is
1
- more efficient than ySRS when  w  
nk  1
1
- less efficient than ySRS when  w  
nk  1
1
- equally efficient as ySRS when  w   .
nk  1
Comparison with stratified sampling:

The systematic sample can also be viewed as if arising as a stratified sample. If the population of N  nk
units is divided into n strata and suppose one unit is randomly drawn from each of the strata. Then we
get a stratified sample of size n . In doing so, just consider each row of the following arrangement as a
stratum.
Systematic sample 1 2 3  i  k
number
Sample 1 y1 y2 y3  yi  yk
composition
2 yk 1 yk  2 yk  3  yk  i  y2 k
      
n
y( n 1) k 1 y( n 1) k  2 y( n 1) k 3 y( n 1) k i ynk
 
Probability 1 1 1  1  1
k k k k k
Sample mean y1 y2 y3  yi  yk

Page 6
Recall that in the case of stratified sampling with k strata, the stratum mean
1 k
yst 
N
N
j 1
j yj
is an unbiased estimator of the population mean.
Considering the set up of stratified sample in the set up of a systematic sample, we have
- Number of strata = n
- Size of strata = k (row size)
- Sample size to be drawn from each stratum = 1
and yst becomes
1 n
yst   ky j
nk j 1
1 n
  yj
n j 1
1 n
Var ( yst ) 
n2
Var ( y )
j 1
j
1 n
k 1 2  N n 2

n2
 j 1 k .1
S j  using Var ( ySRS ) 
 Nn
S 

k 1 n 2
 Sj
kn 2 j 1
k 1 2
 S wst
nk
N n 2
 S wst
Nn
where
1 k
S 2j  
k  1 i 1
( yij  y j ) 2
is the mean sum of squares of units in the j th stratum.
1 n 2 1 k n
2
S wst   j n(k  1) 
n j 1
S 
i 1 j 1
( yij  y j ) 2
is the mean sum of squares within strata (or rows).

Page 7
The variance of the systematic sample mean is
1 k
Var ( ysy )  
k i 1
( yi  Y ) 2
2
1 k 1 n 1 n 
    yij   y j 
k i 1  n j 1 n j 1 
2
1 k  n 
 2    ( yij  y j ) 
n k i 1  j 1 
1  k n k n n

2   
 ( yij  y j ) 2
 ( yij  y j )( yi  y )  .
n k  i 1 j 1 i 1 j   1 
Now we simplify and express this expression in terms of the intraclass correlation coefficient. The
intraclass correlation coefficient between the pairs of deviations of units which lie along the same row
measured from their stratum means is defined as
E ( yij  Y )( yi  Y )
 wst 
E ( yij  Y ) 2
1 k n n

nk (n  1) i 1 j   1
( yij  y j )( yi  y )
=
1 k n
 ( yij  y j )2
nk i 1 j 1
k n n

i 1 j   1
( yij  y j )( yi  y )

( N  1)(n  1) S wst
2
So
1
Var ( ysy )  ( N  n) S wst
2
 ( N  n)(n  1)  wst S wst
2

n2k 
N n 2
 S wst 1  (n  1)  wst . (using N  nk )
Nn
Thus
N n
Var ( ysy )  Var ( yst )  (n  1)  wst S wst
2
Nn
and the relative efficiency of systematic sampling relative to equivalent stratified sampling is given by
Var ( yst ) 1
RE   .
Var ( ysy ) 1  (n  1)  wst
So the systematic sampling is

- more efficient than the corresponding equivalent stratified sample when  wst  0 .
- less efficient than the corresponding equivalent stratified sample when  wst  0
- equally efficient than the corresponding equivalent stratified sample when  wst  0.

Page 8
Comparison of systematic sampling, stratified sampling and SRS with population
with the linear trend:
We assume that the values of units in the population increase according to the linear trend.
So the values of successive units in the population increase in accordance with a linear model so that
yi  a  bi, i  1, 2,..., N .
Now we determine the variances of ySRS , ysy and yst under this linear trend.
Under SRSWOR
N n 2
V ( ySRS )  S .
Nn
Here N  nk
1 N
Y  ab i
N i 1
1 N ( N  1)
 ab
N 2
N 1
 ab
2
1 N
S2  
N  1 i 1
( yi  Y ) 2
2
1 N  N  1
  
N  1 i 1 
a  bi  a  b
2 
2
b2 N  N  1 
   i  2 
N  1 i 1 
b2  N 2  N 1  
2
  i  N   
N  1  i 1  2  
b 2  N ( N  1)(2 N  1) N ( N  1) 2 
 
N  1  6 4 

N ( N  1)
 b2
12
nk  n 2 nk (nk  1)
Var ( ySRS )  b
nk .n 12
2
b
 (k  1)(nk  1).
12
Page 9
Under systematic sampling
Earlier yij denoted the value of study variable with the j th unit in the i th systematic sample. Now yij
represents the value of i  ( j  1)k  unit of the population, so

th
yij  a  b i  ( j  1)k  , i  1, 2,..., k ; j  1, 2,..., n.

ysy  yi
1 k
Var ( ysy )   ( yi  Y )2
k i 1
1 n
yi   yij
n j 1
1 n
  a  b i  ( j  1)k
n j 1 
 n 1 
 a  bi  k
 2 
2
k k
  n 1  nk  1 

i 1
( yi  Y )    a  b  i 
2
i 1  
k a b
2  2 
2
k
 k 1 
 b i 
2

i 1  2 
 k 2  k 1 
2
k 1 k 
 b  i  k 
2
 2  i
 i 1  2  2 i 1 
 k (k  1)(2k  1)  k  1  2 k ( k  1) 
b  2
   (k  1) 
 6  2  2 
b2
 k (k 2  1)
12
1 b2
Var ( ysy )  k (k 2  1)
k 12
b2
 (k 2  1).
12

Page 10
Under stratified sampling
yij  a  b i  ( j  1)k  , i  1, 2,..., k , j  1, 2,..., n
1 k
yst 
N
N y
i 1
i i
N n 2 k 1 2
Var ( yst )  S wst  S wst
Nn nk
1 n 2
2
where S wst  Sj
n j 1
1 k n
 
n(k  1) i 1 j 1
( yij  y j ) 2
2
1 k n
  k 1 
  
n(k  1) i 1 j 1 
a  b i  ( j  1)k   a  b 
 2
 ( j  1)k 

2
b2 k n
 k 1 
  i 
n(k  1) i 1 j 1 

2 
b 2 nk (k 2  1)

n(k  1) 12
k (k  1)
 b2
12
k  1 2 k (k  1)
Var ( yst )  b
nk 12
b  k 1 
2 2
  
12  n 
1
If k is large, so that is negligible, then comparing Var ( yst ),Var ( ysy ) and V ( ySRS ),
k
Var ( yst ) : Var ( ysy ) : Var ( ySRS )
k 2 1
or : k 2  1 : (k  1)(1  nk )
n
k 1
or : k  1 : nk  1
n
k 1 k  1 nk  1
or : :
n(k  1) k 1 k 1
1
 1 : n
n
Thus
1
Var ( yst ) : Var ( ysy ) : Var ( ySRS ) :: : 1 : n
n
So stratified sampling is best for linearly trended population. Next best is systematic sampling.

Page 11
Estimation of variance:
As such, there is only one cluster, so the variance in principle, cannot be estimated.
Some approximations have been suggested.
1. Treat the systematic sample as if it were a random sample. In this case, an estimate of variance is
 ( y )   1  1  s2
Var sy   wc
 n nk 
1 n 1
2
where swc  
n  1 j 0
( yi  jk  yi ) 2 .
This estimator under-estimates the true variance.
2. Use of successive differences of the values gives the estimate of variance as

n 1 2
(y )   1  1  1
Var sy     yi jk  yi( j 1) k  .
 n nk  2(n  1) j 0
This estimator is a biased estimator of true variance.
3. Use the balanced difference of y1 , y2 ,..., yn to get the estimate of variance as

n2 2
(y )   1  1  1  yi y 
Var sy    
 n nk  5(n  2) i  2
 yi 1  i  2 
2 
or
n4 2
(y )   1  1  1  yi y 
Var sy    
 n nk  15(n  4) i  2
 yi 1  yi  2  yi 3  i  4  .
2 
4. The interpenetrating subsamples can be utilized by dividing the sample into C groups each of
n
size . Then the group means are y1 , y2 ,..., yc . Now find
c
1 c
y  yt
c t 1
1 c
( y ) 
Var sy  ( yt  y )2 .
c(c  1) t 1

Page 12
Systematic sampling when N  nk .
When N is not expressible as nk then suppose N can be expressed as
N  nk  p; p  k .
Then consider the following sample mean as an estimator of the population mean
 1 n 1
 n  1  yij if i  p
ysy  yi   n
j 1
1
  yij if i  p.
 n j 1
In this case
1  p  1 n 1  n 
1 n 
E ( yi )      yij      yij  
k  i 1  n  1 j 1  i  p 1  n j 1  
 Y.
So ysy is a biased estimator of Y .
An unbiased estimator of Y is
k
ysy* 
N
y
j
ij
k
 Ci
N
where Ci  nyi is the total of values of the i th column.
k
E ( ysy* )  E (Ci )
N
k 1 k
 .  Ci
N k i 1
Y
k 2  k  1  *2
Var ( ysy* )    Sc
N2  k 
2
1 k  NY 
where S *2
c   nyi 
k  1 i 1 
 .
k 

Page 13
Now we consider another procedure which is opted when N  nk .
[Reference: Theory of Sample Surveys, A.K. Gupta, D.G. Kabe, 2011, World Scientific Publishing Co.]
When population size N is not expressible as the product of n and k , then let
N  nq  r.
Then take the sampling interval as
 n
q if r 
k 2.
q  1 n
if r 
 2
M  M
Let   denotes the largest integer contained in .
g g
If k  q* ( q or q  1) , then the
 N  N  N
  *  with probability  *   1   * 
 q  q  q 
number of units expected in sample  
  N   1 with probability  N    N  .
  q*   *   *
  q  q 
If q  q* , then we get
 r  r r 
n    with probability    1   
 q q q
n*   .
n   r   r   r 
 1 with probability     
  q   q  q

Similarly, if q*  q  1, then
 nr   (n  r )  nr 
n    with probability   1  
  q 1   (q  1)   q 1 
n 
*
n   n  r   1 with probability  n  r    (n  r )  .
  q  1     
 q  1   (q  1) 

  

Page 14
n
Example: Let N  17 and n  5. Then q  3 and r  2 . Since r  , k  q  3.
2
Then sample sizes would be
 r  r  r 1
n     5 with probability    1    
 q q q 3
n*  
n   r   1  6 with probability  r    r   2 .
  q     
  q  q 3
This can be verified from the following example:

Systematic sample number Systematic sample Probability
1 Y1 , Y4 , Y7 , Y10 , Y13 , Y16 1/3
2 Y4 , Y5 , Y8 , Y11 , Y14 , Y17 1/3
3 Y3 , Y6 , Y9 , Y12 , Y15 1/3
We now prove the following theorem which shows how to obtain an unbiased estimator of the population
mean when N  nk .
Theorem: In systematic sampling with sampling interval k from a population with size N  nk , an
unbiased estimator of the population mean Y is given by

k  n' 
Yˆ    y 
N i
where i stands for the i th systematic sample, i  1, 2,..., k and n' denotes the size of i th systematic
sample.
1
Proof. Each systematic sample has a probability . Hence
k
k
1 k  n' 
E (Yˆ )   .   y 
i 1 k N  i
1 k
 n'


N
   y  .
i 1 i
Now, each unit occurs in only one of the k possible systematic samples. Hence
k
 n'
 N
   y    Y ,
i 1 i 1
i
i
which on substitution in E (Yˆ ) proves the theorem.

Page 15
When N  nk , the systematic samples are not of the same size and the sample mean is not an unbiased
estimator of the population mean. To overcome these disadvantages of systematic sampling when
N  nk circular systematic sampling is proposed. Circular systematic sampling consists of selecting a
random number from 1 to N and then selecting the unit corresponding to this random number. After that,
every k th unit in a cyclical manner is selected until a sample of n units is obtained, k being the nearest
N
integer to .
n
In other words, if i is a number selected at random from 1 to N , then the circular systematic sample
consists of units with serial numbers
i  jk , if i  jk  N 
 j  0,1, 2,..., (n  1).
i  jk  N , if i  jk  N 
This sampling scheme ensures an equal probability of inclusion in the sample for every unit.
Example:
14
Let N  14 and n  5. Then, k  nearest integer to  3. Let the first number selected at random from
5
1 to 14 be 7. Then, the circular systematic sample consists of units with serial numbers
7,10,13, 16-14=2, 19-14=5.
This procedure is illustrated diagrammatically in the following figure.
1
12
2
13
3
12
4
11
5
10 6
7
9
8

Page 16
Theorem: In circular systematic sampling, the sample mean is an unbiased estimator of the population
mean.
Proof: If i is the number selected at random, then the circular systematic sample mean is
1 n 
y   y ,
n i
 n 
where   y  denotes the total of y values in the i th circular systematic sample, i  1, 2,..., N . We note
 i
here that in circular systematic sampling, there are N circular systematic samples, each having
1
probability of its selection. Hence,
N
N
1 n  1 1 N  n 
E( y )     y       y 
i 1 n  i N Nn i 1  i
Clearly, each unit of the population occurs in n of the N possible circular systematic sample means.
Hence,
N
 n  N
  
i 1 
y
i
 n 
i 1
Yi ,
which on substitution in E ( y ) proves the theorem.
What to do when N  nk
One of the following possible procedures may be adopted when N  nk .
(i) Drop one unit at random if the sample has (n  1) units.
(ii) Eliminate some units so that N  nk .
(iii) Adopt circular systematic sampling scheme.
(iv) Round off the fractional interval k .

Page 17

Chapter11 Sampling Systematic Sampling

Uploaded by

Copyright:

Available Formats

Chapter11 Sampling Systematic Sampling

Uploaded by

Document Information

Original Description:

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Chapter11 Sampling Systematic Sampling

Uploaded by

Copyright:

Available Formats

Chapter 11

To draw a sample of size n ,

Sampling Theory| Chapter 11 | Systematic Sampling | Shalabh, IIT Kanpur

An alternative approach to select the sample is

Advantages of systematic sampling:

Estimation of population mean : When N = nk:

Thus ysy is an unbiased estimator of Y .

Sampling Theory| Chapter 11 | Systematic Sampling | Shalabh, IIT Kanpur

Alternative form of variance:

Sampling Theory| Chapter 11 | Systematic Sampling | Shalabh, IIT Kanpur

Comparison with SRSWOR:

Sampling Theory| Chapter 11 | Systematic Sampling | Shalabh, IIT Kanpur

Comparison with stratified sampling:

Sampling Theory| Chapter 11 | Systematic Sampling | Shalabh, IIT Kanpur

is an unbiased estimator of the population mean.

and yst becomes

is the mean sum of squares of units in the j th stratum.

is the mean sum of squares within strata (or rows).

Sampling Theory| Chapter 11 | Systematic Sampling | Shalabh, IIT Kanpur

So the systematic sampling is

Sampling Theory| Chapter 11 | Systematic Sampling | Shalabh, IIT Kanpur

represents the value of i  ( j  1)k  unit of the population, so

yij  a  b i  ( j  1)k  , i  1, 2,..., k ; j  1, 2,..., n.

Sampling Theory| Chapter 11 | Systematic Sampling | Shalabh, IIT Kanpur

Sampling Theory| Chapter 11 | Systematic Sampling | Shalabh, IIT Kanpur

This estimator under-estimates the true variance.

2. Use of successive differences of the values gives the estimate of variance as

3. Use the balanced difference of y1 , y2 ,..., yn to get the estimate of variance as

Sampling Theory| Chapter 11 | Systematic Sampling | Shalabh, IIT Kanpur

So ysy is a biased estimator of Y .

Sampling Theory| Chapter 11 | Systematic Sampling | Shalabh, IIT Kanpur

Sampling Theory| Chapter 11 | Systematic Sampling | Shalabh, IIT Kanpur

This can be verified from the following example:

2 Y4 , Y5 , Y8 , Y11 , Y14 , Y17 1/3

3 Y3 , Y6 , Y9 , Y12 , Y15 1/3

unbiased estimator of the population mean Y is given by

which on substitution in E (Yˆ ) proves the theorem.

Sampling Theory| Chapter 11 | Systematic Sampling | Shalabh, IIT Kanpur

Sampling Theory| Chapter 11 | Systematic Sampling | Shalabh, IIT Kanpur

which on substitution in E ( y ) proves the theorem.

Sampling Theory| Chapter 11 | Systematic Sampling | Shalabh, IIT Kanpur

You might also like