Unit 2 Ipr
Unit 2 Ipr
Unit 2 Ipr
Religious Affiliation
Catholic= 1
Protestant= 2
Jewish= 3
Muslim= 4
Other= 5
Categorical data are measured on nominal scales which merely assign labels to distinguish
categories
Ordinal Scale
•But, there is no information regarding the differences (intervals) between points on the scale
Interval Scale
Fahrenheit Scale
Ratio Scale
It isn’t so straight-forward??
•The underlying constructor variable being measured defines the scale of measurement, not the
numbers themselves (Why?)
•Statistical procedures use numbers without considering the underlying constructs that are
measured
•Measurement is the foundation, but whether or not statistics can be interpreted depends on
research design issues
•Independent variables:
manipulated by the experimenter
under the control of the experimenter
•Dependent variables:
not under the experimenter’s control
usually the outcome to be measured
•Typically, we are interested in measuring the effects of independent variables on dependent
variables
An experimenter measures blood pressure response in male and female hypertensive subjects,
randomly assigned to two drug treatment conditions: standard medication A, and improved
medication B
Statistical Notation
10 12 25 7 40
the set of scores is referred to as X
X1=10 X2=12 X3=25 X4=7 X5=40
To refer to a single score, without specifying which one, we will use Xi, where i can take on any
value from 1 to 5, or 1 to N.
Summation Notation: One of the most common symbols in statistics is the uppercase Greek letter
sigma (Σ)Σ means sum up, or add upΣNiXi= 10 + 12 + 25 + 7 + 40 = 94
ΣNiXi=ΣX
ΣX2= 102+ 122+ 252+ 72+ 402= 2518
A researcher should try his best to keep the number of questions as small as possible,
keeping in view the nature, objectives and scope of the enquiry. Respondent‘s
time should not be wasted by asking irrelevant and unimportant questions. A large
number of questions would involve more work for the investigator and thus result
in delay on his part in collecting and submitting the information. A large number of
unnecessary questions may annoy the respondent and he may refuse to cooperate.
A reasonable questionnaire should contain from 15 to 25 questions at large. If a
still larger number of questions are a must in any enquiry, then the questionnaire
should be divided into various sections or parts.
Logical arrangement of questions reduces lot of unnecessary work on the part of the
researcher because it not only facilitates the tabulation work but also does not
leave any chance for omissions or commissions. For example, to find if a person
owns a television, the logical order of questions would be: Do you own a
television? When did you buy it? What is its make? How much did it cost you? Is
its performance satisfactory? Have you ever got it serviced?
4. questions should be simple to understand:
The vague words like good, bad, efficient, sufficient, prosperity, rarely,
frequently,reasonable, poor, rich etc., should not be used since these may be
interpreted differently by different persons and as such might give unreliable and
misleading information. Similarly, the use of words having double meaning like
price, assets, capital income etc., should also be avoided.
Questions should be designed in such a way that they are readily comprehensible and
easy to answer for the respondents. They should not be tedious nor should they tax
the respondents‘ memory. At the same time questions involving mathematical
calculations like percentages, ratios etc., should not be asked.
There are some questions which disturb the respondents and he/she may be shy or
irritated by hearing such questions. Therefore, every effort should be made to
avoid such questions. For example, ‗do you cook yourself or your wife cooks?‘ ‗Or
do you drink?‘ Such questions will
certainlytherespondentsandthusbeavoidedatanycost.Ifunavoidable then highest
amount of politeness should beused.
7. Types of questions:
Under this head, the questions in the questionnaire may be classified as follows:
Shut questions are those where possible answers are suggested by the framers of the
questionnaire and the respondent is required to tick one of them. Shut questions
can further be subdivided into the following forms:
• Simple alternate questions:
In this type of questions the respondent has to choose from the two
clearcutalternativeslike‗Yes‘or‗No‘,‗RightorWrong‘etc.Suchquestionsare also
called as dichotomous questions. This technique can be applied with elegance to
situations where two clear cut alternatives exist.
• Multiple choice questions:
Do you smoke?
Multiple choice questions are very easy and convenient for therespondents to
answer. Such questions save time and also facilitate tabulation. This method
should be used if only a selected few alternative answers exist to a particular
question.
Questions like ‗why do you use a particular type of car, say Maruti car‘ should
preferably be framed into two questions-
(i) Which car do you use? (ii) why do you prefer it?
9 Crosschecks:
It would be practical in every sense to try out the questionnaire on a small scale
before using it for the given enquiry on a large scale. This has been found extremely
useful in practice. The given questionnaire can be improved or modified in the
light of the drawbacks, shortcomings and problems faced by the investigator in the
pretest.
A covering letter from the organizers of the enquiry should be enclosed along with the
questionnaire for the purposes regarding definitions, units, concepts used in the
questionnaire, for taking the respondent‘s confidence, self-addressed envelope in
case of mailed questionnaire, mention about award or incentives for the quick
response, a promise to send a copy of the survey report etc.
Sampling Methods | Types, Techniques & Examples
When you conduct research about a group of people, it’s rarely possible to collect data from every
person in that group. Instead, you select a sample. The sample is the group of individuals who will
actually participate in the research.
To draw valid conclusions from your results, you have to carefully decide how you will select a
sample that is representative of the group as a whole. This is called a sampling method. There are
two primary types of sampling methods that you can use in your research:
• Probability sampling involves random selection, allowing you to make strong statistical
inferences about the whole group.
• Non-probability sampling involves non-random selection based on convenience or other
criteria, allowing you to easily collect data.
You should clearly explain how you selected your sample in the methodology section of your paper
or thesis, as well as how you approached minimizing research bias in your work.
The population can be defined in terms of geographical location, age, income, or many other
characteristics.
It can be very broad or quite narrow: maybe you want to make inferences about the whole adult
population of your country; maybe your research focuses on customers of a certain company,
patients with a specific health condition, or students in a single school.
It is important to carefully define your target population according to the purpose and practicalities of
your project.
If the population is very large, demographically mixed, and geographically dispersed, it might be
difficult to gain access to a representative sample. A lack of a representative sample affects
the validity of your results, and can lead to several research biases, particularly sampling bias.
Sampling frame
The sampling frame is the actual list of individuals that the sample will be drawn from. Ideally, it
should include the entire target population (and nobody who is not part of that population).
Example: Sampling frameYou are doing research on working conditions at a social media marketing
company. Your population is all 1000 employees of the company. Your sampling frame is the
company’s HR database, which lists the names and contact details of every employee.
Sample size
The number of individuals you should include in your sample depends on various factors, including
the size and variability of the population and your research design. There are different sample size
calculators and formulas depending on what you want to achieve with statistical analysis.
To conduct this type of sampling, you can use tools like random number generators or other
techniques that are based entirely on chance.
Example: Simple random samplingYou want to select a simple random sample of 1000 employees
of a social media marketing company. You assign a number to every employee in the company
database from 1 to 1000, and use a random number generator to select 100 numbers.
2. Systematic sampling
Systematic sampling is similar to simple random sampling, but it is usually slightly easier to conduct.
Every member of the population is listed with a number, but instead of randomly generating
numbers, individuals are chosen at regular intervals.
Example: Systematic samplingAll employees of the company are listed in alphabetical order. From
the first 10 numbers, you randomly select a starting point: number 6. From number 6 onwards, every
10th person on the list is selected (6, 16, 26, 36, and so on), and you end up with a sample of 100
people.
If you use this technique, it is important to make sure that there is no hidden pattern in the list that
might skew the sample. For example, if the HR database groups employees by team, and team
members are listed in order of seniority, there is a risk that your interval might skip over people in
junior roles, resulting in a sample that is skewed towards senior employees.
3. Stratified sampling
Stratified sampling involves dividing the population into subpopulations that may differ in important
ways. It allows you draw more precise conclusions by ensuring that every subgroup is properly
represented in the sample.
To use this sampling method, you divide the population into subgroups (called strata) based on the
relevant characteristic (e.g., gender identity, age range, income bracket, job role).
Based on the overall proportions of the population, you calculate how many people should be
sampled from each subgroup. Then you use random or systematic sampling to select a sample from
each subgroup.
Example: Stratified samplingThe company has 800 female employees and 200 male employees.
You want to ensure that the sample reflects the gender balance of the company, so you sort the
population into two strata based on gender. Then you use random sampling on each group,
selecting 80 women and 20 men, which gives you a representative sample of 100 people.
4. Cluster sampling
Cluster sampling also involves dividing the population into subgroups, but each subgroup should
have similar characteristics to the whole sample. Instead of sampling individuals from each
subgroup, you randomly select entire subgroups.
If it is practically possible, you might include every individual from each sampled cluster. If the
clusters themselves are large, you can also sample individuals from within each cluster using one of
the techniques above. This is called multistage sampling.
This method is good for dealing with large and dispersed populations, but there is more risk of error
in the sample, as there could be substantial differences between clusters. It’s difficult to guarantee
that the sampled clusters are really representative of the whole population.
Example: Cluster samplingThe company has offices in 10 cities across the country (all with roughly
the same number of employees in similar roles). You don’t have the capacity to travel to every office
to collect your data, so you use random sampling to select 3 offices – these are your clusters.
This type of sample is easier and cheaper to access, but it has a higher risk of sampling bias. That
means the inferences you can make about the population are weaker than with probability samples,
and your conclusions may be more limited. If you use a non-probability sample, you should still aim
to make it as representative of the population as possible.
Non-probability sampling techniques are often used in exploratory and qualitative research. In these
types of research, the aim is not to test a hypothesis about a broad population, but to develop an
initial understanding of a small or under-researched population.
1. Convenience sampling
A convenience sample simply includes the individuals who happen to be most accessible to the
researcher.
This is an easy and inexpensive way to gather initial data, but there is no way to tell if the sample is
representative of the population, so it can’t produce generalizable results. Convenience samples are
at risk for both sampling bias and selection bias.
Example: Convenience samplingYou are researching opinions about student support services in
your university, so after each of your classes, you ask your fellow students to complete a survey on
the topic. This is a convenient way to gather data, but as you only surveyed students taking the
same classes as you at the same level, the sample is not representative of all the students at your
university.
Voluntary response samples are always at least somewhat biased, as some people will inherently
be more likely to volunteer than others, leading to self-selection bias.
Example: Voluntary response samplingYou send out the survey to all students at your university and
a lot of students decide to complete it. This can certainly give you some insight into the topic, but the
people who responded are more likely to be those who have strong opinions about the student
support services, so you can’t be sure that their opinions are representative of all students.
3. Purposive sampling
This type of sampling, also known as judgement sampling, involves the researcher using their
expertise to select a sample that is most useful to the purposes of the research.
It is often used in qualitative research, where the researcher wants to gain detailed knowledge about
a specific phenomenon rather than make statistical inferences, or where the population is very small
and specific. An effective purposive sample must have clear criteria and rationale for inclusion.
Always make sure to describe your inclusion and exclusion criteria and beware of observer
bias affecting your arguments.
Example: Purposive samplingYou want to know more about the opinions and experiences of
disabled students at your university, so you purposefully select a number of students with different
support needs in order to gather a varied range of data on their experiences with student services.
4. Snowball sampling
If the population is hard to access, snowball sampling can be used to recruit participants via other
participants. The number of people you have access to “snowballs” as you get in contact with more
people. The downside here is also representativeness, as you have no way of knowing how
representative your sample is due to the reliance on participants recruiting others. This can lead
to sampling bias.
Example: Snowball samplingYou are researching experiences of homelessness in your city. Since
there is no list of all homeless people in the city, probability sampling isn’t possible. You meet one
person who agrees to participate in the research, and she puts you in contact with other homeless
people that she knows in the area.
5. Quota sampling
Quota sampling relies on the non-random selection of a predetermined number or proportion of
units. This is called a quota.
You first divide the population into mutually exclusive subgroups (called strata) and then recruit
sample units until you reach your quota. These units share specific characteristics, determined by
you prior to forming your strata. The aim of quota sampling is to control what or who makes up your
sample.
Essentials of sampling:
In order to reach a clear conclusion, the sampling should possess the following essentials:
It must be representative: The sample selected should possess the similar characteristics of
the original universe from which it has been drawn.
• Homogeneity: Selected samples from the universe should have similar nature and should not
have any difference when compared with the universe.
Adequate samples: In order to have a more reliable and representative result, a good number
of items are to be included in the sample.
• Optimization: All efforts should be made to get maximum results both in terms of cost as well as
efficiency. If the size of the sample is larger, there is better efficiency and at the same time the cost
is more. A proper size of sample is maintained in order to have optimized results in terms of cost
and efficiency.
2. Substitution: