Statistics and Probability - Notes
Statistics and Probability - Notes
Statistics and Probability - Notes
ALERIA
Logical decisions or solutions to a problem could be attained
through a statistical process.
CHAPTER 1: EXPLORING DATA
CHAPTER 1: EXPLORING DATA
LESSON 1: INTRODUCING STATISTICS
LESSON 2: DATA COLLECTION ACTIVITY
1. Statistics as a Tool in Decision-Making
- Statistics is defined as a science that studies data to be able to make
a decision. Hence, it is a tool in decision-making process.
- Statistics as a science involves the methods of collecting, processing,
summarizing and analyzing data in order to provide answers or
solutions to an inquiry.
- One also needs to interpret and communicate the results of the
methods identified above to support a decision that one makes when
faced with a problem or an inquiry.
Statistics enable us to
• characterize persons, objects, situations, and phenomena;
• explain relationships among variables;
• formulate objective assessments and comparisons; and, more
importantly
• make evidence-based decisions and predictions.
2. Statistical Process in Solving a Problem
a statistical process in making a decision or providing solutions to a
problem include the following:
Planning or designing the collection of data to answer statistical
questions in a way that maximizes information content and
minimizes bias;
Collecting the data as required in the plan;
Verifying the quality of the data after they were collected;
Summarizing the information extracted from the data; and
Examining the summary statistics so that insight and meaningful
information can be produced to support decision-making or
solutions to the question or problem at hand. A. Contextualization of Data
- Data are facts and figures that are presented, collected and analyzed.
KEY POINTS
Data are either numeric or non-numeric and must be contextualized.
Difference between questions that could be and those that could - To contextualize data, we must identify its six W’s or to put meaning on
not answered using Statistics. the data, we must know the following W’s of the data:
Statistics is a science that studies data. 1. Who? Who provided the data?
There are many uses of Statistics but its main use is in decision- 2. What? What are the information from the respondents and
making. What is the unit of measurement used for each of the
information (if there are any)?
3. When? When was the data collected?
4. Where? Where was the data collected?
MATH 2- STATISTICS AND PROBABILITY SENIOR HIGH SCHOOL GOLDEN SUCCESS COLLEGE PREPARED BY MR. JOVIT R. ALERIA
5. Why? Why was the data collected? 2. Broad Classification of Variables
6. HoW? HoW was the data collected?
Example:
KEY POINTS
A universe is a collection of units from which the data were gathered. 2. Methods of Data Collection
A variable is a characteristic we observed or measured from every Variables were observed or measured using any of the three methods
element of the universe. of data collection, namely: objective, subjective and use of existing
A population is a set of all possible values of a variable. records.
A sample is a subgroup of a universe or a population.
In a study there is only one universe but could have several o The objective and subjective methods - obtained the data
populations. directly from the source.
Variables could be classified as qualitative or quantitative, and the o The former uses any or combination of the five senses (sense of
latter could be further classified as discrete or continuous. sight, touch, hearing, taste and smell) to measure the variable
while the latter obtains data by getting responses through a
CHAPTER 1: EXPLORING DATA questionnaire.
LESSON 4: LEVELS OF MEASUREMENT o The resulting data from these two methods of data collection
is referred to as primary data.
1. Levels of Measurement o The data gathered in Lesson 2 are primary data and were
obtained using the subjective method.
Four levels of measurement of variables: nominal, ordinal, interval and o secondary data are obtained through the use of existing
ratio. These are hierarchical in nature and are described as follows: records or data collected by other entities for certain purposes.
A. Nominal level of measurement - arises when we have variables For example, when we use data gathered by the Philippine
that are categorical and nonnumeric or where the numbers have Statistics Authority, we are using secondary data and the
no sense of ordering. As an example, consider the numbers on the method we employ to get the data is the use of existing
uniforms of basketball players. records. Other data sources include administrative records,
B. Ordinal level - also deals with categorical variables like the nominal news articles, internet, and the like.
level, but in this level ordering is important, that is the values of the
variable could be ranked. KEY POINTS
C. Interval level - tells us that one unit differs by a certain amount of Four levels of measurement: Nominal, Ordinal, Interval and Ratio
degree from another unit. Knowing how much one unit differs from Knowing what level the variable was measured or observed will
another is an additional property of the interval level on top of guide us to know the type of analysis to apply.
having the properties possess by the ordinal level. Three methods of data collection include objective, subjective
D. Ratio level - also tells us that one unit has so many times as much of and use of existing records.
the property as does another unit. The ratio level possesses a Using the data collection method as basis, data can be classified
meaningful (unique and non-arbitrary) absolute, fixed zero point as either primary or secondary data.
and allows all arithmetic operations.
MATH 2- STATISTICS AND PROBABILITY SENIOR HIGH SCHOOL GOLDEN SUCCESS COLLEGE PREPARED BY MR. JOVIT R. ALERIA
- These methods include the (1) textual or narrative; (2) tabular; and (3)
graphical method of presentation.
(1) Textual or narrative - In presenting the data in textual or paragraph
or narrative form, one describes the data by enumerating some of
the highlights of the data set like giving the highest, lowest or the
average values. In case there are only few observations, say less
than ten observations, the values could be enumerated if there is
a need to do so. An example of which is shown below: