0% found this document useful (0 votes)

72 views29 pages

Chapter 3 Part 1 Validity Reliability 1

The document discusses various characteristics of effective assessment methods, including validity, reliability, practicability, comprehensiveness, and relevance. It defines validity as the extent to which an assessment measures what it intends to measure. There are four types of validity: face validity, content validity, construct validity, and criterion validity. Reliability refers to the consistency and stability of measurement and includes four types: inter-rater reliability, test-retest reliability, parallel-forms reliability, and internal consistency reliability. Effective assessments should not only be valid and reliable but also practical, comprehensive, and relevant to the learning outcomes being assessed.

Uploaded by

anjeesahiri91589

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

72 views29 pages

Chapter 3 Part 1 Validity Reliability 1

Uploaded by

anjeesahiri91589

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

by: LLOYD PSYCHE T.

BALTAZAR
Characteristics of Assessment Methods
1 VALIDITY is the extent to which an instrument
measures what it intends to measure.

2 RELIABILITY is the degree to which a test is

consistent and stable in measuring what it is
intended to measure.

3 PRACTICABILITY considers that the test must have

practical value from time, economy, scorability, and
administration point of view.
Characteristics of Assessment Methods

4 COMPREHENSIVE considers that the test must

cover the lesson taught in which it should assess
knowledge, skills, and values as adequately as
possible.

5 RELEVANT means that the test measures the

desired learning outcomes.
Validity

Validity - is the extent to which a measurement tool

measures what it's supposed to measure.

Remember your thermometer?

It's measuring the room temperature, not

your body temperature. Since it's supposed
to be measuring your body temperature, the
thermometer is not valid.
Validity
VALIDITY is an indication of how sound your
assessment is.

▪ Validity also refers to the usefulness of the

instrument for a given purpose.

▪ A test is valid when it is aligned to the learning

outcome.
4 Types of Validity
Face Validity
Content Validity
Construct Validity
Criterion Validity
4 Types of Validity
Face validity is the degree to which a test appears
to be related to a specific construct, in the judgment
of non-experts such as test takers.

▪ A test has face validity if its content simply looks relevant

to the person taking the test.

▪ It evaluates the appearance of the questionnaire in terms

of feasibility, readability, consistency of style and
formatting, and the clarity of the language used.
4 Types of Validity

Content validity is the degree to which

elements of an assessment instrument are
relevant to and representative of the targeted
construct for a particular assessment purpose
(Haynes, Richard, & Kubany, 1995).
4 Types of Validity
Content Validity
▪ involves evaluation of a test in order to ensure that it
includes all the items that are essential and eliminates
undesirable items

▪ the teacher writes out the test based on the TOS

▪ subject matter experts evaluate the extent to which our

test adequately captures the content domain
4 Types of Validity
Content Validity

Examples of measurements that are content valid:

➢If we want to test knowledge on Philippine History (construct),

then the test must cover everything from the beginning to the
present.

➢AP Physics knowledge (construct) measured by the AP exam

(measurement).
4 Types of Validity
Content Validity

Examples of measurements that have debatable content

validity:

➢The Bar Exam is not a good measure of ability to practice law.

➢IQ tests are not a good way to measure intelligence.

4 Types of Validity
Construct validity defined as the experimental
demonstration that a test is measuring the
construct it claims to be measuring.

A construct, or psychological construct as it is also called, is an

attribute, proficiency, ability, or skill that happens in the human
brain and is defined by established theories.

Examples: English proficiency, Math ability, Anxiety, Intelligence

4 Types of Validity
Construct Validity
▪ Construct validity is established when relationships between
our test and other variables confirm what is predicted by
theory.

▪ For example, theory might indicate that the personality traits

of conscientiousness and neuroticism should be negatively
related. If we develop a test of conscientiousness and then
demonstrate that scores on our test correlate negatively with
scores on a test of neuroticism.
4 Types of Validity
Construct Validity

▪ Furthermore, theory might indicate that conscientiousness

contains three specific dimensions. Statistical analysis of the
items within our test could show that the items tend to cluster,
or perform similarly, in three specific groups.
4 Types of Validity
Construct Validity

1. Convergent validity - refers to the degree to which two

measures of constructs that theoretically should be related,
are in fact related.

2. Discriminant validity - tests that constructs that should have

no relationship do, in fact, not have any relationship.
4 Types of Validity
Criterion validity is the extent to which a measure
is related to an outcome. It measures how well one
measure predicts an outcome for another measure.

▪ It is useful for predicting performance or behavior

in another situation (past, present, or future).
4 Types of Validity
Examples of Criterion validity:

➢A job applicant takes a performance test during the interview

process. If this test accurately predicts how well the employee
will perform on the job, the test is said to have criterion validity.

➢A graduate student takes the GRE (Graduate Record Exam). The

GRE has been shown as an effective tool (i.e. it has criterion
validity) for predicting how well a student will perform in graduate
studies.
4 Types of Validity
Criterion Validity

1. Predictive validity – occurs when a test accurately

predicts what it is supposed to predict.
4 Types of Validity
Criterion Validity
2. Concurrent validity - refers to the extent to which
the results of a particular test, or measurement,
correspond to those of a previously established
measurement for the same construct.
Reliability
Reliability - is the degree to which an assessment
tool produces stable and consistent results.

➢ It is a measure of the stability or consistency of test

scores. You can also think of it as the ability for a
test or research findings to be repeatable.

▪ For example, a medical thermometer is a reliable tool

that would measure the correct temperature each
time it is used.
4 Types of Reliability
Inter-rater Reliability

Test-retest Reliability

Parallel-Forms Reliability

Internal Consistency Reliability

4 Types of Reliability
Inter-rater reliability is a measure of reliability used to assess the degree
to which different judges or raters agree in their assessment decisions.

An Example of What
Type of Reliability When to Use How to Use You can Say When
You’re Done
When you want to know Examine the percent of The inter-rater reliability
whether there is agreement between for the best-dressed
consistency in the rating raters. Football player judging
Inter-rater Reliability of some outcome. was 0.9, which indicates
a high degree of
agreement among
judges.
4 Types of Reliability
Test-retest reliability is a measure of reliability obtained by administering
the same test twice over a period of time to a group of individuals. The
scores from Time 1 and Time 2 can then be correlated in order to
evaluate the test for stability over time.

An Example of What
Type of Reliability When to Use How to Use You can Say When
You’re Done
When you want to know Correlate the scores The Bonzo test of
whether a test is reliable from a test given in Time identity formation for
Test-retest Reliability over time. 1 with the same test adolescence is reliable
given in Time 2. over time.
4 Types of Reliability
Parallel-Forms reliability is a measure of reliability obtained by
administering different versions of an assessment tool (both versions
must contain items that probe the same construct, skill, knowledge base,
etc.) to the same group of individuals.
An Example of What
Type of Reliability When to Use How to Use You can Say When
You’re Done
When you want to know Correlate the scores Set A and Set B of the
if several different forms from one form of the test Math Exams are
of a test are reliable or with the scores from a equivalent to one
Parallel Forms equivalent. second, different form of another.
Reliability the same test of the
same content.
4 Types of Reliability
Internal Consistency reliability is a measure of reliability used to
evaluate the degree to which different test items that probe the same
construct produce similar results. Usually Cronbach Alpha (α) is used.

An Example of What
Type of Reliability When to Use How to Use You can Say When
You’re Done
When you want to know Correlate each individual All of the items on the
if the items on a test item score with the total emotional intelligence
assess one, and only score test assess the same
Internal Consistency one dimension. construct.
Reliability
4 Types of Reliability
Type of Reliability Description

Inter-rater Reliability Different people, same test

Test-retest Reliability Same people, different times

Different people, same time,

Parallel Forms Reliability
different/equivalent test

Internal Consistency Reliability Different questions, same construct

Other Good Qualities of Assessment
Instruments
1. Scorability
- ease of scoring
- test should have clear directions for scoring

2. Administrability
- clear provisions for directions and test rules for the
students to follow
Other Good Qualities of Assessment
Instruments
3. Objectivity
- agreement of two or more raters with regards to scoring
- test should not be influenced by personal bias

4. Fairness
- absence of discrimination in the test due to race, skin color,
gender, religion, etc.
Other Good Qualities of Assessment
Instruments

5. Adequacy
- An adequate test is a test with representativeness of test
items to the concept to be measured.

Grief and Bereavement Training One
No ratings yet
Grief and Bereavement Training One
22 pages
4as DETAILED LESSON PLAN IN MATHEMATICS 2
100% (1)
4as DETAILED LESSON PLAN IN MATHEMATICS 2
9 pages
Education As A Social Process
100% (2)
Education As A Social Process
6 pages
General Psychology:An Introduction: Tori Kearns Deborah Lee
No ratings yet
General Psychology:An Introduction: Tori Kearns Deborah Lee
560 pages
Attitude
No ratings yet
Attitude
29 pages
Career Counselling - Constructivist Approaches
No ratings yet
Career Counselling - Constructivist Approaches
120 pages
Research On Drug Abuse in St. Hubert
100% (1)
Research On Drug Abuse in St. Hubert
16 pages
(SpringerBriefs in Psychology) Sabine BÃ Hrer-Kohler, Blanca Bolea-Alamanac - Diversity in Global Mental Health - Gender, Lif
No ratings yet
(SpringerBriefs in Psychology) Sabine BÃ Hrer-Kohler, Blanca Bolea-Alamanac - Diversity in Global Mental Health - Gender, Lif
106 pages
Educ105 - Coverage Exam
No ratings yet
Educ105 - Coverage Exam
14 pages
ACCSA Module 1 - 8 Practice Dimensions & Skillsets of Addiction Counselling
No ratings yet
ACCSA Module 1 - 8 Practice Dimensions & Skillsets of Addiction Counselling
148 pages
Entry Stage of Consultation: CG 621 Dougherty Text, Chapter 3
100% (3)
Entry Stage of Consultation: CG 621 Dougherty Text, Chapter 3
20 pages
Detailed Lesson Plan in Mathematics 3
0% (1)
Detailed Lesson Plan in Mathematics 3
9 pages
What+is+Community+Psychology
No ratings yet
What+is+Community+Psychology
11 pages
Brothers Education Catalogue 2025-26_compressed
No ratings yet
Brothers Education Catalogue 2025-26_compressed
16 pages
(eBook PDF) Psychological Consultation and Collaboration in School and Community Settings 6th Edition download
50% (2)
(eBook PDF) Psychological Consultation and Collaboration in School and Community Settings 6th Edition download
46 pages
How Colors Affect Our Mood
No ratings yet
How Colors Affect Our Mood
7 pages
5952-MentalHealth - Imp - 5 - 2676 - 12008
100% (1)
5952-MentalHealth - Imp - 5 - 2676 - 12008
76 pages
Integrated Group Treatment For People Experiencing Mental Health-Substance Use Problems, Kathleen Sciacca
100% (1)
Integrated Group Treatment For People Experiencing Mental Health-Substance Use Problems, Kathleen Sciacca
14 pages
Edu 410 Career Counseling Module (1)
100% (1)
Edu 410 Career Counseling Module (1)
58 pages
Career Guidance
No ratings yet
Career Guidance
31 pages
Ed101 Module-5
No ratings yet
Ed101 Module-5
70 pages
Experimental Skills: That Really Validate Something That The Theory Says Right
No ratings yet
Experimental Skills: That Really Validate Something That The Theory Says Right
31 pages
Research Review Inventory (R.R.I.3) Review Log File
No ratings yet
Research Review Inventory (R.R.I.3) Review Log File
10 pages
Drug Abuse Among Youth in Informal Settlement in Nairobi-Rachel Muchemi
No ratings yet
Drug Abuse Among Youth in Informal Settlement in Nairobi-Rachel Muchemi
45 pages
Validity and Reliability
No ratings yet
Validity and Reliability
7 pages
Chapter 3 Part 2 Item Analysis 1
No ratings yet
Chapter 3 Part 2 Item Analysis 1
49 pages
ICT Plan Template
100% (1)
ICT Plan Template
21 pages
SSK4300 Individual Assignment
No ratings yet
SSK4300 Individual Assignment
12 pages
Learning and Conditioning UNIT 3
No ratings yet
Learning and Conditioning UNIT 3
19 pages
Pendekatan Kaunseling Dalam Rawatan & Pemulihan Penagihan Dadah
No ratings yet
Pendekatan Kaunseling Dalam Rawatan & Pemulihan Penagihan Dadah
23 pages
Consumer Culture Positioning
No ratings yet
Consumer Culture Positioning
16 pages
For Peer Review: The Persian Version of The Perth Emotional Reactivity Scale-Short Form: Psychometric Evaluation
No ratings yet
For Peer Review: The Persian Version of The Perth Emotional Reactivity Scale-Short Form: Psychometric Evaluation
26 pages
Comprehensive Counseling and Guidance Model For Alabama Public Schools
No ratings yet
Comprehensive Counseling and Guidance Model For Alabama Public Schools
105 pages
Chapter 15 Narrative Therapy
No ratings yet
Chapter 15 Narrative Therapy
23 pages
Abpc1203 Psychological Tests and Measurements
100% (1)
Abpc1203 Psychological Tests and Measurements
14 pages
Lesson 1components of Instructional Planning
100% (2)
Lesson 1components of Instructional Planning
5 pages
Predicting Users' Eat-Out Preference From Big5 Personality Traits
No ratings yet
Predicting Users' Eat-Out Preference From Big5 Personality Traits
14 pages
Content of The Training:: Trainer Evaluation Scoring Rubric
100% (1)
Content of The Training:: Trainer Evaluation Scoring Rubric
8 pages
#9 - User Centered Design
No ratings yet
#9 - User Centered Design
17 pages
Chap 8-Career Counseling
100% (1)
Chap 8-Career Counseling
32 pages
Worksheet 1 Lesson 1
No ratings yet
Worksheet 1 Lesson 1
2 pages
Role of Assessment and Diagnosis in Coiunseling
100% (1)
Role of Assessment and Diagnosis in Coiunseling
4 pages
C3 - Communication Skill in Counselling
No ratings yet
C3 - Communication Skill in Counselling
28 pages
3-CURRENT-ISSUES-IN-CLINICAL-PSYCHOLOGY
No ratings yet
3-CURRENT-ISSUES-IN-CLINICAL-PSYCHOLOGY
21 pages
Environmental Psychology Notes
No ratings yet
Environmental Psychology Notes
13 pages
Drug Addiction in Malaysia
No ratings yet
Drug Addiction in Malaysia
3 pages
ULOa GROUP1
No ratings yet
ULOa GROUP1
16 pages
Amalgamated Semester 1 2022 Draft 3B Examination Timetable
No ratings yet
Amalgamated Semester 1 2022 Draft 3B Examination Timetable
12 pages
Community Psychology Class
No ratings yet
Community Psychology Class
32 pages
Final Fidp - Oral Communication - English 1
100% (1)
Final Fidp - Oral Communication - English 1
8 pages
HRM MCQ
No ratings yet
HRM MCQ
13 pages
Behaviour Economics
100% (1)
Behaviour Economics
13 pages
Shaping Futures: A Qualitative Exploration of The Factors Influencing Career Choices Among Out-Of-School Youth
No ratings yet
Shaping Futures: A Qualitative Exploration of The Factors Influencing Career Choices Among Out-Of-School Youth
7 pages
Physical Education 2 Rythmic Activities
No ratings yet
Physical Education 2 Rythmic Activities
11 pages
Psychology Module Outline
No ratings yet
Psychology Module Outline
15 pages
Emotional Intelligence at Work
No ratings yet
Emotional Intelligence at Work
2 pages
Student Handbook 2020 Online
No ratings yet
Student Handbook 2020 Online
11 pages
The Basic Theory of Mind
No ratings yet
The Basic Theory of Mind
8 pages
CS 425 Software Engineering Course Syllabus
No ratings yet
CS 425 Software Engineering Course Syllabus
6 pages
Week 10 - Personality Assessment - An Overview
No ratings yet
Week 10 - Personality Assessment - An Overview
10 pages
Reliability-Validity Tests - Whichard PDF
No ratings yet
Reliability-Validity Tests - Whichard PDF
10 pages
Quiz 11
No ratings yet
Quiz 11
5 pages
Unit 1, Chapter 1, Section 1, Guided Reading Workbook
No ratings yet
Unit 1, Chapter 1, Section 1, Guided Reading Workbook
3 pages
International Youth Math Challenge: Pre-Final Round Information
0% (1)
International Youth Math Challenge: Pre-Final Round Information
2 pages
Brain Orientation Assessment
No ratings yet
Brain Orientation Assessment
8 pages
Unit 7 at Assessments of Counseling
100% (1)
Unit 7 at Assessments of Counseling
15 pages
Super Life Space Theory - Balance
100% (1)
Super Life Space Theory - Balance
14 pages
Intercultural Communication and Conflict
No ratings yet
Intercultural Communication and Conflict
4 pages
Unit 1 Counselling Theories and Practice: Structure
100% (1)
Unit 1 Counselling Theories and Practice: Structure
20 pages
Revision Notes of Industrial and Organisational Psychology Psyc3230 Lecture 1 3
No ratings yet
Revision Notes of Industrial and Organisational Psychology Psyc3230 Lecture 1 3
13 pages
BITSAT 2020 Cut Off Scores
No ratings yet
BITSAT 2020 Cut Off Scores
1 page
Career Development Theory
No ratings yet
Career Development Theory
10 pages
Psychiatric History
No ratings yet
Psychiatric History
4 pages
Book Review of Practical Counselling and Helping Skills - Chapter 16
No ratings yet
Book Review of Practical Counselling and Helping Skills - Chapter 16
4 pages
Workplace Mental Health Well Being
No ratings yet
Workplace Mental Health Well Being
48 pages
Construction Validay of The DASS-21
No ratings yet
Construction Validay of The DASS-21
14 pages
ED622477
No ratings yet
ED622477
5 pages
The Big Five Model Is A Comprehensive
No ratings yet
The Big Five Model Is A Comprehensive
15 pages
Ernalized Stigma
No ratings yet
Ernalized Stigma
5 pages
Children and Religion - Detailed Essay
No ratings yet
Children and Religion - Detailed Essay
2 pages
Counselling 01
100% (1)
Counselling 01
3 pages
Amir Resume
No ratings yet
Amir Resume
2 pages
G2 Research. Final
No ratings yet
G2 Research. Final
3 pages
Unraveling The Mysteries of Quantum Physics: A Journey Into The Subatomic World
No ratings yet
Unraveling The Mysteries of Quantum Physics: A Journey Into The Subatomic World
1 page
Clinical Assessment, Diagnosis, and Research Methods
No ratings yet
Clinical Assessment, Diagnosis, and Research Methods
53 pages
Core Career Counselling Skills
No ratings yet
Core Career Counselling Skills
2 pages
Notes On Lifespan Psychology 1
No ratings yet
Notes On Lifespan Psychology 1
4 pages
NOTES Groups Hepworth
No ratings yet
NOTES Groups Hepworth
4 pages
The Eclectic Approach
No ratings yet
The Eclectic Approach
1 page
Law Wong Song (2004) JAP
No ratings yet
Law Wong Song (2004) JAP
14 pages
Denver Gelişim Testi Eğitimi: Denver Gelişim Testi Eğitimi, #1
From Everand
Denver Gelişim Testi Eğitimi: Denver Gelişim Testi Eğitimi, #1
Psikolog Pedagog Ekrem Çulfa
5/5 (1)
My Last Cigarette
From Everand
My Last Cigarette
Iván Salvaterra
No ratings yet
Accepting What Is Happening
From Everand
Accepting What Is Happening
Michael W
No ratings yet
What Exactly Is Depression? The Basics
From Everand
What Exactly Is Depression? The Basics
B. A. (Beverly) Smith
No ratings yet