Chapter 7 Interpreting Test Score
Chapter 7 Interpreting Test Score
Chapter 7 Interpreting Test Score
The scores from achievement, aptitude, attitude, psychological tests can not be compared directly
with each other unless the norming group is taken into consideration and the scale on which the
score is based. We addressed this issue in earlier modules on aptitude and achievement tests.
Raw scores -- the number of items correct or the number of points earned; not of much use by
themselves
Grade Equivalent scores --average score grade group in which student's raw score is average;
used to estimate or monitor growth
Standard scores -- terms of standard distance of student's raw score from the mean (average) in
terms of standard deviations; used to monitor growth; better at reflecting reality than grade
equivalent scores
Normal Curve Equivalent -- a normalized standard score; used to avoid problems with grade
equivalent scores and used to describe group performance and to show growth over time
Percentile Ranks -- student's relative position in a group in terms of the percentage of students
scoring lower than or equal to that student; used to determine relative areas of strengths and
weaknesses; can create profile analyses from these scores.
1. A test score should be interpreted in terms of the specific test from which it was derived.
2. A test score should be interpreted in light of all of the student's relevant characteristics.
4. A test score should be interpreted as a band of scores rather than as a specific score.
5. A test score should be verified by supplementary evidence.
6. Do NOT interpret a grade equivalent score as an estimate of the grade where a student should
be placed.
7. Do NOT assume that the units are equal at different parts of the scale.
Test publishers provide a variety of ways of presenting results. The figures in your text are just a
few of the presentations possible.
1. above average
2. average
3. below average
1. above average
2. average
3. below average
Scale scores vary from test to test and from grade to grade within the same test. The range,
standard deviations, and means vary by test, subtest, and grade. They are very often reported and
can be converted to cumulative frequency at midpoint which in turn can be converted to
percentile ranks which are much easier to interpret.
Given a scale score and the number of students earning below that and the number of students
earning exactly that scale score the cumulative frequency at midpoint can be calculated. The
definition of cumulative frequency at midpoint is all the students who earned scale scores below
a given score plus one half of the students who earned that scale score.
An example, if we know that 36 students earned scale scores lower than 400 and 6 students
earned scale scores of exactly 400, then we take one half of 6 and add that to 36, and we know
that the cumulative frequency at midpoint for a scale score of 400 is 40. If we then divide that by
the number of students who took the test, we have the percentile rank. Given that 50 students
took the test, we divide 40 by 50 and obtain a percentile rank of 80. Now we know that this
student has performed as well as or better than 80% of his/her peers. We would also say that this
student is average.