Formalization and Derivation of CTT Eqs. (3.1)–(3.5) in Chap. 3




Thus, we see that the observed score on the variable is the sum of two
other variables, the true and error scores. Both are unobserved and
therefore latent scores. In addition, they are real numbers,
whereas is typically an integer.

This is the second most relevant equation in CTT .
The next important concept developed in CTT is the formalization of reliability . It begins with the idea of two tests being administered to measure the same construct. These tests are often termed parallel tests. In some situations, there really are two tests, but we do not need them to develop a theory and then see what other ways we can calculate the reliability .
Each test will have its error of measurement , but the true score for a person will be the same. Here we have to use double subscripts briefly.
Let the score of person n on test 1 be
and on test
2 be
.


Because we are interested
in how consistent the observed scores are from the two tests, we
calculate the correlation between and
. We would want the correlation to be
high. We begin with the calculation of the covariance between
and
.
Derivation of Covariance
![$$ \begin{aligned} c_{12} & = \frac{{\sum\nolimits_{n = 1}^{N} {(y_{n1} - \overline{y}_{1} )(y_{n2} - \overline{y}_{2} )} }}{N - 1} \\ & = \frac{{\sum\nolimits_{n = 1}^{N} {[(t_{n} + e_{n1} ) - (\overline{t} + \overline{e}_{1} )][(t_{n} + e_{n2} ) - (\overline{t} + \overline{e}_{2} )]} }}{N - 1} \\ & = \frac{{\sum\nolimits_{n = 1}^{N} {[(t_{n} + e_{n1} ) - (\overline{t} )][(t_{n} + e_{n2} ) - (\overline{t} )]} }}{N - 1} \\ & = \frac{{\sum\nolimits_{n = 1}^{N} {[t_{n}^{2} + t_{n} e_{n2} - t_{n} \overline{t} + e_{n1} t_{n} + e_{n1} e_{n2} - e_{n1} \overline{t} - \overline{t} t_{n} - \overline{t} e_{n2} + \overline{t} \overline{t} ]} }}{N - 1} \\ & = \frac{{\sum\nolimits_{n = 1}^{N} {t_{n}^{2} + \sum\nolimits_{n = 1}^{N} {t_{n} e_{n2} } - \sum\nolimits_{n = 1}^{N} {t_{n} \overline{t} } + \sum\nolimits_{n = 1}^{N} {e_{n1} t_{n} } + \sum\nolimits_{n = 1}^{N} {e_{n1} e_{n2} } - \sum\nolimits_{n = 1}^{N} {e_{n1} \overline{t} } - \sum\nolimits_{n = 1}^{N} {\overline{t} t_{n} } - \sum\nolimits_{n = 1}^{N} {\overline{t} e_{n2} } + \sum\nolimits_{n = 1}^{N} {\overline{t}_{{}}^{2} } } }}{N - 1} \\ \end{aligned} $$](/epubstore/A/D-Andrich/A-Course-In-Rasch-Measurement-Theory/OEBPS/images/470896_1_En_25_Chapter/470896_1_En_25_Chapter_TeX_Equa.png)
Now because the error is assumed to be not correlated with the true score , nor with itself across two different tests, the sum of the products of all terms which contain an error term will be 0.

The second last step is proved as follows:




From this equation, we can derive another relevant relationship based on correlations.

However, and
.
Note that here we assume that the error
variance on both tests is the same—of course the true scores must
be the same. Therefore, the variance of the observed scores is the
same. That is, if is the variance of the observed
scores y of any two parallel
tests 1 and 2, then these variances should be equal:
.
Equation (25.6) represents the
proportion of the total variance that is
true variance. Thus the reliability
of the test
is equivalent to the proportion of the total
variance that is true score
variance .







Derivation of the Standard Error of Measurement

Now we need to appreciate what the error variance is; it is the variation of scores from test to test for persons with the same true score or from the same person on more than one occasion.

Equation (25.11) is known as the standard error of measurement .
Notice that if the reliability is 1, that is, the scores on the two parallel tests are perfect, then the standard error is 0; if the reliability is 0, then the standard error is the standard deviation of the original scores which means that all of the variations is error variance.
Derivation of the Equation for Predicting the True Score from the Observed Score
From the
observed score and the test ’s reliability , it
is possible to estimate the true score
and the standard error of this
estimate. This estimate is made using the equation for
regression.
It will be recalled from Statistics Review 4 that variable Y for
person n can be predicted from
variable X using the equation . Note that we now use the subscript
n for the person rather than
i to be consistent with the
presentation here. In this equation,
and
.







In this equation, we
apparently do not know the value of .
However, in the population . Therefore, we substitute
as an estimate of
.

Thus using Eq. (25.13) we can predict the true score from the observed score and from Eq. (25.11) we can estimate the error in this prediction.
Derivation of Coefficient α






This is the expression for reliability in Eq. (4.5).


The variances of the total scores and
the items, and
, are calculated simply as
and
where
is the number of persons
involved.