This chapter summarizes the most common non-Rasch models considered for analysing ordered response category items. These models fall into two distinct classes. The models of the first class have a structure consistent with the PRM but with a greater number of parameters. The models of the second class are structurally different from the PRM but can have the same or more parameters than the PRM. Models from both classes do not have the sufficient statistic properties of the PRM. The application of these models arises from the Item Response Theory (IRT) paradigm in which the main criterion for the choice of the model is that of statistical fit of the responses to the model. These models are chosen to describe or summarize the data, and do not arise from any fundamental principles that are independent of the data. The full class of models, and their connection to the respective paradigms, are summarized in Andrich (2011).
For efficiency of exposition, we begin with the class of models which specializes to the PRM.
The Nominal Response Model

Again, because the response
is of a single person to a single
item, we do not subscript the person and item parameters
and
, nor the two vectors
which characterize the categories of
the item. Here, the response variable
is simply the ordinal count of the
category of the response, beginning with the first category and
is again the normalizing factor which
is the sum of the numerators of Eq. (28.1). In the development
of the Rasch model, this same equation appeared earlier (Rasch,
1961), which was developed further by Andersen
(1977), and then interpreted in terms of thresholds and discrimination at the thresholds in Andrich (1978).
In these publications,
are called, respectively, the
category coefficient and the scoring
function and we use these terms in this chapter. In order to connect this model to the PRM, and better
understand it, we now summarize the original derivation of the
PRM.
Relationship Between the PRM and the NRM
This section follows the derivation of
the threshold form of the PRM shown in Chap. 27. However, there is one important
difference. Instead of specifying the dichotomous Rasch model for
the latent dichotomous responses at the thresholds in the full space , the 2PL
model (Birnbaum, 1968) we encountered in
Chap. 18 was specified. This specification
appeared in the original derivation of the threshold form of the
PRM in Andrich (1978).








Probabilities of responses in the Guttman
subspace when the dichotomous response at
threshold
is the 2PL
model
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|







We see that Eq. (28.6) is the form of the NRM of Eq. (28.1).
With the constraints on the categories of each item, the
number of independent parameters for each item are effectively
. Although they are not typically
viewed in this way, the parameters embody a location (difficulty)
and discrimination at each threshold, a
generalization of the 2PL . Where the
model is applied, the parameters
are attempted to be estimated without
consideration of what these parameters might characterize. It is
evident from Eqs. (28.4) and (28.5) that
is the sum of discriminations of
all thresholds up to threshold
in the required order , and that
is of the same cumulative structure but with the location and
discrimination parameters at the thresholds entangled. With only one response in
one of the
categories, this model is not easy to
implement and is not used routinely in major assessments.








Thus, the PRM is an algebraic specialization of the NRM expressed in the form of threshold locations and discriminations at these thresholds with the discriminations at the thresholds all constant. The equal discriminations at the thresholds give the integer scoring function.

Equation (28.11) is the partial credit parameterization of the PRM which we encountered in Eq. (21.6) in Chap. 21. The equal discriminations at the thresholds among all items give the total score of a person across all items, an integer, as the sufficient statistic for the person parameter. With different discriminations at the thresholds , the NRM does not have a sufficient statistic in the sense that the person and item parameters can be separated in the estimation as in the PRM.
The Generalized Partial Credit Model


The generalized partial credit model also does not have sufficient statistics of the form of the PRM, but because it has a smaller number of parameters than the NRM, it is more tractable than the NRM. As indicated earlier, it is applied from the perspective of the IRT paradigm .
We now turn to the second class of models which is structurally different from the PRM.
The Graded Response Model
The model now called the graded response model (GRM) for the analysis of ordered response categories has its origins in the work of Thurstone. The possibility of collecting data in the form which implied the model was mentioned at the end of Thurstone (1928) and then further developed in Edwards and Thurstone (1952). In modern psychometric form, it is presented in Samejima (1969, 2016), and in a contingency table context, where the dependent variable is in the form of ordered response categories, it is presented in Bock (1975). The GRM was the standard model for the analysis of ordered response categories before the advent of the PRM.
In the PRM, there is a distinct latent response process at each threshold which is then constrained by the category order . In contrast, in the GRM there is only one response process across the continuum and the outcome of this process is portioned into categories.











The cumulative response structure of the graded response model







It is possible to specialize the GRM so
that the discriminations, , are the same across items. Then, the
GRM and the PRM have the same number of parameters. However, the
scale of the GRM is different from PRM,
though in any data set, the estimates of the person parameters will
be highly correlated—that is a property of the data.
The structure of the GRM ensures that
its thresholds , which are different from
the thresholds in the PRM, are
necessarily in order . This results from
the feature that . This means that those using the GRM
tend not to focus on evidence that categories might not be
operating as intended. However, the points of intersection of the
adjacent categories in category characteristic
curves of the GRM may still show reversals—they will do so
if an analysis with the PRM shows reversals. An example of a data
set with respective threshold estimates from the PRM and the GRM is
shown in Andrich (2011).
Estimation of Parameters in the Non-Rasch Models
We saw in Chap. 7 how the person parameter can be eliminated in the dichotomous Rasch model and then the item parameters can be estimated independently of the person parameters. This is because the Rasch model has sufficient statistics for its parameters. Because the non-Rasch models do not have such sufficient statistics, it is not possible to separate the estimation of the item and person parameters in the same way. Therefore, some other assumptions or constraints are required. One approach is to assume a distribution of the person parameters, such as normal, and impose it as a constraint in the estimation. Another approach is to place a constraint on the observed distribution of total scores. In any case, these methods involve first estimating a set of item parameters, then estimating a set of person parameters given the estimates of the item parameters, and then returning to the estimates of the person parameters, and so on, until the estimates converge . In many cases, all estimates do not converge and some upper limit on an estimate of an item difficulty parameter or discrimination parameter may be imposed.
These methods of estimation may also be used with the Rasch model and are used in many Rasch model software packages. RUMM2030 uses a particular kind of conditional estimation which does eliminate the person parameters in the process of estimating the item parameters. In this method, the conditional responses to pairs of items are essential elements of the estimation. The method is described in more detail in Andrich and Luo (2003).