1 General Purpose
Categories are very common in medical
research. Examples include age classes, income classes, education
levels, drug dosages, diagnosis groups, disease severities, etc.
Statistics has generally difficulty to assess categories, and
traditional models require either binary or continuous variables.
If in the outcome, categories can be assessed with multinomial
regression (Chap. 44). If as predictors, they can be
assessed with linear regression for categorical predictors (Chap.
8). However, with multiple categories
or with categories both in the outcome and as predictors, random
intercept models may provide better sensitivity of testing. The
latter models assume that for each predictor category or
combination of categories x1, x2,…slightly
different a-values can be computed with a better fit for the
outcome category y than a single a-value.
We should add that, instead of the above linear equation, even
better results were obtained with log-transformed outcome variables
(log = natural logarithm).


2 Schematic Overview of Type of Data File

3 Primary Scientific Question
Are in a study of exposure and outcome
categories the exposure categories significant predictors of the
outcome categories. Does a random intercept provide better
test-statistics than does a fixed effects analysis.
4 Data Example
In a study, three hospital departments
(no surgery, little surgery, lot of surgery), and three patient age
classes (young, middle, old) were the predictors of the risk class
of falling out of bed (fall out of bed no, yes but no injury, yes
and injury). Are the predictor categories significant determinants
of the risk of falling out of bed with or without injury. Does a
random intercept provide better statistics.
Outcome fall out of bed
|
Predictor department
|
Predictor ageclass
|
Patient_id
|
1
|
0
|
1,00
|
1,00
|
1
|
0
|
1,00
|
2,00
|
1
|
0
|
2,00
|
3,00
|
1
|
0
|
1,00
|
4,00
|
1
|
0
|
1,00
|
5,00
|
1
|
0
|
,00
|
6,00
|
1
|
1
|
2,00
|
7,00
|
1
|
0
|
2,00
|
8,00
|
1
|
1
|
2,00
|
9,00
|
1
|
0
|
,00
|
10,00
|
5 Data Analysis with a Fixed Effect Generalized Linear Mixed Model
Only the first 10 patients of the 55
patient file is shown above. The entire data file is in
extras.springer.com and is entitled “chapter45randomintercept.sav”.
SPSS version 20 and up can be used for analysis. First, we will
perform a fixed intercept model.
The module Mixed Models consists of two
statistical models:
-
Linear,
-
Generalized Linear.
For analysis the statistical model
Generalized Linear Mixed Models is required.
First we will perform a fixed effects
model analysis, then a random effects model.
Command:
-
Click Analyze….Mixed Models....Generalized Linear Mixed Models....click Data Structure….click “patient_id” and drag to Subjects on the Canvas….click Fields and Effects….click Target….Target: select “fall with/out injury”….click Fixed Effects ….click “agecat” and “department” and drag to Effect Builder:….mark Include intercept….click Run.
The underneath results show that both
the various regression coefficients as well as the overall
correlation coefficients between the predictors and the outcome
are, generally, statistically significant.


6 Data Analysis with a Random Effect Generalized Linear Mixed Model
Subsequently, a random intercept
analysis is performed.
Command:
-
Analyze….Mixed Models....Generalized Linear Mixed Models....click Data Structure….click “patient_id” and drag to Subjects on the Canvas….click Fields and Effects….click Target….Target: select “fall with/out injury”….click Fixed Effects ….click “agecat” and “department” and drag to Effect Builder:….mark Include intercept….click Random Effects….click Add Block…mark Include intercept ….Subject combination: select patient_id….click OK….click Model Options….click Save Fields…mark PredictedValue….mark PredictedProbability….click Save ....click Run.
The underneath results show the
test-statistics of the random intercept model. The random intercept
model shows better statistics:
p = 0.007 and 0.013
|
overall for age,
|
p = 0.001 and 0.004
|
overall for department,
|
p = 0.003 and 0.005
|
regression coefficients for age class 0
versus 2,
|
p = 0.900 and 0.998
|
for age class 1 versus 2,
|
p = 0.004 and 0.008
|
for department 0 versus 2, and
|
p = 0.0001 and 0.0002
|
for department 1 versus 2.
|


In the random intercept model we have
also commanded predicted values (variable 7) and predicted
probabilities of having the predicted values as computed by the
software (variables 5 and 6).
1
|
2
|
3
|
4
|
5
|
6
|
7 (variables)
|
0
|
1
|
1,00
|
1,00
|
,224
|
,895
|
1
|
0
|
1
|
1,00
|
2,00
|
,224
|
,895
|
1
|
0
|
1
|
2,00
|
3,00
|
,241
|
,903
|
1
|
0
|
1
|
1,00
|
4,00
|
,224
|
,895
|
1
|
0
|
1
|
1,00
|
5,00
|
,224
|
,895
|
1
|
0
|
1
|
,00
|
6,00
|
,007
|
,163
|
2
|
1
|
1
|
2,00
|
7,00
|
,185
|
,870
|
1
|
0
|
1
|
2,00
|
8,00
|
,241
|
,903
|
1
|
1
|
1
|
2,00
|
9,00
|
,185
|
,870
|
1
|
0
|
1
|
,00
|
10,00
|
,007
|
,163
|
2
|
Like automatic linear regression (see
Chap. 7), and other generalized mixed
linear models (see Chap. 12), random intercept models include
the possibility to make XML files from the analysis, that can
subsequently be used for making predictions about the chance of
falling out of bed in future patients. However, SPSS uses here
slightly different software called winRAR ZIP files that are
“shareware”. This means that you pay a small fee and be registered
if you wish to use it. Note that winRAR ZIP files have an archive
file format consistent of compressed data used by Microsoft since
2006 for the purpose of filing XML (eXtended Markup Language)
files. They are only employable for a limited period of time like
e.g. 40 days.
7 Conclusion
Generalized linear mixed models are
suitable for analyzing data with multiple categorical variables.
Random intercept versions of these models provide better
sensitivity of testing than fixed intercept models.