Correlation Coefficients Top-Down Correlation for Comparison of Sensitivity Analysis Methods
Two-Dimensional Probabilistic Risk Assessment Model 1515
Table I. Input Variables and Corresponding Mean Values, 95 Probability Ranges, and Distribution Types in the Growth Estimation Part
of the E. coli Model
8
Distribution Type 95 Probability
Factor Mean Value
a
Range
a
Variability Uncertainty
b
Unit Storage temperature, retail Temp1
8.7 7.7, 13.1
EDF
c
Beta
◦
C Storage temperature, transportation Temp2
9.4 7.7, 15.5
EDF
c
Beta
◦
C Storage temperature, home Temp3
9.1 7.7, 19
EDF
c
Beta
◦
C Storage time, retail Time1
23.4 0.5, 92.6
Exponential Uniform
hr Storage time, transportation Time2
1.0 0, 2.0
EDF
c
Beta hr
Storage time, home Time3 25.2
0.6, 99.0 Exponential
Uniform hr
Maximum density MD 7.5
5.4, 9.6 Triangle
d
Uniform log
10
Lag period, retail LP1 73.5
19.6, 136.1 Exponential
Normal hr
Lag period, transportation LP2 64.9
12.5, 131.5 Exponential
Normal hr
Lag period, home LP3 72.0
8.2, 137.9 Exponential
Normal hr
Generation time, retail GT1 9.9
2.8, 15.9 Exponential
Normal hr
Generation time, transportation GT2 8.7
1.9, 15.5 Exponential
Normal hr
Generation time, home GT3 9.8
1.3, 16.0 Exponential
Normal hr
a
Values are based on the comingled analysis of variability and uncertainty see Section 3.4.2.
b
Uncertainty distributions are defined for parameters of variability distributions.
c
Empirical distribution function EDF based on FSIS data.
8
Beta distribution is used to define the corresponding cumulative frequency at each temperature as:
C F
i
= BetaINV
α, β where,
i = Cumulative rank of data associated with a temperature.
CF
i
= Cumulative frequency at value i of the empirical distribution.
BetaINV = Inverse of a beta distribution.
α =
Parameter α of the beta distribution:
i k=
1
n
k
. β
= Parameter β of the beta distribution: n
T
−
i k=
1
n
k
+ 1.
n
i
= Number of available data at the ith value of storage temperature.
n
T
= Total number of available survey data.
d
Uncertainty is defined for the most likely value of the triangular distribution.
the main effect of the jth level of Factor B, and the in- teraction effect between the two factors. If additional
factors are involved in the analysis, the concept will be the same.
ANOVA uses the F-test to determine whether a statistically significant difference exists among mean
responses for main effects or interactions between factors. The F-test is used to test the significance of
each main and interaction effect. For the example of Equation 2, the estimators of F values for main and
interaction effects are given in Table II.
35
The rela- tive magnitude of F values can be used to rank the fac-
tors in sensitivity analysis.
37
The higher the F value, the more sensitive the response variable is to the fac-
tor. Therefore, factors with higher F values are given higher rankings.
The coefficient of determination, R
2
, is used to judge the adequacy of the ANOVA model. R
2
mea- sures the proportionate reduction of total variation
in the response variable Y associated with the use of selected main and interaction effects of factors
in the ANOVA model.
33–35
Although the F values calculated for each effect indicate the statistical sig-
nificance of corresponding effect, the coefficient of determination indicates whether the selected effects
adequately capture variability in the output. Gen- erally, a high R
2
value implies that results are not compromised by incomplete specification of effects
or by inappropriate definition of the levels for a fac- tor. When ANOVA is applied to two-dimensional
simulations of variability and uncertainty, a CDF graph for R
2
indicates uncertainty in the adequacy of the ANOVA model for multiple realizations of the
model.