Distribution-Free ANOVA

15.4 Distribution-Free ANOVA

  The single-factor ANOVA model of Chapter 10 for comparing I population or treat- ment means assumed that for i 5 1, 2,…, I, a random sample of size J i was drawn

  from a normal population with mean m and variance s 2 i . This can be written as

  X ij 5m i 1e ij

  j5 1,…, J i ; i 5 1,…, I (15.10)

  where the e ij ’s are independent and normally distributed with mean zero and vari-

  ance s 2 . Although the normality assumption was required for the validity of the F test described in Chapter 10, the next procedure for testing equality of the m i ’s requires only that the e ij ’s have the same continuous distribution.

  the Kruskal-Wallis test

  Let N 5 oJ i , the total number of observations in the data set, and suppose we rank all N observations from 1 (the smallest X ij ) to N (the largest X ij ). When

  672 ChapTeR 15 Distribution-Free procedures

  H 0 :m 1 5m 2 5…5m I is true, the N observations all come from the same distribu-

  tion, in which case all possible assignments of the ranks 1, 2,…, N to the I samples are

  equally likely and we expect ranks to be intermingled in these samples. If, however, H 0

  is false, then some samples will consist mostly of observations having small ranks in the combined sample, whereas others will consist mostly of observations having large ranks. More specifically, if R ij denotes the rank of X ij among the N observations, and R i? and R i? denote, respectively, the total and average of the ranks in the ith sample,

  then when H 0 is true,

  sR i? d5

  E

  E sR ij d5 E

  i o sR

  The Kruskal-Wallis test statistic is a measure of the extent to which the R i? ’s deviate from their common expected value sN 1 1dy2.

  tEst stAtistic

  12 I N1 1

  N (N 1 1) j5 o 1 1 2

  K5 J i R i? 2

  N (N 1 1) o i5 1 J i

  12 I R 2 i?

  5 2 3(N 1 1)

  The second expression for K is the computational formula; it involves the rank totals (R i? ’s) rather than the averages and requires only one subtraction.

  Values of K at least as contradictory to H 0 as the calculated k are those that equal or exceed k. That is, the test is upper-tailed: P-value 5 P 0 (K k). Under H 0 ,

  each possible assignment of the ranks to the I samples is equally likely, so in theory all such assignments can be enumerated, the value of K determined for each one, and the null distribution obtained by counting the number of times each value of K occurs. Clearly, this computation is tedious, so even though there are tables of the exact null distribution and critical values for small values of the J i ’s, we will use the following “large-sample” approximation.

  pRoposition When H 0 is true and either I5 3 J i 6 (i 5 1, 2, 3)

  or

  I. 3 J i 5 (i 5 1,…, I)

  then K has approximately a chi-squared distribution with I 2 1 df. This implies

  that the approximate P-value is the area under the x 2 I2 1 curve to the right of k.

  Appendix Table A.11 gives a tabulation of chi-squared upper-tail curve areas.

  ExAmplE 15.9

  The accompanying observations (Table 15.6) on axial stiffness index resulted from

  a study of metal-plate connected trusses in which five different plate lengths—4 in.,

  6 in., 8 in., 10 in., and 12 in.—were used (“Modeling Joints Made with Light-

  Gauge Metal Connector Plates,” Forest Products J., 1979: 39–44) .

  15.4 Distribution-Free aNOVa 673

  Table 15.6 Data and Ranks for Example 15.9

  i51 s40d: 309.2 309.7 311.0 316.8 326.5 349.8 409.5 i52 s60d: 331.0 347.2 348.9 361.0 381.7 402.1 404.5 i53 s80d: 351.0 357.1 366.2 367.3 382.0 392.4 409.9 i54 s100d: 346.7 362.6 384.2 410.6 433.1 452.9 461.4 i55 s120d: 407.4 410.7 419.9 441.2 441.8 465.8 473.4

  r i? r i?

  i 5 1: 1 2 3 4 5 10 24 49 7.00 i 5 2: 6 8 9 13 17 21 22 96 13.71

  Ranks

  i 5 3: 11 12 15 16 18 20 25 117 16.71 i 5 4: 7 14 19 26 29 32 33 160 22.86 i 5 5: 23 27 28 30 31 34 35 208 29.71

  The computed value of K is

  12 2 2 2 2 s49d 2 s96d s117d s160d s208d

  35 s36d 3 7 7 7 7 7 4

  k5

  1 1 1 1 2 3 s36d

  Appendix Table A.11 shows that the area under the 4 df chi-squared curve to the right of 16.74 is .005 and the area under this curve to the right of 20.51 is .001. So the P -value for the test is slightly larger than .001 but much smaller than .005, and thus

  smaller than .01. Therefore H 0 is rejected at significance level .01, and we conclude

  that expected axial stiffness does depend on plate length.

  n

  Friedman’s test for a randomized Block Experiment

  Suppose X ij 5m1a i 1b j 1e ij , where a i is the ith treatment effect, b j is the jth

  block effect, and the e ij ’s are drawn independently from the same continuous (but not

  necessarily normal) distribution. Then to test H 0 :a 1 5a 2 5…5a I 5 0, the null

  hypothesis of no treatment effects, the observations are first ranked separately from

  1 to I within each block, and then the rank average r i? is computed for each of the

  I treatments. When H 0 is true, the r i? ’s should be close to one another, since within

  each block all I! assignments of ranks to treatments are equally likely. Friedman’s test statistic measures the discrepancy between the expected value sI 1 1dy2 of each rank average and the r i? ’s.

  12J

  I (I 1 1) o 1 i? 2 2 IJ (I 1 1) o i5 i?

  I I1 1 12

  tEst stAtistic

  F 5 R 2 5 R r 2 3 J (I 1 1)

  The test is again upper-tailed, because any value exceeding the calculated f r is even

  more contradictory to H 0 than is f r itself. For the cases I 5 3, J 5 2,…, 15 and

  I5 4, J 5 2,…, 8, Lehmann’s book (see the chapter bibliography) gives the upper- tail critical values from which P-value information can be obtained. Alternatively, for even moderate values of J, the test statistic F r has approximately a chi-squared

  674 ChapTeR 15 Distribution-Free procedures

  distribution with I 2 1 df when H 0 is true, so the approximate P-value is the area

  under the x 2 I2 1 curve to the right of f r .

  ExAmplE 15.10 The article “Physiological Effects During Hypnotically Requested Emotions” (Psychosomatic Med., 1963: 334–343) reports the following data (Table 15.7) on skin potential (mV) when the emotions of fear, happiness, depression, and calmness were requested from each of eight subjects.

  Table 15.7 Data and Ranks for Example 15.10

  Blocks (Subjects)

  x ij 12345678

  Fear

  Happiness 22.7 53.2 9.7 19.6 13.8 47.1 13.6 23.6 Depression 22.5 53.7 10.8 21.1 13.7 39.2 13.7 16.3 Calmness 22.6 53.1 8.3 21.6 13.3 37.0 14.8 14.8

  Ranks 1 2 3 4 5 6 7 8 r i? r 2 i?

  Fear

  Happiness 3 2 2 1 4 3 1 4 20 400 Depression 1 3 4 2 3 2 2 2 19 361 Calmness 2 1 1 3 2 1 3 1 14 196

  f r 5 s1686d 2 3s8ds5d 5 6.45

  4 s8ds5d

  The v = 3 column of Appendix Table A.11 shows that P-value < .09. Since this

  exceeds .05, H 0 cannot be rejected at that significance level. There is no evidence

  that average skin potential depends on which emotion is requested.

  n

  The book by Myles Hollander et. al. (see the chapter bibliography) discusses multiple comparisons procedures associated with the Kruskal-Wallis and Friedman tests, as well as other aspects of distribution-free ANOVA.

  ExERciSES section 15.4 (23–27)

  23. The accompanying data refers to concentration of the

  Test at level .10 to see whether true average strontium-90

  radioactive isotope strontium-90 in milk samples

  concentration differs for at least two of the regions.

  obtained from five randomly selected dairies in each of

  24. The article “Production of Gaseous Nitrogen in

  four different regions.

  Human Steady-State Conditions” (J. of Applied

  Physiology, 1972: 155–159) reports the following

  Region

  observations on the amount of nitrogen expired (in

  liters) under four dietary regimens: (1) fasting, (2) 23

  protein, (3) 32 protein, and (4) 67 protein. Use the

  Supplementary exercises 675

  Kruskal-Wallis test at level .05 to test equality of the

  on soil pretreated with Basic-H. Test at level .01 to see

  corresponding m i ’s.

  whether there are any effects due to the different treatments.

  25. The accompanying data on cortisol level was reported in

  the article “Cortisol, Cortisone, and 11-Deoxycortisol

  A 25.3 23.7 24.4 21.7 26.2

  Levels in Human Umbilical and Maternal Plasma in

  B 19.3 17.3 17.0 16.7 18.3

  Relation to the Onset of Labor” (J. of Obstetric

  C 48.8 47.8 40.2 44.0 46.4

  Gynaecology of the British Commonwealth, 1974:

  D 37.1 37.5 39.6 35.1 36.5

  737–745) . Experimental subjects were pregnant women whose babies were delivered between 38 and 42 weeks

  27. In an experiment to study the way in which different anes-

  gestation. Group 1 individuals elected to deliver by

  thetics affect plasma epinephrine concentration, ten dogs

  Caesarean section before labor onset, group 2 delivered by

  were selected and concentration was measured while they

  emergency Caesarean during induced labor, and group 3

  were under the influence of the anesthetics isoflurane, halo-

  individuals experienced spontaneous labor. Use the

  thane, and cyclopropane (“Sympathoadrenal and

  Kruskal-Wallis test at level .05 to test for equality of the

  Hemodynamic Effects of Isoflurane, Halothane, and

  three population means.

  Cyclopropane in Dogs,” Anesthesiology, 1974: 465–470) . Test at level .05 to see whether there is an anesthetic effect

  Group 1 262 307 211 323 454 339

  on concentration.

  26. In a test to determine whether soil pretreated with small

  Isoflurane

  amounts of Basic-H makes the soil more permeable to

  Halothane

  water, soil samples were divided into blocks, and each

  Cyclopropane 1.07 1.35

  block received each of the four treatments under study.

  The treatments were (A) water with .001 Basic-H flooded on control soil, (B) water without Basic-H on

  Isoflurane

  control soil, (C) water with Basic-H flooded on soil

  Halothane

  pretreated with Basic-H, and (D) water without Basic-H

  Cyclopropane 1.53 .49 .56 1.02 .30

  SuPPlEmENTARy ExERciSES (28–36)

  28. The article “Effects of a Rice-Rich Versus Potato-Rich

  significance level .05 to determine whether the true mean

  Diet on Glucose, Lipoprotein, and Cholesterol

  cholesterol-synthesis rate differs significantly for the two

  Metabolism in Noninsulin-Dependent Diabetics” (Amer.

  sources of carbohydrates.

  J. of Clinical Nutr., 1984: 598–606) gives the accompany- ing data on cholesterol-synthesis rate for eight diabetic subjects. Subjects were fed a standardized diet with potato

  Cholesterol-Synthesis Rate

  or rice as the major carbohydrate source. Participants received both diets for specified periods of time, with

  Subject 123 45 678

  cholesterol-synthesis rate (mmolday) measured at the end

  Potato 1.88 2.60 1.38 4.41 1.87 2.89 3.96 2.31

  of each dietary period. The analysis presented in this article

  Rice

  used a distribution-free test. Use such a test with

  676 ChapTeR 15 Distribution-Free procedures

  29. High-pressure sales tactics or door-to-door salespeople can

  that the underlying distribution is continuous. To illus-

  be quite offensive. Many people succumb to such tactics,

  trate, consider the following sample of 20 observations

  sign a purchase agreement, and later regret their actions. In

  on component lifetime (hr):

  the mid-1970s, the Federal Trade Commission imple- mented regulations clarifying and extending the rights of

  purchasers to cancel such agreements. The accompanying

  data is a subset of that given in the article “Evaluating the

  FTC Cooling-Off Rule” (J. of Consumer Affairs, 1977:

  We wish to test H 0 :m , 5 25.0 versus H a :m , . 25.0. The

  101–106) . Individual observations are cancellation rates

  for each of nine sales people during each of 4 years. Use an

  test statistic is Y 5 the number of observations that

  appropriate test at level .05 to see whether true average

  exceed 25.

  cancellation rate depends on the year.

  a. Determine the P-value of the test when Y 5 15. [Hint: Think of a “success” as a lifetime that exceeds

  Salesperson

  25.0. Then Y is the number of successes in the sam-

  ple. What kind of a distribution does Y have when m , 5 25.0?]

  b. For the given data, should H 0 be rejected at signifi-

  cance level .05?

  [Note: The test statistic is the number of differences

  X i 2 25 that have positive signs, hence the name sign

  test. ]

  30. The given data on phosphorus concentration in topsoil for four different soil treatments appeared in the article

  34. Refer to Exercise 33, and consider a confidence inter-

  “Fertilisers for Lotus and Clover Establishment on a

  val associated with the sign test: the sign interval.

  The relevant hypotheses are now H 0 :m , 5 m, 0 versus

  Sequence of Acid Soils on the East Otago Uplands” (N.

  H a :m , ± m, 0 .

  Zeal. J. of Exptl. Ag., 1984: 119–129) . Use a distribution-

  free procedure to test the null hypothesis of no difference

  a. Suppose we decide to reject H 0 if either Y 15 or

  in true mean phosphorus concentration (mgg) for the four

  Y 5. What is the smallest a for which this equiva-

  soil treatments.

  lent to rejecting H 0 if P-value a? b. The confidence interval will consist of all values m , 0

  I 8.1 5.9 7.0 8.0 9.0

  for which H 0 is not rejected. Determine the CI for the

  Treatment II 11.5 10.9 12.1 10.3 11.9

  given data, and state the confidence level.

  III 15.3 17.4 16.4 15.8 16.0 IV 23.0 33.0 28.4 24.6 27.7

  35. Suppose we wish to test.

  31. Refer to the data of Exercise 30 and compute a 95 CI

  H 0 : the X and Y distributions are identical

  for the difference between true average concentrations

  versus

  for treatments II and III.

  H a : the X distribution is less spread out than the Y

  32. The study reported in “Gait Patterns During Free

  distribution

  Choice Ladder Ascents” (Human Movement Sci.,

  The accompanying figure pictures X and Y distributions

  1983: 187–195) was motivated by publicity concern-

  for which H a is true. The Wilcoxon rank-sum test is not

  ing the increased accident rate for individuals climbing

  appropriate in this situation because when H a is true as

  ladders. A number of different gait patterns were used

  pictured, the Y’s will tend to be at the extreme ends of the

  by subjects climbing a portable straight ladder accord-

  combined sample (resulting in small and large Y ranks),

  ing to specified instructions. The ascent times for

  so the sum of X ranks will result in a W value that is nei-

  seven subjects who used a lateral gait and six subjects

  ther large nor small.

  who used a four-beat diagonal gait are given.

  X distribution

  Lateral

  Y distribution

  Diagonal 1.27 1.82 1.66 .85 1.45 1.24

  a. Carry out a test using a 5 .05 to see whether the data suggests any difference in the true average ascent

  times for the two gaits.

  “Ranks”:

  b. Compute a 95 CI for the difference between the

  Consider modifying the procedure for assigning ranks as

  true average gait times.

  follows: After the combined sample of m 1 n observations

  33. The sign test is a very simple procedure for testing

  is ordered, the smallest observation is given rank 1, the

  hypotheses about a population median assuming only

  largest observation is given rank 2, the second smallest is

  Bibliography 677

  given rank 3, the second largest is given rank 4, and so on.

  36. The ranking procedure described in Exercise 35 is some-

  Then if H a is true as pictured, the X values will tend to be in

  what asymmetric, because the smallest observation

  the middle of the sample and thus receive large ranks. Let

  receives rank 1, whereas the largest receives rank 2, and

  W9 denote the sum of the X ranks and consider an upper-

  so on. Suppose both the smallest and the largest receive

  tailed test based on this test statistic. When H 0 is true, every

  rank 1, the second smallest and second largest receive

  possible set of X ranks has the same probability, so W9

  rank 2, and so on, and let W0 be the sum of the X ranks.

  has the same distribution as does W when H 0 is true. The

  The null distribution of W0 is not identical to the null

  accompanying data refers to medial muscle thickness for

  distribution of W, so different tables are needed. Consider

  arterioles from the lungs of children who died from sudden

  the case m 5 3, n 5 4. List all 35 possible orderings of

  infant death syndrome (x’s) and a control group of children

  the three X values among the seven observations (e.g., 1,

  (y’s). Carry out the test of H 0 versus H a at level .05.

  3, 7 or 4, 5, 6), assign ranks in the manner described,

  SIDS 4.0 4.4 4.8 4.9

  compute the value of W0 for each possibility, and then

  Control 3.7 4.1 4.3

  5.1 5.6

  tabulate the null distribution of W0. What is the P-value if w0

  5 9? This is the Ansari-Bradley test; for additional

  Consult the Lehmann book (in the chapter bibliography)

  information, see the book by Hollander and Wolfe in the

  for more information on this test, called the Siegel-Tukey

  chapter bibliography.

  test.

  BiBliogRAphy

  Hollander, Myles, Douglas Wolfe, and Eric Chicken,

  Lehmann, Erich, Nonparametrics: Statistical Methods Based

  Nonparametric Statistical Methods (3rd ed.), Wiley, New

  on Ranks, Springer, New York, 2006. An excellent

  York, 2013. A very good reference on distribution-free

  discussion of the most important distribution-free methods,

  methods with an excellent collection of tables.

  presented with a great deal of insightful commentary.

  Quality Control Methods

Dokumen yang terkait

AN ALIS IS YU RID IS PUT USAN BE B AS DAL AM P E RKAR A TIND AK P IDA NA P E NY E RTA AN M E L AK U K A N P R AK T IK K E DO K T E RA N YA NG M E N G A K IB ATK AN M ATINYA P AS IE N ( PUT USA N N O MOR: 9 0/PID.B /2011/ PN.MD O)

0 82 16

Analisis Komparasi Internet Financial Local Government Reporting Pada Website Resmi Kabupaten dan Kota di Jawa Timur The Comparison Analysis of Internet Financial Local Government Reporting on Official Website of Regency and City in East Java

19 819 7

Anal isi s L e ve l Pe r tanyaan p ad a S oal Ce r ita d alam B u k u T e k s M at e m at ik a Pe n u n jang S MK Pr ogr a m Keahl ian T e k n ologi , Kese h at an , d an Pe r tani an Kelas X T e r b itan E r lan gga B e r d asarkan T ak s on om i S OL O

2 99 16

ANTARA IDEALISME DAN KENYATAAN: KEBIJAKAN PENDIDIKAN TIONGHOA PERANAKAN DI SURABAYA PADA MASA PENDUDUKAN JEPANG TAHUN 1942-1945 Between Idealism and Reality: Education Policy of Chinese in Surabaya in the Japanese Era at 1942-1945)

1 29 9

Improving the Eighth Year Students' Tense Achievement and Active Participation by Giving Positive Reinforcement at SMPN 1 Silo in the 2013/2014 Academic Year

7 202 3

Improving the VIII-B Students' listening comprehension ability through note taking and partial dictation techniques at SMPN 3 Jember in the 2006/2007 Academic Year -

0 63 87

The Correlation between students vocabulary master and reading comprehension

16 145 49

Improping student's reading comprehension of descriptive text through textual teaching and learning (CTL)

8 140 133

The correlation between listening skill and pronunciation accuracy : a case study in the firt year of smk vocation higt school pupita bangsa ciputat school year 2005-2006

9 128 37

Transmission of Greek and Arabic Veteri

0 1 22