The Wilcoxon Rank-Sum Test

15.2 The Wilcoxon Rank-Sum Test

  The two-sample t test is based on the assumption that both population distributions are normal. There are situations, though, in which an investigator would want to use

  a test that is valid even if the underlying distributions are quite nonnormal. We now describe such a test, called the Wilcoxon rank-sum test . An alternative name for the procedure is the Mann-Whitney test, though the Mann-Whitney test statistic is sometimes expressed in a slightly different form from that of the Wilcoxon test. The Wilcoxon test procedure is distribution-free because it will have the desired level of significance for a very large class of underlying distributions.

  Assumptions X 1 ,…, X m and Y 1 ,…, Y n are two independent random samples from continuous distributions with means m 1 and m 2 , respectively. The X and Y distributions

  have the same shape and spread, the only possible difference between the two

  being in the values of m 1 and m 2 .

  The null hypothesis H 0 :m 1 2m 2 5D 0 asserts that, the X distribution is shifted by

  the amount D 0 to the right of the Y distribution.

  development of the test When m 5 3, n 5 4

  Suppose the relevant hypotheses are H 0 :m 1 2m 2 5 0 versus H a :m 1 2m 2 . 0. If m 1 is actually much larger than m 2 , then most of the observed x’s will typically fall

  662 ChapTeR 15 Distribution-Free procedures

  to the right of the observed y’s. However, if H 0 is true, then the observed values

  from the two samples should be intermingled. The test statistic assesses how much intermingling there is in the two samples.

  Consider the case m 5 3, n 5 4. Then if all three observed x’s were to the

  right of all four observed y’s, this would provide strong evidence for rejecting H 0

  in favor of H a . The test procedure involves pooling the X’s and Y’s into a combined

  sample of size m 1 n 5 7 and ranking these observations from smallest to largest, with the smallest receiving rank 1 and the largest rank 7. If most of the largest ranks

  were associated with X observations, we would begin to doubt H 0 . The test statistic

  that quantifies this reasoning is

  W 5 the sum of the ranks in the combined sample

  associated with X observations

  For the values of m and n under consideration, the smallest possible value of W is w 5 1 1 2 1 3 5 6 (if all three x’s are smaller than all four y’s), and the largest possible value is w 5 5 1 6 1 7 5 18 (if all three x’s are larger than all four y’s).

  As an example, suppose x 1 52 3.10, x 2 5 1.67, x 3 5 2.01, y 1 5 2 5.27, y 5

  3 1.89, y 5 3.86, and y 4 5 .19. Then the pooled ordered sample and corresponding

  ranks are as follows: Ordered pooled sample:

  Thus w 5 1 1 3 1 5 5 9.

  For the alternative under consideration, a larger value of W provides more

  evidence against H 0 than does a smaller value. This implies that the test is upper-

  tailed: P-value 5 P 0 (W w), where again P 0 denotes the probability computed

  assuming that H 0 is true. So we need the “null distribution” of the test statistic. To

  this end, recall that when H 0 is true, all seven observations come from the same population. This means that under H 0 , any possible triple of ranks associated with

  the three x’s—such as (1, 4, 5), (3, 5, 6), or (5, 6, 7)—has the same probability as

  any other possible rank triple. Since there are 7 _ 3 + 5 35 possible rank triples, under

  H 0 each rank triple has probability 1 y35. From a list of all 35 rank triples and the w value associated with each, the probability distribution of W can immediately be determined. For example, there are four rank triples that have w value 11—(1, 3, 7), (1, 4, 6), (2, 3, 6), and (2, 4, 5)—so P(W 5 11) 5 4 y35. The computations are summarized in Table 15.4.

  Table 15.4 Probability Distribution of W( m 5 3, n 5 4) When H 0 Is True

  The null distribution of Table 15.4 is symmetric about w 5 (6 1 18) y2 5 12, which is the middle value in the ordered list of possible W values. This is because the two rank triples (r, s, t) (with r , s , t) and s8 2 t, 8 2 s, 8 2 rd have values of w symmetric about 12, so for each triple with w value below 12, there is a triple with w value above 12 by the same amount.

  For the alternative under consideration, the test statistic value w 5 16 results

  in P-value 5 P 0 (W 16) 5 2 y35 1 1y35 1 1y35 5 .114. We would then not be

  able to reject the null hypothesis at significance level .05. The test is lower-tailed if

  15.2 The Wilcoxon Rank-Sum Test 663

  the alternative hypothesis is H a :m 1 2m 2 , 0. The test statistic value w 5 7 gives P -value 5 P 0 (W 7) 5 1 y35 1 1y35 5 .057. In the case of the alternative hypoth- esis H a :m 1 2m 2 ? 0, the test is two-tailed. Suppose, for example, that w 5 17. Then

  17 and 18 at least as contradictory to H 0 as the obtained value of W, and so also are

  7 and 6; these latter two values are as far out in the lower tail of the null distribu-

  tion as the former two are in the upper tail. The P-value is then P 0 (W 17 or 7)

  5 1 y35 1 1y35 1 1y35 1 1y35 5 .114.

Dokumen yang terkait

AN ALIS IS YU RID IS PUT USAN BE B AS DAL AM P E RKAR A TIND AK P IDA NA P E NY E RTA AN M E L AK U K A N P R AK T IK K E DO K T E RA N YA NG M E N G A K IB ATK AN M ATINYA P AS IE N ( PUT USA N N O MOR: 9 0/PID.B /2011/ PN.MD O)

0 82 16

Analisis Komparasi Internet Financial Local Government Reporting Pada Website Resmi Kabupaten dan Kota di Jawa Timur The Comparison Analysis of Internet Financial Local Government Reporting on Official Website of Regency and City in East Java

19 819 7

Anal isi s L e ve l Pe r tanyaan p ad a S oal Ce r ita d alam B u k u T e k s M at e m at ik a Pe n u n jang S MK Pr ogr a m Keahl ian T e k n ologi , Kese h at an , d an Pe r tani an Kelas X T e r b itan E r lan gga B e r d asarkan T ak s on om i S OL O

2 99 16

ANTARA IDEALISME DAN KENYATAAN: KEBIJAKAN PENDIDIKAN TIONGHOA PERANAKAN DI SURABAYA PADA MASA PENDUDUKAN JEPANG TAHUN 1942-1945 Between Idealism and Reality: Education Policy of Chinese in Surabaya in the Japanese Era at 1942-1945)

1 29 9

Improving the Eighth Year Students' Tense Achievement and Active Participation by Giving Positive Reinforcement at SMPN 1 Silo in the 2013/2014 Academic Year

7 202 3

Improving the VIII-B Students' listening comprehension ability through note taking and partial dictation techniques at SMPN 3 Jember in the 2006/2007 Academic Year -

0 63 87

The Correlation between students vocabulary master and reading comprehension

16 145 49

Improping student's reading comprehension of descriptive text through textual teaching and learning (CTL)

8 140 133

The correlation between listening skill and pronunciation accuracy : a case study in the firt year of smk vocation higt school pupita bangsa ciputat school year 2005-2006

9 128 37

Transmission of Greek and Arabic Veteri

0 1 22