Chapter12 - Chi Square Tests and Nonparametric

Statistics for Managers Using Microsoft® Excel 5th Edition

Chapter 12 Chi Square Tests and Nonparametric Tests

Learning Objectives In this chapter, you learn:



How and when to use the chi-square test for

contingency tables

 How to use the Marascuilo procedure for determining pair-wise differences when evaluating more than two proportions

 How and when to use the McNemar test

 How and when to use nonparametric tests

Contingency Tables Contingency Tables

 Useful in situations involving multiple population proportions

 Used to classify sample observations according to two or more characteristics

 Also called a cross-classification table.

Contingency Table Example Hand Preference vs. Gender Dominant Hand: Left vs. Right Gender: Male vs. Female

 2 categories for each variable, so the table is called a 2 x 2 table

 Suppose we examine a sample of 300 college students

Contingency Table Example Sample results organized in a contingency table:

Hand Preference Gender Female Male

Left

36 Right 108 156 264 120 180 300 120 Females, 12 were left handed 180 Males, 24 were left handed sample size = n = 300:

Contingency Table Example

 If H is true, then the proportion of left-handed females should be the same as the proportion of left-handed males.

 The two proportions above should be the same as the proportion of left-handed people overall.

H : π ₁ = π ₂ (Proportion of females who are left handed is equal to the proportion of males who are left handed) H ₁ : π ₁ ≠ π ₂ (The two proportions are not the same – Hand preference is not independent of gender)

The Chi-Square Test Statistic where: f _o = observed frequency in a particular cell f _e = expected frequency in a particular cell if H is true  ² for the 2 x 2 case has 1 degree of freedom

   cells all

2 e o

2 f ) f (f

χ The Chi-square test statistic is:

Assumed: each cell in the contingency table has expected frequency of at least 5

The Chi-Square Test Statistic

test statistic approximately follows a chi-square The  distribution with one degree of freedom

Decision Rule: ₂ ₂ , reject H ,

If  >  U otherwise, do not reject



H _{Do not Reject H} _ _{reject H} ₂ 

 _U Computing the Average Proportion

 Here: 120 Females, 12 were left handed 180 Males, 24 were left handed

The proportion of left handers overall is 0.12, that is, 12% n

X n n

X X p  

 

1 12 .

300

36 180 120

12 p   

  The average proportion is: Finding Expected Frequencies

To obtain the expected frequency for left handed females,

 multiply the average proportion left handed (p) by the total number of females To obtain the expected frequency for left handed males,

 multiply the average proportion left handed (p) by the total number of males

If the two proportions are equal, then P(Left Handed | Female) = P(Left Handed | Male) = .12 i.e., we would expect (.12)(120) = 14.4 females to be left handed (.12)(180) = 21.6 males to be left handed

Observed vs. Expected Frequencies

Hand Preference

Gender Female Male

Left Observed = 12

Expected = 14.4 Observed = 24

Expected = 21.6

36 Right Observed = 108

Expected = 105.6 Observed = 156

Expected = 158.4 264 120 180 300 The Chi-Square Test Statistic Gender Hand

Female Male Preference Observed = 12 Observed = 24 Left

36 Expected = 14.4 Expected = 21.6 Observed = 108 Observed = 156 Right

264 Expected = 105.6 Expected = 158.4 The test statistic is: ₂ 120 180 300 _{2 } ( f f ) _{o e}

   _{all cells} f ₂ _e ₂ ₂ ₂

(

12 14 . 4 ) ( 108 105 . 6 ) (

24 21 . 6 ) ( 156 158 . 4 )     . 7576

    

14 . 4 105 .

6 21 . 6 158 .

4 The Chi-Square Test Statistic

2 The test statistic is . 7576 , with 1 d.f. 3.841 

  

U Decision Rule: ₂ > 3.841, reject H , otherwise, do

If 

not reject H Here, ₂ ₂ = 3.841,

=.05  = 0..7576 <  _U so you do not reject H and _{Do not Reject H}

_

conclude that there is _{reject H}  ₂ insufficient evidence that the =3.841

 _U two proportions are different.

2 Test for The Differences Among



More Than Two Proportions

 Extend the 

2 test to the case with more than two independent populations:

H : π ₁ = π ₂ = … = π _c H ₁ : Not all of the π _j are equal (j = 1, 2, …, c)

The Chi-Square Test Statistic The Chi-square test statistic is:

2 ( f f )

2  o e

 

 f all cells e where: _o f = observed frequency in a particular cell of the 2 x c table

 _e f = expected frequency in a particular cell if H is true

 ² for the 2 x c case has (2-1)(c-1) = c - 1 degrees of freedom

 

Assumed: each cell in the contingency table has expected frequency of at

least 1

Computing the Overall Proportion

X ...

X X   

The overall

1 2 c p

  proportion is: n n ... n n

  

1 2 c Expected cell frequencies for the c categories are



calculated as in the 2 x 2 case, and the decision rule

is the same: ₂ Decision Rule:

is from the ₂ ₂ Where  _U , reject H ,

If  >  _U chi-square distribution otherwise, do not with c – 1 degrees of reject H freedom

2 Test with More Than Two

 Proportions: Example The sharing of patient records is a



controversial issue in health care. A survey

of 500 respondents asked whether they objected to their records being shared by

insurance companies, by pharmacies, and by

medical researchers. The results are summarized on the following table:

2 Test with More Than Two

 Proportions: Example

Organization Object to

Insurance Pharmacies Medical Record

Companies Researchers Sharing Yes 410 295 335 No 90 205 165



2 Test with More Than Two

Proportions: Example 6933 .

500 500 500 410 335 295

...

₂ ₁ ₂ ₁   

        

 _c ^c n n n

X X

X p The overall proportion is:

Object to Record Sharing Organization

Insurance Companies Pharmacies Medical Researchers

Yes f _o = 410 f _e = 346.667 f _o = 295 f _e = 346.667 f _o = 335 f _e = 346.667 No f _o = 90 f _e = 153.333 f _o = 205 f _e = 153.333 f _o = 165 f _e = 153.333

2 Test with More Than Two

 Proportions: Example Organization Object Insurance Pharmacies Medical to Companies

Researchers Record Sharing ₂ ₂ ₂ f f



  _{o e f f} _{ f f} _

 o e   o e  11 . 571  _{ } 7 . 700 . 3926 f _e f f _e _e

Yes ₂ ₂ ₂ f f f f

 

    _{o e o e}

f f _  o e 

17 . 409 . 888   _ 26 . 159 f f f e e _e

No ₂ _{2 } ( f f ) _{o e} 64 . 1196

  

The Chi-square test statistic is:  _{all cells} f _e

2 Test with More Than Two

 Proportions: Example H : π = π = π ₁ ₂ ₃ H : Not all of the π are equal (j = 1, 2, 3) _{1 j} ₂ Decision Rule:

= 5.991 is from the chi- ₂ ₂  _U , reject H ,

If  >  _U square distribution with 2 otherwise, do not reject H degrees of freedom.

Conclusion: Since 64.1196 > 5.991, you reject H and you conclude that at least one proportion of respondents who object to their records being shared is different across the three organizations j’ among all c(c-1)/2 pairs.

Second, compute the corresponding critical range for the Marascuilo procedure.

The Marascuilo Procedure The Marascuilo procedure enables you to make comparisons between all pairs of groups. First, compute the observed differences p j

The Marascuilo Procedure

 p / ( 1 p / )  p ( 1 p ) ₂ _{j j j j}  Critical Range

Critical Range for the Marascuilo Procedure:

   _U n n / _j _j The Marascuilo Procedure Compute a different critical range for each



pair-wise comparison of sample proportions.

Compare each of the c(c - 1)/2 pairs of



sample proportions against its corresponding

critical range.

Declare a specific pair significantly different

 if the absolute difference in the sample

proportions |p – p | is greater than its critical

j j’ range.

The Marascuilo Procedure Example

Organization Object to

Insurance Pharmacies Medical Record

Companies Researchers Sharing Yes 410 295 335

P = 0.82 P = 0.59 P = 0.67 ₁ ₂ ₃ No 90 205 165 The Marascuilo Procedure Example

MARASCUILO TABLE

Absolute

Proportions Differences Critical Range | Group 1 - Group 2 |

0.23 0.06831808 | Group 1 - Group 3 | 0.15 0.0664689 | Group 2 - Group 3 | 0.08 0.074485617

Conclusion: Since each absolute difference is greater than the critical range, you conclude that each proportion is significantly different that the other two.

 Similar to the 

2 Test of Independence

 2 test for equality of more than two proportions, but extends the concept to contingency tables with r rows and c columns

H : The two categorical variables are independent (i.e., there is no relationship between them) H : The two categorical variables are dependent

1 (i.e., there is a relationship between them)

 The Chi-square test statistic is:

2 Test of Independence

2 ( f f )

2  o e  

 f all cells e where:

 f = observed frequency in a particular cell of the r x c table _o f = expected frequency in a particular cell if H is true _e

2 for the r x c case has (r-1)(c-1) degrees of freedom 

Assumed: each cell in the contingency table has expected frequency of at least 1)

Expected Cell Frequencies Expected cell frequencies:



row total column tot al

 f

 e

Where: row total = sum of all frequencies in the row column total = sum of all frequencies in the column n = overall sample size

Decision Rule

 The decision rule is

If  ² >  ² _U , reject H , otherwise, do not reject H

Where  ² _U is from the chi-square distribution with (r – 1)(c – 1) degrees of freedom

Example: Test of Independence

40 Total

30 Senior

60 Junior

 The meal plan selected by 200 students is shown below: Class

70 Soph.

Total 20/week 10/week none Fresh.

Standing Number of meals per week

88 42 200 Example: Test of Independence The hypothesis to be tested is:

 H : Meal plan and class standing are independent (i.e., there is no relationship between them) H : Meal plan and class standing are dependent

1 (i.e., there is a relationship between them)

Example: Test of Independence

Class

Standing

14.0

   n

30 al column tot x total row f _e 

10 200

5 .

Expected cell frequencies if H is true:

88 42 200

40 Total

8.4

17.6

30 Senior

Number of meals per week Total 20/wk 10/wk none

6.3

13.2

10.5

60 Junior

12.6

26.4

21.0

70 Soph.

14.7

30.8

24.5

Fresh.

Example for one cell:

Example: Test of Independence The test statistic value is: ₂

 ( f f ) ₂ _{o e}  



 _{all cells} f _e ₂ ₂ ₂ (

24 24 . 5 ) (

32 30 . 8 ) (

10 8 . 4 )     . 709

     24 .

5 30 .

8 8 .

= 12.592 for α = .05 from the chi-square  _U distribution with (4 – 1)(3 – 1) = 6 degrees of freedom

Example: Test of Independence

2 The test statistic is . 709 , with 6 d.f. 12.592    

Decision Rule: ₂ > 12.592, reject H , otherwise,

If  do not reject H

Here, ₂ ₂ = 12.592, = 0.709 <   _U

=0.05 so do not reject H _{Do not Reject H}

_

Conclusion: there is

 _{reject H} ₂ insufficient evidence that meal plan and class standing are

=12.592  _U related. McNemar Test Used to test for the difference between two

 proportions of related samples (not independent) You need to use a test statistic that follows

 the normal distribution

McNemar Test Contingency Table Condition (Group) 1 Condition (Group) 2

Yes No Totals Yes A B A+B No C D C+D Totals A+C B+D n

Where A = number of respondents who answered yes to condition 1 and condition 2 B = number of respondents who answered yes to condition 1 and no to 2 C = number of respondents who answered no to condition 1 and yes to 2 D = number of respondents who answered no to condition 1 and condition 2 n = number of respondents in the sample

McNemar Test Contingency Table Condition (Group) 2 Condition (Group) 1 Yes No Totals Yes A B A+B

No C D C+D Totals A+C B+D n

The sample proportions are:

A B A C   p p

 

2 n n McNemar Test Contingency Table Condition (Group) 2 Condition (Group) 1 Yes No Totals Yes A B A+B

No C D C+D Totals A+C B+D n

The population proportions are: π = proportion of the population who answer yes to condition 1 ₁

π = proportion of the population who answer yes to condition 2 ₂

McNemar Test Test Statistic To test the hypothesis: H : π = π

2 H : π ≠ π

2 Use the test statistic: B C

 Z

 B C

 McNemar Test Example Suppose you survey 300 homeowners and ask

 them if they are interested in refinancing their home. In an effort to generate business, a

mortgage company improved their loan terms and

reduced closing costs. The same homeowners were again surveyed. Determine if change in loan

terms was effective in generating business for the

mortgage company. The data are summarized as

follows:

McNemar Test Example Survey response after change Survey response Yes No Totals before change

Yes 118 2 120 No 22 158 180 Totals 140 160 300

Test the hypothesis (at the 0.05 level of significance): H : π ≥ π : The change in loan terms was ineffective ₁ ₂ H : π < π : The change in loan terms increased business ₁ ₁ ₂

McNemar Test Example Survey response after

The critical value (.05

change Survey

significance) is Z = -1.96

response Yes No Totals before change

The test statistic is:

B C

22   Z 4 .

08     Yes 118 2 120 B C

22   No 22 158 180 Totals 140 160 300

Since Z = -4.08 < -1.96, you reject H and conclude that the change in loan terms significantly increase business for the mortgage company.

Wilcoxon Rank-Sum Test Test two independent population medians



Populations need not be normally distributed

 Distribution free procedure

 Used when only rank data are available



Wilcoxon Rank-Sum Test

 Can use when both n ₁ , n ₂ ≤ 10

 Assign ranks to the combined n ₁ + n ₂ sample values

 If unequal sample sizes, let n ^{1 refer to smaller-sized} sample

 Smallest value rank = 1, largest value rank = n ^{1 + n} ²

 Assign average rank for ties

 Sum the ranks for each sample: T ^{1 and T} ²

 Obtain test statistic, T ^{1 (from smaller sample)}

Checking the Rankings The sum of the rankings must satisfy the formula

 below Can use this to verify the sums T and T

 1

2 n(n 1) 

T T 

where n = n + n ₁ ₂

Wilcoxon Rank-Sum Test

Hypothesis and Decision

H : M ₁ = M ₂ H ₁ : M ₁ ≠ M ₂ H : M ₁ ≤ M ₂ H ₁ : M ₁ > M ₂ H : M ₁ ≥ M ₂ H ₁ : M ₁ < M ₂ Two-Tail Test Left-Tail Test Right-Tail Test M ₁ = median of population 1; M ₂ = median of population 2

Reject

T _1L T _1U

Reject Do Not Reject Reject

T _1L

Do Not Reject

T _1U

Reject Do Not Reject

Test statistic = T ₁ (Sum of ranks from smaller sample)

Reject H if T ₁ ≤ T _1L or if T ₁ ≥ T _1U Reject H if T ₁ ≤ T _1L Reject H if T ₁ ≥ T _1U

Wilcoxon Rank-Sum Test Example with Small Samples

 Sample data are collected on the capacity rates (% of capacity) for two factories.

 Are the median operating rates for two factories the same?



For factory A, the rates are 71, 82, 77, 94, 88

 For factory B, the rates are 85, 82, 92, 97

 Test for equality of the population medians at the 0.05 significance level

Wilcoxon Rank-Sum Test Example with Small Samples Capacity Rank

Ranked

Factory A Factory B Factory A Factory B

Capacity

values: _rd

3.5 Tie in 3 and _th

3.5

4 places

9 Rank Sums:

20.5

24.5 Wilcoxon Rank-Sum Test Example with Small Samples Factory B has the smaller sample size, so the test statistic is the sum of the Factory B ranks:

T = 24.5

1 The sample sizes are:

n = 4 (factory B) ₁ n = 5 (factory A) ₂ The level of significance is α = .05

Wilcoxon Rank-Sum Test

Example with Small Samples

 _ n ₁ Lower and n ₂ One- Two-

Upper Critical

5 Values for T Tailed Tailed ₁ from

4 Appendix

Table E.8:

.05 .10 12, 28 19, 36 .025 .05 11, 29 17, 38

5 T = 11 and _1L .01 .02 10, 30 16, 39

T = 29 _1U

.005 .01 --, -- 15, 40

6 Wilcoxon Rank-Sum Test Example with Small Samples

  = .05

Test Statistic (Sum of ranks ₁ _{2 = 5}

n = 4 , n

 from smaller sample):

Two-Tail Test T = 24.5 ₁ H : M = M ₁ ₂ H : M ≠ M ₁ ₁ ₂ Decision:

Do not reject at a = 0.05 Do Not Reject Reject

Reject Conclusion: T =11 T =29 _1L _1U

There is not enough evidence to prove

Reject H if T ≤ T = 11 ₁ _1L that the medians are not equal. or if T ≥ T = 29 ₁ _1U

Wilcoxon Rank-Sum Test

Using Normal Approximation

 For large samples, the test statistic T ₁

is approximately normal

with mean and standard deviation:

1 T

 Must use the normal approximation if either n ₁ or n ₂ > 10

 Assign n ^{1 to be the smaller of the two sample sizes}

 Can use the normal approximation for small samples

2 ) ( 1 n n μ

₁  

12 ) ( 1 n n n σ

1 T ₁   Wilcoxon Rank-Sum Test Using Normal Approximation The Z test statistic is

 μ T 

1 T

₁ Z  σ

T ₁



Where Z approximately follows a standardized

normal distribution

Wilcoxon Rank-Sum Test Using Normal Approximation Use the setting of the prior example:

The sample sizes were: n ₁ = 4 (factory B) n ₂ = 5 (factory A) The level of significance was α = .05

The test statistic was T

1 = 24.5 Wilcoxon Rank-Sum Test Using Normal Approximation n ( n 1 ) 4 (

9 1 ) ₁   μ

20 T

₁   

2 n n ( n 1 ) 4 ( 5 ) (

9 1 ) ₁ ₂   σ 4 . 082 _T ₁   

 The test statistic is T μ

1 T

 24 .

₁  Z 1 .

10    σ 4.082

T ₁ Z = 1.10 is less than the critical Z value of 1.96 (for α = .05)

 so you do not reject H – there is not sufficient evidence that the medians are not equal

Kruskal-Wallis Rank Test



Tests the equality of more than 2 population medians

 Use when the normality assumption for one-way ANOVA is violated

 Assumptions:

 The samples are random and independent

 variables have a continuous distribution

 the data can be ranked

 populations have the same variability

 populations have the same shape Kruskal-Wallis Rank Test

 Obtain overall rankings for each value

 In event of tie, each of the tied values gets the average rank

 Sum the rankings for data from each of the c groups

 Compute the H test statistic Kruskal-Wallis Rank Test

 The Kruskal-Wallis H test statistic: (with c – 1 degrees of freedom)

) 1 ( 3 )

1 (

  

   

 

 n n

T n n H c j

j where: n = total number of values over the combined samples c = Number of groups T _j = Sum of ranks in the j ^th sample n _j = Size of the j ^th sample Kruskal-Wallis Rank Test

 Complete the test by comparing the calculated H value ₂ value from the chi-square distribution to a critical  with c – 1 degrees of freedom

 

Decision rule

Reject H _

 Otherwise do not reject H

2U if test statistic H > 

 _{reject H} _{Do not Reject H} ₂   _U Kruskal-Wallis Rank Test Example

 Do different branch offices have a different number of employees?

Office size (Chicago, C) Office size (Denver, D)

Office size (Houston, H)

44 Kruskal-Wallis Rank Test Example

 Do different branch offices have a different number of employees?

Office size (Chicago, C) Ranking Office size

(Denver, D) Ranking Class size (Houston, H)

Ranking

7  = 44  = 56  = 20 Kruskal-Wallis Rank Test Example

 The H statistic is 72 .

     

A H D C

Medians equal are population all Not : H Median Median Median : H

T n n H ^c _j _j ^j

 _ n n

 

   

  

   

  

2 ² ² ¹ ²

6 )

3 ) 1 (

12 ) 1 (

1 15 (

44 )

1 15 (

  Kruskal-Wallis Rank Test Example

Compare H = 6.72 to the critical value from the chi-square

 distribution for 3 – 1 = 2 degrees of freedom and  = .05:

2 χ 5.991

 U ₂

χ 5.991 _U 

Since H = 6.72 > , reject H There is evidence of a difference in the population median number of employees in the branch offices Chapter Summary

In this chapter, we have ²

 two proportions ² test for differences in more than Developed and applied the 

test for the difference between Developed and applied the 

 two proportions ²

 test for independence Examined the  Used the McNemar test for differences in two related

 proportions

Used the Wilcoxon rank sum test for two population medians

 Small Samples

 Large sample Z approximation

 Applied the Kruskal-Wallis H-test for multiple population

 medians

Chapter12 - Chi Square Tests and Nonparametric