Chapter10 - Hypothesis Testing Two-Sample Tests

Statistics for Managers Using Microsoft® Excel 5th Edition

Learning Objectives

In this chapter, you learn how to use hypothesis testing for comparing the difference between: The means of two independent populations

 The means of two related populations

 The proportions of two independent

 populations The variances of two independent

 populations Two-Sample Tests Overview

Two Sample Tests

Independent Population Means Means,

Related Populations Independent Population

Variances Group 1 vs.

Group 2 Same group before vs. after treatment

Variance 1 vs. Variance 2 Examples

Independent Population Proportions Proportion 1vs.

Proportion 2 Two-Sample Tests Goal: Test hypothesis or form

Independent

a confidence interval for the

Population Means

difference between two population means, μ – μ

σ and σ known ₁ ₂ The point estimate for the

difference between sample

σ and σ unknown ₁ ₂

means:

X – X ₁ ₂ Two-Sample Tests Independent Populations Different data sources

 Independent



Population Means

Independent: Sample selected

from one population has no effect on the sample selected from the other population

 ₁ ₂

σ and σ known Use the difference between 2

sample means

 Use Z test, pooled variance t

σ and σ unknown ₁ ₂

test, or separate-variance t test

Two-Sample Tests Independent Populations

Independent Population Means

Use a Z test statistic σ and σ known ₁ ₂ Use S to estimate unknown σ,

σ and σ unknown ₁ ₂ use a t test statistic

Two-Sample Tests Independent Populations

Assumptions:

Independent Population Means



Samples are randomly and independently drawn



population distributions are σ and σ known ₁ ₂ normal

σ and σ unknown ₁ ₂

Two-Sample Tests Independent Populations

When σ and σ are known and both ₁ ₂ Independent populations are normal, the test Population Means statistic is a Z-value and the standard error of X – X is ₁ ₂

σ and σ known ₁ ₂

2 σ σ

2 σ  

σ and σ unknown ₁ ₂ X ¹ X ² 

n n

Two-Sample Tests Independent Populations

The test statistic is:

X X Z     

μ μ

1 n σ n σ

Independent Population Means

σ ₁ and σ ₂ known σ ₁ and σ ₂ unknown

   

1 Two-Sample Tests Independent Populations

Lower-tail test: H : μ ₁

 μ ₂ H ₁ : μ ₁ < μ ₂ i.e.,

H : μ ₁ – μ ₂  0

H ₁ : μ ₁ – μ ₂ < 0 Upper-tail test:

H : μ ₁ ≤ μ ₂ H ₁ : μ ₁ > μ ₂ i.e.,

H : μ ₁ – μ ₂ ≤ 0 H ₁ : μ ₁ – μ ₂ > 0

Two-tail test: H : μ ₁ = μ ₂ H ₁ : μ ₁ ≠ μ ₂ i.e.,

H : μ ₁ – μ ₂ = 0 H ₁ : μ ₁ – μ ₂ ≠ 0

Two Independent Populations, Comparing Means

Two-Sample Tests Independent Populations

Two Independent Populations, Comparing Means Lower-tail test: Upper-tail test: Two-tail test:

H : μ – μ H : μ – μ ≤ 0 H : μ – μ = 0 ₁ ₂  0 ₁ ₂ ₁ ₂ H : μ – μ < 0 H : μ – μ > 0 H : μ – μ ≠ 0 ₁ ₁ ₂ ₁ ₁ ₂ ₁ ₁ ₂

  /2 /2

z _

z -z z _{ /2 /2} Reject H if Z < -Z Reject H if Z > Z Reject H if Z < -Z _a _a _a/2 or Z > Z _a/2 Two-Sample Tests Independent Populations

Assumptions:

Independent Population Means



Samples are randomly and independently drawn



Populations are normally σ and σ known ₁ ₂ distributed

σ and σ unknown ¹ ²



Population variances are unknown but assumed equal

Two-Sample Tests Independent Populations

Forming interval estimates:

Independent Population Means



The population variances are assumed equal, so use the two sample standard deviations and pool them to σ and σ known ₁ ₂ estimate σ

σ and σ unknown ¹ ²



the test statistic is a t value with (n + n – 2) degrees ₁ ₂ of freedom

Two-Sample Tests Independent Populations

Independent Population Means

σ ₁ and σ ₂ known σ ₁ and σ ₂ unknown

The pooled standard deviation is:    

1) n ( ) 1 (n S S 1 n

1 n

1 p      

 Two-Sample Tests Independent Populations Where t has (n ₁ + n ₂ – 2) d.f., and

   

1 n

Independent Population Means

   

S S 1 n 1 n S ₂ ₁ ² ² ² ² ¹ ¹ ² _p   

    1) n ( ) 1 (n

The test statistic is:

X X

1 S μ μ

1 n

   

2 p

    

   

σ ₁ and σ ₂ known σ ₁ and σ ₂ unknown

Two-Sample Tests Independent Populations

You are a financial analyst for a brokerage firm. Is there a

 difference in dividend yield between stocks listed on the NYSE & NASDAQ? You collect the following data: NYSE NASDAQ Number 21 25 Sample mean 3.27 2.53 Sample std dev 1.30 1.16

Assuming both populations are approximately normal with equal variances, is there a difference in average yield ( = 0.05)?

Two-Sample Tests Independent Populations        

1  

The test statistic is:

X X t

1 S μ μ

1.16

    

   

      

     

2 p

3.27 n 1 n

1.30

2.53

1.5021 1) 1) 25 ( - (21

1 5021 .

2.040

     

      

S 1 n 1 n S ² ² ₂ ₁ ² ² ² ² ¹ ¹ ² _p

21 1) n ( ) 1 (n S

Two-Sample Tests Independent Populations

 H : μ ₁ - μ ₂ = 0 i.e. (μ ₁ = μ ₂ )

 H ₁ : μ ₁ - μ ₂ ≠ 0 i.e. (μ ₁ ≠ μ ₂ )

  = 0.05

 df = 21 + 25 - 2 = 44

 Critical Values: t = ± 2.0154

 Test Statistic: 2.040 t

2.0154 -2.0154 .025 Reject H Reject H .025

Decision: Reject H at α = 0.05

2.040

Conclusion: There is evidence of a difference in the means.

Independent Populations Unequal Variance

 are equal, the pooled-variance t test is inappropriate

Instead, use a separate-variance t test, which

If you cannot assume population variances

 includes the two separate sample variances in the computation of the test statistic The computations are complicated and are

 best performed using Excel

Two-Sample Tests Independent Populations

Independent Population Means

σ ₁ and σ ₂ known σ ₁ and σ ₂ unknown

 

1 n σ n σ

X X    Z

The confidence interval for

μ ₁ – μ

2 is:

Two-Sample Tests Independent Populations

Independent Population Means

σ ₁ and σ ₂ known σ ₁ and σ ₂ unknown

     

   

   _ ₂ ₁ ² _p _{2 - n n} ² ¹ n

1 n

1 S

X X ₂ ₁ t

The confidence interval for μ ₁ – μ ₂ is:

Where    

1) n ( ) 1 (n S S 1 n

1 n S ₂ ₁ ² ² ² ² ¹ ¹ ² _p        Two-Sample Tests Related Populations Tests Means of 2 Related Populations

Paired or matched samples

 Repeated measures (before/after)

 Use difference between paired values:

 D = X - X

2 Eliminates Variation Among Subjects

 Assumptions:



Both Populations Are Normally Distributed



Two-Sample Tests Related Populations The ith paired difference is D , where i

D = X - X i 1i 2i

The point estimate for the population mean paired difference is D : _n

 _i _ ₁ D  n

Suppose the population standard deviation of the difference scores, σ D , is known.

Two-Sample Tests Related Populations The test statistic for the mean difference is a Z value: n σ

μ D Z D D

  Where μ _D = hypothesized mean difference σ _D = population standard deviation of differences n = the sample size (number of pairs) Two-Sample Tests Related Populations If σ is unknown, you can estimate the

D unknown population standard deviation with a sample standard deviation: n ₂

(D D )

  _i _ ₁ S _D  n

1  Two-Sample Tests Related Populations The test statistic for D is now a t statistic:

D μ  D t 

S D n n

2 Where t has n - 1 d.f.

(D D )  i

 i _

1 and

S S is:

D  D n

1  Two-Sample Tests Related Populations

Lower-tail test: Upper-tail test: Two-tail test: H : μ H : μ ≤ 0 H : μ = 0 _D  0 _D _D

H : μ < 0 H : μ > 0 H : μ ≠ 0 _{1 D} _{1 D} _{1 D}

  /2 /2

t _

t -t t _{ /2 /2} Reject H if t < -t Reject H if t > t Reject H if t < -t _a _a _a/2 or t > t _a/2 Two-Sample Tests Related Populations Example Assume you send your salespeople to a “customer service” training workshop. Has the training made a difference in the number of complaints? You collect the following data:

Salesperson Number of Complaints Difference, D _i (2-1) Before (1) After (2) C.B.

6 4 -2 T.F.

20 6 -14 M.H.

3 2 -1 R.K.

Two-Sample Tests Related Populations Example

Salesperson Number of Complaints Difference, D _i (2-1) Before (1) After (2) C.B.

6 4 -2 T.F.

20 6 -14 M.H.

3 2 -1 R.K.

M.O _n

4 ₂ -4 (D D ) _i 

 D _{i }

 _i ₁ _ n

1  D 4 .

2    n

5.67  Two-Sample Tests Related Populations Example Has the training made a difference in the number of complaints (at the α = 0.01 level)?

H : μ = 0 _D Critical Value = ± 4.604 d.f. = n - 1 = 4 H : μ _{1 D}  0 Test Statistic:

D μ

4.2    _D t 1.66    

S / n 5.67/ _D

/2

Two-Sample Tests Related Populations Example Reject - 4.604 4.604 Reject

- 1.66 Decision: Do not reject H (t statistic is not in the reject region)

Conclusion: There is no evidence of a significant change in the number of complaints /2 Two-Sample Tests Related Populations The confidence interval for μ (σ known) is:

 D σ

D Z D  n

Where n = the sample size (number of pairs in the paired sample)

Two-Sample Tests Related Populations

The confidence interval for μ (σ unknown) is:

 D S D

D t  1 n n n ₂

(D D ) _i   _{i } ₁ where S _D



1  Two Population Proportions Goal: Test a hypothesis or form a confidence interval for the difference between two independent population proportions, π – π

2 Assumptions:

n π (1-π ₁ ₁  5 , n )  5 ₁ ₁ n π (1-π ₂ ₂  5 , n )  5 ₂ ₂ The point estimate for the difference is p - p ₁ ₂ Two Population Proportions Since you begin by assuming the null hypothesis is true, you assume π = π and pool

2 the two sample (p) estimates.

X X 

2 The pooled estimate for p 

the overall proportion is:

n n 

2 where X and X are the number of ₁ ₂ successes in samples 1 and 2

– p

)

1 ( n n p p p p Z

X X p   

X , n n

X , n

2 ² ² ¹ ¹ ¹ ² ¹ ² ¹

2 is a Z statistic:

  The test statistic for p

Two Population Proportions    

     

   

  P P where

Two Population Proportions

Hypothesis for Population Proportions Lower-tail test:

H : π ₁  π ₂ H ₁ : π ₁ < π ₂ i.e.,

H : π ₁ – π ₂  0

H ₁ : π ₁ – π ₂ < 0 Upper-tail test:

H : π ₁ ≤ π ₂ H ₁ : π ₁ > π ₂ i.e.,

H : π ₁ – π ₂ ≤ 0 H ₁ : π ₁ – π ₂ > 0

Two-tail test: H : π ₁ = π ₂ H ₁ : π ₁ ≠ π ₂ i.e.,

H : π ₁ – π ₂ = 0 H ₁ : π ₁ – π ₂ ≠ 0 Two Population Proportions

Hypothesis for Population Proportions Lower-tail test: Upper-tail test: Two-tail test:

H : π – π H : π – π ≤ 0 H : π – π = 0 ₁ ₂  0 ₁ ₂ ₁ ₂ H : π – π < 0 H : π – π > 0 H : π – π ≠ 0 ₁ ₁ ₂ ₁ ₁ ₂ ₁ ₁ ₂

  /2 /2

z _

z -z z _{ /2 /2} Reject H if Z < -Z Reject H if Z > Z Reject H if Z < -Z _ _ _ _{ Z > Z} _ or Two Independent Population Proportions: Example

Is there a significant difference between the

 proportion of men and the proportion of

women who will vote Yes on Proposition A?

In a random sample of 72 men, 36 indicated



they would vote Yes and, in a sample of 50

women, 31 indicated they would vote Yes Test at the .05 level of significance 

Two Independent Population Proportions: Example H : π ^{1 – π} ^{2 = 0 (the two proportions are equal)}

  ₁ ₁ ₂ H : π – π ≠ 0 (there is a significant difference between proportions)

The sample proportions are:

 Men: p ₁ = 36/72 = .50

 

Women: p = 31/50 = .62 ₂  The pooled estimate for the overall proportion is:

X X

67 ₁   ₂ p .549     n n

72 50 122 ₁   ₂

1.96

1.96 .025

.025

1.31 Decision: Do not reject H Conclusion: There is no evidence of a significant difference in proportions who will vote yes between men and women.

Critical Values = ±1.96 For  = .05

      p p

    

     

   

1 n 1 ) p (1 p z ₂ ₁ ² ¹ ² ¹  

1 .549) (1 .549 .62 .50 n

1.31

       

is:

π ₁ – π ₂

The test statistic for

Two Independent Population Proportions: Example

Reject H Reject H Two Independent Population Proportions The confidence interval for π – π is:

2 p (1 p ) p (1 p )

 

2 p p Z

  



1 2  n n

2 Testing Population Variances

 Purpose: To determine if two independent populations have the same variability.

2 ² ² ² ² ² H : σ = σ H : σ H : σ ≤ σ ₁ ₂ ₂ ₂ ₁ ₂  σ ₂ ₂ ₁ ₂ ₂ ₂ H : σ ≠ σ H : σ < σ H : σ > σ ₁ ₁ ₂ ₁ ₁ ₂ ₁ ₁ ₂ Two-tail test Lower-tail test Upper-tail test Testing Population Variances

1 S

F 

The F test statistic is: = Variance of Sample 1 n ₁ - 1 = numerator degrees of freedom n ₂ - 1 = denominator degrees of freedom = Variance of Sample 2 ² ¹ S

2 ₂ S Testing Population Variances



The F critical value is found from the F table

 There are two appropriate degrees of freedom: numerator and denominator.

 In the F table,

 numerator degrees of freedom determine the column

 denominator degrees of freedom determine the row Testing Population Variances 

F _L ^Reject _H ^{Do not} ^{reject H}

H : σ ₁ ²  σ ₂ ² H ₁ : σ ₁ ² < σ ₂ ² Reject H if F < F _L

 F _U ^{Reject H Do not} ^{reject H}

H : σ ₁ ² ≤ σ ₂ ² H ₁ : σ ₁ ² > σ ₂ ² Reject H if F > F _U Lower-tail test Upper-tail test Testing Population Variances

Two-tail test

2 ² H : σ = σ ₁ ₂ ₂ ₂ H : σ ≠ σ ₁ ₁ ₂

/2 ₂ /2 S ₁ F F _   ₂ _U rejection region

S ₂ F _{reject H} _{Do not Reject H} for a two-tail test is: ₂ S _{L U} ₁ F F F F   ₂ _L S ₂ Testing Population Variances To find the critical F values:

1. Find F from the F table for n – 1 _U ₁ numerator and n – 1 denominator degrees ₂ of freedom.

1 F  _L

2. Find F using the formula: _L

F _{* U}

Where F is from the F table with n – 1 _U* ₂ numerator and n – 1 denominator degrees of ₁ freedom (i.e., switch the d.f. from F ) _U Testing Population Variances

You are a financial analyst for a brokerage firm. You

 want to compare dividend yields between stocks listed on the NYSE & NASDAQ. You collect the following data:

NYSE NASDAQ Number

25 Mean

3.27

2.53 Std dev

1.30

1.16 Is there a difference in the variances between the

 NYSE & NASDAQ at the  = 0.05 level?

– σ

 n ₂ – 1 = 25 – 1 = 24 d.f.

F _U :

F _L = 1/F _{.025, 24, 20} = 0.41

 n ₁ – 1 = 21 – 1 = 20 d.f.

 Denominator:

 n ₂ – 1 = 25 – 1 = 24 d.f.

 Numerator:

F _U = F _{.025, 20, 24} = 2.33

 Denominator:

Testing Population Variances

 n ₁ – 1 = 21 – 1 = 20 d.f.

 Numerator:

 H ₁ : σ ² ₁ – σ ² ₂ ≠ 0 (there is a difference between variances)

22 = 0 (there is no difference between variances)

 H : σ

 Form the hypothesis test:

F _L : Testing Population Variances

 The test statistic is: 256 .

1 16 .

1 30 .

1   

S S F /2 = .025 F _U =2.33 ^{Reject H Do not} ^{reject H} F _L =0.41 /2 = .025

Reject H

 F = 1.256 is not in the rejection region, so we do not reject H

 Conclusion: There is insufficient evidence of a difference in variances at  = .05 Chapter Summary In this chapter, we have

 Performed Z test for the differences in two means

Compared two independent samples

  Performed pooled variance t test for the differences in two means

Formed confidence intervals for the differences

 between two means Compared two related samples (paired samples)

 Performed paired sample Z and t tests for the mean

 difference Formed confidence intervals for the paired

 difference Performed separate-variance t test  Chapter Summary

 Compared two population proportions

 Formed confidence intervals for the difference between two population proportions

 Performed Z-test for two population proportions

 Performed F tests for the difference between two population variances

 Used the F table to find F critical values In this chapter, we have

Chapter10 - Hypothesis Testing Two-Sample Tests

Learning Objectives

Two Sample Tests

Two-Sample Tests Independent Populations

Assumptions:

Two-Sample Tests Independent Populations

Two-Sample Tests Independent Populations

Assumptions:

Forming interval estimates:

Two-Sample Tests Independent Populations

X X t

Two-Sample Tests Independent Populations

Independent Populations Unequal Variance

Two-Sample Tests Independent Populations

Two-Sample Tests Independent Populations

Two-Sample Tests Related Populations Example

Dokumen yang terkait

18 FAKTOR - FAKTOR YANG BERHUBUNGAN DENGAN KEJADIAN HEPATITIS B DI RSUD TOBELO MALUKU UTARA

2. Kedudukan Pembukaan Undang –Undang Dasar 1945 - KONSTITUSI UUD1945

BAB II - 2. Model Data Relasional

CASE FAIR - 22 sd 25.rar

Chapter02 - Presenting Data in Tables and Charts

Chapter03 - Numerical Descriptive Measures

Chapter04 - Basic Probability

Chapter05 - Discrete Probability

Chapter07 - Sampling and Sampling Distributions

Chapter09 - Hypothesis Testing One-Sample Tests

Dukungan

Links

Chapter10 - Hypothesis Testing Two-Sample Tests

Learning Objectives

Two Sample Tests

Two-Sample Tests Independent Populations

Assumptions:

Two-Sample Tests Independent Populations

Two-Sample Tests Independent Populations

Assumptions:

Forming interval estimates:

Two-Sample Tests Independent Populations

X X t

Two-Sample Tests Independent Populations

Independent Populations Unequal Variance

Two-Sample Tests Independent Populations

Two-Sample Tests Independent Populations

Two-Sample Tests Related Populations Example

Dokumen yang terkait

18 FAKTOR - FAKTOR YANG BERHUBUNGAN DENGAN KEJADIAN HEPATITIS B DI RSUD TOBELO MALUKU UTARA

2. Kedudukan Pembukaan Undang –Undang Dasar 1945 - KONSTITUSI UUD1945

BAB II - 2. Model Data Relasional

CASE FAIR - 22 sd 25.rar

Chapter02 - Presenting Data in Tables and Charts

Chapter03 - Numerical Descriptive Measures

Chapter04 - Basic Probability

Chapter05 - Discrete Probability

Chapter07 - Sampling and Sampling Distributions

Chapter09 - Hypothesis Testing One-Sample Tests

Dokumen yang Anda mencari sudah siap untuk unduhkan