The Z test

7.1 The Z test

  This test is used when the null hypothesis specifies the mean and the variance of the observations. Let’s see how the test works using the following example. We toss a coin 100 times and we obtain 55 heads. Is this enough evidence to say that the coin is loaded?

7.1.1 Two tailed Z test

  1. We have n random variables

  X 1 ,...,X n

  where X i commonly represents a measurements from subject i in a sample of n subjects. We assume the following:

  • The random variables are independent and identically distributed.

  68 CHAPTER 7. INTRODUCTION TO CLASSIC STATISTICAL TESTS

  • If n < 30 the random variables are Gaussian.

  In our example X 1 ,...,X n are Bernoulli random variables. If the outcome

  of toss number i is heads then X i takes value 1 otherwise it takes value 0.

  2. Formulate a null hypothesis that specifies the mean and standard deviation

  of the mean. We represent these as E( ¯ X |H n true) and Sd( ¯ X |H n true)

  In our example the null hypothesis H n is that the coin is fair. In such case

  E( ¯ X |H n true) = E(X i |H n true) = 0.5 and

  Sd(X i |H n true)

  Sd( ¯ X |H n true) =

  3. Define the random variable Z as follows

  X ¯ − E( ¯ X |H n true)

  X ¯ − E( ¯ X |H n true)

  Sd( ¯ X |H n true)

  Sd(X i |H n true) n

  4. Compute, the value taken by the random variable Z,

  In our example

  5. If Z "∈ [−1.96, 1.96] reject H n and report that the results were statistically

  significant. Otherwise withhold judgment and report that the resuts were

  not statistically significant. In our example Z ∈ [−1.96, 1.96] so we withhold judgment. We do not have

  enough evidence to say that the coin is loaded.

  Proof: Since this is a classical test we just need to show that with this procedure the type I error specification is no larger than 5 . In other words P (H n rejected |H n true) ≤

  0.05. To do so first we will show that if H n is true then Z is an standard Gaussian random variable. First note that Z is just a linear combination of ¯ X

  7.1. THE Z TEST

  Z=a+b¯ X (7.5)

  E( ¯ X |H n true)

  a= −

  Sd( ¯ X |H n true)

  Sd( ¯ X |H n true)

  If X 1 ,...,X n are Gaussian then ¯ X is also Gaussian (because the sum of Gaus-

  sian is Gaussian). If X 1 ,...,X n are not Gaussian but n > 30, by the Central

  limit theorem, then the cumulative of ¯ X is approximately Gaussian. Under these conditions Z is also Gaussian (because Z is a linear transformation of ¯ X and a linear transformation of a Gaussian rav is also Gaussian). Moreover

  E(Z |H n true) = a + bE( ¯ X |H n true) = 0

  Var(Z 2 |H

  n true) = b Var( ¯ X |H n true) = 1

  Thus, if the null hypothesis is true and the assumptions of the Z test are met, then Z is a standard Gaussian random variable and

  P (H n rejected |H n true) = P (Z "∈ [−1.96, 1.96]) = 2Φ(−1.96) = 120 (7.10)

7.1.2 One tailed Z test

  In this case the null hypothesis includes an entire range of values for the sample mean. For example, the null hypothesis could be that the expected value of the mean is smaller than 0. Or that the expected value of the mean is smaller than 3. The first thing we do is to pick the extreme case proposed by the null hypothesis and which is finite. For example if the null hypothesis is that the expected value of the sample mean is larger than 3, then the extreme case is that the expected value is exactly 3. I’m going to represent this extreme case of the null hypothesis as H x to distinguish it from the more general null hypothesis that we want to reject. For

  example if the null hypothesis says E( ¯ X |H n true) ≤ 3 the the extreme case of the null hypothesis would claim E( ¯ X |H x true) = 3.

  70 CHAPTER 7. INTRODUCTION TO CLASSIC STATISTICAL TESTS

  1. Formulate the null hypothesis and the extreme hypothesis.

  2. Compute the value taken by the random variable Z as in the two-tailed case,

  only in this case use expected values given by H x instead of H n .

  3. If positive values of Z are the only ones inconsistent with H n and if Z >

  1.64 reject H n . Report that the results were statistically significant. If neg- ative values of Z are the only ones inconsistent with H n and if Z < −1.64 then reject the null hypothesis. Report that the results are statistically sig-

  nificant. Otherwise withhold judgment. Report that the results were not

  statistically significant. To see that this test also has the desired type I error specification note that

  P (H n rejected |H n true) ≤ P (H n rejected |H x true)

  = P (Z > 1.64) = Φ( −1.64) = 120

  Thus we have proven that the type I error specification is not larger than 120, making the test acceptable from a classical point of view.

  Example We want to show that bees prefer red flowers rather than yellow flow- ers. We construct 100 plastic flowers 50 of which are yellow and 50 red. Other than the color the flowers are indistinguishable in appearance. We then measure for 200 bees the amount of time they spend in yellow versus red flowers. We find that 160 out of the 200 bees spent more time in the red flowers. Is this enough evidence to say that bees prefer red flowers?

  We can model the preference of each bee as independent Bernoulli random

  variables X 1 ,...,X 200 , where X i = 1 represents the event that bee number i

  in the sample spent more time on the red flowers. In our case the sample mean takes the following value ¯ X = 160200 = 0.8. The null hypothesis says that

  E( ¯ X |H n true) ≤ 0.5. This hypothesis covers an entire range of values so a one tailed test is appropriate. The extreme case of the null hypothesis, symbolized as

  H x says that E( ¯ X |H x true) = 0.5. Thus,

Dokumen yang terkait

Analisis Komparasi Internet Financial Local Government Reporting Pada Website Resmi Kabupaten dan Kota di Jawa Timur The Comparison Analysis of Internet Financial Local Government Reporting on Official Website of Regency and City in East Java

19 819 7

Anal isi s L e ve l Pe r tanyaan p ad a S oal Ce r ita d alam B u k u T e k s M at e m at ik a Pe n u n jang S MK Pr ogr a m Keahl ian T e k n ologi , Kese h at an , d an Pe r tani an Kelas X T e r b itan E r lan gga B e r d asarkan T ak s on om i S OL O

2 99 16

ANTARA IDEALISME DAN KENYATAAN: KEBIJAKAN PENDIDIKAN TIONGHOA PERANAKAN DI SURABAYA PADA MASA PENDUDUKAN JEPANG TAHUN 1942-1945 Between Idealism and Reality: Education Policy of Chinese in Surabaya in the Japanese Era at 1942-1945)

1 29 9

Improving the Eighth Year Students' Tense Achievement and Active Participation by Giving Positive Reinforcement at SMPN 1 Silo in the 2013/2014 Academic Year

7 202 3

Improving the VIII-B Students' listening comprehension ability through note taking and partial dictation techniques at SMPN 3 Jember in the 2006/2007 Academic Year -

0 63 87

The Correlation between students vocabulary master and reading comprehension

16 145 49

Improping student's reading comprehension of descriptive text through textual teaching and learning (CTL)

8 140 133

The Effectiveness of Computer-Assisted Language Learning in Teaching Past Tense to the Tenth Grade Students of SMAN 5 Tangerang Selatan

4 116 138

The correlation between listening skill and pronunciation accuracy : a case study in the firt year of smk vocation higt school pupita bangsa ciputat school year 2005-2006

9 128 37

Transmission of Greek and Arabic Veteri

0 1 22