31
3.6.2 Reliability
Harris in Sisbiyanto and Trisanti 2009: 14 state that “by reliability is meant the stability of test scores. A test cannot measure anything well unless it measures
consistently.” It might be assessed in many ways. In this research, to find out the reliability of the research, Alpha formula as stated by Arikunto 2006: 196 was
applied, as the following: = − 1 1 −
∑
where, r
11
= reliability of the instrument, k
= the number of the items of aspect evaluation, Σαb² = the sum of item variance,
and αt² = total variance Total Variance:
= ∑
− ∑
The Item Variance:
= ∑
− ∑
3.6.3 Difficulty Level
Good test items are the items which are not very difficult and also not very easy Arikunto, 2002:207. Very easy test items cannot stimulate the students to solve
the test, and very difficult test items will make the students give up finding the
32
answer of the test. To find out the difficulty level of the test items, this formula is used:
DL =
Sum of Score x 100 Maximum Score
3.6.4 Discriminating Power
Discriminating power is the power of the test items to discriminate between smart students and the others Arikunto, 2002:211. The index of discriminating power
can range from -1.00 to 1.00. The negative symbol signs that the test fails in measuring the quality of the test. It means that the high quality of the test is
judged as the poor item. To find out the discriminating level for the test items, I used this formula:
= −
∑ + ∑
− 1 Criteria: The item test is significant if t t
table.
Here is the index discrimination to evaluate the test items can be shown in the following Table 3.3:
Table 3.3 Index of Discriminating Power Index of Discrimination
Item Evaluation
0.40 and up Very good items
0.30 to 0.39 Reasonably good, but possibly subject to
improvement 0.20 to 0.29
Marginal items, usually needing and being subject to improvement
Below 0.19
The test i students, those test ite
And they are also not cannot answer them.
correctly.
3.6.5 T-test