Summative Test Test 1. The Definition of Test
the important characteristic of the good test can be classified into three main aspects, they are validity, reliability, and practicality.
33
a. Validity Validity is one of the criteria in a good test. Test validity presupposes about
is to tested. J.B. Heaton points out: the validity of a test is measure what is supposed to be measure.
34
It means that the validity is the extent to which a test measures what it is intended to measure. It is vital for a test to be valid in order
for the results to be accurately applied and interpreted. As stated by Suharsimi: A test is valid if the test was able to accurately
measure what is to be measured.
35
The validity of a test must be considered in measurement in this case there must be seen whether the test used really measures what are supposed to
measure. A test is categorized as valid test; if the test measure what should be measured.
Validity contains for types called content validy, concurrent validy, predective validity, and construct validity.
Content validity is concerned with the total of sampling which is used for content. It means that
Concurrent validity is concerned the relationship between the test score and the variable of the test that will be measured. It means that
33
Arifin, op. cit., p. 246.
34
Heaton, op cit., p. 159.
35
Suharsimi Arikunto, Dasar-dasar Evaluasi Pendidikan, Jakarta: PT. Bumi Aksara, 2009, revision edition, p. 59.
Predictive validity is concerned with the test score as a function to score the performance at certain time. It means that
Constructive validity is an addition to measure the validity if three of measurement above is not ample to be measured.
b. Reliability Reliability refers to the consistency of the test scores in which a test
measures the same thing all the time. In other words, the reliability of a test refers to its consistency in which it yields the same rank for an individual taking
the test for several time. To have confidence in a measuring instrument, we would need to be assured, for example, that approximately the same result would
be obtained 1 if we tested a group on Tuesday instead of Monday. 2 if we gave two parallel forms of the test to the same group on Monday and Tuesday.
3 if we scored particular tets on Tuesday and Monday. 4 if two or more scorers scored the test independently. It is clear from the foregoing that two
somewhat different types of consistency or reability are involved: reability of the test itself, and reability of the scoring of test. The writer concludes test is reliable
if it consistently yields the same, or nearly the same rating over repeated administration.
According to Wilmar Tinambunan, “there are several ways of estimating the reliability of a test. The three basic methods and the type of information each
provides are as follow: