Level of Difficulty Construct Validity

32 Point biser D discrimination power D 0.199 Very low 0.200 – 0.299 Low 0.300 – 0.399 Average D 0.400 High Prop. Endorsing the quality of options 0.000 – 0.010 Low 0.011 – 0.050 Sufficient 0.051 – 1.000 Good Alpha reliability 0.000 – 0.400 Low 0.401 – 0.700 Average 0.701 – 1.000 High To make easier in choosing item test which need to be revised or dropped, recommended by using the following criteria: Table 2. The criteria to classify the quality of the test items Criteria Indeks Classification Interpretation Prop. correct p level of difficulty 0.000 – 0.099 Very difficult Dropneeds total revising 0.100 – 0.299 Difficult Needs revising 0.300 – 0.700 Average Good 0.701 – 0.900 Easy Needs revising 0.901 – 1.000 Very easy Dropneeds total revising Point biser D discrimination power D 0.199 Very low Dropneeds total revising 0.200 – 0.299 Low Needs revising 0.300 – 0.399 Quite average Without revision D 0.400 High Very good Prop. Endorsing the quality of options 0.000 – 0.010 Least Dropneeds revising 0.011 – 0.050 Sufficient Good enough 0.051 – 1.000 Good Very good Alpha Reliability 0.000 – 0.400 Low Not sufficient 0.401 – 0.700 Average Sufficient 0.701 – 1.000 High Good 33

III. RESEARCH DESIGN

This chapter discuses the design of this research and how to collect the data from the research participants. The writer also encloses the data collecting technique, the procedures of this research, the scoring system and how to analyze the data.

3.1 The Research Design

The design of the research was descriptive analystic. This research was intended to determine whether or not the first semester English test for the first year students of SMK Negeri 1 Gedong Tataan in 20122013 academic year meets such criteria as face validity, content validity, construct validity, reliability, difficult level, discrimination power, and the quality of options. Descriptive analystic was a kind of method which is used to evaluate the document without reducing or adding. The data was authentic data, as it is.

3.2 Setting of the Research 1.

Time The research conducted in a week. It would administered during the English lesson which is being tested when the students had been finished their English semester test items, and the writer asked their time to questionnaire about face validity.