94 H. Bonesrønning Economics of Education Review 18 1999 89–105
grades, the equation would be misspecified. Eq. 14 should instead be thought of as a definitional relationship
that provides a measure of the divergence between exam results and teachers’ grades for each school.
To deal with the problem that students sort themselves into the specialization subjects, we apply the standard
Heckman 1979 procedure. That is, the probability that a student chooses mathematics is estimated from a probit
equation. The inverse Mill’s ratio is calculated from the probit equation and thereafter included in the education
production function to correct for selectivity bias.
4. Data and Results
The analysis is based on a sample of 1092 academic students from 15 upper secondary schools in one
Norwegian county. All students taking their final exam in 1990 are included. Table 1 presents the sample means
and the standard deviations of the variables used in this study. Appendix A provides variable definitions. The
teachers’ grades as well as the grades from the nationally administred exams are given on a 0–6 scale, where 0 and
1 represent failure and 6 is the best result. The table reports the teachers’ grades and the exam results for
Norwegian language and the specialization subjects mathematics, social sciences or foreign language. As
can be seen, the teachers’ grades are set well above the exam results, indicating that the latent variable G
3
is larger than 3, or alternatively, that the latent variable g
s
is larger than 1. The students’ initial level of knowledge
Table 1 Data means and standard deviations
Variable Mean
Std.dev. Min
Max Exam results Norwegian
3.03 0.91
6 language
Teacher gradingNorwegian
3.51 0.94
1 6
language Exam
resultsspecialization 2.81
1.26 6
subjects Teacher
gradingspecialization 3.51
1.19 5.5
subjects Aggregate gradeslower
42.26 3.18
32 53
secondary Teachers’ education
46.00 13.30
17 65
Laudable teachers 47.00
10.91 22
59 Teachers’ age
42.27 3.11
36 48
students 318.8
123 84
480 departments
3.66 1.33
2 6
Teachers per 100 7.37
1.03 6.30
9.80 students
is measured by an aggregate of their grades from the lower secondary school. 11 subjects are included, and
the scale is 0–5. The variation in the average initial level of knowledge across schools basically reflects that some
schools are in excess demand while others are in excess supply.
The teachers are characterized by experience, edu- cation and laudability. The two former characteristics are
the conventional indicators for teacher quality in edu- cation production function studies, while teacher lauda-
bility has not been utilized in any analysis we are aware of. The teachers’ teaching experience school averages
varies between 10 and 20 years approximately. The tea- chers’ education is measured by the fraction of teachers
having a master’s degree, i.e. we have essentially separ- ated between teachers with 4 and 6 years of education
beyond secondary school. As can be seen, 47 percent of the teachers have laudable results from the universities
and the variation between the schools is substantial.
Three school characteristics are reported; the number of teachers per 100 students, the number of students in
the academic department and the total number of depart- ments within the school. The separation between the
number of students within the academic department and the number of departments carries over from Bonesrøn-
ning 1996, and reflects the upper secondary school organization with academic and vocational departments
side by side.
No socioeconomic background variables have been available for this study. Omitted variable bias is thus a
potential problem. All earlier studies show that the stu-
95 H. Bonesrønning Economics of Education Review 18 1999 89–105
dents’ socioeconomic background simultanously influ- ence the students’ initial level of knowledge and their
final achievement. We therefore expect the coefficient for the initial level of knowledge to be biased upwards.
We do not expect that the other independent variables will be seriously affected by the omission of socioecon-
omic background variables. This expectation is sup- ported by Hanushek and Taylor 1990, who provide evi-
dence that serious misspecification problems are likely to be avoided when the students’ initial level of knowl-
edge is included.
4.1. Measuring G
3
and g
s
We have estimated Eq. 16 for the Norwegian langu- age, the specialization subjects and mathematics. The
results are reported in Tables 2 and 3. As can be seen from Table 2, the estimated G
3 j
coefficients are above 3 without exception, and all G
3 j
are sharply determined. The t-statistics are in the ranges above 16.0, which indi-
cate that the coefficients are significantly different from zero at the 0.5 percent level. However, the t-statistics
Table 2 The grading parameter G
3
.t-statistics in parentheses School
Norwegian language Specialization subjects
Mathematics High achievers
1 3.26
4.08 4.60
4.38 40.3
26.7 16.0
25.5 2
3.38 3.44
3.04 3.42
57.8 50.7
37.4 39.8
3 3.40
3.64 3.44
3.91 74.2
51.3 31.2
36.7 4
3.70 3.40
3.37 3.52
40.6 46.8
11.5 22.1
5 3.55
3.37 3.35
3.21 54.2
50.2 33.2
28.2 6
3.20 3.76
3.73 3.75
46.3 40.6
22.5 30.8
7 4.10
3.74 3.74
4.00 51.9
35.6 30.7
27.1 8
3.33 3.85
3.38 3.72
51.2 45.8
20.4 27.9
9 4.01
3.42 3.09
3.41 36.0
21.9 18.7
17.3 10
3.90 3.93
4.29 3.94
34.9 36.4
23.1 28.6
11 3.18
3.52 3.57
3.68 64.9
64.5 29.8
47.4 12
3.24 3.66
3.57 3.73
36.8 56.0
27.1 38.8
13 3.97
4.09 4.14
4.14 40.1
41.9 21.6
31.8 14
3.40 3.60
3.15 3.42
50.7 46.9
25.6 38.4
15 3.78
3.72 3.81
3.63 55.9
54.1 29.6
32.3
give irrelevant information in this case. We therefore tested the equality of school means. The null hypothesis
of the equality of school means is rejected for all sub- jects. Note that across subjects, mathematics shows the
largest variation in G
3 j
. A student who gets the grade 3 on the external mathematics exam has the expected tea-
chers’ grades 4.6 and 3.1 in the schools with the easiest and hardest grading respectively. This is somewhat sur-
prising, since we intuitively think of mathematics as rather objective, with little leeway for teachers to evalu-
ate equal achievements differently.
The estimated g
s j
are below 1 with a few exceptions. The partial correlations between G
3 j
and g
s j
are negative but rather small for all subjects, implying that the teach-
ers’ grades vary less with achievement when the teachers practice easy grading. The partial correlation coefficient
is very small in absolute value 0.03 for mathematics.
The differences in teachers’ grading practices docu- mented in Tables 2 and 3 are of such magnitudes that
they clearly affect the students’ probabilities of getting admission to the universities. Our concerns are whether
these differences are systematically associated with
96 H. Bonesrønning Economics of Education Review 18 1999 89–105
Table 3 The grading parameter g
s
t-statistics in parentheses School
Norwegian language Specialization subjects
Mathematics High achievers
1 0.84
0.87 1.03
0.89 8.3
8.3 6.2
7.9 2
0.71 0.73
0.80 0.78
10.8 14.3
17.1 13.8
3 0.79
0.70 0.76
0.54 17.3
13.4 9.5
6.4 4
0.70 0.82
0.73 0.77
7.3 14.6
3.7 7.1
5 0.81
0.80 0.91
0.94 10.7
15.9 12.3
10.8 6
0.81 0.98
0.86 0.95
10.7 13.1
4.7 7.0
7 0.58
0.71 0.81
0.68 5.9
7.2 9.2
4.3 8
0.96 0.84
1.00 1.05
13.0 11.5
7.6 8.3
9 0.80
0.79 0.77
0.68 5.9
6.8 6.8
4.3 10
0.75 0.68
0.59 0.63
5.8 8.3
3.1 5.3
11 0.78
0.78 0.91
0.72 14.8
16.0 8.2
10.3 12
0.77 0.81
0.94 0.81
8.0 14.2
8.7 9.3
13 0.62
0.62 0.59
0.56 6.0
7.3 3.4
4.8 14
0.69 0.76
0.73 0.85
9.0 12.9
7.5 12.3
15 0.68
0.83 0.84
0.91 8.7
16.1 8.6
10.6
teacher and student characteristics, and further, how the grading practices are related to student achievement. We
start the investigation of these questions by estimating the standard education production function. This is the
correct reduced form when grading is endogenous, and it serves as our benchmark model.
4.2. The Standard Education Production Function The estimation of Eq. 14 for the Norwegian langu-
age, for the specialization subjects, for mathematics and for high achievers in the specialization subjects are
reported in Table 4. Table 4 confirms three important previous findings
from education production function studies. First, the best predictor of individual students’ achievement is
their initial level of knowledge, and the coefficient is very sharply determined. Note however that since no
socioeconomic variables are included, the estimated coefficient for the initial level of knowledge is most
likely biased upwards. Second, the number of teachers per student does not seem to influence student achieve-
ment. We have been careful in specifying the teacher input exclusive all extra classroom activities, but have
been restricted to treat the teacher input as a school spe- cific variable. The latter is a problem if for instance low
achievers are assigned to smaller classes, but the student allocation to different class sizes is very much a random
process in the academic departments included in this study. It remains of course that if measurement error
exists, the estimated coefficient of the teacher-student ratio will be biased towards zero. Our third finding con-
sistent with earlier studies is that more educated teachers will not improve upon student achievement. One
interpretation of the results is that simple accumulation of credits with no regard for the subjects being taught
has no positive effect on student achievement. However, the significant negative sign of some of the estimated
teachers’ education coefficients remains a puzzle.
The importance of teachers’ experience is not well established by previous studies. Hanushek 1986 has
surveyed 109 education production function studies including teachers’ experience as an independent vari-
able. 33 of the estimates show a statistically significant
97 H. Bonesrønning Economics of Education Review 18 1999 89–105
Table 4 The standard education production functiont-statistics in parentheses
Variable Norwegian language
Specialization subjects Mathematics
High achievers Lower secondary grades
0.17 0.19
0.21 0.19
24.6 17.8
8.5 7.2
Student’s sex 0.06
2 0.11
2 0.31
2 0.08
1.4 1.6
2.2 0.8
Teachers’ education 2
0.08 2
0.01 2
0.004 2
0.01 3.0
1.8 0.5
1.0 Laudable teachers
0.01 0.03
0.04 0.03
3.4 6.3
4.7 4.1
Teachers’ age 0.02
0.07 0.12
0.07 1.6
3.8 3.2
2.5 students
2 0.0004
2 0.0002
2 0.002
2 0.001
1.5 3.1
2.8 1.5
departments 0.06
0.06 0.04
2 0.05
3.0 1.9
0.7 1.2
Teachers per 100 0.00
0.02 0.00
0.00 students
0.5 0.3
0.5 0.8
Lower secondary 2
0.01 2
0.08 2
0.01 2
0.01 gradesaverage
0.1 0.8
0.5 0.7
Constant 2
2.23 5.58
2 8.12
2 11.88
0.8 1.4
1.1 2.0
R
2 adj
0.38 0.25
0.26 0.12
N 1092
1092 332
559
relationship of the expected positive sign, while 7 dis- play a statistically negative relationship. 69 studies are
not significant at the 5 percent level. Our analysis shows that teacher experience has a significant and positive
influence on student achievement in all the reported specifications. Teacher experience seems to matter least
in Norwegian language and most in mathematics. In mathematics, the difference between the school with the
least experienced and the most experienced teachers is 1.44 grades, all else equal. Since this is a substantial
effect, we should try to sort out the reasons why experi- ence show a strong positive association with student
achievement in this study.
First, the positive correlation may result from more senior teachers having the ability to select schools and
classrooms with better students. This possibility is men- tioned by Hanushek 1986. We have investigated
whether teacher experience is associated with school and community characteristics, but it turns out that no stat-
istically significant relationships exist.
4
4
In Bonesrønning 1996, where the relationship between school size and student achievement is investigated empirically,
I find evidence that the teacher input is endogenous. The sample used in this paper is a subsample of the sample used in
Bonesrønning 1996. Thus, teacher behavior may differ between the Norwegian counties.
Second, we have tested the hypothesis that there exists an optimal number of teaching years, but the hypothesis
has been rejected. Recall that teachers’ experience varies little across the schools in our sample. This variation
may very well be within a relative linear part of a con- cave relationship. Further investigations are needed to
settle this question.
5
Third, the importance of teacher experience may have been obfuscated in previous analyses due to omitted vari-
ables. We have estimated, but not reported, Eq. 14 without the fraction of laudable teachers included. It
turns out that the significant association between teacher experience and student achievement disappears when the
laudability variable is excluded. For this sample at least, the importance of purchased teacher characteristics is
revealed only when appropriate controls are made for relevant non-purchased teacher characteristics.
The “new” indicator, the fraction of laudable teachers
5
Many of the students from the 68-generation became teach- ers in the upper secondary school. Born approximately in 1950,
these teachers were 38 years old in 1988. That is, the teachers from this cohort are among the youngest teachers in our sample.
Many researchers have pointed out that the 68- generation rep- resents a shift downwards in teaching quality. Our results are
consistent with this hypothesis, but without additional data the explanation is speculative.
98 H. Bonesrønning Economics of Education Review 18 1999 89–105
to all teachers, shows a significant and positive associ- ation with student achievement in all specifications
reported in Table 4. In mathematics, the difference in student achievement between the schools with the lowest
and the highest fractions of laudable teachers is 1.48 grades, all else equal. The positive association between
laudability and student achievement is an intuitive plaus- ible result, since laudability indicates that the teachers
have good understanding of the subjects they teach. Note however that the theoretical analysis has focused on two
mechanisms —teaching quality and teacher strength — through which teachers can influence student achieve-
ment. We are of course unable to separate between the influences from these two factors on the basis of the
results in Table 4, but will return to this issue below.
Now we investigate the relationships between grading practices and teacher characteristics.
4.3. The Grading Equation Eq. 15 is estimated by ordinary least squares with
G
3
as the dependent variable. The results are reported in Table 5. As can be seen, the grading in Norwegian lang-
uage is not significantly associated with any of the teacher characteristics, while the grading in the speciali-
zation subjects and mathematics is strongly correlated with all the teacher characteristics. For the latter subjects,
it is evident that laudable and experienced teachers prac- tice hard grading, while the most educated teachers prac-
tice easy grading. Moreover, in the specialization sub- jects
and mathematics the
grading parameter is
negatively associated with the students’ average initial level of knowledge. The null hypothesis that the teach-
Table 5 The grading equation. Dependent variable G
3
, t-statistics in parentheses Variable
Norwegian language Specialization subjects
Mathematics Lower secondary gradesaverage
0.33 2
0.18 2
0.23 1.9
2.9 1.7
Teachers’ age 0.04
2 0.07
2 0.09
1.2 5.7
3.4 Laudable teachers
0.01 2
0.01 2
0.25 1.2
4.1 3.8
Teachers’ education 0.001
0.001 0.02
0.2 4.5
3.9 students
2 0.001
0.001 0.0002
1.4 2.7
0.2 departments
2 0.05
2 0.06
2 0.08
0.7 2.4
1.6 Teachers per 100 students
0.04 2
0.06 0.001
0.4 1.9
1.8 Constant
2 12.26
15.10 16.63
1.6 5.5
2.7 R
2 adj
0.32 0.83
0.77
ers’ grading practices are not affected by the students’ average initial level of knowledge is formally rejected
for the specialization subjects, but not for mathematics. Note that we have not corrected for the multilevel
character of the data involved in this analysis. In general, such correction implies more conservative estimates.
Table 6 reports the OLS-estimation results with g
s j
as the dependent variable. There is basically no evidence
that the degree to which the teachers’ grades vary with achievement correlates with teacher characteristics.
Note that Table 5, columns 2 and 3, provide evi- dence that the teacher characteristics are associated with
hard grading in a way very similar to the association between teacher characteristics and student achievement
in Table 4. The crucial question then is whether these teacher
characteristics affect
student achievement
directly, or whether the teachers’ influence on student achievement is mediated through the grading practices.
This issue is now investigated within the education pro- duction function framework.
As noted in Section 2, the specification of the edu- cation production function incorporating grading effects
depends on where hard grading origins. This is related to the following question: How should the results in
Table 5, columns 2 and 3, be interpreted? It is clear that the results are consistent with the rent seeking
model. That is, the estimated coefficients might be inter- preted as showing that low achieving students press for
easy grading and that the teachers’ ability to withstand pressure is associated with laudability and experience.
We have interacted the students’ initial level of knowl- edge with the fraction of laudable teachers. The results
for mathematics, which are not reported, indicate that
99 H. Bonesrønning Economics of Education Review 18 1999 89–105
Table 6 The grading equation. Dependent variable g
s
, t-statistics in parentheses Variable
Norwegian language Specialization subjects
Mathematics Lower secondary gradesaverage
2 0.07
2 0.06
2 0.20
0.8 1.0
2.5 Teachers’ age
2 0.01
0.01 2
0.004 0.7
0.8 0.3
Laudable teachers 2
0.003 2
0.001 0.002
0.7 0.4
0.5 Teachers’ education
0.001 0.001
2 0.003
0.3 0.4
0.6 students
2 0.00003
2 0.002
2 0.00003
0.1 0.7
0.1 departments
0.02 0.04
0.03 0.5
1.9 1.0
Teachers per 100 students 2
0.01 0.01
0.3 0.4
Constant 4.38
2.56 9.12
1.2 1.0
2.6 R
2 adj
2 0.51
2 0.03
0.06
low achievers are less capable of affecting grading when the fraction of laudable teachers increases. These results
are consistent the model predictions: With increasing teacher strength, the rent seeking frontier shifts inwards,
which implies that the marginal returns to student rent seeking decreases.
However, the results may as well be interpreted as showing i that laudable and experienced teachers know
more about grading than other teachers admittedly, this interpretation take into account empirical results not
presented yet, and ii that all types of teachers conjec- ture that it is efficient to practice easy grading for low
achievers.
These interpretations
motivate two
different approaches to the education production function, to be
presented below. 4.4. The Education Production Function Revisited I
We start with the interpretation that grading is a policy instrument for the teacher. Thus grading should be
regarded as an exogenous variable to be included in the education production function.
Table 7 reports the results from estimation of the edu- cation producton function augmented with the grading
parameters G
3
and g
s
. For Norwegian language, G
3
and g
s
are both significantly associated with student achieve- ment when included one at a time, while neither of the
grading parameters turn out to be significant predictors of student achievement when included simultanously in
the equation. The signs of the coefficients indicate that students i respond to harder grading by increasing their
studying time and ii respond to increased marginal returns to achievement by increasing their studying time.
The magnitude of the grading effect the G
3
effect is illustrated by the following example. A student in the
school who practice the hardest grading performs 0.28 grades better than a student in the school which practice
the easiest grading, all else equal.
The G
3
coefficient is not significant different from zero for the specialization subjects, but when restricted
to the sample of high achieving students, the G
3
coef- ficient turns out to be a significant predictor of student
achievement. The negative sign indicates that high achieving students treat leisure as a normal good. More-
over, the grading effect is substantial. The sample vari- ation in G
3
multiplied by the estimated coefficient is equal to 1.3 grades. The g
s
coefficient is not significantly associated with student achievement, but the sign of the
g
s
coefficient indicates that students allocate less time to studying when the marginal returns to real achievement
increases. Since the g
s
effect should be interpreted as a substitution effect, this is a puzzling result. It might indi-
cate that our estimation strategy of treating the speciali- zation subjects as one subject is inappropriate.
To mitigate these problems we highlight mathematics which is one of the specialization subjects. We consider
the self-selection issue by estimating a selection equation with the dependent variable equal to 1 if the student
choose mathematics and 0 otherwise. The independent variables are individual student characteristics and two
grading parameters which are the G
3
parameters for mathematics and for all the specialization subjects net of
mathematics. The results are reported in Table 8, col- umn 1.
As expected, males are more likely to choose math-
100 H. Bonesrønning Economics of Education Review 18 1999 89–105
Table 7 The education production function revisited It-statistics in parentheses
Specialization Variable
Norwegian language Norwegian language Norwegian language High achievers
subjects Lower secondary
0.17 0.17
0.17 0.19
0.19 grades
24.6 24.7
24.7 17.8
7.2 Student’s sex
0.07 0.07
0.07 2
0.10 2
0.08 1.5
1.5 1.5
1.5 0.8
Teachers’ education 2
0.01 2
0.01 2
0.01 0.001
0.01 3.3
2.5 2.5
0.2 1.1
Laudable teachers 0.01
0.01 0.01
0.02 0.01
3.6 3.9
3.9 2.0
1.4 Teachers’ age
0.03 0.03
0.03 0.04
2 0.002
2.2 2.3
2.3 0.9
0.2 students
2 0.0004
2 0.001
2 0.001
2 0.002
2 0.002
1.4 2.0
2.4 2.4
2.2 departments
0.06 0.05
0.05 2
0.07 2
0.07 3.1
2.4 2.4
1.7 1.5
Teachers per 100 2
0.002 2
0.005 2
0.006 2
0.001 2
0.07 students
0.1 0.2
0.2 0.1
0.9 Lower secondary
2 0.02
0.03 0.03
2 0.23
2 0.27
gradesaverage 0.4
0.4 0.4
1.4 1.2
g
s
0.49 0.12
2 0.92
2 0.87
1.8 0.3
1.2 1.2
G
3
2 0.27 1.6
2 0.30 2.4
2 0.64 0.9
2 1.13 2.2
Intercept 2
4.91 1.6 2
5.90 1.9 2
5.69 1.9 5.51 0.5
1.20 0.9 R
2 adj
0.38 0.38
0.38 0.25
0.13 N
1092 1092
1092 1092
559
ematics than females, and high achievers are more likely to choose mathematics than low achievers. There are
some indications, although insignificant, that the prob- ability for choosing mathematics increases when the
grading in mathematics gets easier, which seems like a intuitive plausible result. A surprising result, at least at
first glance, is that the probability for choosing math- ematics increases when the grading in other specializa-
tion subjects get easier. At second thought however, we already know that the teacher characteristics associated
with easy grading also are those teacher characteristics associated with poor student performance. The students
then face a trade-off between easy grading and teaching quality, and their subject choices cannot easily be pre-
dicted. It might be argued that measures of teaching quality should be included in the selection equation, but
unfortunately we have been unable to separate the teach- ers into the relevant categories.
We apply the Heckman 1979 model to correct for selectivity bias, which implies that the inverse Mill’s
ratio is included in the mathematics education production function. The results are reported in Table 8, columns
2 and 3. The coefficient of the selection-bias variable l
is significant, which indicate that selectivity bias is an issue. Table 8, column 4, reports the results for the
mathematics education production function exclusive the selection-bias variable. Note that the gender differences
appear to be much larger when the l variable is included. This is consistent with earlier findings showing that the
effect of self-selection is particularily strong for females.
In column 2 the G
3
coefficient is positive and insig- nificant. In column 3, where we have interacted the
students’ initial level of knowledge with the G
3
coef- ficient, the grading parameter is significantly associated
with student achievement at the 10 percent level. Stu- dents with very high initial levels of knowledge are posi-
tively affected by harder grading, while students with lower initial levels of knowledge seem to respond to
harder grading by decreasing their studying time. In one interpretation, these results indicate that high achievers
treat leisure as a normal good, while low achievers treat leisure as an inferior good. The g
s
coefficient is insig- nificant, but it still has the “wrong” sign.
The grading effects do not seem to be very much affected by the controls made for selectivity bias. As can
be seen from column 4, the students’ response to hard
101 H. Bonesrønning Economics of Education Review 18 1999 89–105
Table 8 Grading effects in mathematicst-statistics in parentheses
Variable Probit
Mathematics Mathematics
Mathematics Lower secondary grades
0.18 0.52
0.82 0.58
12.1 4.2
3.6 2.8
Student’s sex 2
0.95 2
1.96 2
1.87 2
0.35 10.3
3.0 2.9
2.4 Teachers’ education
2 0.002
2 0.002
2 0.01
0.2 0.1
0.8 Laudable teachers
0.04 0.04
0.05 2.6
2.6 3.5
Teachers’ age 0.08
0.08 0.14
1.3 1.3
2.5 students
2 0.004
2 0.004
2 0.002
3.6 3.6
2.7 departments
0.05 0.04
0.06 0.7
0.6 0.8
Teachers per 100 2
0.002 1.7 2
0.002 1.6 2
0.008 0.7 students
Lower secondary 2
0.12 2
0.15 2
0.08 gradesaverage
0.5 0.6
0.3 1.3
0.9 1.7
1.9 G
3 spec
0.66 2.7
g
s math
2 0.62 0.7
2 0.71 0.8
2 0.30 0.4
Lower secondary 2
0.88 2
1.02 grades G
3 math
1.6 1.8
l 2.58 2.6
2.41 2.4 R
2 adj
0.27 0.27
0.26 N
1092 1092
1092 332
grading is somewhat more sensitive to the students’ initial level of knowledge when no controls for selec-
tivity bias are made. This might reflect that the high achieving students who choose mathematics when grad-
ing is hard, are those students who respond to harder grading by increasing their effort.
There are two reasons why the results regarding the effects of teachers’ grading reported in this section may
be spurious. First, the positive correlation between hard grading and student achievement might be generated by
omitted variables of teaching quality that are positively correlated both with hard grading and student achieve-
ment. That is, no grading effect may actually exist. A related argument could be made about the “wrong” sign
of the g
s
coefficient. At this stage, we have no solution to offer to this potential omitted variable problem. The
second reason the results may be spurious is that grading is determined by rent seeking students. Then the positive
relationship between hard grading and student achieve- ment is mediated at least partially through the students’
time constraint. This problem is discussed in the next section.
4.5. The Education Production Function Revisited II The alternative interpretation of the estimation results
from Eq. 15 is that rent seeking occurs for the speciali- zation subjects. In this case the estimated grading para-
meter coefficients reported in Section 4.3 will be biased. The bias is due to correlation between the grading para-
meter and the residual, generated by the unobserved student characteristics which determine the grading para-
meter. To deal with this problem, instruments for the teacher grading parameter are generated from Eq. 15
and included in Eq. 14. Since the students’ initial aver- age level of knowledge is significantly associated with
teachers’ grading, but not with student achievement, identification might be possible.
The instruments are used to test whether the assump- tion that the grading parameter can be treated as exogen-
ous is valid. We apply the Durbin–Wu–Hausman speci- fication test in a fashion similar to Ehrenberg and Brewer
1995. That is, both the grading parameter and the instrument are included in Eq. 14. The results for math-
ematics and for the high achievers in the specialization
102 H. Bonesrønning Economics of Education Review 18 1999 89–105
subjects are reported in Table 9. We do not consider that endogeneity of the grading parameter is a problem
for Norwegian language. As can be seen, the coefficients of the instruments are
small and insignificant in all specifications. Formal F- tests suggest that one cannot reject the hypothesis that
the coefficients of the instruments are equal to zero. We have generated alternative instruments by including
interaction terms in Eq. 15, but in no case the instru- ments turn out to be significantly associated with student
achievement. Formally, the tests imply that the grading parameters should be treated as exogenous, that is, the
equation estimated in Section 4.3 seems to be the cor- rect equation.
Unfortunately, the evidence provided by these tests cannot be regarded as being very decisive. As can be
seen from Table 9, the coefficients of the teacher charac- teristics become insignificant when the instruments are
included in the estimated equation. This is an indication that the identification problems are not properly solved.
The sample does not provide any additional variables that could help solve the identification problems.
Table 9 The education production function revisited IIt-statistics in parentheses
Variable Mathematics
High achievers Lower secondary grades
0.78 0.19
3.4 7.2
Student’s sex 2
1.99 2
0.08 3.0
0.8 Teachers’ education
0.02 0.01
0.9 1.2
Laudable teachers 0.01
0.01 0.5
1.4 Teachers’ age
2 0.01
2 0.01
0.1 0.2
students 2
0.004 2
0.002 3.6
2.2 departments
2 0.07
2 0.07
0.7 1.5
Teachers per 100 students 2
0.11 2
0.07 0.7
0.9 G
3
13.6 2
1.11 1.6
2.1 g
s
2 0.29
2 0.86
0.3 1.2
G
3 IV
2 11.2
2 0.03
1.3 0.2
Lower secondary gradesG
3
2 0.29
1.5 Lower secondary gradesG
3 IV
0.02 1.2
R
2 avg
0.27 0.13
N 332
559
5. Concluding Remarks