Directory UMM :Data Elmu:jurnal:E:Economics Letters:Vol70.Issue2.Feb2001:
www.elsevier.com / locate / econbase
Change point estimation in regressions with I(d ) variables
*
Chih-Chiang Hsu
Department of Economics, National Central University, Chung-Li 32054, Taiwan Received 18 February 2000; accepted 15 July 2000
Abstract
In this paper we study the least-squares change-point estimator in regressions with stationary and invertible
I(d ) regressors and disturbances. We find that the least-squares estimator remains consistent when there is a
one-time break, but it may identify a spurious change when there is none. 2001 Elsevier Science B.V. All rights reserved.
Keywords: Change point estimation; Fractional Brownian motion; Spurious change JEL classification: C22
1. Introduction
The change-point problem has long been a leading research topic in the statistics and econometrics literature. For a mean shift in weakly dependent I(0) processes, Bai (1994) established the consistency and rate of convergence results for the least-squares change-point estimator under fairly general conditions. These results were subsequently extended by Kuan and Hsu (1998) for fractionally integrated I(d ), 20.5,d,0.5, processes with a mean change. Similar results were also obtained for stationary regression models, time trend models, and cointegration models; see e.g., Nunes et al. (1995) and Bai (1997, 1998). On the other hand, Nunes et al. (1995) and Kuan and Hsu (1998) observed that, for the data that are nonstationary I(1) series or have long memory (I(d ) with 0,d,0.5), the least-squares estimator may suggest a spurious change point when there is none. In this paper, we extend the results of Kuan and Hsu (1998) to regression models with stationary and invertible I(d ) regressors and disturbances. It is shown that the least-squares estimator remains consistent when there is a one-time change. When there is no change and the disturbances have long memory, we demonstrate that existing structural-change tests will reject the null with probability
*Tel.:1886-342-271-51; fax: 1886-342-228-76. E-mail address: [email protected] (C.-C. Hsu).
0165-1765 / 01 / $ – see front matter 2001 Elsevier Science B.V. All rights reserved. P I I : S 0 1 6 5 - 1 7 6 5 ( 0 0 ) 0 0 3 6 1 - X
(2)
approaching one and hence may incorrectly suggest that a change has occurred. Moreover, the least-squares estimator may also identify a spurious change point in the middle of the sample. Interestingly, these phenomena are irrelevant to the dependence structure of regressors. In contrast with the spurious regression results of Tsay and Chung (2000), our finding indicates that a spurious change may still occur even when there is no spurious regression. This paper proceeds as follows. Least-squares consistency is established in Section 2. The spurious-change results of test and estimation are discussed in Section 3. Section 4 gives some concluding remarks. All proofs are given in Appendix A.
2. Consistency
We consider the data generating process (DGP) of y which has a one-time change at unknownt point k :0
9
9
yt5xtb11xtl1ht.k0j1´t, t51, . . . ,T, (1)
where xt5[1,x ]t 9, bi5[a ,b ]i i 9, i51,2, l5b22b15[l1,l2]9 denotes the magnitude of parameter changes, and 1 is the indicator function. Also let k05[Tt0], the integer part of Tt0, so that t0
represents the relative position of k in the sample. Given (1), we impose the following conditions.0
] ] ]
[A1]. t0[[t],t], where t],t and [t],t] is a proper subset of [0,1].
[A2]. Both x andt ´t are stationary and invertible ARFIMA (autoregressive, fractionally integrated, moving average) processes:
dx d´
F(L )(12L ) x 5Q(L )v, F(L )(12L ) ´ 5Q(L )u ,
x t x t ´ t ´ t
where F(L ) andQ(L ) are polynomials in the lag operator L such that their roots lie outside the unit circle, and 20.5,d ,dx ´,0.5.
[A3]. The processes hvj andhuj are two mutually independent sequences of i.i.d. random variables
t t
2 2 d1
with zero mean, finite variances s and s , Euvu , ` with d $maxh4,28d /(112d )j, and
v u t 1 x x
d2
Euutu , ` with d2$maxh4,28d /(1´ 12d )´ j.
Conditions [A2] and [A3] ensure that x andt ´t obey the functional central limit theorem (FCLT):
[Tt] [Tt]
1 1
]]]T0.51dx
O
xt⇒kxB (dxt), ]]]T0.51d´O
´t,⇒k´B (d´t), (2)t51 t51
where ⇒ denotes weak convergence, k is a positive constant, and B is the fractional Browniand
P D
motion. In what follows we also write → as convergence in probability and → as convergence in distribution.
ˆ
For each hypothetical change point k, the least-squares change-point estimator solves k5 argmin1#k#T RSS(k), where
(3)
k T
2 2
ˆ ˆ
9
9
RSS(k)5
O
( yt2xtb1(k)) 1O
( yt2xtb2(k)) , t51 t5k11ˆ ˆ
andb1(k) and b2(k) are the pre- and post-change least-squares estimators ofb1 andb2. The estimated
ˆ
break fraction, t, is defined as
]
ˆ
* *
t5inf
h
t :t 5argmint[[t],t] RSS([Tt]) ,j
(3)ˆ ˆ
and we may write t5k /T. It is easy to verify that T
2 ˜
9
RSS([Tt])5
O
(´t1xtl1ht.[Tt]j) 2H ([TT t]), 0t51
where
21 21
˜ ˜
9
˜ ˜9
˜H ([TT t])5E (0,T t)MT (0,t)E (0,T t)1E (Tt,1)MT (t,1)E (Tt,1), (4) with
[Tt2] [Tt2]
˜
9
9
E (Tt1,t2)5
O
xts
´t1xtl1ht.[Tt0]jd
, M (Tt1,t2)5O
x x .t t t5[Tt1]11 t5[Tt1]11Hence, the least-squares estimator of (3) is the same as that obtained from maximizing (4). Bai (1997)
ˆ
establishes the consistency result for t in regression models with weakly dependent data; Kuan and Hsu (1998) also prove consistency for the location model with ARFIMA data. In this section, we generalize these consistency results to regression models with ARFIMA regressors and disturbances.
P
ˆ
Theorem 2.1. Given DGP (1), suppose that Conditions [A1]–[A3] hold. Then t→t as T→`, 0
ˆ
where t is given by (3).
We report some Monte Carlo results in Fig. 1. The data y are generated according to (1) witht
b15[1,0.5]9, b25[2,1]9, and t050.5. Four combinations of d and d are considered: (d ,d )x ´ x ´ 5 (0.4,0.4), (20.4,0.4), (0.4,20.4), (20.4,20.4). In each experiment, T5100, and the number of
ˆ
replications is 50000. Fig. 1 shows that the empirical distributions of k are more concentrated around the true change point, although their precisions vary with d and d . These simulations confirm thex ´
result of Theorem 2.1. However, it seems that the value of d is more crucial in determining the´
ˆ
performance of k. In view of Fig. 1 (a) and (c) (or (b) and (d)), we can see that for dx50.4
ˆ
(dx5 20.4), k performs better when d is negative. On the other hand, comparing Fig. 1 (a) and (b),´
ˆ
we observe that given d´50.4, k performs similarly for dx50.4 and dx5 20.4; in fact, positive dx yields a slightly better result. The same conclusion holds for Fig. 1 (c) and (d).
3. Spurious change
We have shown that the least-squares estimator is capable of locating the change point when it exists. In this section, however, we demonstrate that it is difficult to know if there is indeed a
(4)
ˆ
Fig. 1. Empirical distributions of k for regression models when k0550. (a) dx50.4, d´50.4 (b) dx5 20.4, d´50.4 (c) dx50.4,
d´5 20.4 (d) dx5 20.4, d´5 20.4.
structural change in I(d ) data. In practice, researchers usually first test the existence of a change and then decide if they should proceed to estimate the change point. Unfortunately, as noted by Kuan and Hsu (1998), many well-known tests for structural change have significant size distortions when data have long memory. In contrast with Tang and MacNeill (1993), these size distortions do not result from ignoring nuisance parameters of weak dependence. In fact, the null of parameter constancy will be rejected with probability approaching one when data have strong positive correlations. In regressions with I(d ) variables, consider the Wald statistic for the null hypothesisb15b2 against the alternative (1). For each possible break k, the Wald statistic is
21
ˆ ˆ ˆ ˆ ˆ ˆ
W (k)T 5(b1(k)2b2(k))9(V(1,k)1V(k,T )) (b1(k)2b2(k)),
21
ˆ
where V(t1,t2)5MT (t1,t2)RSS /T. Under the null hypothesis, l50, and hence [Tt2]
˜ ˜
E (Tt1,t2)5E (Tt1,t2)5
O
xt´t, H ([TT t])5H ([TT t]) t5[Tt1]1121 21
9
9
5E (0,T t)MT (0,t)E (0,T t)1E (Tt,1)MT (t,1)E (Tt,1). It follows that
(5)
H ([TT t])2HT
]]]]]]]
W ([TT t])5 T ,
2
O
´ 1H ([Tt]) /TS
t TD
t51
21
9
where HT5E (0,1)MT T (0,1)E (0,1). The next result shows that W ([TT T t]) diverges when d´.0, regardless of the value of d .x
Theorem 3.1. Given DGP (1) with l50, suppose that Conditions [A2] and [A3] hold with
] 0,d´,0.5. Then for each t[[t],t],
2 2
k´[B (d t)2tB (1)]d
D ´ ´
22d´ ]]]]]]]
T W ([TT t])→ g(0)t(12t) (5)
´
as T→`, whereg(0) is the variance of ´.
´ t
Theorem 3.1 shows that for a positive d , W ([T´ T t]) diverges in probability. By the continuous mapping theorem, the supremum Wald test of Andrews (1993) and mean Wald and exponential Wald tests of Andrews and Ploberger (1994) also diverge to infinity. Thus, when there is no change, the Wald-type tests would reject the null of parameter constancy with probability approaching one as long as the disturbances have long memory (i.e., 0,d´,0.5), and hence may incorrectly suggest that a change has taken place. As the result of Theorem 3.1 does not depend on d , this conclusion holdsx even when the regressor is antipersistent (i.e., 20.5,dx,0). The Lagrange multiplier and
ˆ
likelihood ratio tests also suffer from the same problem. Moreover, when V is replaced with the Newey-West estimator to account for weak dependence, we can show that W still diverges as dT ´.0. Our simulations also confirm the asymptotic analysis of Theorem 3.1. The DGP is generated according to (1) with b15b25[1,2]9 for different combinations of d and d : dx ´ x5 20.45, . . . ,0.45 and d´50.1, . . . ,0.45 with the increment 0.05. Thus, there are 162 experiments; for each experiment,
T5200, the number of replications is 5,000, and the nominal size is 5%. We compute the supremum
ˆ
Wald test of Andrews (1993). In computing the Wald statistic, we estimate V by the Newey-West estimator. The bandwidth is determined by the data-dependent formula of Andrews (1991) with AR(1) specification. The rejection frequencies of the supremum Wald test are summarized in Fig. 2. It can be seen that the type I errors increase from approximately 25% for d´50.1 to about 95% for
d´50.45, regardless of the value of d . Therefore, as long as d is positive, the existing tests sufferx ´
from serious size distortions.
The previous results suggest that, whether or not there is a change, it is likely that one would be led to estimate the change point after conducting standard structural-change tests. From the proof of Theorem 3.1, we know that
22d´ ]
ˆ
t 5argmaxt[[t,t] T H ([TT t])
]
2 2 2 2
k´B (d t) k´[B (1)d 2B (d t)]
D ´ ´ ´
] ]]] ]]]]]]
→argmaxt[[t,t] 1
t (12t)
] ]
;argmaxt[[t,t] G (d´ t),
(6)
Fig. 2. Empirical sizes of sup-Wald-type tests for regression models.
]
for any t[[t],t]. By invoking the functional law of iterated logarithm (Taqqu, 1977), we have
2 2
B (d t) [B (1)d 2B (d t)]
´ ´ ´
]] ]]]]]]
lim sup 5lim sup 50 a.s.
t 12t
t→0 t→1
2 2
Thus, the asymptotic behavior of G (d t) is well-defined on [0,1] with G (0)d 5G (1)d 5k´B (1) . Itd
´ ´ ´ ´
D
ˆ ˆ
follows thatt has a limiting distribution with support equal to [0,1], and thatt→argmaxt[[0,1] G (d t).
´
However, the maximum of G (d t) is not attained at 0 or 1 with probability one, since
´
2 2
kd´[tB (1)d´ 2B (d´ t)] ]]]]]]]
G (0)d 2G (d t)5G (1)d 2G (d t)5 2 ,0,
´ ´ ´ ´ t(12t)
for any t[(0,1). This proves the following theorem.
Theorem 3.2. Given DGP (1) with l50, suppose that Conditions [A2] and [A3] hold with
D
] ˆ
0,d´,0.5. If ]t50 and t51, then t→argmaxt[[0,1] G (d´ t) and G (0)d´ 5G (1)d´ ,G (d´t) for any
t[(0,1) with probability one.
Ideally, when there is no change, we expect that the probability mass of the change-point estimator
P
ˆ
concentrates around the two end points of the sample; i.e., t→h0, 1j. Theorem 3.2 shows that,when
ˆ ˆ
d´.0,t is unlikely to be close to 0 and 1; it is more likely thatt incorrectly identifies a change point in the middle of the sample even when there is none. This is the problem of spurious change, in the sense of Nunes et al. (1995). Nunes et al. (1995) and Bai (1998) showed that the I(1) regressors and errors are responsible for the occurrence of spurious break and spurious regression. Tsay and Chung (2000) also pointed out that the spurious regression may occur when dx1d´.0.5. From Theorem 3.2, we can see that the spurious-change problem may still exist even if the regressors and disturbances are stationary and dx1d´,0.5. That is, a spurious change may arise when there is no spurious regression.
We examine some simulations which are conducted as that in Section 2 butb15b25[1,0.5]9. Four cases are considered: (d ,d )x ´ 5(0.35,0.35), (20.35,0.35), (0.35,20.35), (20.35,20.35). Fig. 3
(7)
ˆ
Fig. 3. Empirical distributions of k for regression models when there is no change. (a) dx50.35, d´50.35 (b) dx5 20.35, d´50.35 (c)
dx50.35, d´5 20.35 (d) dx5 20.35, d´5 20.35. ˆ
shows that, for cases (a) and (b), k has the lowest rejection frequencies at the two end points (1 and
T ). Hence, the least-square estimator will most likely find a change point in the middle of the sample
rather than two end points; this confirms the result of Theorem 3.2. Note that when dx1d´,0.5 (Fig. 3 (b)), the spurious-change problem still occurs. Moreover, we also consider the cases that d´,0. In
ˆ
contrast with Fig. 3 (a) and (b), (c) and (d) show that the empirical distributions of k now have more probability mass at the two end points. Although we do not give a formal proof, it seems that the spurious-change problem occurs only for a positive d . As the results of Section 2, the fractionally´
differencing parameter of disturbances, d , is also crucial for a spurious change.´
4. Conclusions
In this paper we study the asymptotic properties of the least-squares estimator of the change point for regressions with I(d ) regressors and disturbances. The consistency of the least-squares change-point estimator is established for stationary I(d ) regressions. It is also found that a spurious change may arise when the disturbances have long memory; whether the regressors have long memory is irrelevant here. Thus, one should be extremely careful in drawing inferences from existing structural-change tests and structural-change-point estimators when data have long memory. A procedure that can properly test and estimate the unknown structural change in a long-memory environment is currently being investigated.
(8)
Acknowledgements
I am deeply indebted to C.-M. Kuan for enormous advice and comments.
Appendix A
Proof of Theorem 2.1. If d ±0, d ±0, and in the light of Tsay and Chung (2000), we know
x ´
dx1d´
Op
sT
d
, dx1d´.0.5,[Tt]
]]
0.5
O
sT
logTd
, d 1d 50.5,O
xt t´ 5 pœ
x ´ (A.1)t51
5
0.5s
d
O Tp , otherwise.
This result, the FCLT (2), and the Borel–Cantelli lemma imply that
[Tt] T
1 P 1 P
]
O
xt´t→0, ]O
xt´t→0.Tt51 Tt5[Tt]11
Furthermore, by a suitable law of large numbers, it can be seen that
[Tt]
0, t#t ,
1 P 0
]T
O
x xt9
tl1ht.[Tt0]j→H
(t2t )V l, t.t ,t51 0 x 0
and
T (12t )V l, t#t ,
1 P 0 x 0
]T
O
x xt9
tl1ht.[Tt0]j→H
(12t)V l, t.t ,t5[Tt]11 x 0
]
uniformly in t[[]t,t], where Vx5diag [1,gx(0)] and gx(0) is the variance of x . Combining theset results, we have
2
(12t0) 2 2
]]](l11gx(0)l2), t#t0,
1 ˜ P 12t
]H ([TT t])→ 2
T
5
t0 2 2
]
(122t01 )(l11gx(0)l2), t.t0,
t
2 2
where the limit reaches the global maximum (12t0)(l11gx(0)l2) at t5t0. h
Proof of Theorem 3.1. Because of the FCLT (2) and (6), we have [Tt] [Tt]
1 1 D
2( 0.51d )´ ]]] ]]]
T E (0,T t)5[ 0.51d´
O
´t, 0.51d´O
xt´t]9→[k´B (d´ t), o (1)]p 9,T t51 T t51
D
2( 0.51d )
] ´
for anyt[[]t,t]. Similarly, T E (Tt,1)→[k´(B (1)d´ 2B (d´ t)), o (1)]p 9. Moreover, by the law of
P P
(9)
22d´ 2( 0.51d )´
9
21 2( 0.51d )´T H ([TT t])5(T E (0,T t))(T MT (0,t))(T E (0,T t))
2 2 k´B (d t)
D ´
2( 0.51d )´ 21 2( 0.51d )´
9
]]]1(T E (Tt,1))(T MT (t,1))(T E (Tt,1))→
t
2 2
k´[B (1)d´ 2B (d´ t)] ]]]]]]
1 ,
(12t) D
22d 2 2
] ´
for any t[[]t,t]. Similarly, T HT→k´B (1) . Under the null hypothesis, it can then be shownd´
P
T 2
that [ot51´t 1H ([TT t])] /T→g´(0). Combining these limits and using the continuous mapping theorem, we obtain the result of (5). h
References
Andrews, D.W.K., 1991. Heteroskedasticity and autocorrelation consistent covariance matrix estimation. Econometrica 59, 817–858.
Andrews, D.W.K., 1993. Tests for parameter instability and structural change with unknown change point. Econometrica 61, 821–856.
Andrews, D.W.K., Ploberger, W., 1994. Optimal tests when a nuisance parameter is present only under the alternative. Econometrica 62, 1383–1414.
Bai, J., 1994. Least squares estimation of a shift in linear processes. Journal of Time Series Analysis 15, 453–472. Bai, J., 1997. Estimation of a change point in multiple regression models. The Review of Economic and Statistics 79,
551–563.
Bai, J., 1998. A note on spurious break. Econometric Theory 14, 663–669.
Kuan, C.-M., Hsu, C.-C., 1998. Change-point estimation of fractionally integrated processes. Journal of Time Series Analysis 19, 693–708.
Nunes, L.C., Kuan, C.-M., Newbold, P., 1995. Spurious Break. Econometric Theory 11, 736–749.
Tang, S.M., MacNeill, I.B., 1993. The effect of serial correlation on tests for parameter change at unknown time. Annals of Statistics 21, 552–575.
Taqqu, M.S., 1977. Law of the iterated logarithm for sums of non-linear functions of Gaussian random variables. Zeitschrift ¨
fur Wahrscheinlichkeitstheorie und verwandte Gebiete 40, 203–238.
Tsay, W.-J., Chung, C.-F., 2000. The spurious regression of fractionally integrated processes. Journal of Econometrics 96, 155–182.
(1)
ˆ
Fig. 1. Empirical distributions of k for regression models when k0550. (a) dx50.4, d´50.4 (b) dx5 20.4, d´50.4 (c) dx50.4,
d´5 20.4 (d) dx5 20.4, d´5 20.4.
structural change in I(d ) data. In practice, researchers usually first test the existence of a change and then decide if they should proceed to estimate the change point. Unfortunately, as noted by Kuan and Hsu (1998), many well-known tests for structural change have significant size distortions when data have long memory. In contrast with Tang and MacNeill (1993), these size distortions do not result from ignoring nuisance parameters of weak dependence. In fact, the null of parameter constancy will be rejected with probability approaching one when data have strong positive correlations. In regressions with I(d ) variables, consider the Wald statistic for the null hypothesisb15b2 against the alternative (1). For each possible break k, the Wald statistic is
21
ˆ ˆ ˆ ˆ ˆ ˆ
W (k)T 5(b1(k)2b2(k))9(V(1,k)1V(k,T )) (b1(k)2b2(k)),
21 ˆ
where V(t1,t2)5MT (t1,t2)RSS /T. Under the null hypothesis, l50, and hence
[Tt2]
˜ ˜
E (Tt1,t2)5E (Tt1,t2)5
O
xt´t, H ([TT t])5H ([TT t]) t5[Tt1]1121 21
9
9
5E (0,T t)MT (0,t)E (0,tT )1E (Tt,1)MT (t,1)E (Tt,1). It follows that
(2)
H ([TT t])2HT ]]]]]]]
W ([TT t])5 T ,
2
O
´ 1H ([Tt]) /TS
t TD
t51
21
9
where HT5E (0,1)MT T (0,1)E (0,1). The next result shows that W ([TtT T ]) diverges when d´.0, regardless of the value of d .x
Theorem 3.1. Given DGP (1) with l50, suppose that Conditions [A2] and [A3] hold with ]
0,d´,0.5. Then for each t[[t,] t],
2 2
k´[B (t)d 2tB (1)]d
D ´ ´
22d´ ]]]]]]]
T W ([TT t])→ g(0)t(12t) (5)
´
as T→`, whereg(0) is the variance of ´.
´ t
Theorem 3.1 shows that for a positive d , W ([Tt´ T ]) diverges in probability. By the continuous mapping theorem, the supremum Wald test of Andrews (1993) and mean Wald and exponential Wald tests of Andrews and Ploberger (1994) also diverge to infinity. Thus, when there is no change, the Wald-type tests would reject the null of parameter constancy with probability approaching one as long as the disturbances have long memory (i.e., 0,d´,0.5), and hence may incorrectly suggest that a change has taken place. As the result of Theorem 3.1 does not depend on d , this conclusion holdsx even when the regressor is antipersistent (i.e., 20.5,dx,0). The Lagrange multiplier and
ˆ
likelihood ratio tests also suffer from the same problem. Moreover, when V is replaced with the Newey-West estimator to account for weak dependence, we can show that W still diverges as dT ´.0. Our simulations also confirm the asymptotic analysis of Theorem 3.1. The DGP is generated according to (1) with b15b25[1,2]9 for different combinations of d and d : dx ´ x5 20.45, . . . ,0.45 and d´50.1, . . . ,0.45 with the increment 0.05. Thus, there are 162 experiments; for each experiment, T5200, the number of replications is 5,000, and the nominal size is 5%. We compute the supremum
ˆ
Wald test of Andrews (1993). In computing the Wald statistic, we estimate V by the Newey-West estimator. The bandwidth is determined by the data-dependent formula of Andrews (1991) with AR(1) specification. The rejection frequencies of the supremum Wald test are summarized in Fig. 2. It can be seen that the type I errors increase from approximately 25% for d´50.1 to about 95% for d´50.45, regardless of the value of d . Therefore, as long as d is positive, the existing tests sufferx ´
from serious size distortions.
The previous results suggest that, whether or not there is a change, it is likely that one would be led to estimate the change point after conducting standard structural-change tests. From the proof of Theorem 3.1, we know that
22d´
]
ˆ
t 5argmaxt[[t,t] T H ([TT t])
]
2 2 2 2
k´B (d t) k´[B (1)d 2B (d t)]
D ´ ´ ´
] ]]] ]]]]]]
→argmaxt[[t,t] 1
t (12t)
] ]
;argmaxt[[t,t] G (d´ t),
(3)
Fig. 2. Empirical sizes of sup-Wald-type tests for regression models.
]
for any t[[t,] t]. By invoking the functional law of iterated logarithm (Taqqu, 1977), we have
2 2
B (d t) [B (1)d 2B (d t)]
´ ´ ´
]] ]]]]]]
lim sup 5lim sup 50 a.s.
t 12t
t→0 t→1
2 2
Thus, the asymptotic behavior of G (t) is well-defined on [0,1] with G (0)d d 5G (1)d 5k´B (1) . Itd
´ ´ ´ ´
D
ˆ ˆ
follows thatt has a limiting distribution with support equal to [0,1], and thatt→argmaxt[[0,1] G (d t). ´ However, the maximum of G (t) is not attained at 0 or 1 with probability one, sinced
´
2 2
kd´[tB (1)d´ 2B (d´ t)] ]]]]]]]
G (0)d 2G (d t)5G (1)d 2G (d t)5 2 ,0,
´ ´ ´ ´ t(12t)
for any t[(0,1). This proves the following theorem.
Theorem 3.2. Given DGP (1) with l50, suppose that Conditions [A2] and [A3] hold with D
] ˆ
0,d´,0.5. If ]t50 and t51, then t→argmaxt[[0,1] G (d´ t) and G (0)d´ 5G (1)d´ ,G (d´t) for any t[(0,1) with probability one.
Ideally, when there is no change, we expect that the probability mass of the change-point estimator
P
ˆ
concentrates around the two end points of the sample; i.e., t→h0, 1j. Theorem 3.2 shows that,when
ˆ ˆ
d´.0,t is unlikely to be close to 0 and 1; it is more likely thatt incorrectly identifies a change point in the middle of the sample even when there is none. This is the problem of spurious change, in the sense of Nunes et al. (1995). Nunes et al. (1995) and Bai (1998) showed that the I(1) regressors and errors are responsible for the occurrence of spurious break and spurious regression. Tsay and Chung (2000) also pointed out that the spurious regression may occur when dx1d´.0.5. From Theorem 3.2, we can see that the spurious-change problem may still exist even if the regressors and disturbances are stationary and dx1d´,0.5. That is, a spurious change may arise when there is no spurious regression.
We examine some simulations which are conducted as that in Section 2 butb15b25[1,0.5]9. Four cases are considered: (d ,d )x ´ 5(0.35,0.35), (20.35,0.35), (0.35,20.35), (20.35,20.35). Fig. 3
(4)
ˆ
Fig. 3. Empirical distributions of k for regression models when there is no change. (a) dx50.35, d´50.35 (b) dx5 20.35, d´50.35 (c)
dx50.35, d´5 20.35 (d) dx5 20.35, d´5 20.35. ˆ
shows that, for cases (a) and (b), k has the lowest rejection frequencies at the two end points (1 and T ). Hence, the least-square estimator will most likely find a change point in the middle of the sample rather than two end points; this confirms the result of Theorem 3.2. Note that when dx1d´,0.5 (Fig. 3 (b)), the spurious-change problem still occurs. Moreover, we also consider the cases that d´,0. In
ˆ
contrast with Fig. 3 (a) and (b), (c) and (d) show that the empirical distributions of k now have more probability mass at the two end points. Although we do not give a formal proof, it seems that the spurious-change problem occurs only for a positive d . As the results of Section 2, the fractionally´
differencing parameter of disturbances, d , is also crucial for a spurious change.´
4. Conclusions
In this paper we study the asymptotic properties of the least-squares estimator of the change point for regressions with I(d ) regressors and disturbances. The consistency of the least-squares change-point estimator is established for stationary I(d ) regressions. It is also found that a spurious change may arise when the disturbances have long memory; whether the regressors have long memory is irrelevant here. Thus, one should be extremely careful in drawing inferences from existing structural-change tests and structural-change-point estimators when data have long memory. A procedure that can properly test and estimate the unknown structural change in a long-memory environment is currently being investigated.
(5)
Acknowledgements
I am deeply indebted to C.-M. Kuan for enormous advice and comments.
Appendix A
Proof of Theorem 2.1. If d ±0, d ±0, and in the light of Tsay and Chung (2000), we know
x ´
dx1d´
Op
s
Td
, dx1d´.0.5,[Tt]
]]
0.5
O
s
T logTd
, d 1d 50.5,O
xt t´ 5 pœ
x ´ (A.1)t51
5
0.5s
d
O Tp , otherwise.
This result, the FCLT (2), and the Borel–Cantelli lemma imply that
[Tt] T
1 P 1 P
]
O
xt´t→0, ]O
xt´t→0.Tt51 Tt5[Tt]11
Furthermore, by a suitable law of large numbers, it can be seen that
[Tt]
0, t#t ,
1 P 0
]T
O
x xt9
tl1ht.[Tt0]j→H
(t2t )V l, t.t ,t51 0 x 0
and
T (12t )V l, t#t ,
1 P 0 x 0
]T
O
x xt9
tl1ht.[Tt0]j→H
(12t)V l, t.t ,t5[Tt]11 x 0
]
uniformly in t[[t,t], where] Vx5diag [1,gx(0)] and gx(0) is the variance of x . Combining theset results, we have
2
(12t0) 2 2
]]](l11gx(0)l2), t#t0,
1 ˜ P 12t
]H ([TT t])→ 2
T
5
t0 2 2
]
(122t01 )(l11gx(0)l2), t.t0, t
2 2
where the limit reaches the global maximum (12t0)(l11gx(0)l2) at t5t0. h
Proof of Theorem 3.1. Because of the FCLT (2) and (6), we have
[Tt] [Tt]
1 1 D
2( 0.51d )´ ]]] ]]]
T E (0,T t)5[ 0.51d´
O
´t, 0.51d´O
xt´t]9→[k´B (d´ t), o (1)]p 9,T t51 T t51
D
2( 0.51d )
] ´
for anyt[[t,] t]. Similarly, T E (Tt,1)→[k´(B (1)d´ 2B (d´ t)), o (1)]p 9. Moreover, by the law of
P P
(6)
22d´ 2( 0.51d )´
9
21 2( 0.51d )´T H ([TT t])5(T E (0,T t))(T MT (0,t))(T E (0,T t))
2 2
k´B (d t)
D ´
2( 0.51d )´ 21 2( 0.51d )´
9
]]]1(T E (Tt,1))(T MT (t,1))(T E (Tt,1))→ t
2 2
k´[B (1)d´ 2B (d´ t)] ]]]]]]
1 ,
(12t) D
22d 2 2
] ´
for any t[[t,] t]. Similarly, T HT→k´B (1) . Under the null hypothesis, it can then be shownd´
P
T 2
that [ot51´t 1H ([TT t])] /T→g´(0). Combining these limits and using the continuous mapping theorem, we obtain the result of (5). h
References
Andrews, D.W.K., 1991. Heteroskedasticity and autocorrelation consistent covariance matrix estimation. Econometrica 59, 817–858.
Andrews, D.W.K., 1993. Tests for parameter instability and structural change with unknown change point. Econometrica 61, 821–856.
Andrews, D.W.K., Ploberger, W., 1994. Optimal tests when a nuisance parameter is present only under the alternative. Econometrica 62, 1383–1414.
Bai, J., 1994. Least squares estimation of a shift in linear processes. Journal of Time Series Analysis 15, 453–472. Bai, J., 1997. Estimation of a change point in multiple regression models. The Review of Economic and Statistics 79,
551–563.
Bai, J., 1998. A note on spurious break. Econometric Theory 14, 663–669.
Kuan, C.-M., Hsu, C.-C., 1998. Change-point estimation of fractionally integrated processes. Journal of Time Series Analysis 19, 693–708.
Nunes, L.C., Kuan, C.-M., Newbold, P., 1995. Spurious Break. Econometric Theory 11, 736–749.
Tang, S.M., MacNeill, I.B., 1993. The effect of serial correlation on tests for parameter change at unknown time. Annals of Statistics 21, 552–575.
Taqqu, M.S., 1977. Law of the iterated logarithm for sums of non-linear functions of Gaussian random variables. Zeitschrift ¨
fur Wahrscheinlichkeitstheorie und verwandte Gebiete 40, 203–238.
Tsay, W.-J., Chung, C.-F., 2000. The spurious regression of fractionally integrated processes. Journal of Econometrics 96, 155–182.