Introduction A state space model in small area estimation

larger geographical areas. Sample survey data provide effective reliable estimators of totals and means for large areas and domains. But it is recognized that the usual direct survey estimators performing statistics for a small area, have unacceptably large standard errors, due to the circumstance of small sample size in the area. In fact, sample sizes in small areas are reduced, due to the circumstance that the overall sample size in a survey is usually determined to provide specific accuracy at a macro area level of aggregation, that is national territories, regions ad so on Datta and Lahiri, 2000. Demand for reliable small area statistics has steadily increased in recent years which prompted considerable research on efficient small area estimation. Direct small area estimators from survey data fail to borrow strength from related small areas since they are based solely on the sample data associated with the corresponding areas. As a result, they are likely to yield unacceptably large standard errors unless the sample size for the small area is reasonably largeRao, 2003. Small area efficient statistics provide, in addition of this, excellent statistics for local estimation of population, farms, and other characteristics of interest in post-censual years.

2. Indirect Estimation in Small Area

A domain area is regarded as large or major if domain-specific sample is large enough to yield direct estimates of adequate precision. A domain is regarded as small if the domain-specific sample is not large enough to support direct estimates of adequate precision. Some other terms used to denote a domain with small sample size include local area, sub- domain, small subgroup, sub-province, and minor domain. In some applications, many domains of interest such as counties may have zero sample size. In making estimates for small area with adequate level of precision, it is often necessary to use indirect estimators that borrow strength by using thus values of the variable of interest, y, from related areas andor time periods and thus increase the effective sample size. These values are brought into the estimation process through a model either implicit or explicit that provides a link to related areas andor time periods through the use of supplementary information related to y, such as recent census counts and current administrative records Pfeffermann 2002; Rao 2003. Methods of indirect estimation are based on explicit small area models that make specific allowance for between area variation. In particular, we introduce mixed models involving random area specific effects that account for between area variation beyond that explained by auxiliary variables included in the model. We assume that T i = g i Y for some specified

g. is related to area specific auxiliary data z

i = z 1i , …, z pi T through a linear model T i = z i T E + b i v i , i = 1, …, m where the b i are known positive constants and E is the px1 vector of regression coefficients. Further, the v i are area specific random effects assumed to be independent and identically distributed iid with E m v i = 0 and V m v i = V v 2 t 0, or v i a iid 0, V v 2

3. Generalized Linear Mixed Model

Datta and Lahiri 2000, and Rao2003 considered a general linear mixed model GLMM covering the univariate unit level model as special cases: y P = X P E + Z P v + e P Random vectors v and e P are independent with e P a N0, V 2 \ P and v a N0, V 2 D O, where \ P is a known positive definite matrix and D O is a positive definite matrix which is structurally known except for some parameters O typically involving ratios of variance 659 ICCS-IX 12-14 Dec 2007 components of the form V i 2 V 2 . Further, X P and Z P are known design matrices and y P is the N x 1 vector of population y-values. The GLMM form : » ¼ º « ¬ ª » ¼ º « ¬ ª » ¼ º « ¬ ª » ¼ º « ¬ ª e e v Z Z X X y y y E P where the asterisk denotes non-sampled units. The vector of small area totals Y i is of the form Ay + Cy with A = and C = where = blockdiagA T n m i i 1 1 T n N m i i i 1 1 u m i A 1 1 , …, A m . We are interested in estimating a linear combination, P = 1 T E + m T

v, of the regression

parameters E and the realization of v, for specified vectors, l and m, of constants. For known G, the BLUP best linear unbiased prediction estimator of P is given by Rao, 2003 H P~ = tG, y = 1 T E ~ + m T v ~ = 1 T E ~ + m T GZ T V -1 y - X E ~ Model of indirect estimation, z T i ˆ i T E + b i v i + e i , i = 1, …, m, is a special case of GLMM with block diagonal covariance structure. Making the above substitutions in the general form for the BLUP estimator of P i , we get the BLUP estimator of T i as: H i T ~ = z i T E ~ + J i - z i Tˆ i T E ~ , where J i = V v 2 b i 2 \ i + V v 2 b i 2 , and E ~ = E V ~ v 2 = » » ¼ º « « ¬ ª V \ T » » ¼ º « « ¬ ª V \ ¦ ¦ m i i v i i i m i i v i T i i b b 1 2 2 1 1 2 2 ˆ z z z

4. State Space Models

Many sample surveys are repeated in time with partial replacement of the sample elements. For such repeated surveys considerable gain in efficiency can be achieved by borrowing strength across both small areas and time. Their model consist of a sampling error model T it ˆ T it + e it , t = 1, …, T; i = 1, …, m T it = z it T E it where the coefficients E it = E it0 , E it1 , …, E itp T are allowed to vary cross-sectionally and over time, and the sampling errors e it for each area i are assumed to be serially uncorrelated with mean 0 and variance \ it . The variation of E it over time is specified by the following model: p j v itj ij j t i j ij itj ,..., 1 , , 1 ȕ ȕ ȕ ȕ , 1 , » ¼ º « ¬ ª » ¼ º « ¬ ª » ¼ º « ¬ ª 7 It is a special case of the general state-space model which may be expressed in the form y t = Z t D t + H t ; E H t = 0, E H t H t T = 6t D t = H t D t-1 + A K t ; E K t = 0, E K t K t T = where H t and K t are uncorrelated contemporaneously and over time. The first equation is known as the measurement equation, and the the second equation is known as the transition equation. This model is a special case of the general linear mixed model but the state-space form permits updating of the estimates over time, using the Kalman filter equations, and smoothing past estimates as new data becomes available, using an appropriate smoothing algoritm. The vector D t is known as the state vector. Let -1 t Į~ be the BLUP estimator of D t-1 based on all observed up to time t-1, so that = H -1 t | t Į~ -1 t Į~ is the BLUP of D t at time t-1. Further, P t|t-1 = HP t-1 H T + A A T is the covariance matrix of the prediction errors -1 t | t Į~ - D t , where P t-1 = E 1 - t Į~ - D t-1 1 - t Į~ - D t-1 T 660

Introduction A state space model in small area estimation

2. Indirect Estimation in Small Area

g. is related to area specific auxiliary data z

3. Generalized Linear Mixed Model

v, of the regression

4. State Space Models

Parts

Dokumen yang terkait

Penerapan metode Empirical Bayes pada Model Small Area Estimation dalam pendugaan pengeluaran per kapita di Kota Bogor

Small area estimation with time and area effects using a dynamic linear model

Pendekatan dynamic linear model atau State Space Model pada pendugaan area kecil (Small Area Estimation)

Indonesia Case Small Area Estimation

Estimation of Small Area Statistics using Beta-Binomial Model

DETERMINING POVERTY MAP USING SMALL AREA ESTIMATION METHOD

2008 Pendugaan Berbasis Model untuk Kasus Biner Pada Small Area Estimation

GENERALIZED ADDITIVE MIXED MODELS FOR SMALL AREA ESTIMATION

Solusi Alternatif : Small Area Estimation (SAE)

DEVELOPMENT OF SMALL AREA ESTIMATION RESEARCH IN INDONESIA

Dukungan

Links

Introduction A state space model in small area estimation

2. Indirect Estimation in Small Area

g. is related to area specific auxiliary data z

3. Generalized Linear Mixed Model

v, of the regression

4. State Space Models

Parts

Dokumen yang terkait

Penerapan metode Empirical Bayes pada Model Small Area Estimation dalam pendugaan pengeluaran per kapita di Kota Bogor

Small area estimation with time and area effects using a dynamic linear model

Pendekatan dynamic linear model atau State Space Model pada pendugaan area kecil (Small Area Estimation)

Indonesia Case Small Area Estimation

Estimation of Small Area Statistics using Beta-Binomial Model

DETERMINING POVERTY MAP USING SMALL AREA ESTIMATION METHOD

2008 Pendugaan Berbasis Model untuk Kasus Biner Pada Small Area Estimation

GENERALIZED ADDITIVE MIXED MODELS FOR SMALL AREA ESTIMATION

Solusi Alternatif : Small Area Estimation (SAE)

DEVELOPMENT OF SMALL AREA ESTIMATION RESEARCH IN INDONESIA

Dokumen yang Anda mencari sudah siap untuk unduhkan