271
Table IV.2 Interviewed households, by year, and by season Shamsi calendar
Season Year
Total 1392
1393 Winter
5,067 -
5,067 Spring
4,449 4,449
Summer 4,880
4,880 Autumn
6,390 6,390
Total 5,067
15,719 20,786
Non-response within clusters was very limited. Only 845 4.1 percent of the households in the visited clusters were not available or refused or were unable to participate. In 841 of these cases households
were replaced by reserve households listed in the cluster reserve list, leaving 4 households unaccounted for 0.02 percent.
IV.8 Calculation of sampling weights and post-stratification
Sample weights are the scaling factors that are required to inflate the sampled households to the number of households that they represent in the survey. The use of stratification in the ALCS 2014 sample design
requires that sample weights are separately calculated for each stratum. Due to imperfections in the survey implementation, the design weights are adjusted in order to achieve optimal representation of the
survey results. This section separately addresses the calculation of the sampling weights for the resident population and that for the Kuchi population.
IV.8.1 Resident population
Calculation of the design weight
The first step in calculating the sample weights is calculating the weights that would inflate the sampled households to the number of households in the sampling frame. This calculation follows from the
selection probability of the households as defined in the sampling design. In the two-stage sampling design of ALCS 2013-14, the PSUs were the EAs as defined in the sample
frame, made up of the 2009 household listing and the available household listings from the SDS. The selection of PSUs in the first sampling stage was implemented in accordance with:
stratification by province an optimum allocation distribution for provinces, which minimises the standard error
selection with probability proportional to the number of households PPS. The probability of selecting a PSU in stage 1 is
p
1
= c
s
h
ps
H
s09
where p
1
is the probability of selecting PSU or EA p in stratum s, c
s
is the number of clusters selected in stratum s, h
ps
is the number of households in EA p from stratum s and H
s09
is the number of households in stratum s as reported in the sampling frame.
For EAs encompassing two or more villages, a second sampling stage was introduced in order to reduce travel time and costs. The selection of the village to be included was done with probability proportional
to the number of households, with
272
p
2
= m
vs
h
ps
where p
2
is the probability of selecting one village out of all villages in EA p in stratum s, m
vs
is the number of households in that village and h
ps
is the number of households in EA p from stratum s. For EAs without village segmentation, m
vs
= h
ps
and p
vp
= 1. The Ultimate Sampling Units in the survey were households. The sampling design specified a fixed
number of 10 households per selected EA. Therefore, the probability of selecting a household in an EA or in the selected village in the EA in the third sampling stage is
p
3
= 10 m
vs
The overall probability of selecting a household is the product of the selection probabilities in each stage for any stratum.
p
123
= p
1
p
2
p
3
= c
s
h
ps
H
s09
m
vs
h
ps
10 m
ps
= 10 c
s
H
s09
The design weight for each sampled household is the reciprocal of the selection probability, thus
d
w
hs
= 1 p
123
= H
s09
10c
s
where
d
w
hs
is the design weight for households in stratum s. The weighted sample total – the sum of the
products of sampled households and their respective design weights – is equal to the total population of
households in each stratum in the sample frame: ∑ h
ps d
w
hs
= H
s09
Calculation of non-coverage adjustment factors
Two main reasons exist in survey taking for exclusion of households in the collected data: Non-response – households not willing to be interviewed or not available for being interviewed
Non-coverage – households that cannot be reached if areas are inaccessible because of reasons such as the local security situation or road conditions.
66
Non-response in ALCS in not a major issue: overall non-response was 4 percent. Very few household refuse to collaborate in the survey and most of the non-response was due to non-available households.
As ALCS adopted the strategy of addressing non-response by substituting households from a reserve list, there is no need to adjust for non-response.
Non-coverage, on the other hand, was a more serious problem in the survey, especially because of the security situation in the country. Non-coverage was partly addressed by replacing inaccessible clusters
by clusters from a reserve list. Since a number of inaccessible clusters could not be replaced during the fieldwork of ALCS, the sampled households weighted by the design weight
∑h
ps d
w
hs
do not add up to the total population of households H
s
in the sample frame. To compensate for non-covered households, the design weight was adjusted.
To obtain the non-coverage adjustment factor, first the non-coverage rate was calculated. This is the ratio between the number of actually interviewed households and the number of sampled households:
nc
s
=
i
h
s s
h
s 66
In addition, some surveys exclude some areas on beforehand because the relevance of information from these – e.g. very thinly populated areas – does not compensate the costs of getting there.
273
where nc
s
is the non-coverage rate in stratum s,
i
h
s
is the number of interviewed households in stratum s and
s
h
s
is the number of sampled households in stratum s. The adjustment factor for non-coverage in stratum s
n
w
hs
is the reciprocal of the non-coverage rate nc:
n
w
hs
= 1 nc
s
The sample weight that is required to scale-up the sampled households to the total population of households in the sample frame
dn
w
hs
now becomes the product of the design weight and the non- coverage factor. For each stratum s this is:
dn
w
hs
=
d
w
hs n
w
hs
The newly weighted sample total ∑ h
ps dn
w
hs
is again equal to the sample frame population H
s09
.
Calculation of post-stratification factors
Additional expansion factors are required to re-scale the number of households in the sample frame to the number in the period in which the survey was conducted. As in the previous survey round, the
estimated number of households was derived from the CSO population projections by province P
s14
.
67
For the settled households, the provincial population was divided by the average household size for that province, which was obtained in the current survey by applying
dn
w
s
the combined design weight and non-coverage factor in order to reduce distortion by sampling and coverage effects.
Since the re-scaling of the number of households is done at province level, this normalisation exercise implies post-stratification of the sample. The re-scaling factors are calculated as the ratio between the
CSO estimate of the number of households in 2014 in a stratum and the number of households in the sampling frame:
r
w
hs
= H
s14
H
s09
and the combined sampling weight becomes
dnr
w
hs
=
dn
w
hs r
w
hs
Seasonal distribution
Because the interview implementation was not entirely uniform across seasons quarters, uncorrected annual estimates would place relatively larger weights on those seasons which had a large sample
winter and autumn, thereby distorting the representativeness of national results. Because the sample was stratified by season, and imposing the assumption that the level of seasonal, international migration
is negligible, the weighted distribution can be smoothed out to ensure that the estimated population size by quarter is the same. This adjustment is implemented as:
w
hsq
= 1
dnr
w
hs
0.25 P
s14 dnr
p
sq
where w
hsq
is the factor that standardises across seasons quarters and
dnr
p
sq
is the sampled population in stratum s and season q, weighted by the weights for the sampling design, non-coverage and re-scaling.
The denominator gives the total number of sampled, settled individuals in each stratum by quarter. The adjustment term in the numerator gives the population of individuals for each stratum by quarter
according to the CSO 2014 population estimate.
67
The 1393 population projections are considered a sufficient approximation for the mid-survey population.
274
The final household sampling weight hw
hsq
is the product of all weighting factors: hw
sq
=
dnr
w
hs
w
hsq
Individual weights
In order to obtain the expansion factor for individuals the following calculation was made: iw
hsq
= hw
hsq
hs
hsq
the term hs
hsq
being the household size of household h in stratum s and quarter q.
IV.8.2 Kuchi population The Kuchi sample was designed on basis of the 2003-04 National Multi-sectoral Assessment of Kuchi
NMAK-2004. For this separate Kuchi stratum a community selection was implemented with PPS and a second stage selection with again a constant cluster size of ten households. The 66 clusters 660
households for this stratum were divided between the summer 30 clusters and winter 36 clusters periods in 1393 2014.
In the absence of up-to-date information about the actual number of Kuchis and the poltical sensitivity of addressing this issue, the present position taken by CSO is that the Kuchi pupolation is stable at a
number close to 1.5 million people. Apart from the sampling frame, the restriction to two seasons and the absence of the need to
accommodate population growth, the procedures for the calculation of the sampling weights for the Kuchi stratum are the same as those for the resident population
IV.8.3 Weights variables The values of the final household sample weight hw