Appendix G
Mathematical Proof of Why the Potential for Bias Resulting from not Refreshing our Sample to Include “New Births” in 2004 is Small
Assume that we
are estimating a population proportion of some characteristic of
interest (yes/no type) relating to school districts. We can think
of the population of school districts as being divided into two
strata. The first stratum would consist of school districts that
existed in 1997 and the second stratum consists of school districts
that existed in 2004 but not in 1997 (“new births”).
Let the number of school districts in the population be
.
Let the number of school districts in the first stratum be
and the number of school districts in the second stratum be
.
We have:
The overall population proportion of interest can be written as a weighted average of the proportion among existing districts and the population proportion among new births. Let
be the overall proportion,
the proportion among existing districts and
the proportion among new births. The overall proportion can be
written as:
The sample proportion we have in 2004 can only be
computed from the responding school districts that existed in 1997.
Let this proportion be
.
The bias in the sample proportion
because of not having any data from the new school districts is:
The bias in the estimate is the difference between the expected value of the estimate and population proportion. The expected value is the average of sample proportions of all possible samples that we can draw from the population of respondents. We have
Therefore, the bias in the estimate is
That is, the bias is the difference between the proportion among the respondents minus the overall proportion. This can be written as:
Alternatively, this can be written as:
Since we have
,
we can write
as
The bias in the estimate due to not including the
new births is small if either (1)
which is the proportion of new births is small or (2) the difference
between the proportion among existing districts and the proportion
among new births is small.
In this case
is small (2%). Therefore, the bias is not likely to be large as this
difference gets multiplied by a small number.
Abt Associates Inc.
File Type | application/msword |
File Title | Supporting Statement for Paperwork Reduction Act Submission to OMB: |
Author | Cay Bradley |
Last Modified By | doritha.ross |
File Modified | 2008-09-18 |
File Created | 2008-09-18 |