Sampling and Estimation from Finite Populations. Yves Tille

Чтение книги онлайн.

Читать онлайн книгу Sampling and Estimation from Finite Populations - Yves Tille страница 19

Sampling and Estimation from Finite Populations - Yves Tille

Скачать книгу

target="_blank" rel="nofollow" href="#fb3_img_img_da88fa42-4287-5eb2-8309-a6c1d1c7f846.png" alt="equation"/>

      The bias is zero if and only if images for all images

      If some inclusion probabilities are zero, it is impossible to estimate images without bias. It is said that the sampling design does not cover the population or is vitiated by a coverage problem. We sometimes hear that a sample is biased, but this terminology should be avoided because bias is a property of an estimator and not of a sample. In what follows, we will consider that all the sampling designs have nonzero first‐order inclusion probabilities.

      For estimating the mean

equation

      we can simply divide the expansion estimator of the total by the size of the population. We obtain

equation

      However, the population size is not necessarily known.

      (2.1)equation

      Indeed, there is no reason that

equation

      is equal to 1, even if the expectation of this quantity is equal to 1. The expansion estimator of the mean is thus vitiated by an error that does not depend on the variability of variable images. Even if images images remains random.

      We refer to the following definition:

      Definition 2.7

      An estimator of the mean images is said to be linearly invariant if, for all images, when images then images

      For this reason, even when the size of the population images is known, it is recommended to use the estimator of Hájek (1971) (HAJ) which consists of dividing the total by the sum of the inverses of the inclusion probabilities:

      (2.2)equation

      When images then images Therefore, the bad property of the expansion estimator is solved because the Hájek estimator is linearly invariant. However, images is usually biased because it is a ratio of two random variables. In some cases, such as simple random sampling without replacement with fixed sample size, the Hájek ratio is equal to the expansion estimator.

      

      Result 2.5

      The variance of the expansion estimator of the total is

      Proof:

equation

      It is also possible to write the expansion estimator with a vector notation. Let images be the vector of dilated values:

equation equation

      The calculation of the variance is then immediate:

equation

      If the sample is of fixed sample size, Sen (1953) and Yates & Grundy (1953) have shown the following result:

      Result 2.6

      If the sampling design is of fixed sample size, then the variance of the expansion estimator can also be written as

      Proof:

      Under a design with fixed sample size, it has been proved in Result 2.2, page 16, that

equation

      The first two terms of Expression (2.5) are therefore null and we find the expression of Result 2.5.

      In

Скачать книгу