Sampling and Estimation from Finite Populations. Yves Tille

Чтение книги онлайн.

Читать онлайн книгу Sampling and Estimation from Finite Populations - Yves Tille страница 16

Sampling and Estimation from Finite Populations - Yves Tille

Скачать книгу

alt="images"/> are not random. Indeed, under this approach, the only source of randomness is the way of selecting the sample.

      The objective is to estimate parameters in this population. These parameters are also called functions of interest because they do not correspond to the usual definition of parameter used in inferential statistics for a parametric model. Parameters are simply functions of images or images. For example, the goal may be to estimate totals,

equation

      means,

equation

      or population variances,

equation

      The population size images is not necessarily known and therefore can also be an estimation objective. However, as one can write

equation

      the estimation of the population size is a problem of the same nature as the estimation of images or images.

equation

      the population covariance,

equation

      or the correlation coefficient,

equation

      These parameters are unknown and will therefore be estimated using a sample.

      A sample without replacement images is simply a subset of the population images We also consider the set images of all the possible samples. For instance, if images, then

equation

      where images denotes the empty set.

      Definition 2.1

      A sampling design without replacement images is a probability distribution on images such that

equation

      Definition 2.2

      A random sample images is a random variable whose values are the samples:

equation

      A random sample can also be defined as a discrete random vector composed of non‐negative integer variables images. The variable images represents the number of times unit images is selected in the sample. If the sample is without replacement then variable images can only take the values 0 or 1 and therefore has a Bernoulli distribution. In general, random variables images are not independent except in very special cases. The use of indicator variables images was introduced by Cornfield (1944) and greatly simplified the notation in survey sampling theory because it allows us to clearly separate the values of the variables images or images from the source of randomness images.

      Often, we try to select the sample as randomly as possible. The usual measure of randomness of a probability distribution is the entropy.

      Definition 2.3

      The entropy of a sampling design is the quantity

equation

      We suppose that images

      The sample size images is the number of units selected in the sample. We can write

equation

      When the sample size is not random, we say that the sample is of fixed sample size and we simply denote it by images.

      The variables are observed only on the units selected in the sample. A statistic images is a function of the values images that are observed on the random sample: images. This statistic takes the value

Скачать книгу