Читать онлайн книгу - Industrial Data Analytics for Diagnosis and Prognosis. Yong Chen. Математика. LiveLib

Новинки Лучшее Рекомендации

Информация о книге:

Название:

Автор:

Жанр:

Серия:

Издательство:

Industrial Data Analytics for Diagnosis and Prognosis - Yong Chen

Скачать книгу

with respect to μ and setting it to be equal to one, it is easy to see that

table row cell f left parenthesis D right parenthesis equals integral f left parenthesis D vertical line bold mu right parenthesis g left parenthesis bold mu right parenthesis text d end text bold mu. end cell end table

A point estimate of μ can be obtained by maximizing the posterior distribution. This method is called the maximum a posteriori (MAP) estimate. The MAP estimate of μ can be written as

(3.27)

From (3.27), it can be seen that the MAP estimate is closely related to MLE. Without the prior g(μ), the MAP is the same as the MLE. So if the prior follows a uniform distribution, the MAP and MLE will be equivalent. Following this argument, if the prior distribution has a flat shape, we expect that the MAP and MLE are similar.

We first consider a simple case where the data follow a univariate normal distribution with unknown mean μ and known variance σ². The likelihood function based on a random sample of independent observations D = {x₁, x₂,…, xn} is given by

$table row cell f left parenthesis D vertical line mu right parenthesis equals product from i equals 1 to n of f left parenthesis x subscript i vertical line mu right parenthesis equals 1 over left parenthesis 2 pi sigma squared right parenthesis to the power of n divided by 2 end exponent e to the power of negative fraction numerator 1 over denominator 2 sigma squared end fraction sum from i equals 1 to n of left parenthesis x subscript i minus mu right parenthesis squared end exponent. end cell end table$

Based on (3.26), we have

table row cell f left parenthesis mu vertical line D right parenthesis proportional to f left parenthesis D vertical line mu right parenthesis g left parenthesis mu right parenthesis comma end cell end table

where g(μ) is the probability density function of the prior distribution. We choose a normal distribution N(μ₀, σ₀²) as the prior for μ. This prior is a conjugate prior because the resulting posterior distribution will also be normal. By completing the square in the exponent of the likelihood and prior, the posterior distribution can be obtained as

table row cell mu vertical line D tilde N left parenthesis mu subscript n comma sigma subscript n superscript 2 right parenthesis comma end cell end table

where

$table row cell mu subscript n end cell cell equals fraction numerator sigma squared over denominator n sigma subscript 0 superscript 2 plus sigma squared end fraction mu subscript 0 plus fraction numerator n sigma subscript 0 superscript 2 over denominator n sigma subscript 0 superscript 2 plus sigma squared end fraction top enclose x end cell end table$ (3.28)

$table row cell fraction numerator 1 over denominator sigma subscript n superscript 2 end fraction end cell cell equals fraction numerator 1 over denominator sigma subscript 0 superscript 2 end fraction plus n over sigma squared. end cell end table$ (3.29)

The posterior mean given in (3.28) can be understood as a weighted average of the prior mean μ₀ and the sample mean x̄, which is the MLE of μ. When the sample size n is very large, the weight for x̄ is close to one and the weight for μ₀ is close to 0, and the posterior mean is very close to the MLE, or the sample mean. On the other hand, when n is very small, the posterior mean is very close the prior mean μ₀. Similarly, if the prior variance σ₀² is very large, the prior distribution has a flat shape and the posterior mean is close to the MLE. Note that because the mode of a normal distribution is equal to the mean, the MAP of μ is exactly μn. Consequently, when n is very large, or when the prior is flat, the MAP is close to the MLE.

Equation (3.29) shows the relationship between the posterior variance and the prior variance. It is easier to understand the relationship if we consider the inverse of the variance, which is called the precision. A high (low) precision corresponds to a low (high) variance. Equation (3.29) basically says that the posterior precision is equal to the prior precision with an added precision contribution proportional to n. Each observation adds a contribution of 1 over sigma squared comma , the precision of xn, to the posterior precision. When n is very large, the posterior precision becomes very high, or equivalently the posterior variance becomes very small. On the other hand, when n is very small, the posterior precision and variance will be very close to the prior precision and variance. Specifically, when n = 0, the posterior distribution is the same as the prior distribution. We illustrate the posterior distribution of the mean with known variance under various sample sizes in Figure 3.3, where the data are generated from N(2, 1) and the prior distribution of the mean is N(0, 1). It is clear from Figure 3.3 that with sample size n getting larger, the posterior distribution of the mean becomes more and more concentrated at the true mean.

Figure 3.3 Posterior distribution of the mean with various sample sizes

When the data follow a p-dimensional multivariate normal distribution with unknown mean μ and known covariance matrix Σ, the posterior distribution based on a random sample of independent observations D = {x₁, x₂,…, x_n} is given by

f left parenthesis bold mu vertical line D right parenthesis proportional to f left parenthesis D vertical line bold mu right parenthesis g left parenthesis bold mu equals product from i equals 1 to n of f left parenthesis bold x subscript i vertical line bold mu right parenthesis g left parenthesis bold mu right parenthesis comma

where g(μ) is the density of the conjugate prior distribution Np(μ₀, Σ₀). Similar to the univariate case, the posterior distribution of μ can be obtained as

table row cell bold mu vertical line D tilde N subscript p left parenthesis mu subscript n comma capital sigma subscript n right parenthesis comma end cell end table

where

Скачать книгу

Industrial Data Analytics for Diagnosis and Prognosis. Yong Chen

Чтение книги онлайн.

Читать онлайн книгу Industrial Data Analytics for Diagnosis and Prognosis - Yong Chen страница 25

Информация о книге: