Читать онлайн книгу - Industrial Data Analytics for Diagnosis and Prognosis. Yong Chen. Математика. LiveLib

Новинки Лучшее Рекомендации

Информация о книге:

Название:

Автор:

Жанр:

Серия:

Издательство:

Industrial Data Analytics for Diagnosis and Prognosis - Yong Chen

Скачать книгу

with mean vector μ and covariance matrix Σ, the probability density function of X has the form

$table row cell f left parenthesis bold x right parenthesis equals fraction numerator 1 over denominator left parenthesis 2 pi right parenthesis to the power of p divided by 2 end exponent vertical line capital sigma vertical line to the power of 1 divided by 2 end exponent end fraction e to the power of negative left parenthesis bold x minus bold italic mu right parenthesis to the power of T capital sigma to the power of negative 1 end exponent left parenthesis bold x minus bold italic mu right parenthesis end exponent. end cell end table$ (3.8)

We denote the p-dimensional normal distribution by Np(μ, Σ).

From (3.8), the density of a p-dimensional normal distribution depends on x through the term (x − μ)^T Σ⁻¹ (x − μ), which is the square of the distance from x to Σ standardized by the covariance matrix. Then it is clear that the set of x values yielding a constant height for the density form an ellipsoid. The set of points with the same height for the density is called a contour. The constant probability density contour of a p-dimensional normal distribution is:

left curly bracket bold x vertical line left parenthesis bold x minus bold italic mu right parenthesis to the power of T capital sigma to the power of negative 1 end exponent left parenthesis bold x minus bold italic mu right parenthesis equals c squared right curly bracket comma

which forms the surface of an ellipsoid centered at μ with standardized distance between x and μ equal to c. And the contour with larger distance c has a smaller height value for the density. It can be shown that the axes of the ellipsoid contours of constant density for the p-dimensional normal distribution are in the directions of the eigenvectors of Σ with lengths proportional to the square roots of the corresponding eigenvalues of Σ.

Example 3.1: Consider a bivariate (p = 2) normally distributed random vector X = (X₁ X₂)T. Suppose the mean vector is μ = (0 0)T and the covariance matrix is

table row cell bold capital sigma equals open parentheses table row 1 rho row rho 1 end table close parentheses. end cell end table

So the variance of both variables is equal to one and the covariance matrix coincides with the correlation matrix. The inverse of the covariance matrix is

$table row cell capital sigma to the power of negative 1 end exponent equals fraction numerator 1 over denominator 1 minus rho squared end fraction open parentheses table row 1 cell negative rho end cell row cell negative rho end cell 1 end table close parentheses end cell end table$

and |Σ| = 1 − ρ². Substituting Σ⁻¹ and |Σ| in (3.8), we have

$f open parentheses x subscript 1 comma space x subscript 2 close parentheses space equals space fraction numerator 1 over denominator 2 pi square root of 1 minus rho squared end root end fraction exp space open curly brackets negative fraction numerator 1 over denominator 2 open parentheses 1 minus straight rho squared close parentheses end fraction open parentheses x subscript italic 1 superscript italic 2 plus straight x subscript 2 superscript 2 minus 2 rho x subscript italic 1 x subscript italic 2 close parentheses close curly brackets$ (3.9)

From (3.9), if ρ = 0, the joint density can be written as f(x₁,x₂) = f(x₁)f(x₂), where f(x) is the univariate normal density as given in (3.7), with μ = 0 and σ = 1. So in this case X₁ and X₂ are independent. This result is true for general multivariate normal distribution, as discussed later in this section.

By solving the characteristic equation |Σ − λI| = 0, the two eigenvalues of Σ are λ₁ = 1 + ρ and λ₂ = 1 – ρ. Based on Σv = λv, the corresponding eigenvectors can be obtained as

$table row cell bold v subscript 1 equals open parentheses table row cell fraction numerator square root of 2 over denominator 2 end fraction end cell row cell fraction numerator square root of 2 over denominator 2 end fraction end cell end table close parentheses comma space bold v subscript 2 equals open parentheses table row cell fraction numerator negative square root of 2 over denominator 2 end fraction end cell row cell fraction numerator square root of 2 over denominator 2 end fraction end cell end table close parentheses. end cell end table$

So the major axis of the ellipse contour of constant density is along the line x₁ = x₂ and the minor axis is orthogonal to the major axis. The larger the correlation coefficient ρ, the more elongated the ellipse contour. As an example, two bivariate normal distributions with ρ = 0 and ρ = 0.75 are shown in Figure 3.1(a) and Figure 3.1(b), respectively. Notice how the presence of correlation causes the probability distribution to concentrate along the line x₁ = x₂. When ρ = 0, it is easy to see that the constant-density contour is a circle, as shown in Figure 3.2(a). For ρ = 0.75, the constant-density contour is an ellipse shown in Figure 3.2(b).

Figure 3.1 Two bivariate normal distributions, (a) ρ = 0 (b) ρ = 0.75

Figure 3.2 Contour plots for the distributions in Figure 3.1

Properties of the Multivariate Normal Distribution

We list some of the most useful properties of the multivariate normal distribution. These properties make it convenient to manipulate normal distributions, which is one of the reasons for the popularity of the normal distribution. Suppose the random vector X follows a p-dimensional normal distribution Np(μ,Σ).

Normality of linear combinations of the variables in X. Let c be a vector of constants. From (3.3) and (3.4), we have E(cT X) = cT μ and var(cT X) (cT Σc. This is true for any random vector X. When X follows a multivariate normal distribution, we have the additional property that cT X also follows a (univariate) normal distribution. That is, if X ∼ Np(μ,Σ, then cT X ∼ N(cT μ, cT Σc). In general, if C is a q × p matrix, CX still follows a multivariate normal distribution. From (

Скачать книгу

Industrial Data Analytics for Diagnosis and Prognosis. Yong Chen

Чтение книги онлайн.

Читать онлайн книгу Industrial Data Analytics for Diagnosis and Prognosis - Yong Chen страница 20

Информация о книге:

Properties of the Multivariate Normal Distribution