Industrial Data Analytics for Diagnosis and Prognosis. Yong Chen

Чтение книги онлайн.

Читать онлайн книгу Industrial Data Analytics for Diagnosis and Prognosis - Yong Chen страница 23

Industrial Data Analytics for Diagnosis and Prognosis - Yong Chen

Скачать книгу

alt="table row cell H subscript 0 colon bold mu equals mu subscript 0 text vs. end text H subscript 1 colon bold mu not equal to mu subscript 0. end cell end table"/> (3.20)

      Let X1, X2,…, Xn denote a random sample from a multivariate normal population. The test statistic in (3.19) can be naturally generalized to the multivariate distribution as

      where and S are the sample mean vector and the sample covariance matrix of X1, X2,…, Xn. The T2 statistic in (3.19) is called Hotelling’s T2 in honor of Harold Hotelling who first obtained its distribution. Assuming H0 is true, we have the following result about the distribution of the T2-statistic:

table row cell fraction numerator bold n bold minus bold p over denominator bold left parenthesis bold n bold minus bold 1 bold right parenthesis bold p end fraction bold T to the power of bold 2 bold tilde bold F subscript bold p bold comma bold n bold minus bold p end subscript bold comma end cell end table

      where Fp,n−p denotes the F-distribution with p and np degrees of freedom. Based on the results on the distribution of T2, we reject H0 at the significance level of α if

      where Fp,n−p denotes the upper (100α)th percentile of the F-distribution with p and np degrees of freedom. The p-value of the test based on the T2-statistic is

table row cell straight P equals text Pr end text left parenthesis straight F left parenthesis straight p comma straight n minus straight p right parenthesis > semicolon fraction numerator straight n minus straight p over denominator left parenthesis straight n minus 1 right parenthesis straight p end fraction straight T squared right parenthesis comma end cell end table

      where F(p,np) denotes a random variable distributed as Fp,n−p.

      The T2 statistic can also be written as

table row cell left curly bracket straight mu vertical line straight n left parenthesis bold x with bold bar on top minus bold mu right parenthesis to the power of straight T bold S to the power of negative 1 end exponent left parenthesis bold x with bold bar on top minus bold mu right parenthesis less or equal than fraction numerator left parenthesis straight n minus 1 right parenthesis straight p over denominator straight n minus straight p end fraction straight F subscript straight alpha comma straight p comma straight n minus straight p end subscript right curly bracket. end cell end table

      The concept of univariate confidence interval can be extended to multivariate confidence region. For p-dimensional normal distribution, the 100(1 − α)% confidence region for μ is defined as

table row cell left curly bracket mu vertical line n left parenthesis bold x with bold bar on top minus bold italic mu right parenthesis to the power of T bold S to the power of negative 1 end exponent left parenthesis bold x with bold bar on top minus bold italic mu right parenthesis less or equal than fraction numerator left parenthesis n minus 1 right parenthesis p over denominator n minus p end fraction F subscript alpha comma p comma n minus p end subscript right curly bracket. end cell end table

      It is clear that the confidence region for μ is an ellipsoid centered at . Similar to the univariate case, the null hypothesis H0 :μ = μ0 is not rejected at level α if and only if μ0 is in the 100(1 − α)% confidence region for μ.

      The T2-statistic can also be derived as the likelihood ratio test of the hypotheses in (3.20). The likelihood ratio test is a general principle of constructing statistical test procedures and having several optimal properties for reasonably large samples. The detailed study of the likelihood ratio test theory is beyond the scope of this book.

      Substituting the MLE of μ and Σ in (3.16) and (3.17), respectively, into the likelihood function in (3.13), it is easy to see

table row cell max with bold italic mu comma bold capital sigma below L left parenthesis bold italic mu comma bold capital sigma right parenthesis equals fraction numerator 1 over denominator left parenthesis 2 pi right parenthesis to the power of n p divided by 2 end exponent vertical line bold capital sigma with bold hat on top vertical line to the power of n divided by 2 end exponent end fraction e to the power of negative n p divided by 2 end exponent comma end cell end table

      where is the MLE of Σ given in (3.17). Under the null hypothesis H0 : μ = μ0, the MLE of Σ with μ = μ0 fixed can be obtained as

table row cell bold capital sigma with bold hat on top subscript 0 equals 1 over n sum from i equals 1 to n of left parenthesis bold x subscript i minus bold italic mu subscript 0 right parenthesis left parenthesis x subscript i minus bold italic mu subscript 0 right parenthesis to the power of T. end cell end table

      It can be seen that stack sum subscript 0 with hat on top is the same as

Скачать книгу