Читать онлайн книгу - Data Science in Theory and Practice. Maria Cristina Mariani. Математика. LiveLib

Новинки Лучшее Рекомендации

Информация о книге:

Название:

Автор:

Жанр:

Серия:

Издательство:

Data Science in Theory and Practice - Maria Cristina Mariani

Скачать книгу

href="#fb3_img_img_88a4c458-1aef-51a7-b963-ce9fab490ff8.png" alt="upper X"/> is said to have a binomial distribution with parameters

and

if it has a pmf shown below

upper P left-parenthesis x semicolon p comma n right-parenthesis equals StartBinomialOrMatrix n Choose k EndBinomialOrMatrix left-parenthesis p right-parenthesis Superscript x Baseline left-parenthesis 1 minus p right-parenthesis Superscript left-parenthesis n minus x right-parenthesis Baseline for x equals 0 comma 1 comma ellipsis comma n comma

where is the probability of success on an individual trial and is number of trials in the binomial experiment.

The multinomial distribution is a generalization of the binomial distribution. Specifically, assume that independent distributions may result in one of the outcomes generically labeled , each with corresponding probabilities . Now define a vector , where each of the counts the number of outcomes in the resulting sample of size . The joint distribution of the vector is

f left-parenthesis x 1 comma ellipsis comma x Subscript k Baseline right-parenthesis equals StartFraction n factorial Over x 1 factorial ellipsis x Subscript k Baseline factorial EndFraction p 1 Superscript x 1 Baseline ellipsis p Subscript k Superscript x Super Subscript k Superscript Baseline bold 1 Subscript left-brace bold x bold 1 bold plus bold midline-horizontal-ellipsis bold plus bold x Sub Subscript bold k Subscript bold equals bold n right-brace Baseline period

In the same way as the binomial probabilities appear as coefficients in the binomial expansion of , the multinomial probabilities are the coefficients in the multinomial expansion , so they sum to 1. This expansion in fact gives the name of the distribution.

If we label the outcome as a success and everything else a failure, then simply counts successes in independent trials and thus . Thus, the first moment of the random vector and the diagonal elements in the covariance matrix are easy to calculate as and , respectively. The off‐diagonal elements (covariances) are not that complicated to calculate either. However, for multinomial random vectors, the first two moments are difficult to compute. The one‐dimensional marginal distributions are binomial; however, the joint distribution of , the first components, is not multinomial. Instead, suppose we group the first categories into 1 and we let . Because the categories are linked, that is, , we also have that . We can easily verify that the vector , or equivalently , will have a multinomial distribution with associated probabilities .

Next consider the conditional distribution of the first components given the last components. That is, the distribution of

left-parenthesis upper X 1 comma ellipsis comma upper X Subscript r Baseline right-parenthesis bar upper X Subscript r plus 1 Baseline equals n Subscript r plus 1 Baseline comma ellipsis comma upper X Subscript k Baseline equals n Subscript k Baseline period

This distribution is also multinomial with the number of elements n minus n Subscript r plus 1 Baseline minus midline-horizontal-ellipsis minus n Subscript k and probabilities left-parenthesis p prime 1 comma ellipsis comma p prime Subscript r right-parenthesis , where p prime Subscript i Baseline equals StartFraction p Subscript i Baseline Over p 1 plus midline-horizontal-ellipsis plus p Subscript r Baseline EndFraction .

Data Science in Theory and Practice. Maria Cristina Mariani

Чтение книги онлайн.

Читать онлайн книгу Data Science in Theory and Practice - Maria Cristina Mariani страница 20

Информация о книге:

2.3.3 Multivariate Normal Distribution

Скачать книгу

Data Science in Theory and Practice. Maria Cristina Mariani

Чтение книги онлайн.

Читать онлайн книгу Data Science in Theory and Practice - Maria Cristina Mariani страница 20

Информация о книге:

2.3.3 Multivariate Normal Distribution Скачать книгу

2.3.3 Multivariate Normal Distribution

Скачать книгу