Data Science in Theory and Practice. Maria Cristina Mariani

Чтение книги онлайн.

Читать онлайн книгу Data Science in Theory and Practice - Maria Cristina Mariani страница 23

Data Science in Theory and Practice - Maria Cristina Mariani

Скачать книгу

bold upper X right-parenthesis equals upper E Start 4 By 1 Matrix 1st Row x 1 2nd Row x 2 3rd Row vertical-ellipsis 4th Row x Subscript p Baseline EndMatrix equals Start 4 By 1 Matrix 1st Row upper E left-parenthesis x 1 right-parenthesis 2nd Row upper E left-parenthesis x 2 right-parenthesis 3rd Row vertical-ellipsis 4th Row upper E left-parenthesis x Subscript p Baseline right-parenthesis EndMatrix period"/>

      More generally, if bold upper Z Subscript n times p Baseline equals left-bracket z Subscript j k Baseline right-bracket is a matrix of random variables, then the upper E left-parenthesis bold upper Z right-parenthesis is the matrix of expectations with elements left-bracket upper E left-parenthesis z Subscript j k Baseline right-parenthesis right-bracket, i.e.:

StartLayout 1st Row 1st Column bold upper Z 2nd Column equals upper E Start 6 By 6 Matrix 1st Row 1st Column z Subscript 1 comma 1 Baseline 2nd Column z Subscript 1 comma 2 Baseline 3rd Column midline-horizontal-ellipsis 4th Column z Subscript 1 comma k Baseline 5th Column midline-horizontal-ellipsis 6th Column z Subscript 1 comma p Baseline 2nd Row 1st Column z Subscript 2 comma 1 Baseline 2nd Column z Subscript 2 comma 2 Baseline 3rd Column midline-horizontal-ellipsis 4th Column z Subscript 2 comma k Baseline 5th Column midline-horizontal-ellipsis 6th Column z Subscript 2 comma p Baseline 3rd Row 1st Column vertical-ellipsis 2nd Column vertical-ellipsis 3rd Column Blank 4th Column vertical-ellipsis 5th Column vertical-ellipsis 6th Column vertical-ellipsis 4th Row 1st Column z Subscript j comma 1 Baseline 2nd Column z Subscript j comma 2 Baseline 3rd Column midline-horizontal-ellipsis 4th Column z Subscript j comma k Baseline 5th Column midline-horizontal-ellipsis 6th Column z Subscript j comma p Baseline 5th Row 1st Column vertical-ellipsis 2nd Column vertical-ellipsis 3rd Column Blank 4th Column vertical-ellipsis 5th Column vertical-ellipsis 6th Column vertical-ellipsis 6th Row 1st Column z Subscript n comma 1 Baseline 2nd Column z Subscript n comma 2 Baseline 3rd Column midline-horizontal-ellipsis 4th Column z Subscript n comma k Baseline 5th Column midline-horizontal-ellipsis 6th Column z Subscript n comma p Baseline EndMatrix 2nd Row 1st Column Blank 2nd Column equals Start 6 By 6 Matrix 1st Row 1st Column upper E left-parenthesis z Subscript 1 comma 1 Baseline right-parenthesis 2nd Column upper E left-parenthesis z Subscript 1 comma 2 Baseline right-parenthesis 3rd Column midline-horizontal-ellipsis 4th Column upper E left-parenthesis z Subscript 1 comma k Baseline right-parenthesis 5th Column midline-horizontal-ellipsis 6th Column upper E left-parenthesis z Subscript 1 comma p Baseline right-parenthesis 2nd Row 1st Column upper E left-parenthesis z Subscript 2 comma 1 Baseline right-parenthesis 2nd Column upper E left-parenthesis z Subscript 2 comma 2 Baseline right-parenthesis 3rd Column midline-horizontal-ellipsis 4th Column upper E left-parenthesis z Subscript 2 comma k Baseline right-parenthesis 5th Column midline-horizontal-ellipsis 6th Column upper E left-parenthesis z Subscript 2 comma p Baseline right-parenthesis 3rd Row 1st Column vertical-ellipsis 2nd Column vertical-ellipsis 3rd Column Blank 4th Column vertical-ellipsis 5th Column vertical-ellipsis 6th Column vertical-ellipsis 4th Row 1st Column upper E left-parenthesis z Subscript j comma 1 Baseline right-parenthesis 2nd Column upper E left-parenthesis z Subscript j comma 2 Baseline right-parenthesis 3rd Column midline-horizontal-ellipsis 4th Column upper E left-parenthesis z Subscript j comma k Baseline right-parenthesis 5th Column midline-horizontal-ellipsis 6th Column upper E left-parenthesis z Subscript j comma p Baseline right-parenthesis 5th Row 1st Column vertical-ellipsis 2nd Column vertical-ellipsis 3rd Column Blank 4th Column vertical-ellipsis 5th Column vertical-ellipsis 6th Column vertical-ellipsis 6th Row 1st Column upper E left-parenthesis z Subscript n comma 1 Baseline right-parenthesis 2nd Column upper E left-parenthesis z Subscript n comma 2 Baseline right-parenthesis 3rd Column midline-horizontal-ellipsis 4th Column upper E left-parenthesis z Subscript n comma k Baseline right-parenthesis 5th Column midline-horizontal-ellipsis 6th Column upper E left-parenthesis z Subscript n comma p Baseline right-parenthesis EndMatrix period EndLayout

      For a random vector bold upper X Superscript upper T Baseline equals left-bracket x 1 comma x 2 comma ellipsis comma x Subscript p Baseline right-bracket, the mean vector consists of the means of each variable:

upper E left-parenthesis bold upper X right-parenthesis equals upper E Start 4 By 1 Matrix 1st Row x 1 2nd Row x 2 3rd Row vertical-ellipsis 4th Row x Subscript p Baseline EndMatrix equals Start 4 By 1 Matrix 1st Row upper E left-parenthesis x 1 right-parenthesis 2nd Row upper E left-parenthesis x 2 right-parenthesis 3rd Row vertical-ellipsis 4th Row upper E left-parenthesis x Subscript p Baseline right-parenthesis EndMatrix equals Start 4 By 1 Matrix 1st Row mu 1 2nd Row mu 2 3rd Row vertical-ellipsis 4th Row mu Subscript p Baseline EndMatrix equals mu comma x overbar Subscript 1 Baseline equals StartFraction 1 Over n EndFraction sigma-summation Underscript j equals 1 Overscript n Endscripts x Subscript j Baseline 1 Baseline period

      The sample mean can be computed from the n measurements on each of the p variables. Therefore, in general for p sample means, we have:

x overbar Subscript k Baseline equals StartFraction 1 Over n EndFraction sigma-summation Underscript j equals 1 Overscript n Endscripts x Subscript j k Baseline comma k equals 1 comma 2 comma ellipsis comma p period

      Example 3.2 Consider the following data matrix introduced in Example 3.1:

bold upper X equals Start 3 By 2 Matrix 1st Row 1st Column 48 2nd Column 3 2nd Row 1st Column 22 2nd Column 1 3rd Row 1st Column 50 2nd Column 2 EndMatrix period

      Each receipt yields a pair of measurements, total dollar sales, and number of movies sold. To find the sample mean x overbar, we calculate the average of each column as follows:

StartLayout 1st Row 1st Column x overbar Subscript 1 2nd Column equals one third sigma-summation Underscript j equals 1 Overscript 3 Endscripts x Subscript j Baseline 1 Baseline equals one third left-parenthesis 48 plus 22 plus 50 right-parenthesis equals 40 comma 2nd Row 1st Column x overbar Subscript 2 2nd Column equals one third sigma-summation Underscript j equals 1 Overscript 3 Endscripts x Subscript j Baseline 2 Baseline equals one third left-parenthesis 3 plus 1 plus 2 right-parenthesis equals 2 period EndLayout

      Therefore,

bold upper X overbar equals StartBinomialOrMatrix x overbar Subscript 1 Baseline Choose x overbar Subscript 2 Baseline EndBinomialOrMatrix equals StartBinomialOrMatrix 40 Choose 2 EndBinomialOrMatrix period

Скачать книгу