Data Science in Theory and Practice. Maria Cristina Mariani
Чтение книги онлайн.
Читать онлайн книгу Data Science in Theory and Practice - Maria Cristina Mariani страница 22
2.4 Problems
1 If and are two matrices, prove the following properties of the trace of a matrix..., for a any constant .
2 If and are two matrices, prove the following properties of the determinant of a matrix.det = det .det = det det = det .
3 LetFind .Find .Find .Find .
4 LetFind .Find .Compare and .
5 LetFind .Find .
6 Show that the real symmetric matrixis positive definite for any non‐zero column vector.
7 Prove that if and are positive definite matrices then so is .
8 For what values of is the following matrix positive semidefinite?
9 Decide whether the following matrices are positive definite, negative definite, or neither. Please explain your reasoning.
10 For random variables and , show thatThe variance is the variance of the random variable , while the same holds for the random variable .
3 Multivariate Analysis
3.1 Introduction
Multivariate analysis is the statistical analysis of several variables at once. This is when multiple measurements are made on each experimental unit, and for which the relationship among multivariate measurements and their structure are important to the experiment's understanding. Experimental units are what you apply the treatments to. Many problems in the analysis of life science are multivariate in nature. However the analysis of large multivariable data sets is a major challenge for many research fields. Applications of multivariate techniques are vast. Some includes behavioral and biological sciences, finance, geophysics, medicine, ecology, and many other fields. The materials in this chapter will form the basis of discussion for what will be discussed later in this text.
3.2 Multivariate Analysis: Overview
We begin with the formal definition of multivariate analysis.
Definition 3.1 (Multivariate analysis) Multivariate analysis consists of a collection of techniques that can be used when several measurements are made on each experimental unit.
These measurements (i.e. data) must frequently be arranged and displayed in various ways. We now discuss the concepts underlying the first steps of data organization.
Multivariate data arise whenever an investigator, practitioner, or researcher seeks to study some physical phenomenon and selects a number
The rectangular array
Example 3.1 (A data array) A selection of three receipts from Bestbuy was obtained in order to investigate the nature of movie sales. Each receipt provided, among other things, the number of movies sold and the total amount of each sale. Let the first variable be total dollar sales and the second variable be number of movies sold. Then we can take the corresponding numbers on the receipts as three measurements on two variables. From the above description, we obtain the tabular form of the data as follows:
Then the data matrix
with three rows and two columns.
We now present some descriptive statistics. We will begin with the mean vectors.
3.3 Mean Vectors
Let