Читать онлайн книгу - Methods in Psychological Research. Annabel Ness Evans. Зарубежная психология. LiveLib

Новинки Лучшее Рекомендации

Информация о книге:

Название:

Автор:

Жанр:

Серия:

Издательство:

Methods in Psychological Research - Annabel Ness Evans

Скачать книгу

the color of paper will influence reading comprehension, but our participants vary considerably in age. This could pose a serious confound because reading comprehension changes with age. If we measured age, we can use ANCOVA to remove variability in reading comprehension that is due to age and then test the effect of color. The procedure removes the variance due to age from the DV before the F is calculated for the effect of color. Consequently, we are testing the effect of color after we have taken into account the effect of age.

The statistics described here are useful for comparing group means, but you may come across research where the variables are categories and the data are summarized by counting the frequency of things. When there are frequency counts instead of scores, you may see a chi-square test.

Chi-Square Test.

Do people prefer Coke or Pepsi? Suppose we have offered both drinks and asked people to declare a preference. We count the number of people preferring each drink. These data are not measures, and means cannot be calculated. If people’s preference did not differ between the two drinks, we would expect about the same number of people to pick each, and we could use a chi-square test, called the goodness-of-fit test, to test our hypothesis. In chi-square, our null hypothesis is that our observed frequencies will not be different from those we would expect by chance.

In the literature, you will likely see the data summarized by reporting the frequencies of each category either as total counts or perhaps as percentages of the total. Then you may read a statement that the frequencies in the groups are statistically significant, followed by a report of the chi-square statistic and p value.

Chi-square is called a nonparametric or distribution-free test because the test does not make the assumption that the population is distributed normally. Indeed, hypotheses about the shape of the population distribution are exactly what we are testing with chi-square.

There are two common chi-square tests: the goodness-of-fit test and the test for independence. The goodness-of-fit test is used when there are categorical data on one variable, as we had in the soft drink preference example. Perhaps a researcher is interested in the relationship between two categorical variables. In this case, you might see the chi-square test for independence. Imagine that our researcher has asked cola tasters to indicate their choice of cola and has also categorized them by age. The research hypothesis might be that preference for cola depends on age. The researcher might think that younger people prefer Pepsi, for example, and older people prefer Coke. Or perhaps older people have no preference. The chi-square statistic is the same for this test as for the goodness-of-fit test. The difference is in the hypothesis. The null is that the two variables are independent (i.e., there is no relationship between them). In a research article, you will likely see a table of frequencies (or percentages), a statement as to whether a relationship was found between the variables, and the chi-square statistic and p value.

Conceptual Exercise 2B

For each of the following, decide whether a t test, an F test, or a chi-square test might be appropriate:

1 A new teacher decides to put some of the principles he learned in school to the test. He randomly selects half of his class and consistently praises each student for being on the task for a minimum period of time. With the other half of the class, he periodically gives praise for on-task behavior. He wants to know if periodic praise produces more on-task behavior than consistent praise.

2 Psychiatric walk-in clients are randomly assigned to five therapists for short-term counseling. One therapist specializes in psychoanalytic techniques, one in client-centered techniques, one in behavioral techniques, and one in cognitive techniques. The fifth therapist is eclectic, using techniques from each of the therapies. All clients are rated on various scales designed to measure improvement. The researcher compares mean improvement ratings of the clients for each therapist.

3 A statistics professor wants to know if generally there are more or less equal numbers of psychology, sociology, and business students in her classes. She keeps a tally.

Other Nonparametric Tests.

In addition to chi-square, there are numerous other nonparametric tests that you will see in the literature. We have not tried to present a complete list here; instead, we have included the more common tests.

A nonparametric alternative to a t test for independent groups is the Mann-Whitney U test, which detects differences in central tendency and differences in the entire distributions of rank-ordered data. The Wilcoxon signed-ranks test is an alternative to a t test for dependent groups for rank-ordered data on the same or matched participants.

A nonparametric alternative to the one-way ANOVA is the Kruskal–Wallis H test, used when the data are rank orders of three or more independent groups. When those groups are dependent (i.e., repeated measures), a nonparametric test is the Friedman test.

Pearson’s r Test.

If you earned a lot of money, would you be happy? Is there a relationship between income and happiness? If a researcher were interested in investigating a linear relationship between two continuous variables, he or she would use the Pearson product–moment test to calculate the correlation r. If you are getting a sense of déjà vu, it is probably because we talked about r as a descriptive statistic, but here we are talking about it as an inferential statistic. The important distinction is that the r reported as an inferential statistic will have an associated p value. For example, in a research article, you will read that a positive relationship was found between a measure of need for achievement and years of education and that the relationship was statistically significant. If the relationship was statistically significant, you will also see a p value reported.

Regression.

Regression is related to correlation, but in regression, we are interested in using a predictor variable to predict a criterion variable. Continuing with the example of need for achievement and education, perhaps the researcher was also interested in predicting the need for achievement from education level. If the correlation between the two variables is statistically significant, it is a simple matter of fitting a line through the data and using the equation for the line to predict need for achievement from education level. We say “simple matter” because the calculations are all done by computer, but, certainly, the equation for a straight line is simple:

Y = mX + b

where Y is the criterion variable, X is the predictor variable, m is the slope of the line, and b is the value of Y where the line intercepts the y-axis. Be sure to keep in mind as you read the research that the accuracy of the predicted values will be as good as the correlation is. That is, the closer the correlation is to +1 (or −1), the better the predictions will be.

The statistical procedures we have been discussing all involve an a priori hypothesis about the nature of the population. Hypothesis testing is used a lot in psychology. Some other disciplines tend to prefer post hoc procedures, and you will find confidence interval estimates quite often in the literature you will be reading.

Confidence Intervals

Confidence