Chi-Square
 
 
What do you do when your data is not continuous/normally distributed?

How the chi-square test works.

How does the chi-square relate to previous tests.


Here's an example problem for a one way chi-square:

In Monty Hall's game show there are three doors A, B, and C.  The null hypothesis is that contestants will choose each door with equal frequency. What is the expected cell count if there are 60 contestants?

A B C
20 20 20

Here is the observed data:
 
A B C
10 10 40

Calculate the chi square statistic. (10-20)2/20+(10-20)2/20+(40-20)2/20=30

Is this significant?  There are c-1=2 degrees of freedom. The critical value for X2 is 5.99.



Like the t and F, different distributions for different degrees of freedom. Only the right end of the tail matters.

2 degrees of freedom
 
 
 
 
 
 


4 degrees of freedom



Chi-square tests can be done with two independent variables


estimated Chi-Square X2


Here's some Data (Male/Female vs. 4 sport motivations):
Female Male Total
Popularity 14 31 45
Fitness 7 18 25
Self-Confidence 21 5 26
Entertainment 25 13 38
Total 67 67 134

Null hypothesis is that there is no association between the rows and columns.