MathBench > Statistical Tests

Chi-squared Tests

Detour Stop #3: What are "degrees of freedom"?

On the last page, I said you should look up the chi-squared critical value under "number of data rows minus one". Why?

When I told you that 42 out of 100 sick days were on Mondays or Fridays, you automatically knew that 58 had to be in the middle of the week, right? I was "free" to specify how many were on Monday or Friday, but then I was NOT "free" to decide how many were on non-Monday or Friday. So we say that, in this problem, there is only 1 degree of freedom (or df).

Say you flip a coin 100 times and record how many tosses are heads and how many are tails. If we want to do a chi-squared test to determine whether a coin is fair (lands equally on heads and tails), how many degrees of freedom would the test have?

 

(To make this problem interactive, turn on javascript!)

I think I have the answer: There are two variables here -- number of
heads and number of tails. But only 1 is free to vary -- once I tell you
how many heads there were, you know how many tails there were,
and vice versa. So df = 1.

 

It is possible to do chi-squared tests using more than 2 variables. For example, let's say I have data on how many sick days fell on EACH of the five weekdays:

Day Observed Expected
Mon 22 20
Tues 19 20
Wed 19 20
Thurs 20 20
Fri 20 20

We could do a chi-squared test to check whether the distribution of sick days matched our expectations for ALL FIVE weekdays

How many degrees of freedom would this test have?

 

(To make this problem interactive, turn on javascript!)

I think I have the answer: Once I know how many sick days occurred
on 4 of the 5 days, the fifth day is no longer "free" to vary.
Therefore there are only 4 degrees of freedom.

end detour sign