EXAMPLE 4.9
Students who seek therapy at university counseling centers often do not attend many sessions. For example, in one study, the median number of therapy sessions was 3 and the mean was 4.6 (Hatchett, 2003). Let’s examine the spread of fictional scores for a sample of five students: 1, 2, 4, 4, and 10 therapy sessions, with a mean of 4.2. We find out how far each score deviates from the mean by subtracting the mean from every score. First, we label with anX the column that lists the scores. Here, the second column includes the results we get when we subtract the mean from each score, orX −M. We call each of these a deviation from the mean (or just adeviation)—the amount that a score in a sample differs from the mean of the sample.
X
|
X −M
|
---|
1 |
− 3.2 |
2 |
− 2.2 |
4 |
− 0.2 |
4 |
− 0.2 |
10 |
5.8 |
But we can’t just take the mean of the deviations. If we do (and if you try this, don’t forget the signs—negative and positive), we get 0—every time. Are you surprised? Remember, the mean is the point at which all scores are perfectly balanced. Mathematically, the scoreshave to balance out. Yet we know that thereis variability among these scores. The number representing the amount of variability is certainly not 0!
When we ask students for ways to eliminate the negative signs, two suggestions typically come up: (1) Take the absolute value of the deviations, thus making them all positive, or (2) square all the scores, again making them all positive. It turns out that the latter, squaring all the deviations, is how statisticians solve this problem. Once we square the deviations, we can take their average and get a measure of variability. Later (using a beautifully descriptive term created by our students), we will “unsquare” those deviations in order to calculate the standard deviation