Classical Biases in Common Statistics |
Written 2001 Formatted 2009 |
|||||||||
|
Although statistics can be very valuable, we do have to be clear on what they can tell us, and what they can't. Understanding where the biases come from will help promote understanding that distinction. As you look at these examples ask, "How can I be sure not to read in more than is actually there?" This page is laid out in a form that would make an easy discussion for a classroom lesson. |
||||||||||
Example 1: Economic ComparisonsLook at the following two groups and ask yourself which group is better off. Each group has four members.
If you look only at the average incomes which group do you consider better off? Group A has the higher average. Is Group A really better off? Which is better poor or dead? Poor or in jail? |
Related pages at this site:
|
|||||||||
Example 2: More Economic ComparisonsHow do these two groups compare?
|
||||||||||
Example 3: Economic Combined EffectsWhat happens to the numbers when you combine the two effects above?
Statisticians frequently talk about normalizing data, that is correcting for intrinsic errors. How would you normalize this data to account for the dead and jailed persons being removed from the data set? |
||||||||||
Example 4: Life expectancy - Remote LocationImagine a 40 year old pregnant woman, discouraged in life, retreating to a remote desert and dying immediately after labor. Only two people, mother and child, have settled in this place, so it is easy to calculate the life expectancy (40 + 0) / 2 = 20. If you go to that location should you expect to die when you are 20? Example 5: Life Expectancy - Childhood IllnessImagine a small town with a high infant mortality rate. The recorded deaths have occurred at these ages: ten infants have died in their first year, and five adults at the ages 60, 70, 75, an d 80. This towns life expectancy will be calculated as 20. Should the young start worrying as they approach the age of 20? Do life expectancy numbers describe something that any individual within that group should expect? |
Life expectancy | |||||||||
Example 6: Generational ChangesHow do these two generations compare?
|
||||||||||
Comparing real data to our examplesIn many locations and times in history life expectancy has been reported to be less than 30. How should we interpret this? Did most people really die when they were 30? How would this have affected families? How old would most children have been when their parents died? Who would have raised the children? Many observers make powerful claims about the members of different groups after average incomes are compared? Do averages really represent the individuals? Imagine what the average income data will look like for any group that Bill Gates is a member of. During the first decade of the 21st century, the average income rose, but the median income stagnated. What did this mean? How would you generalize these examples to other statistical data sets? What alternatives to averaging would be more informative? How would you represent the difference between the lowest and the average? Or the highest and the average? Do some biased data sets lend themselves well to normalizing? If so, How would you do it? |
The Flaw of Averages | |||||||||
|
||||||||||