Skip to main content

Section Avoiding Common Interpretation Pitfalls

There are predictable mistakes that people make when interpreting data. Teaching students to recognize and avoid these pitfalls helps them become more sophisticated consumers and creators of data-based arguments.

Checkpoint 66.

Students find that overall, boys in their school have higher math test scores than girls. However, when they look at each grade level separately, girls have higher scores in every single grade. How is this possible?
Hint.
Think about how the composition of the groups might affect overall averages.
Solution.
This is an example of Simpson’s Paradox—when overall trends reverse when you look at subgroups. It might occur if older students (who generally score higher) include more boys, while younger students include more girls. This teaches students that aggregate data can be misleading and that breaking data into subgroups can reveal important patterns.

Exploration 26. Try This Week: Bias Spotting Practice.

Time needed: 20 minutes
Activity: Present students with scenarios and have them identify potential sources of bias:
Elementary Scenarios:
• A survey about favorite school lunch foods is conducted only in the cafeteria line
• A poll about bedtime is given only to students who arrive early to school
• A survey about weekend activities is conducted only on Monday morning
Secondary Scenarios:
• A study about social media usage surveys only students who have smartphones
• Research on exercise habits is conducted only at a fitness center
• A poll about college plans surveys only students in advanced placement classes
Follow-up Questions: Who might be missing from each sample? How might this affect the results? How could the study be improved?

Checkpoint 67.

A student wants to argue that homework should be eliminated. They find one study showing no relationship between homework and grades, but ignore five other studies showing positive relationships. What’s the problem with this approach?
Hint.
Think about the importance of considering all available evidence, not just evidence that supports your preferred conclusion.
Solution.
This is cherry-picking—selecting only data that supports your preferred conclusion. Good data interpretation requires considering all available evidence and explaining why some studies might show different results. Students should learn to look for patterns across multiple sources of evidence rather than searching for single sources that confirm their beliefs.

Checkpoint 68.

Students conduct a survey in their suburban school and find that 80% of students have access to high-speed internet at home. A student concludes: “Most teenagers have good internet access.” What’s problematic about this generalization?
Hint.
Think about whether their sample represents all teenagers everywhere.
Solution.
The sample comes from one geographic area and socioeconomic context. Teenagers in rural areas, different countries, or lower-income communities might have very different internet access rates. Students should learn to be specific about what populations their data represents and avoid broad generalizations that go beyond their evidence.
This video highlights some common data interpretation mistakes.

Exploration 27. Common Interpretation Pitfalls and Prevention Strategies.

Pitfall 1: Assuming causation from correlation
Prevention: Always ask “What else could explain this relationship?”
Pitfall 2: Overgeneralizing from limited samples
Prevention: Be specific about who your data represents
Pitfall 3: Ignoring missing data or excluded groups
Prevention: Ask “Who is not represented in this data?”
Pitfall 4: Cherry-picking supportive evidence
Prevention: Look for patterns across multiple sources
Pitfall 5: Treating all data as equally reliable
Prevention: Consider data quality, source, and methods