Use this checklist to evaluate potential datasets for your project. A suitable dataset should meet most of these criteria:
- Contains at least 100 records (rows)
- Includes at least 8 variables (columns)
- Has a mix of categorical and numerical variables
- Is on a topic that genuinely interests you
- Has documentation about data collection methods and meanings of variables
- Is reasonably clean but offers some opportunities for data cleaning practice
- Can be easily imported into CODAP
- Supports at least three meaningful statistical questions
- Contains variables that might have interesting relationships
- Is publicly available or properly licensed for educational use
Review the dataset you’re considering and evaluate how many of these criteria it meets. A good dataset for your project should satisfy at least 7-8 of these items.
Solution.
This checklist serves as a reference for dataset evaluation. There is no single correct answer, as the suitability of a dataset depends on your specific project needs. However, datasets meeting more criteria will generally provide better opportunities for meaningful analysis.
