Overview
This is a fictional dataset and should only be used for data science training purposes.
This data set includes scores from three exams and a variety of personal, social, and economic factors that have interaction effects upon them.
Sample Data
gender | race/ethnicity | parental level of education | lunch | test preparation course | math score | reading score | writing score |
---|---|---|---|---|---|---|---|
female | group E | some college | standard | completed | 81 | 84 | 85 |
female | group A | some high school | free/reduced | completed | 37 | 47 | 50 |
male | group D | high school | standard | none | 55 | 58 | 52 |
male | group D | associate's degree | free/reduced | none | 61 | 49 | 57 |
female | group B | high school | free/reduced | completed | 51 | 77 | 73 |
male | group B | some high school | standard | none | 79 | 75 | 66 |
female | group B | associate's degree | standard | none | 40 | 49 | 47 |
male | group C | some college | standard | none | 75 | 71 | 70 |
male | group D | bachelor's degree | free/reduced | none | 44 | 48 | 44 |
female | group C | bachelor's degree | standard | none | 81 | 87 | 80 |
... |
Example Research Questions
- How effective is the test preparation course?
- Which major factors contribute to test outcomes?
- What would be the best way to improve student scores on each test?
What patterns and interactions in the data can you find? Let me know in the comments section below.
Download



All data sets are generated on-the-fly. So, you can increase your n by downloading a data set multiple times and combining the files.
Disclaimer
All data sets are fictional and should be used for educational purposes only.