Exam Scores

Exam scores for students at a public school


This is a fictional dataset and should only be used for data science training purposes.

This data set includes scores from three exams and a variety of personal, social, and economic factors that have interaction effects upon them.

Sample Data

Sample Output
genderrace/ethnicityparental level of educationlunchtest preparation coursemath scorereading scorewriting score
malegroup Bsome high schoolstandardnone544946
femalegroup Dhigh schoolfree/reducednone495860
malegroup Csome high schoolstandardnone626060
femalegroup Dmaster's degreefree/reducedcompleted617177
femalegroup Csome collegefree/reducednone365451
femalegroup Csome high schoolfree/reducednone404941
femalegroup Csome collegefree/reducednone435552
malegroup Csome collegestandardnone635353
femalegroup Chigh schoolfree/reducedcompleted678784
malegroup Bhigh schoolfree/reducedcompleted847077

Example Research Questions

  1. How effective is the test preparation course?
  2. Which major factors contribute to test outcomes?
  3. What would be the best way to improve student scores on each test?

What patterns and interactions in the data can you find? Let me know in the comments section below.


Download a Small Sample - Download a .csv file with 10 results as a sample set (n=10) Download a Medium Sample - Download a .csv file with 100 results as a sample set (n=100) Download a Large Sample - Download a .csv file with 1000 results as a sample set (n=1000)
All data sets are generated on-the-fly. So, you can increase your n by downloading a data set multiple times and combining the files.


All data sets are fictional and should be used for educational purposes only.

Share this page: