Exam Scores

Exam scores for students at a public school


This is a fictional dataset and should only be used for data science training purposes.

This data set includes scores from three exams and a variety of personal, social, and economic factors that have interaction effects upon them.

Sample Data

Sample Output
genderrace/ethnicityparental level of educationlunchtest preparation coursemath scorereading scorewriting score
femalegroup Esome collegestandardcompleted818485
femalegroup Asome high schoolfree/reducedcompleted374750
malegroup Dhigh schoolstandardnone555852
malegroup Dassociate's degreefree/reducednone614957
femalegroup Bhigh schoolfree/reducedcompleted517773
malegroup Bsome high schoolstandardnone797566
femalegroup Bassociate's degreestandardnone404947
malegroup Csome collegestandardnone757170
malegroup Dbachelor's degreefree/reducednone444844
femalegroup Cbachelor's degreestandardnone818780

Example Research Questions

  1. How effective is the test preparation course?
  2. Which major factors contribute to test outcomes?
  3. What would be the best way to improve student scores on each test?

What patterns and interactions in the data can you find? Let me know in the comments section below.


Download a Small Sample - Download a .csv file with 10 results as a sample set (n=10) Download a Medium Sample - Download a .csv file with 100 results as a sample set (n=100) Download a Large Sample - Download a .csv file with 1000 results as a sample set (n=1000)
All data sets are generated on-the-fly. So, you can increase your n by downloading a data set multiple times and combining the files.


All data sets are fictional and should be used for educational purposes only.

Share this page: