This is a fictional dataset and should only be used for data science training purposes.
This data set includes scores from three exams and a variety of personal, social, and economic factors that have interaction effects upon them.
|gender||race/ethnicity||parental level of education||lunch||test preparation course||math score||reading score||writing score|
|female||group E||some college||standard||completed||81||84||85|
|female||group A||some high school||free/reduced||completed||37||47||50|
|male||group D||high school||standard||none||55||58||52|
|male||group D||associate's degree||free/reduced||none||61||49||57|
|female||group B||high school||free/reduced||completed||51||77||73|
|male||group B||some high school||standard||none||79||75||66|
|female||group B||associate's degree||standard||none||40||49||47|
|male||group C||some college||standard||none||75||71||70|
|male||group D||bachelor's degree||free/reduced||none||44||48||44|
|female||group C||bachelor's degree||standard||none||81||87||80|
Example Research Questions
- How effective is the test preparation course?
- Which major factors contribute to test outcomes?
- What would be the best way to improve student scores on each test?
What patterns and interactions in the data can you find? Let me know in the comments section below.
DownloadDownload a Small Sample - Download a .csv file with 10 results as a sample set (n=10) Download a Medium Sample - Download a .csv file with 100 results as a sample set (n=100) Download a Large Sample - Download a .csv file with 1000 results as a sample set (n=1000)
All data sets are generated on-the-fly. So, you can increase your n by downloading a data set multiple times and combining the files.
All data sets are fictional and should be used for educational purposes only.